[PoC] [WiP Draft] Performance gate prototype #1825

vytas7 · 2020-12-20T19:07:39Z

This is an early prototype of how implementing the https://github.com/pythonspeed/cachegrind-benchmarking approach could look like for Falcon.

The prototype builds upon this excellent article by @itamarst: https://pythonspeed.com/articles/consistent-benchmarking-in-ci/ (thanks @njsmith for kindly pointing to this idea).

Closes #1450

To do if we want to proceed with this:

ASGI performance metric
Barebones "Hello, World!" performance metric
Media performance metric
Routing performance metric
URL params and headers performance metric
Clean up tox environments, requirements etc
Add Cython support
Run Cython gates for master only (?)
Add PyPy support, master only (?); probably only informational?

codecov · 2020-12-20T19:08:51Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (29b05ed) to head (30d2886).
⚠️ Report is 386 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff            @@
##            master     #1825   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           54        54           
  Lines         5154      5154           
  Branches       831       831           
=========================================
  Hits          5154      5154

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

itamarst · 2020-12-21T14:26:32Z

Excited to see this, and to see if it turns out to be useful. I'm going to be setting up something similar for one of my projects, so will share what I come up with if this is still in progress, or maybe steal from you, depending 😀

itamarst · 2020-12-21T21:51:58Z

Note that I've found a bug in the cachegrind.py script calculation, so you'll want to pull a new version once I've updated it (tomorrow hopefully).

vytas7 · 2020-12-21T21:57:58Z

Heh, thanks for heads up @itamarst !
I did experience that the cachegrind.py metric was a lot (1-2 orders of magnitude) noisier than the instruction count from valgrind, if that is what would be revised.
But maybe it is a bit noisier in its nature, so I didn't pay too much attention to it.

vytas7 · 2020-12-21T22:01:36Z

@itamarst Btw, another thing I was really surprised when toying with this approach, was how much "rogue" Python hash seeds can affect performance 🙂

itamarst · 2020-12-21T22:07:00Z

Yeah, you really want to set a fixed PYTHONHASHSEED for consistency.

The issue was that I was counting L3 hits wrong, if something hit RAM I also counted this as hitting L3 (i.e. L3 hits were too high). I am not sure if this will have much impact on the noisiness though.

itamarst · 2020-12-22T15:26:53Z

OK, https://github.com/pythonspeed/cachegrind-benchmarking has been updated.

vytas7 · 2020-12-22T19:48:44Z

Thanks @itamarst , I'll check that out!

vytas7 · 2020-12-26T17:33:22Z

@itamarst thanks again for the update.
The noisiness is gone, and the variation of the least-squares linear regression fitting error is now low beyond belief 💯 .
In fact, I'm now getting exactly the same cost of one iteration in two different CI runs (within at least 9 significant digits!).

…8 builds

itamarst · 2021-01-12T15:45:00Z

I got benchmarks working. Beyond what is in original repository:

I store results as pretty-printed, sorted JSON file on disk. Expectation is that developer runs this locally and checks in result, but I'm single-person project.
On every PR, I run the benchmarks again and add diff as GitHub comment: https://github.com/pythonspeed/filprofiler/blob/master/.github/workflows/main.yml#L120
In addition to setting PYTHONHASHSEED and figuring out equivalent fixed-seed for Rust, I ended up using Conda environments to reduce noise between my machine and GitHub Actions machines (so e.g. it's the same Python binary, rather than different dot-versions or different compilers etc.). The result is quite consistent on my machine, and a little noisy between my machine and VMs, but better than it would be without Conda.

Example output: pythonspeed/filprofiler#110

vytas7 · 2022-03-13T16:39:55Z

This has been rotting for so long due to the lack of my bandwidth, that I'm thinking to wait a couple of weeks more and then migrate this to Ubuntu 22.04 + CPython 3.10, to have the same gauge for a longer time.

I'm hoping to circle back on this shortly after we release 3.1.

vytas7 added 5 commits December 18, 2020 13:33

perf(CI): performance gate doodles (WiP)

ec4ae41

WiP: some doodles

5e6ce5e

WiP: some doodles (contd.)

a95d130

perf: add a performance testing gate prototype

fe41f96

Merge branch 'master' into performance-gate

bfc7406

vytas7 marked this pull request as draft December 20, 2020 19:08

perf: adjust baseline constants for Ubuntu 20.04

f31b20f

vytas7 added 3 commits December 26, 2020 16:24

Merge branch 'master' into performance-gate

5102ef2

perf(CI): run measurements under Ubuntu 20.04 Python builds

ad71631

perf(CI): source a new version of cachegrind-benchmarking

4dd07e1

vytas7 added 7 commits December 26, 2020 18:48

perf(CI): adjust the performance baseline for Ubuntu 20.04 CPython 3.…

0108a73

…8 builds

perf(CI): add a new metric pertinent to query params

c0a32ac

perf(CI): actually include the query params metric

145fa53

perf(CI): extend definitions to perf_asgi and perf_query

7efbb9a

perf(CI): record the observed query metric value into baseline

2301667

perf(CI): add a basic ASGI performance metric

fdb21af

perf(CI): establish baseline for the ASGI metric

30d2886

kgriffs added the pr::delay-for-next-release label Mar 9, 2021

kgriffs mentioned this pull request Mar 9, 2021

Test: Add basic ASGI tests to benchmark suite #1881

Open

kgriffs removed the pr::delay-for-next-release label May 10, 2021

kgriffs mentioned this pull request Aug 4, 2021

Roadmap: 3.x #1894

Closed

14 tasks

vytas7 added the pr::delay-for-next-release label Mar 13, 2022

vytas7 removed the pr::delay-for-next-release label Mar 26, 2022

vytas7 mentioned this pull request May 22, 2022

Roadmap: 4.x #2073

Open

19 tasks

vytas7 mentioned this pull request Jul 12, 2024

Evaluate CodSpeed for benchmarking in CI #2238

Open

vytas7 added the blocked-by-release label Sep 6, 2024

vytas7 removed the blocked-by-release label Oct 19, 2024

vytas7 added the blocked-by-release label Jul 24, 2025

vytas7 removed the blocked-by-release label Aug 9, 2025

vytas7 added blocked-by-release and removed blocked-by-release labels Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[PoC] [WiP Draft] Performance gate prototype #1825

[PoC] [WiP Draft] Performance gate prototype #1825

Uh oh!

vytas7 commented Dec 20, 2020 •

edited

Loading

Uh oh!

codecov bot commented Dec 20, 2020 •

edited

Loading

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

vytas7 commented Dec 21, 2020 •

edited

Loading

Uh oh!

vytas7 commented Dec 21, 2020

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

itamarst commented Dec 22, 2020

Uh oh!

vytas7 commented Dec 22, 2020

Uh oh!

vytas7 commented Dec 26, 2020

Uh oh!

itamarst commented Jan 12, 2021

Uh oh!

vytas7 commented Mar 13, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[PoC] [WiP Draft] Performance gate prototype #1825

Are you sure you want to change the base?

[PoC] [WiP Draft] Performance gate prototype #1825

Uh oh!

Conversation

vytas7 commented Dec 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

vytas7 commented Dec 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vytas7 commented Dec 21, 2020

Uh oh!

itamarst commented Dec 21, 2020

Uh oh!

itamarst commented Dec 22, 2020

Uh oh!

vytas7 commented Dec 22, 2020

Uh oh!

vytas7 commented Dec 26, 2020

Uh oh!

itamarst commented Jan 12, 2021

Uh oh!

vytas7 commented Mar 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vytas7 commented Dec 20, 2020 •

edited

Loading

codecov bot commented Dec 20, 2020 •

edited

Loading

vytas7 commented Dec 21, 2020 •

edited

Loading

vytas7 commented Mar 13, 2022 •

edited

Loading