🔪 flaky and Zombienet tests by ggwpez · Pull Request #8600 · paritytech/polkadot-sdk

ggwpez · 2025-05-21T15:58:51Z

Commenting out all flaky tests and tracking them here #48

Changes:

Disable flaky Rust tests by adding a new disabled feature. #[ignore] attribute is not possible since CI runs with --ignored
Disable all Zombienet tests
Waiting for CI what other tests fail.

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

substrate/client/consensus/babe/src/tests.rs

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

paritytech-workflow-stopper · 2025-05-22T10:43:00Z

All GitHub workflows were cancelled due to failure one of the required jobs.
Failed workflow url: https://github.com/paritytech/polkadot-sdk/actions/runs/15184421552
Failed job name: fmt

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

ggwpez · 2025-05-22T20:49:08Z

Initially this was disabling the flaky ones, why all zombienet tests now? Some of them were passing afaik.

Yes but i kept disabling more and more, so now I disabled all since there is no way to check if they are not flaky without running the CI many times.

Maybe we can go the other way and initially disable all and then enable the non-flaky ones one-by-one.

bkchr · 2025-05-22T21:06:54Z

Maybe we could just disable the zombienet tests in the merge queue?

athei · 2025-05-22T22:28:48Z

Maybe we could just disable the zombienet tests in the merge queue?

In the PR they are just noise. We always merge with them red because they almost always are. I see no reason to keep them.

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

* master: (99 commits) Snowbridge: Remove asset location check for compatibility (#8473) add poke_deposit extrinsic to pallet-bounties (#8382) litep2p/peerset: Reject non-reserved peers in the reserved-only mode (#8650) Charge deposit based on key length (#8648) [pallet-revive] make subscription task panic on error (#8587) tx/metrics: Add metrics for the RPC v2 `transactionWatch_v1_submitAndWatch` (#8345) Bridges: Fix - Improve try-state for pallet-xcm-bridge-hub (#8615) Introduce CreateBare, deprecated CreateInherent (#7597) Use hashbrown hashmap/hashset in validation context (#8606) ci: rm gitlab config (#8622) 🔪 flaky and Zombienet tests (#8600) cumulus: adjust unincluded segment size metric buckets (#8617) Benchmark storage access on block validation (#8069) Revert 7934 es/remove tj changes (#8611) collator-protocol: add more collation observability (#8230) `fatxpool`: add fallback for ready at light (#8533) txpool: fix tx removal from unlocks set (#8500) XCMP weight metering: account for the MQ page position (#8344) fix epmb solution duplicate issue + add remote mining apparatus to epm (#8585) Fix generated address returned by Substrate RPC runtime call (#8504) ...

alindima · 2025-05-28T11:52:53Z

I see this PR disabled all zombienet tests, which I think was a bad idea.
They aren't just noise. Certainly not all of them were flaky and most of them were flaky for infrastructure reasons (running on spot instances or undersized hardware), based on my experience of running them locally to check. Us in the Parachains team have been working on porting them to zombienet-sdk and trying to make them more reliable.

I'd suggest to just not make them mandatory for merging the PR. We should still run them but take the results with a grain of salt. There are certainly some tests that are failing almost all the time (and we could disable these altogether maybe), but at least we also had plenty of tests that were failing extremely rarely (or never).
They are valuable even as informational only. Maybe not for everyone, depending on which area of the code you're usually touching

bkchr · 2025-05-28T11:59:14Z

Us in the Parachains team have been working on porting them to zombienet-sdk and trying to make them more reliable.

Porting to zombienet-sdk will not change anything in regards to their flakiness.

They are valuable even as informational only.

No. we already have merged prs in the past with failing tests and then realizing that they actually broke. It is also not really easy too check them etc.

The only proper fix is to make them rock stable. Stuff like spot instances need to be detected by zombienet itself and not counted as an error. But we also had this discussion already multiple times and these tests are still flaky and it doesn't improve.

alindima · 2025-05-28T12:06:38Z

Porting to zombienet-sdk will not change anything in regards to their flakiness.

We don't only do a direct translation. We try to test them thoroughly and investigate their flakyness. zombienet-sdk sometimes enables us to make more precise assertions.

No. we already have merged prs in the past with failing tests and then realizing that they actually broke. It is also not really easy too check them etc.

Right, there's a chance of merging some buggy code. But now we can almost be sure that we'll merge some buggy code

alexggh · 2025-05-28T12:14:18Z

The only proper fix is to make them rock stable.

I don't think anyone disagrees here, but are we in a better situation with all of them disabled ?

pepoviola · 2025-05-28T13:11:16Z

Hi, I'm also agree on this

The only proper fix is to make them rock stable.

And we are working to reach that stability, just as update:

We are working on re-enable the dashboard (tracking tests and make it easy to visualize them)
Split the deployment and test phases to easily categorize (and re-try) tests when the deployment phase fail (related to infra issues).
Audit failing test , get the root cause (infra or logic related) and fix them.

Also, in order to get more information of the test but without causing friction we propose to only run them when certain label is present (so people form parachains/node sdk can run them easily) and also make them not required.

Any other feedback/opinion is wellcome.

athei · 2025-05-28T13:49:47Z

I'd suggest to just not make them mandatory for merging the PR. We should still run them but take the results with a grain of salt.

They were never mandatory. That would have been insane. But you still had to wait for them to finish running. And I am not checking the results and nobody is checking the result of a job that fails > 50% of the time. I just merge just as anybody else. They provide ZERO value.

It makes no sense to have optional tests. Just make them solid or don't add them. We need to do better. But until then they just don't run in a PR.

alindima · 2025-05-28T14:03:01Z

They were never mandatory. That would have been insane. But you still had to wait for them to finish running. And I am not checking the results and nobody is checking the result of a job that fails > 50% of the time. I just merge just as anybody else. They provide ZERO value.

I for one (and I assume others too) got a good memory of which particular tests are flaky (or at least you could check on other PRs which are the most common ones). Definitely not all of them are (especially to the 50% ratio that you mention).
If I'm writing a particularly large PR that adds a lot of code it's useful to have the CI run and for the couple of flaky tests that are reported I'd run them locally to make sure they are passing.
Me and others have been doing this for a while. It's frustrating but certainly better than just not running any of them.

I can understand that this is not valuable to everyone, but for some it is. Having the label is a good compromise for the time being. At least I can test my code and the code I'm reviewing.

ggwpez · 2025-05-28T14:12:37Z

I can understand that this is not valuable to everyone, but for some it is. Having the label is a good compromise for the time being. At least I can test my code and the code I'm reviewing.

Yea lets go with the label for now.

I for one (and I assume others too) got a good memory of which particular tests are flaky (or at least you could check on other PRs which are the most common ones).

But to all other people working on Polkadot SDK it is confusing AF. This is a team project!
New developers come in and think they made a mistake and get frustrated because CI is red.
Disabling all ZN tests is my way of escalating the issue since it has not been taken seriously for the last two years!

alexggh · 2025-05-28T14:34:30Z

Disabling all ZN tests is my way of escalating the issue since it has not been taken seriously for the last two years!

You should've made it the other way around make them mandatory :D, so that everyone stops and fix them, but I guess we tried that as well :D.

Anyways, let's go with the label for now, but my instinct tells me this has the reverse impact where they get even lower priority.

# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>

Commenting out all flaky tests and tracking them here #48 Changes: - Disable flaky Rust tests by adding a new disabled feature. `#[ignore]` attribute is not possible since CI runs with `--ignored` - Disable all Zombienet tests - [ ] Waiting for CI what other tests fail. --------- Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>

Commenting out all flaky tests and tracking them here #48 Changes: - Disable flaky Rust tests by adding a new disabled feature. `#[ignore]` attribute is not possible since CI runs with `--ignored` - Disable all Zombienet tests - [ ] Waiting for CI what other tests fail. --------- Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>

🔪 flaky tests

5ae7d5d

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

ggwpez added the T10-tests This PR/Issue is related to tests. label May 21, 2025

ggwpez added 3 commits May 21, 2025 16:59

Merge branch 'master' into oty-shortcircuit-ci

439ec49

properly ignore

32b50db

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Merge branch 'master' into oty-shortcircuit-ci

688f9db

bkchr reviewed May 21, 2025

View reviewed changes

substrate/client/consensus/babe/src/tests.rs Show resolved Hide resolved

ggwpez added 2 commits May 21, 2025 19:39

taplo

85d7581

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Disable flaky ZB tests

e15bc89

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

ggwpez requested review from a team as code owners May 21, 2025 17:40

ggwpez added 5 commits May 21, 2025 19:42

Dont delete, just add them to the list

d5329a5

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

more

5a0d9a9

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Merge branch 'master' into oty-shortcircuit-ci

dec6fe0

comment flaky test

a53a357

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

RIP zombienet test

8122824

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

bkchr approved these changes May 21, 2025

View reviewed changes

sandreim approved these changes May 22, 2025

View reviewed changes

clippy

88513bf

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

fmt

7e9806a

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

pepoviola added the R0-no-crate-publish-required The change does not require any crates to be re-published. label May 22, 2025

pepoviola approved these changes May 22, 2025

View reviewed changes

Green tests are suspicious, try again

48ef853

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

alvicsam approved these changes May 22, 2025

View reviewed changes

ggwpez added 5 commits May 22, 2025 18:13

more flaky

e4dba82

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Merge branch 'master' into oty-shortcircuit-ci

5b6b9e5

more flaky

5f3fe49

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Disable ALL zombienet tests

eab0a29

Signed-off-by: Oliver Tale-Yazdi <[email protected]>

Merge branch 'master' into oty-shortcircuit-ci

422020f

ggwpez changed the title ~~🔪 flaky tests~~ 🔪 flaky and Zombienet tests May 22, 2025

athei added this pull request to the merge queue May 22, 2025

Merged via the queue into master with commit e723cfa May 22, 2025
182 checks passed

athei deleted the oty-shortcircuit-ci branch May 22, 2025 23:05

alvicsam mentioned this pull request May 23, 2025

ci: rm gitlab config #8622

Merged

github-merge-queue bot pushed a commit that referenced this pull request May 23, 2025

ci: rm gitlab config (#8622)

dd97d10

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

lrubasze mentioned this pull request May 28, 2025

Add T18-zombienet_tests label paritytech/labels#44

Merged

lrubasze mentioned this pull request May 28, 2025

ci: trigger zombienet tests if 'T18-zombienet_tests' label set #8696

Merged

bkontur mentioned this pull request Jun 9, 2025

Bridges zombienet tests #8800

Open

3 tasks

pgherveou pushed a commit that referenced this pull request Jun 11, 2025

ci: rm gitlab config (#8622)

5a9efc9

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

alvicsam added a commit that referenced this pull request Oct 17, 2025

ci: rm gitlab config (#8622)

191dcce

Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.

github-actions bot mentioned this pull request Oct 24, 2025

Update polkadot-sdk from stable2503 to stable2506 moondance-labs/tanssi#1340

Closed

Conversation

ggwpez commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

paritytech-workflow-stopper bot commented May 22, 2025

Uh oh!

ggwpez commented May 22, 2025

Uh oh!

bkchr commented May 22, 2025

Uh oh!

athei commented May 22, 2025

Uh oh!

Uh oh!

alindima commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bkchr commented May 28, 2025

Uh oh!

alindima commented May 28, 2025

Uh oh!

alexggh commented May 28, 2025

Uh oh!

pepoviola commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

athei commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alindima commented May 28, 2025

Uh oh!

ggwpez commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexggh commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

ggwpez commented May 21, 2025 •

edited

Loading

alindima commented May 28, 2025 •

edited

Loading

pepoviola commented May 28, 2025 •

edited

Loading

athei commented May 28, 2025 •

edited

Loading

ggwpez commented May 28, 2025 •

edited

Loading