Conversation
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
|
All GitHub workflows were cancelled due to failure one of the required jobs. |
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Yes but i kept disabling more and more, so now I disabled all since there is no way to check if they are not flaky without running the CI many times. Maybe we can go the other way and initially disable all and then enable the non-flaky ones one-by-one. |
|
Maybe we could just disable the zombienet tests in the merge queue? |
In the PR they are just noise. We always merge with them red because they almost always are. I see no reason to keep them. |
Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.
* master: (99 commits) Snowbridge: Remove asset location check for compatibility (#8473) add poke_deposit extrinsic to pallet-bounties (#8382) litep2p/peerset: Reject non-reserved peers in the reserved-only mode (#8650) Charge deposit based on key length (#8648) [pallet-revive] make subscription task panic on error (#8587) tx/metrics: Add metrics for the RPC v2 `transactionWatch_v1_submitAndWatch` (#8345) Bridges: Fix - Improve try-state for pallet-xcm-bridge-hub (#8615) Introduce CreateBare, deprecated CreateInherent (#7597) Use hashbrown hashmap/hashset in validation context (#8606) ci: rm gitlab config (#8622) 🔪 flaky and Zombienet tests (#8600) cumulus: adjust unincluded segment size metric buckets (#8617) Benchmark storage access on block validation (#8069) Revert 7934 es/remove tj changes (#8611) collator-protocol: add more collation observability (#8230) `fatxpool`: add fallback for ready at light (#8533) txpool: fix tx removal from unlocks set (#8500) XCMP weight metering: account for the MQ page position (#8344) fix epmb solution duplicate issue + add remote mining apparatus to epm (#8585) Fix generated address returned by Substrate RPC runtime call (#8504) ...
|
I see this PR disabled all zombienet tests, which I think was a bad idea. I'd suggest to just not make them mandatory for merging the PR. We should still run them but take the results with a grain of salt. There are certainly some tests that are failing almost all the time (and we could disable these altogether maybe), but at least we also had plenty of tests that were failing extremely rarely (or never). |
Porting to zombienet-sdk will not change anything in regards to their flakiness.
No. we already have merged prs in the past with failing tests and then realizing that they actually broke. It is also not really easy too check them etc. The only proper fix is to make them rock stable. Stuff like spot instances need to be detected by zombienet itself and not counted as an error. But we also had this discussion already multiple times and these tests are still flaky and it doesn't improve. |
We don't only do a direct translation. We try to test them thoroughly and investigate their flakyness. zombienet-sdk sometimes enables us to make more precise assertions.
Right, there's a chance of merging some buggy code. But now we can almost be sure that we'll merge some buggy code |
I don't think anyone disagrees here, but are we in a better situation with all of them disabled ? |
|
Hi, I'm also agree on this
And we are working to reach that stability, just as update:
Also, in order to get more information of the test but without causing friction we propose to only run them when certain label is present (so people form parachains/node sdk can run them easily) and also make them not required. Any other feedback/opinion is wellcome. |
They were never mandatory. That would have been insane. But you still had to wait for them to finish running. And I am not checking the results and nobody is checking the result of a job that fails > 50% of the time. I just merge just as anybody else. They provide ZERO value. It makes no sense to have optional tests. Just make them solid or don't add them. We need to do better. But until then they just don't run in a PR. |
I for one (and I assume others too) got a good memory of which particular tests are flaky (or at least you could check on other PRs which are the most common ones). Definitely not all of them are (especially to the 50% ratio that you mention). I can understand that this is not valuable to everyone, but for some it is. Having the label is a good compromise for the time being. At least I can test my code and the code I'm reviewing. |
Yea lets go with the label for now.
But to all other people working on Polkadot SDK it is confusing AF. This is a team project! |
You should've made it the other way around make them mandatory :D, so that everyone stops and fix them, but I guess we tried that as well :D. Anyways, let's go with the label for now, but my instinct tells me this has the reverse impact where they get even lower priority. |
# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>
Commenting out all flaky tests and tracking them here #48 Changes: - Disable flaky Rust tests by adding a new disabled feature. `#[ignore]` attribute is not possible since CI runs with `--ignored` - Disable all Zombienet tests - [ ] Waiting for CI what other tests fail. --------- Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.
# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>
Commenting out all flaky tests and tracking them here #48 Changes: - Disable flaky Rust tests by adding a new disabled feature. `#[ignore]` attribute is not possible since CI runs with `--ignored` - Disable all Zombienet tests - [ ] Waiting for CI what other tests fail. --------- Signed-off-by: Oliver Tale-Yazdi <[email protected]>
Since zombienet [has been disabled](#8600) to improve stability, it makes no sense to keep the GitLab configuration.
# Description Zombienet tests are no longer automatically triggered with polkadot-sdk CI because of their flakiness (see #8600). This PR allows to conditionally trigger Zombienet tests if label 'T18-zombienet_tests' is set. This is to have an option to trigger them anyway until they are stabilized. It will help us to monitor the current status of the tests while stabilizing. --------- Co-authored-by: Javier Viola <[email protected]> Co-authored-by: Javier Viola <[email protected]>
Commenting out all flaky tests and tracking them here #48
Changes:
Disable flaky Rust tests by adding a new disabled feature.
#[ignore]attribute is not possible since CI runs with--ignoredDisable all Zombienet tests
Waiting for CI what other tests fail.