[SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests #29380

huaxingao · 2020-08-06T17:58:24Z

What changes were proposed in this pull request?

The test creates 10 batches of data to train the model and expects to see error on test data improves as model is trained. If the difference between the 2nd error and the 10th error is smaller than 2, the assertion fails:

FAIL: test_train_prediction (pyspark.mllib.tests.test_streaming_algorithms.StreamingLinearRegressionWithTests)
Test that error on test data improves as model is trained.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/runner/work/spark/spark/python/pyspark/mllib/tests/test_streaming_algorithms.py", line 466, in test_train_prediction
    eventually(condition, timeout=180.0)
  File "/home/runner/work/spark/spark/python/pyspark/testing/utils.py", line 81, in eventually
    lastValue = condition()
  File "/home/runner/work/spark/spark/python/pyspark/mllib/tests/test_streaming_algorithms.py", line 461, in condition
    self.assertGreater(errors[1] - errors[-1], 2)
AssertionError: 1.672640157855923 not greater than 2

I saw this quite a few time on Jenkins but was not able to reproduce this on my local. These are the ten errors I got:

4.517395047937127
4.894265404350079
3.0392090466559876
1.8786361640757654
0.8973106042078115
0.3715780507684368
0.20815690742907672
0.17333033743125845
0.15686783249863873
0.12584413600569616

I am thinking of having 15 batches of data instead of 10, so the model can be trained for a longer time. Hopefully the 15th error - 2nd error will always be larger than 2 on Jenkins. These are the 15 errors I got on my local:

4.517395047937127
4.894265404350079
3.0392090466559876
1.8786361640757658
0.8973106042078115
0.3715780507684368
0.20815690742907672
0.17333033743125845
0.15686783249863873
0.12584413600569616
0.11883853835108477
0.09400261862100823
0.08887491447353497
0.05984929624986607
0.07583948141520978

Why are the changes needed?

Fix flaky test

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manually tested

huaxingao · 2020-08-06T18:09:06Z

cc @srowen @viirya

srowen

Seems fine, or loosening the tolerances

SparkQA · 2020-08-06T18:23:16Z

Test build #127152 has finished for PR 29380 at commit df3f21e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

viirya

Ok for me too. If it's still flaky, simply loosening the tolerances like @srowen said.

huaxingao · 2020-08-06T19:17:30Z

@srowen @viirya Thanks for taking a look. I will merge to master. Does this need to go into 3.0.1 too?

dongjoon-hyun

+1. Looks reasonable.
Also, +1 for having this on branch-3.0.

### What changes were proposed in this pull request? The test creates 10 batches of data to train the model and expects to see error on test data improves as model is trained. If the difference between the 2nd error and the 10th error is smaller than 2, the assertion fails: ``` FAIL: test_train_prediction (pyspark.mllib.tests.test_streaming_algorithms.StreamingLinearRegressionWithTests) Test that error on test data improves as model is trained. ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/runner/work/spark/spark/python/pyspark/mllib/tests/test_streaming_algorithms.py", line 466, in test_train_prediction eventually(condition, timeout=180.0) File "/home/runner/work/spark/spark/python/pyspark/testing/utils.py", line 81, in eventually lastValue = condition() File "/home/runner/work/spark/spark/python/pyspark/mllib/tests/test_streaming_algorithms.py", line 461, in condition self.assertGreater(errors[1] - errors[-1], 2) AssertionError: 1.672640157855923 not greater than 2 ``` I saw this quite a few time on Jenkins but was not able to reproduce this on my local. These are the ten errors I got: ``` 4.517395047937127 4.894265404350079 3.0392090466559876 1.8786361640757654 0.8973106042078115 0.3715780507684368 0.20815690742907672 0.17333033743125845 0.15686783249863873 0.12584413600569616 ``` I am thinking of having 15 batches of data instead of 10, so the model can be trained for a longer time. Hopefully the 15th error - 2nd error will always be larger than 2 on Jenkins. These are the 15 errors I got on my local: ``` 4.517395047937127 4.894265404350079 3.0392090466559876 1.8786361640757658 0.8973106042078115 0.3715780507684368 0.20815690742907672 0.17333033743125845 0.15686783249863873 0.12584413600569616 0.11883853835108477 0.09400261862100823 0.08887491447353497 0.05984929624986607 0.07583948141520978 ``` ### Why are the changes needed? Fix flaky test ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Manually tested Closes #29380 from huaxingao/flaky_test. Authored-by: Huaxin Gao <[email protected]> Signed-off-by: Huaxin Gao <[email protected]> (cherry picked from commit 75c2c53) Signed-off-by: Huaxin Gao <[email protected]>

huaxingao · 2020-08-06T20:56:27Z

Merged to master/3.0. Thanks everyone!

HyukjinKwon · 2020-08-07T00:58:46Z

Nice, @huaxingao!

cloud-fan · 2020-08-07T14:31:15Z

thanks for fixing it!

[SPARK-32506] Flaky test: StreamingLinearRegressionWithTests

df3f21e

probot-autolabeler bot added MLLIB PYTHON labels Aug 6, 2020

srowen reviewed Aug 6, 2020

View reviewed changes

viirya reviewed Aug 6, 2020

View reviewed changes

dongjoon-hyun approved these changes Aug 6, 2020

View reviewed changes

dongjoon-hyun changed the title ~~[SPARK-32506] Flaky test: StreamingLinearRegressionWithTests~~ [SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests Aug 6, 2020

huaxingao closed this in 75c2c53 Aug 6, 2020

huaxingao deleted the flaky_test branch August 6, 2020 20:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests #29380

[SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests #29380

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

srowen left a comment

Uh oh!

SparkQA commented Aug 6, 2020

Uh oh!

viirya left a comment

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

dongjoon-hyun left a comment

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

HyukjinKwon commented Aug 7, 2020

Uh oh!

cloud-fan commented Aug 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests #29380

[SPARK-32506][TESTS] Flaky test: StreamingLinearRegressionWithTests #29380

Uh oh!

Conversation

huaxingao commented Aug 6, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

srowen left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Aug 6, 2020

Uh oh!

viirya left a comment

Choose a reason for hiding this comment

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

huaxingao commented Aug 6, 2020

Uh oh!

HyukjinKwon commented Aug 7, 2020

Uh oh!

cloud-fan commented Aug 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants