Skip to content

Conversation

@BryanCutler
Copy link
Member

What changes were proposed in this pull request?

This PR breaks down the large ml/tests.py file that contains all Python ML unit tests into several smaller test files to be easier to read and maintain.

The tests are broken down as follows:

pyspark
├── __init__.py
...
├── ml
│   ├── __init__.py
...
│   ├── tests
│   │   ├── __init__.py
│   │   ├── test_algorithms.py
│   │   ├── test_base.py
│   │   ├── test_evaluation.py
│   │   ├── test_feature.py
│   │   ├── test_image.py
│   │   ├── test_linalg.py
│   │   ├── test_param.py
│   │   ├── test_persistence.py
│   │   ├── test_pipeline.py
│   │   ├── test_stat.py
│   │   ├── test_training_summary.py
│   │   ├── test_tuning.py
│   │   └── test_wrapper.py
...
├── testing
...
│   ├── mlutils.py
...

How was this patch tested?

Ran tests manually by module to ensure test count was the same, and ran python/run-tests --modules=pyspark-ml to verify all passing with Python 2.7 and Python 3.6.

@BryanCutler
Copy link
Member Author

Dist by line count:

348 ./test_algorithms.py
 84 ./test_base.py
 71 ./test_evaluation.py
314 ./test_feature.py
118 ./test_image.py
392 ./test_linalg.py
367 ./test_param.py
369 ./test_persistence.py
 77 ./test_pipeline.py
 56 ./test_stat.py
254 ./test_training_summary.py
552 ./test_tuning.py
116 ./test_wrapper.py

@BryanCutler
Copy link
Member Author

cc @HyukjinKwon

@SparkQA
Copy link

SparkQA commented Nov 17, 2018

Test build #98945 has finished for PR 23063 at commit a4f8f12.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member

Let me leave a cc for @holdenk, @MLnick, @jkbradley and @mengxr FYI.

@HyukjinKwon
Copy link
Member

Will merge this one tomorrow if this is not merged till then.

@HyukjinKwon
Copy link
Member

Merged to master.

@asfgit asfgit closed this in 034ae30 Nov 18, 2018
@BryanCutler BryanCutler deleted the python-test-breakup-ml-SPARK-26033 branch November 19, 2018 05:45
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
…r files

## What changes were proposed in this pull request?

This PR breaks down the large ml/tests.py file that contains all Python ML unit tests into several smaller test files to be easier to read and maintain.

The tests are broken down as follows:
```
pyspark
├── __init__.py
...
├── ml
│   ├── __init__.py
...
│   ├── tests
│   │   ├── __init__.py
│   │   ├── test_algorithms.py
│   │   ├── test_base.py
│   │   ├── test_evaluation.py
│   │   ├── test_feature.py
│   │   ├── test_image.py
│   │   ├── test_linalg.py
│   │   ├── test_param.py
│   │   ├── test_persistence.py
│   │   ├── test_pipeline.py
│   │   ├── test_stat.py
│   │   ├── test_training_summary.py
│   │   ├── test_tuning.py
│   │   └── test_wrapper.py
...
├── testing
...
│   ├── mlutils.py
...
```

## How was this patch tested?

Ran tests manually by module to ensure test count was the same, and ran `python/run-tests --modules=pyspark-ml` to verify all passing with Python 2.7 and Python 3.6.

Closes apache#23063 from BryanCutler/python-test-breakup-ml-SPARK-26033.

Authored-by: Bryan Cutler <[email protected]>
Signed-off-by: hyukjinkwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants