[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well #24504

HyukjinKwon · 2019-05-01T13:05:05Z

What changes were proposed in this pull request?

Looks updating documentation from 0.8.0 to 0.12.1 was missed.

How was this patch tested?

N/A

HyukjinKwon · 2019-05-01T13:05:12Z

cc @BryanCutler

HyukjinKwon · 2019-05-01T13:05:42Z

docs/sql-pyspark-pandas-with-arrow.md

I realised that other Arrow optimization might likely be placed in other places .. :).

Make sense. The filename is sql-pyspark...

SparkQA · 2019-05-01T13:17:29Z

Test build #105055 has finished for PR 24504 at commit 275a4ef.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

viirya

This looks good.

There is another section:

Supported SQL Types
Currently, all Spark SQL data types are supported by Arrow-based conversion except MapType, ArrayType of TimestampType, and nested StructType. BinaryType is supported only when installed PyArrow is equal to or higher than 0.10.0.

As currently supported version is 0.12.1, the last sentence looks redundant?

HyukjinKwon · 2019-05-01T14:24:35Z

Yea, thanks for checking it.

SparkQA · 2019-05-01T14:40:14Z

Test build #105058 has finished for PR 24504 at commit 3482d31.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-05-01T14:55:18Z

Test build #105061 has finished for PR 24504 at commit ffdb362.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

BryanCutler

LGTM

BryanCutler · 2019-05-01T17:14:15Z

merged to master, thanks @HyukjinKwon !

…ow version in PySpark as well ## What changes were proposed in this pull request? Looks updating documentation from 0.8.0 to 0.12.1 was missed. ## How was this patch tested? N/A Closes apache#24504 from HyukjinKwon/SPARK-27276-followup. Authored-by: HyukjinKwon <[email protected]> Signed-off-by: Bryan Cutler <[email protected]>

* [SPARK-27276][PYTHON][SQL] Increase minimum version of pyarrow to 0.12.1 and remove prior workarounds This increases the minimum support version of pyarrow to 0.12.1 and removes workarounds in pyspark to remain compatible with prior versions. This means that users will need to have at least pyarrow 0.12.1 installed and available in the cluster or an `ImportError` will be raised to indicate an upgrade is needed. Existing tests using: Python 2.7.15, pyarrow 0.12.1, pandas 0.24.2 Python 3.6.7, pyarrow 0.12.1, pandas 0.24.0 Closes apache#24298 from BryanCutler/arrow-bump-min-pyarrow-SPARK-27276. Authored-by: Bryan Cutler <[email protected]> Signed-off-by: HyukjinKwon <[email protected]> * Fix pandas infer_dtype warning * [SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well ## What changes were proposed in this pull request? Looks updating documentation from 0.8.0 to 0.12.1 was missed. ## How was this patch tested? N/A Closes apache#24504 from HyukjinKwon/SPARK-27276-followup. Authored-by: HyukjinKwon <[email protected]> Signed-off-by: Bryan Cutler <[email protected]> Co-authored-by: Bryan Cutler <[email protected]> Co-authored-by: HyukjinKwon <[email protected]>

…ow version in PySpark as well ## What changes were proposed in this pull request? Looks updating documentation from 0.8.0 to 0.12.1 was missed. ## How was this patch tested? N/A Closes apache#24504 from HyukjinKwon/SPARK-27276-followup. Authored-by: HyukjinKwon <[email protected]> Signed-off-by: Bryan Cutler <[email protected]>

HyukjinKwon commented May 1, 2019

View reviewed changes

viirya approved these changes May 1, 2019

View reviewed changes

Update documentation about Arrow version in PySpark as well

ffdb362

HyukjinKwon force-pushed the SPARK-27276-followup branch from 11a50df to ffdb362 Compare May 1, 2019 14:42

HyukjinKwon changed the title ~~[SPARK-27276][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well~~ [SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well May 1, 2019

srowen approved these changes May 1, 2019

View reviewed changes

dongjoon-hyun approved these changes May 1, 2019

View reviewed changes

BryanCutler approved these changes May 1, 2019

View reviewed changes

BryanCutler closed this in 9623420 May 1, 2019

HyukjinKwon deleted the SPARK-27276-followup branch March 3, 2020 01:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well #24504

[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well #24504

Uh oh!

HyukjinKwon commented May 1, 2019

Uh oh!

HyukjinKwon commented May 1, 2019

Uh oh!

HyukjinKwon May 1, 2019

Uh oh!

viirya May 1, 2019

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

viirya left a comment

Uh oh!

HyukjinKwon commented May 1, 2019

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

BryanCutler left a comment

Uh oh!

BryanCutler commented May 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well #24504

[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arrow version in PySpark as well #24504

Uh oh!

Conversation

HyukjinKwon commented May 1, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

HyukjinKwon commented May 1, 2019

Uh oh!

HyukjinKwon May 1, 2019

Choose a reason for hiding this comment

Uh oh!

viirya May 1, 2019

Choose a reason for hiding this comment

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

viirya left a comment

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented May 1, 2019

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

SparkQA commented May 1, 2019

Uh oh!

BryanCutler left a comment

Choose a reason for hiding this comment

Uh oh!

BryanCutler commented May 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants