Skip to content

Commit 71b5c5b

Browse files
panbingkunYikun
authored andcommitted
[SPARK-41251][PS][INFRA] Upgrade pandas from 1.5.1 to 1.5.2
### What changes were proposed in this pull request? This PR proposes upgrading pandas to 1.5.2, for pandas API on Spark. New version of pandas (1.5.2) was released at Nov 22, 2022, brings some bug fix, the release notes as follows: https://pandas.pydata.org/pandas-docs/dev/whatsnew/v1.5.2.html ### Why are the changes needed? We should follow the behavior of latest pandas, and support it. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Pass GA. Closes #38787 from panbingkun/upgrade_pandas_1.5.2. Authored-by: panbingkun <pbk1982@gmail.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
1 parent 033dbe6 commit 71b5c5b

2 files changed

Lines changed: 3 additions & 3 deletions

File tree

dev/infra/Dockerfile

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -64,8 +64,8 @@ RUN Rscript -e "devtools::install_version('roxygen2', version='7.2.0', repos='ht
6464
# See more in SPARK-39735
6565
ENV R_LIBS_SITE "/usr/local/lib/R/site-library:${R_LIBS_SITE}:/usr/lib/R/library"
6666

67-
RUN pypy3 -m pip install numpy 'pandas<=1.5.1' scipy coverage matplotlib
68-
RUN python3.9 -m pip install numpy pyarrow 'pandas<=1.5.1' scipy unittest-xml-reporting plotly>=4.8 sklearn 'mlflow>=1.0' coverage matplotlib openpyxl 'memory-profiler==0.60.0'
67+
RUN pypy3 -m pip install numpy 'pandas<=1.5.2' scipy coverage matplotlib
68+
RUN python3.9 -m pip install numpy pyarrow 'pandas<=1.5.2' scipy unittest-xml-reporting plotly>=4.8 sklearn 'mlflow>=1.0' coverage matplotlib openpyxl 'memory-profiler==0.60.0'
6969

7070
# Add Python deps for Spark Connect.
7171
RUN python3.9 -m pip install grpcio protobuf

python/pyspark/pandas/supported_api_gen.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ def generate_supported_api(output_rst_file_path: str) -> None:
9898
9999
Write supported APIs documentation.
100100
"""
101-
pandas_latest_version = "1.5.1"
101+
pandas_latest_version = "1.5.2"
102102
if LooseVersion(pd.__version__) != LooseVersion(pandas_latest_version):
103103
msg = (
104104
"Warning: Latest version of pandas (%s) is required to generate the documentation; "

0 commit comments

Comments
 (0)