Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
5835 commits
Select commit Hold shift + click to select a range
ff83dd9
[SPARK-49402][PYTHON][FOLLOW-UP] Fix Binder integration in PySpark do…
HyukjinKwon Aug 27, 2024
315d987
[SPARK-49402][PYTHON][FOLLOW-UP] Fix Binder integration in PySpark do…
HyukjinKwon Aug 27, 2024
39f760c
[SPARK-49402][PYTHON][FOLLOW-UP] Fix Binder integration in PySpark do…
HyukjinKwon Aug 27, 2024
51bfb88
[SPARK-49402][PYTHON][FOLLOW-UP] Fix Binder integration in PySpark do…
HyukjinKwon Aug 27, 2024
e5a5d19
[SPARK-49402][PYTHON][FOLLOW-UP] Fix Binder integration in PySpark do…
HyukjinKwon Aug 27, 2024
93fedc5
[SPARK-49402][PYTHON][FOLLOW-UP] Set upperfound for NumPy
HyukjinKwon Aug 27, 2024
dcfefd0
[SPARK-49402][PYTHON][FOLLOW-UP] Manually load ~/.profile in Spark Co…
HyukjinKwon Aug 28, 2024
2ad11b6
[SPARK-46037][SQL] Correctness fix for Shuffled Hash Join build left …
cloud-fan Aug 29, 2024
dce7c9a
[SPARK-43242][CORE][3.5] Fix throw 'Unexpected type of BlockId' in sh…
CavemanIV Aug 30, 2024
30a75d1
[MINOR] Add `artifacts` to `.gitignore`
zhengruifeng Aug 30, 2024
d5caaaa
[SPARK-49480][CORE] Fix NullPointerException from `SparkThrowableHelp…
jshmchenxi Sep 2, 2024
38ad0e7
[SPARK-49476][SQL] Fix nullability of base64 function
Kimahriman Sep 2, 2024
a1cd99d
Preparing Spark release v3.5.3-rc1
itholic Sep 3, 2024
0616ef2
Preparing development version 3.5.4-SNAPSHOT
itholic Sep 3, 2024
d8adf4b
[SPARK-49476][SQL][3.5][FOLLOWUP] Fix base64 proto test
Kimahriman Sep 3, 2024
7febde1
[SPARK-49275][SQL][3.5] Fix return type nullness of the xpath expression
chenhao-db Sep 4, 2024
7718777
[MINOR][DOCS] Fix site.SPARK_VERSION pattern in RDD Programming Guide
yaooqinn Sep 4, 2024
d83bf8c
[SPARK-49509][CORE] Use `Platform.allocateDirectBuffer` instead of `B…
cxzl25 Sep 4, 2024
e5ec16e
[SPARK-48965][SQL] Use the correct schema in `Dataset#toJSON`
bersprockets Sep 5, 2024
8fbeaf5
[SPARK-49408][SQL] Use IndexedSeq in ProjectingInternalRow
wzx140 Sep 5, 2024
d8f9d8d
[SPARK-49152][SQL][FOLLOWUP] DelegatingCatalogExtension should also u…
cloud-fan Sep 5, 2024
1f8c71f
Preparing Spark release v3.5.3-rc2
itholic Sep 6, 2024
46214da
Preparing development version 3.5.4-SNAPSHOT
itholic Sep 6, 2024
3f22ef1
[SPARK-49246][SQL][FOLLOW-UP] The behavior of SaveAsTable should not …
cloud-fan Sep 9, 2024
6292cfc
Preparing Spark release v3.5.3-rc3
itholic Sep 9, 2024
e923790
Preparing development version 3.5.4-SNAPSHOT
itholic Sep 9, 2024
ba374c6
fix import
cloud-fan Sep 9, 2024
32232e9
Preparing Spark release v3.5.3-rc3
itholic Sep 9, 2024
67421bb
Preparing development version 3.5.4-SNAPSHOT
itholic Sep 9, 2024
1206b52
[SPARK-49501][SQL] Fix double-escaping of table location
cstavr Sep 9, 2024
96eebeb
[SPARK-49595][CONNECT][SQL] Fix `DataFrame.unpivot/melt` in Spark Con…
xinrong-meng Sep 11, 2024
560efed
[SPARK-49261][SQL] Don't replace literals in aggregate expressions wi…
bersprockets Sep 12, 2024
e693e18
[SPARK-49628][SQL] ConstantFolding should copy stateful expression be…
cloud-fan Sep 13, 2024
e7ca790
[SPARK-49699][SS] Disable PruneFilters for streaming workloads
n-young-db Sep 20, 2024
dd76a82
[SPARK-49760][YARN] Correct handling of `SPARK_USER` env variable ove…
cnauroth Sep 24, 2024
b513297
[SPARK-49750][DOC] Mention delegation token support in K8s mode
pan3793 Sep 24, 2024
f1c69a5
[SPARK-49791][SQL] Make DelegatingCatalogExtension more extendable
cloud-fan Sep 26, 2024
b51db8b
[SPARK-49791][SQL][FOLLOWUP][3.5] Fix `import` statement
dongjoon-hyun Sep 26, 2024
1040657
[SPARK-49211][SQL][FOLLOW-UP] Support catalog in QualifiedTableName
amaliujia Sep 27, 2024
50c1783
[SPARK-49803][SQL][TESTS] Increase `spark.test.docker.connectionTimeo…
dongjoon-hyun Sep 27, 2024
be254c5
[SPARK-49804][K8S] Fix to use the exit code of executor container always
Sep 27, 2024
f888d57
[SPARK-46525][SQL][TESTS][3.5] Fix `docker-integration-tests` on Appl…
yaooqinn Sep 27, 2024
a49d6f4
[SPARK-47949][SQL][DOCKER][TESTS] MsSQLServer: Bump up mssql docker i…
yaooqinn Apr 23, 2024
16b8153
[SPARK-49819] Disable CollapseProject for correlated subqueries in pr…
n-young-db Sep 30, 2024
75860a3
[SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema field…
nikhilsheoran-db Oct 1, 2024
d9467b5
[SPARK-49816][SQL][3.5] Should only update out-going-ref-count for re…
cloud-fan Oct 1, 2024
b8f8456
[SPARK-49841][PYTHON][TESTS][3.5] Skip PySpark `test_cast_to_udt_with…
dongjoon-hyun Oct 1, 2024
ec28154
[SPARK-47702][CORE] Remove Shuffle service endpoint from the location…
Oct 4, 2024
aa6784c
Revert "[SPARK-47702][CORE] Remove Shuffle service endpoint from the …
LuciferYang Oct 4, 2024
d4b34d2
[SPARK-49843][SQL][3.5] Fix change comment on char/varchar columns
stefankandic Oct 4, 2024
9a5ae45
[SPARK-49836][SQL][SS] Fix possibly broken query when window is provi…
HeartSaVioR Oct 4, 2024
ca8407b
[SPARK-47702][CORE][3.5] Remove Shuffle service endpoint from the loc…
Oct 5, 2024
4d93a30
[SPARK-49806][PYTHON][TESTS][FOLLOW-UP] Skip newline difference in Sp…
HyukjinKwon Oct 7, 2024
4472fb2
[SPARK-49909][SQL][3.5] Fix the pretty name of some expressions
panbingkun Oct 9, 2024
5996c48
[SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves Unresolv…
LantaoJin Oct 10, 2024
36299b3
[SPARK-49959][SQL] Fix ColumnarArray.copy() to read nulls from the co…
chenhao-db Oct 15, 2024
92ec503
[SPARK-49981][CORE][TESTS] Fix `AsyncRDDActionsSuite.FutureAction res…
dongjoon-hyun Oct 16, 2024
d37a8b9
[SPARK-49983][CORE][TESTS] Fix `BarrierTaskContextSuite.successively …
dongjoon-hyun Oct 16, 2024
f5bc928
[SPARK-49979][SQL] Fix AQE hanging issue when collecting twice on a f…
liuzqt Oct 17, 2024
3d2d1c1
[SPARK-48155][SQL][3.5] AQEPropagateEmptyRelation for join should che…
AngersZhuuuu Oct 17, 2024
1fae1d7
[SPARK-49829][SS] Fix the bug on the optimization on adding input to …
HeartSaVioR Oct 18, 2024
6b9b3c0
Revert "[SPARK-49909][SQL][3.5] Fix the pretty name of some expressions"
panbingkun Oct 18, 2024
3a4ebae
[SPARK-50021][CORE][UI][3.5] Fix `ApplicationPage` to hide App UI lin…
dongjoon-hyun Oct 18, 2024
d24393b
[SPARK-50022][CORE][UI][3.5] Fix `MasterPage` to hide App UI links wh…
dongjoon-hyun Oct 18, 2024
e3e79e5
[SPARK-50123][TESTS] Move BitmapExpressionUtilsSuite & ExpressionImpl…
yaooqinn Oct 25, 2024
2f5e0fa
[SPARK-50150][BUILD][3.5] Upgrade Jetty to 9.4.56.v20240826
dongjoon-hyun Oct 29, 2024
175b5e9
[MINOR][BUILD] Skip `deepspeed` in requirements on MacOS
zhengruifeng Aug 10, 2023
4205b79
[SPARK-50155][3.5] Move scala and java files to their default folders
panbingkun Oct 30, 2024
0229c0e
[SPARK-50176][CONNECT][3.5] Disallow reattaching after the session is…
Nov 1, 2024
6df1966
[SPARK-50195][CORE] Fix `StandaloneRestServer` to propagate `spark.ap…
dongjoon-hyun Nov 1, 2024
9d47266
[SPARK-50199][PYTHON][TESTS] Use Spark 3.4.4 instead of 3.0.1 in `tes…
dongjoon-hyun Nov 1, 2024
08023c0
[SPARK-50176][CONNECT][FOLLOWUP][3.5] Fix ReattachableExecuteSuite fa…
Nov 4, 2024
1ddf4a9
[MINOR][DOCS][3.5] Fix specified java versions in `install.rst`
dvorst Nov 4, 2024
9b172de
[SPARK-50210][CORE] Fix `SparkSubmit` to show REST API `kill` respons…
dongjoon-hyun Nov 3, 2024
acccf53
[SPARK-50212][BUILD][3.5] Fix the conditional check for executing the…
LuciferYang Nov 5, 2024
d39f5ab
[SPARK-50235][SQL] Clean up ColumnVector resource after processing al…
viirya Nov 6, 2024
8da6987
[SPARK-50300][BUILD] Use mirror host instead of `archive.apache.org`
dongjoon-hyun Nov 13, 2024
8502a22
[SPARK-50304][INFRA] Remove `(any|empty).proto` from RAT exclusion
dongjoon-hyun Nov 14, 2024
e0bdfee
[SPARK-50316][BUILD][3.5] Upgrade ORC to 1.9.5
dongjoon-hyun Nov 14, 2024
242d333
[SPARK-50312][SQL] SparkThriftServer createServer parameter passing e…
CuiYanxiang Nov 15, 2024
08b195c
[MINOR][DOCS] Fix a HTML/Markdown syntax error in sql-migration-guide.md
yaooqinn Nov 20, 2024
df9b9de
[SPARK-50258][SQL] Fix output column order changed issue after AQE op…
wangyum Nov 20, 2024
5ff129a
[SPARK-50430][CORE] Use the standard Properties.clone instead of manu…
HyukjinKwon Nov 27, 2024
5e51e2c
[SPARK-49294][UI] Add width attribute for shuffle-write-time checkbox
xunxunmimi5577 Dec 2, 2024
1d6f7ad
[SPARK-50433][DOCS][TESTS][3.5] Fix configuring log4j2 guide docs for…
pan3793 Dec 3, 2024
5dc927b
[SPARK-50487][DOCS] Update broken jira link
huangxiaopingRD Dec 4, 2024
acedb15
[SPARK-50498][PYTHON] Avoid unnecessary py4j call in `listFunctions`
zhengruifeng Dec 5, 2024
86e29e9
[SPARK-50505][DOCS] Fix `spark.storage.replication.proactive` default…
dongjoon-hyun Dec 5, 2024
d01f34f
[SPARK-50492][SS] Fix java.util.NoSuchElementException when event tim…
liviazhu Dec 6, 2024
153cd9e
[SPARK-50492][SS][FOLLOWUP][3.5] Change `def references` to `lazy val…
LuciferYang Dec 6, 2024
bf29ab9
[SPARK-50421][CORE][3.5] Fix executor related memory config incorrect…
zjuwangg Dec 6, 2024
d8f3afa
[SPARK-50514][DOCS] Add `IDENTIFIER clause` page to `menu-sql.yaml`
dongjoon-hyun Dec 7, 2024
305d2a0
[SPARK-49695][SQL][3.5] Postgres fix xor push-down
andrej-db Dec 7, 2024
a57f3c2
[SPARK-50483][CORE][SQL][3.5] BlockMissingException should be thrown …
wangyum Dec 8, 2024
929a19f
Preparing Spark release v3.5.4-rc1
LuciferYang Dec 9, 2024
8e6507a
Preparing development version 3.5.5-SNAPSHOT
LuciferYang Dec 9, 2024
deabe49
[SPARK-49134][INFRA][3.5] Support retry for deploying artifacts to Ne…
yaooqinn Dec 9, 2024
a3cf28e
[SPARK-50463][SQL][3.5] Fix `ConstantColumnVector` with Columnar to R…
richardc-db Dec 10, 2024
bb953f9
[SPARK-50510][CONNECT][3.5] Fix sporadic ReattachableExecuteSuite fai…
Dec 11, 2024
e97580a
[SPARK-50087][SQL][3.5] Robust handling of boolean expressions in CAS…
andrej-db Dec 12, 2024
92e650c
[SPARK-50545][CORE][SQL][3.5] `AccessControlException` should be thro…
pan3793 Dec 12, 2024
91af6f9
Preparing Spark release v3.5.4-rc2
LuciferYang Dec 16, 2024
a764524
Preparing development version 3.5.5-SNAPSHOT
LuciferYang Dec 16, 2024
8168ea8
[SPARK-50430][CORE][FOLLOW-UP] Keep the logic of manual putting key a…
HyukjinKwon Dec 3, 2024
0fbe292
Revert "[SPARK-50430][CORE][FOLLOW-UP] Keep the logic of manual putti…
HyukjinKwon Dec 16, 2024
f7c48fe
Revert "[SPARK-50430][CORE] Use the standard Properties.clone instead…
HyukjinKwon Dec 16, 2024
b0a7d4d
[SPARK-50587][INFRA][3.5] Remove unsupported `curl` option `--retry-a…
LuciferYang Dec 16, 2024
a6f220d
Preparing Spark release v3.5.4-rc3
LuciferYang Dec 17, 2024
bcaa5a9
Preparing development version 3.5.5-SNAPSHOT
LuciferYang Dec 17, 2024
45349b6
[SPARK-50510][CONNECT][TEST][3.5] Fix flaky ReattachableExecuteSuite
Dec 17, 2024
a3d23fd
[MINOR][SS] Minor update to watermark propagation comments
neilramaswamy Dec 18, 2024
5a91172
[SPARK-50483][SPARK-50545][DOC][FOLLOWUP][3.5] Mention behavior chang…
pan3793 Dec 23, 2024
19ad7f7
[SPARK-50769][SQL] Fix ClassCastException in HistogramNumeric
linhongliu-db Jan 9, 2025
5a8483b
[SPARK-50784][TESTS] Fix `lint-scala` not to ignore `scalastyle` errors
dongjoon-hyun Jan 10, 2025
00b3833
[SPARK-50797][SQL][TESTS][3.5] Move `HiveCharVarcharTestSuite` from `…
dongjoon-hyun Jan 13, 2025
51fb84a
[SPARK-50624][SQL] Add TimestampNTZType to ColumnarRow/MutableColumna…
nastra Jan 13, 2025
5de80b7
[SPARK-50886][BUILD][3.5] Upgrade Avro to 1.11.4
dongjoon-hyun Jan 19, 2025
bdfa6ef
[SPARK-50870][SQL] Add the timezone when casting to timestamp in V2Sc…
Jan 21, 2025
e4e41b2
[SPARK-45013][CORE][TEST][3.5] Flaky Test with NPE: track allocated r…
yaooqinn Jan 21, 2025
7177891
[SPARK-50890][PYTHON][TESTS][CONNECT] Skip test_take in Spark Connect…
HyukjinKwon Jan 20, 2025
bd2b8e5
[SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make…
LuciferYang Jan 22, 2025
5abaeac
Revert "[SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14…
dongjoon-hyun Jan 22, 2025
e608297
[SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to…
dongjoon-hyun Jan 22, 2025
3912368
Revert "[SPARK-50890][PYTHON][TESTS][CONNECT] Skip test_take in Spark…
HyukjinKwon Jan 23, 2025
d061aad
Revert "Revert "[SPARK-50890][PYTHON][TESTS][CONNECT] Skip test_take …
HyukjinKwon Jan 23, 2025
d2f9b4c
[SPARK-50984][SQL][TESTS] Make `ExpressionImplUtilsSuite` robust by m…
dongjoon-hyun Jan 25, 2025
7fd9ced
[SPARK-50853][CORE][3.5] Close temp shuffle file writable channel
Jan 25, 2025
3e01d41
[SPARK-50091][SQL][3.5] Handle case of aggregates in left-hand operan…
bersprockets Jan 25, 2025
22118f9
[SPARK-50987][DOCS] Make `spark-connect-overview.md`s version strings…
dongjoon-hyun Jan 25, 2025
09a255a
[SPARK-50998][K8S][DOCS] Fix `spark.kubernetes.configMap.maxSize` def…
dongjoon-hyun Jan 27, 2025
1f81475
[SPARK-51033][CORE][TESTS] Fix `CryptoStreamUtilsSuite` to use `sameE…
dongjoon-hyun Jan 29, 2025
c339e50
[SPARK-51027][SQL] Prevent `HiveClient.runSqlHive` invocation in non-…
dongjoon-hyun Jan 29, 2025
1966d57
[SPARK-51039][BUILD] Fix `hive-llap-common` dependency to use `hive.l…
dongjoon-hyun Jan 30, 2025
fd86566
[SPARK-51061][CORE] Hide `Jetty` info in REST Submission API
dongjoon-hyun Feb 3, 2025
105bed7
[SPARK-51071][INFRA][3.5] Upgrade `upload-artifact` to v4 to recover …
dongjoon-hyun Feb 3, 2025
31fbf01
[SPARK-51104][DOC] Self-host JavaScript and CSS in Spark website
gengliangwang Feb 6, 2025
3b892bd
[SPARK-51104][DOC][FOLLOWUP] Self-host docsearch.min.css in Spark web…
gengliangwang Feb 6, 2025
e395f33
[SPARK-51128][DOC] Self host docsearch.min.css.map in Spark website
gengliangwang Feb 7, 2025
03f7200
[SPARK-51129][DOC] Fix code tab switching in Spark Website
gengliangwang Feb 7, 2025
4e024f4
[SPARK-51093][SQL][TESTS][3.5] Fix minor endianness issues in tests
jonathan-albrecht-ibm Feb 10, 2025
012df44
[MINOR][DOCS] Fix incorrect description of constraint on spark.sql.ad…
JoshRosen Feb 11, 2025
af45325
[SPARK-51164][CORE][TESTS] Fix `CallerContext` test by enabling `hado…
dongjoon-hyun Feb 11, 2025
2ea2c0b
[SPARK-51164][CORE][TESTS][FOLLOWUP] Add hadoop.caller.context.enable…
cnauroth Feb 12, 2025
6084757
[SPARK-51157][SQL][3.5] Add missing @VarArgs Scala annotation for Sca…
yaooqinn Feb 12, 2025
4962453
[SPARK-51042][SQL][3.5] Read and write the month and days fields of i…
jonathan-albrecht-ibm Feb 12, 2025
030e691
[SPARK-51219][SQL] Fix `ShowTablesExec.isTempView` to work with non-`…
ostronaut Feb 18, 2025
4d15f64
[SPARK-51219][SQL][TESTS][FOLLOWUP] ShowTablesExec` remove `ArrayImpl…
ostronaut Feb 20, 2025
23637fe
[SPARK-51185][CORE][3.5] Revert simplifications to PartitionedFileUti…
LukasRupprecht Feb 21, 2025
c0bfae6
[SPARK-51244][INFRA][3.5] Upgrade left Github Action image from `ubun…
wayneguow Feb 21, 2025
3ba36ec
[SPARK-51187][SQL][SS][3.5] Implement the graceful deprecation of inc…
HeartSaVioR Feb 23, 2025
eedb1aa
[SPARK-48899][K8S] Fix `ENV` key value format in K8s Dockerfiles
dongjoon-hyun Jul 15, 2024
d65ecc0
[SPARK-51173][TESTS] Add `configName` Scalastyle rule
dongjoon-hyun Feb 12, 2025
7c29c66
Preparing Spark release v3.5.5-rc1
dongjoon-hyun Feb 23, 2025
195ce4f
Update version to ODP-3.2.3.4-2
shubhluck Sep 2, 2025
468e160
ODP-4994 Aligning projects to internal ODP versions
shubhluck Sep 2, 2025
b73f506
[SPARK-47738][BUILD] Upgrade Kafka to 3.7.0
panbingkun Apr 6, 2024
ee85f4a
ODP-4386 - [SPARK-45502][BUILD] Upgrade Kafka to 3.6.1
dengziming Dec 16, 2023
5b9399d
[SPARK-47018][BUILD][SQL] Bump built-in Hive to 2.3.10
pan3793 May 10, 2024
64fe54c
[SPARK-47932][SQL][TESTS] Avoid using legacy commons-lang
pan3793 Apr 22, 2024
d5c1984
ODP-1916 Partial revert SPARK-39653 to add populate fn for handling W…
shubhluck Aug 16, 2024
c84d01f
ODP-4083: Replace package with install argument on Spark project
prabhjyotsingh Apr 23, 2025
cfe55e4
ODP-3237: [SPARK-44537][BUILD] Upgrade kubernetes-client to 6.8.0
panbingkun Jul 26, 2023
3eb2d17
ODP-3237: [SPARK-44821][BUILD][K8S] Upgrade `kubernetes-client` to 6.8.1
dongjoon-hyun Aug 16, 2023
910dd2e
ODP-3237: [SPARK-45728][BUILD][K8S] Upgrade `kubernetes-client` to 6.9.1
dongjoon-hyun Oct 31, 2023
2eb747d
ODP-3237: [SPARK-46662][K8S][BUILD] Upgrade `kubernetes-client` to 6.…
bjornjorgensen Jan 11, 2024
b64552d
ODP-3237: [SPARK-47550][K8S][BUILD] Update `kubernetes-client` to 6.11.0
bjornjorgensen Mar 25, 2024
0b01f5f
ODP-3237: [SPARK-47860][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
dongjoon-hyun Apr 15, 2024
92ac136
ODP-3237: [SPARK-47915][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
bjornjorgensen Apr 19, 2024
fde81f2
ODP-3237: [SPARK-48514][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
bjornjorgensen Jun 4, 2024
e959e18
ODP-3237: [SPARK-48801][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
panbingkun Jul 9, 2024
3209f18
ODP-3237: [SPARK-47550][K8S][BUILD] Update `kubernetes-client` to 6.13.1
senthh May 26, 2025
acc681d
ODP-3237: [SPARK-49196][BUILD] Upgrade `kubernetes-client` to 6.13.2
panbingkun Aug 12, 2024
3b2dfba
ODP-3237: [SPARK-49255][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
dongjoon-hyun Aug 16, 2024
f5e6d42
ODP-3237: [SPARK-49800][BUILD][K8S] Upgrade `kubernetes-client` to 6.…
bjornjorgensen Sep 26, 2024
9b17fca
ODP-3237: [SPARK-50493][SPARK-37687][BUILD] Migrate kubernetes-client…
panbingkun Jan 3, 2025
ede3e69
ODP-3251: [SPARK-49624][BUILD] Upgrade `aircompressor` to 2.0.2 (#68)
prabhjyotsingh Feb 7, 2025
f15b2cf
ODP-3256: [SPARK-44723][BUILD] Upgrade `gcs-connector` to 2.2.16
dongjoon-hyun Aug 9, 2023
6240a7a
ODP-3256: [SPARK-44898][BUILD] Upgrade `gcs-connector` to 2.2.17
dongjoon-hyun Aug 21, 2023
5be9b01
ODP-3256: [SPARK-51006][BUILD] Upgrade `gcs-connector` to `2.2.26`
dongjoon-hyun Jan 28, 2025
960b9e4
ODP-3256: [SPARK-45991][BUILD] Upgrade `protobuf-java` to 3.25.1 to m…
dongjoon-hyun Nov 19, 2023
f55440a
ODP-3256: [SPARK-48933][BUILD] Upgrade `protobuf-java` to `3.25.3`
panbingkun Jul 19, 2024
a1dbbf5
ODP-3256: [SPARK-49497][BUILD] Upgrade protobuf-java to 3.25.4
dev-lpq Sep 3, 2024
77a196b
ODP-3256: [SPARK-49721][BUILD] Upgrade `protobuf-java` to 3.25.5
dongjoon-hyun Sep 20, 2024
9cae2fe
ODP-3257: [SPARK-44811][BUILD] Upgrade Guava to 33.2.1-jre
pan3793 Sep 12, 2024
be445d8
ODP-3257: [SPARK-50468][BUILD] Upgrade Guava to 33.3.1-jre
LuciferYang Dec 2, 2024
7217188
ODP-3257: [SPARK-50972][BUILD] Upgrade Guava to 33.4.0
LuciferYang Jan 24, 2025
59523f8
ODP-3257: Fix CVE-2023-2976 com.google.guava:guava in spark (docker-i…
prabhjyotsingh Feb 11, 2025
4650536
ODP-3300: exclude logback jars (from zookeeper) from spark builds (#73)
prabhjyotsingh Feb 11, 2025
3d7ec61
ODP-2842 - Upgrading commons-compress version to 1.26 for fixing CVE-…
senthh Dec 18, 2024
ad83387
ODP-2842 - Upgrading commons-compress version to 1.26 for fixing CVE-…
senthh Dec 18, 2024
8bff172
ODP-2842 - Upgrading commons-compress version to 1.26 for fixing CVE-…
senthh Dec 18, 2024
24de608
ODP-2841 - Upgrading netty* version to 4.1.108.Final for fixing CVE-2…
senthh Dec 18, 2024
01e4cfb
ODP-2849 - Upgrading dnsjava version to 3.6.0 for fixing CVE-2024-25638
senthh Dec 19, 2024
0d83953
ODP-2851 Upgrading commons-io version to 2.18.0 for fixing CVE-2024-4…
senthh Dec 19, 2024
023f1a9
ODP-2837 - Upgrading avro version to 1.12.0 for fixing CVE-2023-39410…
senthh Dec 18, 2024
3eff16b
ODP-2866 Upgrading velocity version to 2.3
senthh Dec 24, 2024
a576588
ODP-2866 Upgrading gson version to 2.8.9
senthh Dec 24, 2024
4ebb54e
ODP-3239: Fix CVE-2024-4755 commons-io:commons-io in spark (#65)
prabhjyotsingh Feb 7, 2025
a7393a3
ODP-756 Included Open Table formats Hudi,Delta & Iceberg to spark jar…
shubhluck Apr 18, 2024
e94c8f8
ODP-2118: Hudi, DeltaLake, Iceberg version upgrade for open table cli…
manishsinghmowall Sep 18, 2024
a9e2e33
ODP-3668 Improvement - Include Hudi and iceberg in profiles
senthh Mar 13, 2025
755c4cd
ODP-3361: bump up netty version to 4.1.118.Final to fix CVEs CVE-2024…
senthh Feb 20, 2025
595f819
ODP-2583: bump up Curator to 5.2.0
basapuram-kumar Nov 13, 2024
9c4a140
ODP-3361: Upgrade dbcp version to 2.9.0 to fix CVE-2020-36186
senthh Sep 16, 2024
71b6cf1
ODP-2866 Upgrading gson version to 2.8.9
senthh Dec 24, 2024
6810fe0
ODP-3361: Upgrade iceberg version to 1.7.0 to fix CVE-2024-47561
senthh Feb 25, 2025
4689096
ODP-2269 - Fixing 'NoClassDefFoundError for InternalFutureFailureAcce…
senthh Sep 20, 2024
4e096b8
ODP-3411|ODP-3361: bump up hudi version to internal 0.15.2
prabhjyotsingh Mar 1, 2025
3bf544d
ODP-3411 - CVE: Upgrade commons-io versions to 2.18 in deps to fix CV…
senthh Sep 16, 2024
cebfdb5
ODP-3411 - CVE: Upgrade gcs-aws-java-sdk-bundle versions to 1.12.782 …
senthh Mar 5, 2025
c6de2d1
ODP-3772 [Spark3][CVE] - Increase datanucleus-rdbms and datanucleus-c…
senthh Mar 20, 2025
b102ddf
ODP-3791 [Spark3][CVE] - Increase okhttp version to 4.12.0 to fix CVE…
senthh Mar 21, 2025
a1190ad
ODP-3792 [Spark3][CVE] - Increase jackson-databind version to 2.17.2…
senthh Mar 21, 2025
02f98ff
ODP-1486|ODP-1303 [SPARK-45732][BUILD] Upgrade commons-text to 1.11.0
senthh Jul 1, 2024
40f73cc
Revert "ODP-1486|ODP-1303 [SPARK-45732][BUILD] Upgrade commons-text t…
senthh May 26, 2025
60625bd
Revert "ODP-3792 [Spark3][CVE] - Increase jackson-databind version t…
senthh May 27, 2025
5807411
Revert "ODP-3791 [Spark3][CVE] - Increase okhttp version to 4.12.0 to…
senthh May 27, 2025
05f36e8
Revert "ODP-3411|ODP-3361: bump up hudi version to internal 0.15.2"
senthh May 27, 2025
e5cb4c8
ODP-3411|ODP-3361: bump up hudi version to internal 0.15.2
prabhjyotsingh Mar 1, 2025
ede5a86
ODP-3411|ODP-3361: bump up hudi version to internal 0.15.2
senthh May 27, 2025
44ace27
ODP-1486|ODP-1303 [SPARK-45732][BUILD] Upgrade commons-text to 1.11.0
senthh Jul 1, 2024
30e3255
ODP-3792 [Spark3][CVE] - Increase jackson-databind version to 2.17.2…
senthh Mar 21, 2025
086a934
ODP-3791 [Spark3][CVE] - Increase okhttp version to 4.12.0 to fix CVE…
senthh Mar 21, 2025
a361ff3
ODP-2118: Hudi, DeltaLake, Iceberg version upgrade for open table cli…
manishsinghmowall Sep 18, 2024
cb0ad94
ODP-3913: Change distribution management to point to staging repo
deepakdamri Apr 2, 2025
a8af8d7
ODP-2118: Hudi, DeltaLake, Iceberg version upgrade for open table cli…
senthh May 27, 2025
a964bdf
ODP-4422 Add deltalake profile and update Open Table Format versions
shubhluck May 28, 2025
04b52a6
ODP-4444: Upgrade jetty-server to 9.4.57.v20241219 to fix CVE-2024-13009
senthh Jun 2, 2025
7509730
ODP-4446 - CVE - Upgrade derby to 10.14.3 to fix CVE-2022-46337
senthh Jun 2, 2025
5df6ec5
[SPARK-51950][BUILD] Upgrade `Parquet` to 1.15.2
dongjoon-hyun May 3, 2025
06edbc0
ODP-1603:Could not initialize class org.apache.commons.text.StringEsc…
kravii Jun 7, 2024
b4b7185
ODP-780 Update libthrift from 0.12.0 to 0.14.1 for Hive 3.1.4 connection
shubhluck Apr 2, 2024
d368c8b
ODP-4858 Fix Hudi/delta sql queries issue cuased by antlr version (#85)
senthh Aug 4, 2025
1f774c1
Revert "ODP-2837 - Upgrading avro version to 1.12.0 for fixing CVE-20…
shubhluck Sep 2, 2025
d348ff5
Revert "ODP-3791 [Spark3][CVE] - Increase okhttp version to 4.12.0 to…
shubhluck Sep 2, 2025
4e907f4
Revert "ODP-3237: [SPARK-50493][SPARK-37687][BUILD] Migrate kubernete…
shubhluck Sep 2, 2025
51a8f25
Revert "[SPARK-43880][BUILD] Organize `hadoop-cloud` in standard mave…
shubhluck Sep 3, 2025
b727972
Revert "[SPARK-40039][SS] Introducing a streaming checkpoint file man…
shubhluck Sep 3, 2025
16c6436
ODP-5029 Handle Iceberg version matching JDK8 and enforceByteCode error
shubhluck Sep 3, 2025
47a57f0
ODP-5105|ODP-5004 Revert "[SPARK-51185][CORE][3.5] Revert simplificat…
senthh Aug 27, 2025
542716c
ODP-5106 Include Gluten as profile in Spark 3.5.5.3.2.3.4*
senthh Sep 12, 2025
c0f072c
OSV-6034 - CVE - Increase netty version to 4.1.127.Final in fix CVE-…
senthh Sep 26, 2025
8f78988
OSV-6096 - CVE - Increase jdom2 version to 2.0.6.1 in fix CVE-2021-3…
senthh Sep 26, 2025
f1f09b9
Update build-version to ODP-3.2.3.4-2001
Sourabh-Dilraj Sep 30, 2025
03bbd36
Update build-version to ODP-3.2.3.4-2002
Sourabh-Dilraj Oct 13, 2025
2ce0536
OSV-7929 - Mention jettison version explicitly as 1.5.4 to fix CVE-20…
senthh Oct 17, 2025
94baecd
Update build-version to ODP-3.2.3.4-2003
Oct 27, 2025
50e06e2
Update build-version to ODP-3.2.3.4-2
Nov 1, 2025
a1c6ae1
Point repositories and distributionManagement to odp-central/release
Nov 5, 2025
39c4231
[Automated] Changed version from '3.2.3.4-2' to '3.2.3.4-SNAPSHOT'
JeffreySmith Nov 21, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
5 changes: 5 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -31,3 +31,8 @@ github:
merge: false
squash: true
rebase: true

notifications:
pullrequests: [email protected]
issues: [email protected]
commits: [email protected]
28 changes: 18 additions & 10 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -84,12 +84,12 @@ SPARK SHELL:
- "repl/**/*"
- "bin/spark-shell*"
SQL:
#- any: ["**/sql/**/*", "!python/pyspark/sql/avro/**/*", "!python/pyspark/sql/streaming.py", "!python/pyspark/sql/tests/test_streaming.py"]
#- any: ["**/sql/**/*", "!python/pyspark/sql/avro/**/*", "!python/pyspark/sql/streaming/**/*", "!python/pyspark/sql/tests/streaming/test_streaming.py"]
- "**/sql/**/*"
- "common/unsafe/**/*"
#- "!python/pyspark/sql/avro/**/*"
#- "!python/pyspark/sql/streaming.py"
#- "!python/pyspark/sql/tests/test_streaming.py"
#- "!python/pyspark/sql/streaming/**/*"
#- "!python/pyspark/sql/tests/streaming/test_streaming.py"
- "bin/spark-sql*"
- "bin/beeline*"
- "sbin/*thriftserver*.sh"
Expand All @@ -103,13 +103,13 @@ SQL:
- "**/*schema.R"
- "**/*types.R"
AVRO:
- "external/avro/**/*"
- "connector/avro/**/*"
- "python/pyspark/sql/avro/**/*"
DSTREAM:
- "streaming/**/*"
- "data/streaming/**/*"
- "external/kinesis*"
- "external/kafka*"
- "connector/kinesis*"
- "connector/kafka*"
- "python/pyspark/streaming/**/*"
GRAPHX:
- "graphx/**/*"
Expand All @@ -123,13 +123,15 @@ MLLIB:
- "python/pyspark/mllib/**/*"
STRUCTURED STREAMING:
- "**/sql/**/streaming/**/*"
- "external/kafka-0-10-sql/**/*"
- "python/pyspark/sql/streaming.py"
- "python/pyspark/sql/tests/test_streaming.py"
- "connector/kafka-0-10-sql/**/*"
- "python/pyspark/sql/streaming/**/*"
- "python/pyspark/sql/tests/streaming/test_streaming.py"
- "**/*streaming.R"
PYTHON:
- "bin/pyspark*"
- "**/python/**/*"
PANDAS API ON SPARK:
- "python/pyspark/pandas/**/*"
R:
- "**/r/**/*"
- "**/R/**/*"
Expand All @@ -149,4 +151,10 @@ WEB UI:
- "**/*UI.scala"
DEPLOY:
- "sbin/**/*"

CONNECT:
- "connector/connect/**/*"
- "**/sql/sparkconnect/**/*"
- "python/pyspark/sql/**/connect/**/*"
PROTOBUF:
- "connector/protobuf/**/*"
- "python/pyspark/sql/protobuf/**/*"
34 changes: 0 additions & 34 deletions .github/workflows/ansi_sql_mode_test.yml

This file was deleted.

105 changes: 89 additions & 16 deletions .github/workflows/benchmark.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,9 +27,13 @@ on:
required: true
default: '*'
jdk:
description: 'JDK version: 8, 11 or 17'
description: 'JDK version: 8, 11, 17 or 21-ea'
required: true
default: '8'
scala:
description: 'Scala version: 2.12 or 2.13'
required: true
default: '2.12'
failfast:
description: 'Failfast: true or false'
required: true
Expand All @@ -42,21 +46,78 @@ on:
jobs:
matrix-gen:
name: Generate matrix for job splits
runs-on: ubuntu-20.04
runs-on: ubuntu-22.04
outputs:
matrix: ${{ steps.set-matrix.outputs.matrix }}
env:
SPARK_BENCHMARK_NUM_SPLITS: ${{ github.event.inputs.num-splits }}
steps:
- name: Generate matrix
id: set-matrix
run: echo "::set-output name=matrix::["`seq -s, 1 $SPARK_BENCHMARK_NUM_SPLITS`"]"
run: echo "matrix=["`seq -s, 1 $SPARK_BENCHMARK_NUM_SPLITS`"]" >> $GITHUB_OUTPUT

# Any TPC-DS related updates on this job need to be applied to tpcds-1g job of build_and_test.yml as well
tpcds-1g-gen:
name: "Generate an input dataset for TPCDSQueryBenchmark with SF=1"
if: contains(github.event.inputs.class, 'TPCDSQueryBenchmark') || contains(github.event.inputs.class, '*')
runs-on: ubuntu-22.04
env:
SPARK_LOCAL_IP: localhost
steps:
- name: Checkout Spark repository
uses: actions/checkout@v3
# In order to get diff files
with:
fetch-depth: 0
- name: Cache Scala, SBT and Maven
uses: actions/cache@v3
with:
path: |
build/apache-maven-*
build/scala-*
build/*.jar
~/.sbt
key: build-${{ hashFiles('**/pom.xml', 'project/build.properties', 'build/mvn', 'build/sbt', 'build/sbt-launch-lib.bash', 'build/spark-build-info') }}
restore-keys: |
build-
- name: Cache Coursier local repository
uses: actions/cache@v3
with:
path: ~/.cache/coursier
key: benchmark-coursier-${{ github.event.inputs.jdk }}-${{ hashFiles('**/pom.xml', '**/plugins.sbt') }}
restore-keys: |
benchmark-coursier-${{ github.event.inputs.jdk }}
- name: Cache TPC-DS generated data
id: cache-tpcds-sf-1
uses: actions/cache@v3
with:
path: ./tpcds-sf-1
key: tpcds-${{ hashFiles('.github/workflows/benchmark.yml', 'sql/core/src/test/scala/org/apache/spark/sql/TPCDSSchema.scala') }}
- name: Checkout tpcds-kit repository
if: steps.cache-tpcds-sf-1.outputs.cache-hit != 'true'
uses: actions/checkout@v3
with:
repository: databricks/tpcds-kit
ref: 1b7fb7529edae091684201fab142d956d6afd881
path: ./tpcds-kit
- name: Build tpcds-kit
if: steps.cache-tpcds-sf-1.outputs.cache-hit != 'true'
run: cd tpcds-kit/tools && make OS=LINUX
- name: Install Java ${{ github.event.inputs.jdk }}
if: steps.cache-tpcds-sf-1.outputs.cache-hit != 'true'
uses: actions/setup-java@v3
with:
distribution: temurin
java-version: ${{ github.event.inputs.jdk }}
- name: Generate TPC-DS (SF=1) table data
if: steps.cache-tpcds-sf-1.outputs.cache-hit != 'true'
run: build/sbt "sql/Test/runMain org.apache.spark.sql.GenTPCDSData --dsdgenDir `pwd`/tpcds-kit/tools --location `pwd`/tpcds-sf-1 --scaleFactor 1 --numPartitions 1 --overwrite"

benchmark:
name: "Run benchmarks: ${{ github.event.inputs.class }} (JDK ${{ github.event.inputs.jdk }}, ${{ matrix.split }} out of ${{ github.event.inputs.num-splits }} splits)"
needs: matrix-gen
# Ubuntu 20.04 is the latest LTS. The next LTS is 22.04.
runs-on: ubuntu-20.04
name: "Run benchmarks: ${{ github.event.inputs.class }} (JDK ${{ github.event.inputs.jdk }}, Scala ${{ github.event.inputs.scala }}, ${{ matrix.split }} out of ${{ github.event.inputs.num-splits }} splits)"
if: always()
needs: [matrix-gen, tpcds-1g-gen]
runs-on: ubuntu-22.04
strategy:
fail-fast: false
matrix:
Expand All @@ -69,14 +130,15 @@ jobs:
SPARK_LOCAL_IP: localhost
# To prevent spark.test.home not being set. See more detail in SPARK-36007.
SPARK_HOME: ${{ github.workspace }}
SPARK_TPCDS_DATA: ${{ github.workspace }}/tpcds-sf-1
steps:
- name: Checkout Spark repository
uses: actions/checkout@v2
uses: actions/checkout@v3
# In order to get diff files
with:
fetch-depth: 0
- name: Cache Scala, SBT and Maven
uses: actions/cache@v2
uses: actions/cache@v3
with:
path: |
build/apache-maven-*
Expand All @@ -87,19 +149,28 @@ jobs:
restore-keys: |
build-
- name: Cache Coursier local repository
uses: actions/cache@v2
uses: actions/cache@v3
with:
path: ~/.cache/coursier
key: benchmark-coursier-${{ github.event.inputs.jdk }}-${{ hashFiles('**/pom.xml', '**/plugins.sbt') }}
restore-keys: |
benchmark-coursier-${{ github.event.inputs.jdk }}
- name: Install Java ${{ github.event.inputs.jdk }}
uses: actions/setup-java@v1
uses: actions/setup-java@v3
with:
distribution: temurin
java-version: ${{ github.event.inputs.jdk }}
- name: Cache TPC-DS generated data
if: contains(github.event.inputs.class, 'TPCDSQueryBenchmark') || contains(github.event.inputs.class, '*')
id: cache-tpcds-sf-1
uses: actions/cache@v3
with:
path: ./tpcds-sf-1
key: tpcds-${{ hashFiles('.github/workflows/benchmark.yml', 'sql/core/src/test/scala/org/apache/spark/sql/TPCDSSchema.scala') }}
- name: Run benchmarks
run: |
./build/sbt -Pyarn -Pmesos -Pkubernetes -Phive -Phive-thriftserver -Phadoop-cloud -Pkinesis-asl -Pspark-ganglia-lgpl test:package
dev/change-scala-version.sh ${{ github.event.inputs.scala }}
./build/sbt -Pscala-${{ github.event.inputs.scala }} -Pyarn -Pmesos -Pkubernetes -Phive -Phive-thriftserver -Phadoop-cloud -Pkinesis-asl -Pspark-ganglia-lgpl Test/package
# Make less noisy
cp conf/log4j2.properties.template conf/log4j2.properties
sed -i 's/rootLogger.level = info/rootLogger.level = warn/g' conf/log4j2.properties
Expand All @@ -109,13 +180,15 @@ jobs:
--jars "`find . -name '*-SNAPSHOT-tests.jar' -o -name '*avro*-SNAPSHOT.jar' | paste -sd ',' -`" \
"`find . -name 'spark-core*-SNAPSHOT-tests.jar'`" \
"${{ github.event.inputs.class }}"
# Revert to default Scala version to clean up unnecessary git diff
dev/change-scala-version.sh 2.12
# To keep the directory structure and file permissions, tar them
# See also https://github.com/actions/upload-artifact#maintaining-file-permissions-and-case-sensitive-files
echo "Preparing the benchmark results:"
tar -cvf benchmark-results-${{ github.event.inputs.jdk }}.tar `git diff --name-only` `git ls-files --others --exclude-standard`
tar -cvf benchmark-results-${{ github.event.inputs.jdk }}-${{ github.event.inputs.scala }}.tar `git diff --name-only` `git ls-files --others --exclude=tpcds-sf-1 --exclude-standard`
- name: Upload benchmark results
uses: actions/upload-artifact@v2
uses: actions/upload-artifact@v4
with:
name: benchmark-results-${{ github.event.inputs.jdk }}-${{ matrix.split }}
path: benchmark-results-${{ github.event.inputs.jdk }}.tar
name: benchmark-results-${{ github.event.inputs.jdk }}-${{ github.event.inputs.scala }}-${{ matrix.split }}
path: benchmark-results-${{ github.event.inputs.jdk }}-${{ github.event.inputs.scala }}.tar

Loading