Skip to content

Commit 8d59997

Browse files
committed
[SPARK-45257][CORE] Enable spark.eventLog.compress by default
### What changes were proposed in this pull request? This PR aims to enable `spark.eventLog.compress` by default for Apache Spark 4.0.0. ### Why are the changes needed? - To save the event log storage cost by compressing the logs with ZStandard codec by default ### Does this PR introduce _any_ user-facing change? Although we added a migration guide, the old Spark history servers are able to read the compressed logs. ### How was this patch tested? Pass the CIs. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #43036 from dongjoon-hyun/SPARK-45257. Lead-authored-by: Dongjoon Hyun <dhyun@apple.com> Co-authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
1 parent 7c3c7c5 commit 8d59997

3 files changed

Lines changed: 6 additions & 2 deletions

File tree

core/src/main/scala/org/apache/spark/internal/config/package.scala

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ package object config {
165165
ConfigBuilder("spark.eventLog.compress")
166166
.version("1.0.0")
167167
.booleanConf
168-
.createWithDefault(false)
168+
.createWithDefault(true)
169169

170170
private[spark] val EVENT_LOG_BLOCK_UPDATES =
171171
ConfigBuilder("spark.eventLog.logBlockUpdates.enabled")

docs/configuration.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1311,7 +1311,7 @@ Apart from these, the following properties are also available, and may be useful
13111311
</tr>
13121312
<tr>
13131313
<td><code>spark.eventLog.compress</code></td>
1314-
<td>false</td>
1314+
<td>true</td>
13151315
<td>
13161316
Whether to compress logged events, if <code>spark.eventLog.enabled</code> is true.
13171317
</td>

docs/core-migration-guide.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,10 @@ license: |
2222
* Table of contents
2323
{:toc}
2424

25+
## Upgrading from Core 3.4 to 4.0
26+
27+
- Since Spark 4.0, Spark will compress event logs. To restore the behavior before Spark 4.0, you can set `spark.eventLog.compress` to `false`.
28+
2529
## Upgrading from Core 3.3 to 3.4
2630

2731
- Since Spark 3.4, Spark driver will own `PersistentVolumnClaim`s and try to reuse if they are not assigned to live executors. To restore the behavior before Spark 3.4, you can set `spark.kubernetes.driver.ownPersistentVolumeClaim` to `false` and `spark.kubernetes.driver.reusePersistentVolumeClaim` to `false`.

0 commit comments

Comments
 (0)