Presto: Separate Hive configs for Coordinator & Worker #121

simoneves · 2025-11-03T17:55:41Z

There are additional Hive config options which can be used in the Worker but if they are also in the Coordinator, it throws an error.

For now, we just add some Parquet read parameters at their defaults, and is therefore just a convenience for users who wish to tweak these for some specific machine.

IMPORTANT!
You will need to use --overwrite-config when first running with this PR, otherwise it will reuse the existing tree, and there will be a hive.properties file in the old location which will throw a startup error in the Docker mappings. Be sure to copy any edited files aside before doing this, of course, or they will be lost.

paul-aiyedun · 2025-11-03T18:02:35Z

presto/docker/docker-compose.java.yml

      - ./config/generated/java/etc_common:/opt/presto-server/etc
      - ./config/generated/java/etc_coordinator/config_java.properties:/opt/presto-server/etc/config.properties
      - ./config/generated/java/etc_coordinator/node.properties:/opt/presto-server/etc/node.properties
+      - ./config/generated/java/etc_coordinator/catalog/hive.properties:/opt/presto-server/etc/catalog/hive.properties


Should the updates be in docker-compose.common.yml?

They can't be because configs are now per-variant

paul-aiyedun · 2025-11-03T21:39:29Z

presto/docker/config/template/etc_worker/catalog/hive.properties

+parquet.reader.chunk-read-limit=0
+parquet.reader.pass-read-limit=0


These configurations do not appear in the documentation. Can you please add comments that describe what these parameters do and why they are needed?

They aren't needed, other than to prove a point that the Coordinator and Worker configs can be different, but apparently @devavret tweaks them on his local laptop system, so asked for them to be exposed.

The values are not documented in Velox itself, but appear to be passed to the cuDF Chunked Parquet Reader, and that documentation is here:

https://docs.rapids.ai/api/libcudf/stable/classcudf_1_1io_1_1chunked__parquet__reader#a49f5549b53257828d50f5fa65114e07a

The values in that API are in bytes, but it appears that the config parser is smart enough to convert (say) 16M into (16 * 1024 * 1024).

I have added comments to the template file based on the parameter descriptions in that documentation.

simoneves added 2 commits November 3, 2025 09:34

Split files

e52b5cc

Map split files

1714590

simoneves requested review from devavret, karthikeyann, misiugodfrey and paul-aiyedun November 3, 2025 17:56

paul-aiyedun reviewed Nov 3, 2025

View reviewed changes

Add default Parquet read options to Worker Hive properties

6e8dc10

simoneves requested a review from paul-aiyedun November 3, 2025 18:12

paul-aiyedun reviewed Nov 3, 2025

View reviewed changes

Add comments on Parquet read parameters

84b10bd

simoneves requested a review from paul-aiyedun November 4, 2025 17:34

Merge branch 'main' into seves/HERC-139_split_hive_config

66ac846

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Presto: Separate Hive configs for Coordinator & Worker #121

Presto: Separate Hive configs for Coordinator & Worker #121

simoneves commented Nov 3, 2025 •

edited

Loading

Uh oh!

paul-aiyedun Nov 3, 2025

Uh oh!

simoneves Nov 3, 2025

Uh oh!

paul-aiyedun Nov 3, 2025

Uh oh!

simoneves Nov 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		parquet.reader.chunk-read-limit=0
		parquet.reader.pass-read-limit=0

Presto: Separate Hive configs for Coordinator & Worker #121

Are you sure you want to change the base?

Presto: Separate Hive configs for Coordinator & Worker #121

Conversation

simoneves commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paul-aiyedun Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

simoneves Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

paul-aiyedun Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

simoneves Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

simoneves commented Nov 3, 2025 •

edited

Loading

simoneves Nov 4, 2025 •

edited

Loading