Parquet exporter handle optional fields #1024

albertlockett · 2025-08-28T16:09:22Z

part of #863

Because some OTAP fields are optional, in a stream of record batches we may receive subsequent batches with different schemas. Parquet doesn't support having row groups with different sets of column chunks, which means we need to know the schema a-priori when the writer is created.

This PR adds code to normalize the schema of the record batch before writing by:

putting all the fields in the same order
creating all null/default value columns for any missing column

The missing columns should have a small overhead when written to disk, because parquet will either write an entirely empty column chunk for the null column (all null count, no data), or and for all default-value columns, parquet will use dictionary and RLE encoding by default, leading to a small column chunk with a single value value in dict & a single run for the key.

What's unfortunate is that we still materialize an all-null column before writing with the length of the record batch. This can be optimized when run-end encoded arrays are supported in parquet, because we could just create a run array with a single run of null/default value. The arrow community is currently working on adding support (see apache/arrow-rs#7713 & apache/arrow-rs#8069).

codecov · 2025-08-28T16:11:31Z

Codecov Report

❌ Patch coverage is 95.85799% with 28 lines in your changes missing coverage. Please review.
✅ Project coverage is 81.18%. Comparing base (0d3422f) to head (0bd3217).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1024      +/-   ##
==========================================
+ Coverage   81.08%   81.18%   +0.10%     
==========================================
  Files         363      364       +1     
  Lines       89424    90095     +671     
==========================================
+ Hits        72508    73148     +640     
- Misses      16388    16419      +31     
  Partials      528      528

Components	Coverage Δ
otap-dataflow	`81.21% <95.97%> (+0.33%)`	⬆️
beaubourg	`∅ <ø> (∅)`
otel-arrow-rust	`87.07% <80.00%> (-0.01%)`	⬇️
query_abstraction	`80.61% <ø> (ø)`
query_engine	`91.05% <ø> (ø)`
syslog_cef_receivers	`∅ <ø> (∅)`
otel-arrow-go	`52.82% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lquerel · 2025-08-28T21:04:44Z

rust/otap-dataflow/crates/otap/src/parquet_exporter/schema.rs

+//! This means we can't receive two consecutive OTAP batches for some payload type and write them
+//! into the same writer. To  handle this, we insert all null columns for missing columns (or all
+//! default-value where the column is not nullable), and also arrange the columns so they're always
+//! in the same order.


We probably don't have any other simple option to implement to resolve this limitation. However, it's important to keep an eye on the implications this will have, particularly on memory consumption across the parquet -> arrow -> query engine path. It might have no effect, but we'll need to verify that.

@lquerel I agree. Related to memory consumption, there's an optimization to switch the placeholders we create with a more efficient representation. We can do this once run-end encoding is supported in Parquet (work in progress).

Say we're adding a placeholder for a missing Int32 column, we'd need to add an all nulls Int32Array which still has a values buffer containing four bytes each row plus the nulls buffer:

values buf = [0, 0, ... ] // len = num rows nulls buf = [false, false ... ] // len = num rows

Eventually we'll change to use a RunArray, which would contain an Int32Array of length 1, plus a single run specifying to use the same value for the entire length:

values buf = [0] nulls buf = [false] run ends = [num_rows]

I think we could maybe even reuse the values & nulls buffer across multiple column replacements for the same datatypes.

I will create a followup issue to track this optimization

wrt to querying, I think the behvaiour will depend on the query engine but we can definitely verify. I imagine that generally the overhead of the extra metadata from these additional columns would be small compared to the size of the data. Depending on the query, we'd still probably have empty arrays in memory for these columns if the user selected them.

lquerel

LGTM. See my comment.

part of open-telemetry#863 Because some OTAP fields are optional, in a stream of record batches we may receive subsequent batches with different schemas. Parquet doesn't support having row groups with different sets of column chunks, which means we need to know the schema a-priori when the writer is created. This PR adds code to normalize the schema of the record batch before writing by: - putting all the fields in the same order - creating all null/default value columns for any missing column The missing columns should have a small overhead when written to disk, because parquet will either write an entirely empty column chunk for the null column (all null count, no data), or and for all default-value columns, parquet will use dictionary and RLE encoding by default, leading to a small column chunk with a single value value in dict & a single run for the key. What's unfortunate is that we still materialize an all-null column before writing with the length of the record batch. This can be optimized when run-end encoded arrays are supported in parquet, because we could just create a run array with a single run of null/default value. The arrow community is currently working on adding support (see apache/arrow-rs#7713 & apache/arrow-rs#8069). --------- Co-authored-by: Laurent Quérel <[email protected]>

albertlockett added 6 commits August 27, 2025 19:56

add optional column handling to parquet exporter

e0f0c16

stash

59344e3

cleaned up the implementation

1d90be6

I ran cargo format

d945e2e

added template structs for every payload type

88027bc

finished testing some edge cases

83270d7

albertlockett requested a review from a team as a code owner August 28, 2025 16:09

github-actions bot added the rust Pull requests that update Rust code label Aug 28, 2025

fix the broken test

a7b6965

jmacd approved these changes Aug 28, 2025

View reviewed changes

lquerel reviewed Aug 28, 2025

View reviewed changes

Merge branch 'main' into albert/863-part-2-optional-fields

0bd3217

lquerel approved these changes Aug 28, 2025

View reviewed changes

lquerel enabled auto-merge August 28, 2025 21:11

lquerel added this pull request to the merge queue Aug 28, 2025

Merged via the queue into open-telemetry:main with commit fc324c5 Aug 28, 2025
35 checks passed

This was referenced Aug 29, 2025

[parquet exporter] use more memory efficient types when creating placeholder columns #1034

Open

Parquet Exporter better handling of adaptive OTAP Schema #863

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parquet exporter handle optional fields #1024

Parquet exporter handle optional fields #1024

Uh oh!

albertlockett commented Aug 28, 2025 •

edited

Loading

Uh oh!

codecov bot commented Aug 28, 2025 •

edited

Loading

Uh oh!

lquerel Aug 28, 2025

Uh oh!

albertlockett Aug 28, 2025 •

edited

Loading

Uh oh!

albertlockett Aug 28, 2025

Uh oh!

lquerel left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Parquet exporter handle optional fields #1024

Parquet exporter handle optional fields #1024

Uh oh!

Conversation

albertlockett commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lquerel Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

albertlockett Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albertlockett Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

lquerel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertlockett commented Aug 28, 2025 •

edited

Loading

codecov bot commented Aug 28, 2025 •

edited

Loading

albertlockett Aug 28, 2025 •

edited

Loading