[VARIANT] Validate precision in VariantDecimalXX structs and add missing tests #7776

scovich · 2025-06-25T16:05:22Z

Which issue does this PR close?

Rationale for this change

As a follow-up to #7738, we should verify that the unscaled integer value fits in the max precision (scale factor was already validated).

What changes are included in this PR?

Add the missing checking, and add missing unit tests for both precision and scale.

Also move the VariantDecimalXX structs to their own mod.

Are these changes tested?

Yes, see above.

Are there any user-facing changes?

No. Public re-rexport of the moved structs.

scovich · 2025-06-25T16:05:45Z

Attn @Weijun-H @alamb

scovich · 2025-06-25T16:06:07Z

parquet-variant/src/variant.rs

@@ -1,5 +1,3 @@
-use std::ops::Deref;


not sure how this ended up here...

we can blame the IDE perhaps

scovich · 2025-06-25T16:06:25Z

parquet-variant/src/variant.rs

 #[derive(Debug, Clone, Copy, PartialEq)]
 pub struct ShortString<'a>(pub(crate) &'a str);

-/// Represents a 4-byte decimal value in the Variant format.


Moved to decimal.rs

scovich · 2025-06-25T16:06:58Z

parquet-variant/src/variant/decimal.rs

+    const MAX_PRECISION: u32 = 9;
+    const MAX_UNSCALED_VALUE: u32 = 10_u32.pow(Self::MAX_PRECISION) - 1;


Hoisted up and renamed the existing constant, and used it to define the other constant

scovich · 2025-06-25T16:07:23Z

parquet-variant/src/variant/decimal.rs

+                "Scale {} of a 4-byte decimal cannot exceed the max precision {}",
+                scale,
+                Self::MAX_PRECISION,


Updated the error message to reference the constant instead of a magic number

scovich · 2025-06-25T16:07:39Z

parquet-variant/src/variant/decimal.rs

+        // Validate that the integer value fits within the precision
+        if integer.unsigned_abs() > Self::MAX_UNSCALED_VALUE {
+            return Err(ArrowError::InvalidArgumentError(format!(
+                "{} is too large to store in a 4-byte decimal with max precision {}",
+                integer,
+                Self::MAX_PRECISION
+            )));
+        }


The newly added validation

alamb

Love it

alamb · 2025-06-25T19:20:27Z

Thank you @scovich

CC @Weijun-H in case you are interested as well

[VARIANT] Move VariantDecimalXX structs to their own mod and add tests

93af2a7

github-actions bot added the parquet Changes to the parquet crate label Jun 25, 2025

scovich commented Jun 25, 2025

View reviewed changes

alamb approved these changes Jun 25, 2025

View reviewed changes

Merge remote-tracking branch 'apache/main' into validate-decimal-value

e122538

alamb mentioned this pull request Jun 25, 2025

[Variant] Add negative tests for reading invalid primitive variant values #7779

Merged

alamb merged commit d6c421c into apache:main Jun 25, 2025
12 checks passed

scovich mentioned this pull request Jun 25, 2025

[VARIANT] Add support for the json_to_variant API #7783

Merged

alamb mentioned this pull request Jul 28, 2025

[Variant] Add input validation in VariantBuilder #7697

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VARIANT] Validate precision in VariantDecimalXX structs and add missing tests #7776

[VARIANT] Validate precision in VariantDecimalXX structs and add missing tests #7776

Uh oh!

scovich commented Jun 25, 2025

Uh oh!

scovich commented Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Uh oh!

alamb Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Uh oh!

alamb left a comment

Uh oh!

alamb commented Jun 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		const MAX_PRECISION: u32 = 9;
		const MAX_UNSCALED_VALUE: u32 = 10_u32.pow(Self::MAX_PRECISION) - 1;

[VARIANT] Validate precision in VariantDecimalXX structs and add missing tests #7776

[VARIANT] Validate precision in VariantDecimalXX structs and add missing tests #7776

Uh oh!

Conversation

scovich commented Jun 25, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

scovich commented Jun 25, 2025

Uh oh!

scovich Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

alamb Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb commented Jun 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants