@@ -155,40 +155,11 @@ documented in [LogicalTypes.md][logical-types].
155155[ logical-types ] : LogicalTypes.md
156156
157157### Sort Order
158-
159158Parquet stores min/max statistics at several levels (such as Column Chunk,
160- Column Index and Data Page). Comparison for values of a type obey the
161- following rules:
162-
163- 1 . Each logical type has a specified comparison order. If a column is
164- annotated with an unknown logical type, statistics may not be used
165- for pruning data. The sort order for logical types is documented in
166- the [ LogicalTypes.md] [ logical-types ] page.
167- 2 . For primitive types, the following rules apply:
168-
169- * BOOLEAN - false, true
170- * INT32, INT64 - Signed comparison.
171- * FLOAT, DOUBLE - Signed comparison with special handling of NaNs and
172- signed zeros. The details are documented in the
173- [ Thrift definition] ( src/main/thrift/parquet.thrift ) in the
174- ` ColumnOrder ` union. They are summarized here but the Thrift definition
175- is considered authoritative:
176- * NaNs should not be written to min or max statistics fields.
177- * If the computed max value is zero (whether negative or positive),
178- ` +0.0 ` should be written into the max statistics field.
179- * If the computed min value is zero (whether negative or positive),
180- ` -0.0 ` should be written into the min statistics field.
181-
182- For backwards compatibility when reading files:
183- * If the min is a NaN, it should be ignored.
184- * If the max is a NaN, it should be ignored.
185- * If the min is +0, the row group may contain -0 values as well.
186- * If the max is -0, the row group may contain +0 values as well.
187- * When looking for NaN values, min and max should be ignored.
188-
189- * BYTE_ARRAY and FIXED_LEN_BYTE_ARRAY - Lexicographic unsigned byte-wise
190- comparison.
191-
159+ Column Index, and Data Page). These statistics are according to a sort order,
160+ which is defined for each column in the file footer. Parquet supports common
161+ sort orders for logical and primitve types. The details are documented in the
162+ [ Thrift definition] ( src/main/thrift/parquet.thrift ) in the ` ColumnOrder ` union.
192163
193164## Nested Encoding
194165To encode nested columns, Parquet uses the Dremel encoding with definition and
0 commit comments