Jan Finis
Jan Finis
This commit adds a new column order `IEEE754TotalOrder`, which can be used for floating point types (FLOAT, DOUBLE, FLOAT16). The advantage of the new order is a well-defined ordering between...
This commit proposes an improvement for handling of NaN values in FLOAT and DOUBLE type columns. The goal is to allow reading engines, regardless of how they order NaN w.r.t....
### Apache Iceberg version 1.4.3 (latest release) ### Query engine None; it's a Spec issue ### Please describe the bug 🐞 The spec does not clearly define how the truncate...
### Apache Iceberg version 1.4.3 (latest release) ### Query engine Spark, but probably all of them. ### Please describe the bug 🐞 The truncate partition transform can underflow for all...
### Apache Iceberg version 1.4.3 (latest release) ### Query engine None, it's a spec issue. ### Please describe the bug 🐞 I'm referring to the definition of `field_summary`, which is...
### Apache Iceberg version Latest ### Query engine None; It's a Spec issue. ### Please describe the bug 🐞 The spec is inconsistent with respect to the `partition_spec_id` column. Here...
This commit is a combination of the following PRs: * Introduce IEEE 754 total order https://github.com/apache/parquet-format/pull/221 * Add nan_count to handle NaNs in statistics https://github.com/apache/parquet-format/pull/196 Both these PRs try to...