incubator-xtable icon indicating copy to clipboard operation
incubator-xtable copied to clipboard

Extend the List of Stat Types to be Covered by the ParquetStatsConverterUtil class

Open sapienza88 opened this issue 4 months ago • 0 comments

Feature Request / Improvement

Converting a Parquet Binary type stat to a corresponding type in either Delta/Iceberg/Hudi format causes Exceptions to raise.

In order to make the table formats unaware of the all stat conversions from the Parquet format and to properly handle the conversion within the Parquet side, Binary (aka BYTE_ARRAY) logical types (e.g., ENUM, JSON, BSON) have to be treated on a per-case basis . We have to add tests for these types and as needed extend the cases in xtable-core/src/main/java/org/apache/xtable/parquet/ParquetStatsConverterUtil.java to include a more comprehensive list of Binary or other types.

Are you willing to submit PR?

  • [x] Yes I am willing to submit a PR!

Code of Conduct

sapienza88 avatar Oct 06 '25 16:10 sapienza88