datafusion
datafusion copied to clipboard
Incorrect statistics read for struct array in parquet
Describe the bug
I found this while adding tests https://github.com/apache/datafusion/pull/10608. The statistics of struct array returns nothing
To Reproduce
See test_struct in https://github.com/apache/datafusion/pull/10608
Expected behavior
Return some values for the statistics
Additional context
No response
take
#8334 Related. The current statistics for structs returns null.
The problem here is how to effectively deal with nested struct, I don't actually know whether all the columns related to one struct are totally stored in one row group or they would separate in different row groups