Ilya Cherkasov comments

Results 23 comments of


                                            Ilya Cherkasov

[SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time

>mor read_optimized can use it. can i set spark-sql to use read_optimized to test it out?

[SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time

Okay so let's compare. For clean experiment, I created 2 separate sessions for queries below. ``` scala> spark.time({ | val df = spark.read | .format("org.apache.hudi") | .option("hoodie.datasource.query.type", "read_optimized") | .load("s3://path/table/")...

[SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time

Sure, but anything specific you want to see?

[SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time

for snapshot: 441,483,112, query time 28141ms for read-optimized: 22,887,045, query time 26054ms. ![read-optimized](https://github.com/apache/hudi/assets/892781/d61438ac-3792-4217-9b79-23783128def1) ![snapshot](https://github.com/apache/hudi/assets/892781/3d8d3326-8eb6-4a3a-88a7-0b46d27405e7) ``` scala> spark.time({ | val df = spark.read | .format("org.apache.hudi") | .option("hoodie.datasource.query.type", "read_optimized") | .load("s3://table/") |...