Sagar Sumit
Sagar Sumit
### Change Logs This PR updates the documentation for Hudi-Presto integration to its current state. ### Impact _Describe any public API or user-facing feature change or any performance impact._ **Risk...
JDK versions 16 or later enforce strong encapsulation and do now allow to invoke `setAccessible` on a field, especially when the `isAccessible` is false. More details in [JEP 403](https://openjdk.org/jeps/403). This...
## What is the purpose of the pull request Segregating all the config flips in #5643 . This one enables reconcile schema by default. ## Brief change log *(for example:)*...
## What is the purpose of the pull request hudi-spark-* are not needed in hudi-kafka-connect-bundle. This PR removes those dependencies. NOTE: hudi-aws is still needed because of dependency of `CloudWatchReporter`...
* Add hudi and hive catalog properties * Add core-site.xml and correct spark-defaults.conf configurations. * Add sample script to create hudi table and sync to metastore.
### Change Logs Build storage partition stats index and use it for data skipping. Main changes are as follows: - Index is saved as another partition in the metadata table....
### Change Logs Cleaner could not delete partitions but meta sync fails to drop partition in that case. This could cause query using engines that depend on catalog to fail....
### Change Logs Added basepath to task labels and logging wherever necessary for easier debugging. Also, logging some stats for upsert partitioner. ### Impact none ### Risk level (write none,...
### Change Logs Metadata table should retain at least as much history as data table. Follow similar policy as data table and set retention to 1.2x for metadata table. ###...
### Change Logs This PR introduces a new class hierarchy for handling merge keys in a more flexible and decoupled manner. It adds the `HoodieMergeKey` interface, along with two implementations:...