Add Hudi sink connector support
This PR adds Hudi CDC sink support to Flink CDC.
As of now the following features are supported:
- Simple bucket index
- Non-partitioned tables
- MOR tables
- Compaction plan generation (Compaction execution will require a separate process as of now)
Future improvements will be made to bring along more future support other native hudi features gradually/iteratively as we are trying to keep the PR small and manageable for reviews.
@danny0405 @cshuo FYI
Changes here will require Hudi 1.1.0 to be released first.
@lvyanquan @yuxiqian Bumped the version, but am not sure why the CI is failing, the E2E tests are passing locally:
Can you please advise and help to review? Thank you!
We've re-triggered the CI tests, and a checkstyle issue was reported. You can fix it and trigger a new test run once it's resolved.
@lvyanquan style issue is fixed, thks for helping.
@lvyanquan the ci is green. Could you help with the final review, thks~
The failed test is org.apache.flink.cdc.pipeline.tests.MaxComputeE2eITCase.testSingleSplitSingleTable, should be irrelevant to this pr.
@lvyanquan @yuxiqian gentle reminder for review, thks. cc @leonardBang
Thanks @voonhous and @cshuo for the great work, the changes looks good to me, but currently our test coverage is pretty low, cloud you add more necessary tests ?
@leonardBang thks for reviewing, will add more tests later.