hudi-rs icon indicating copy to clipboard operation
hudi-rs copied to clipboard

Support 1.0 table format

Open xushiyan opened this issue 1 year ago • 4 comments

As of 0.4.0, the supported table versions are 5 and 6. Table version 8 (1.0) support involves:

  • reading LSM tree timeline
  • processing newer log format
  • APIs to read timeline and instants wrt different table versions

Step 1 file group reader support

File group reader APIs can be made to work with ver 8 base/log files, while leaving Table APIs unchanged. This will allow engine integration to work first.

Test data (sample file slices with different log block types) should be prepared to verify merging support.

Step 2 table API support

LSM tree read support and Timeline APIs to be refactored. And finally add table version 8 to the table config validation to allow using table builder.

xushiyan avatar May 04 '24 06:05 xushiyan

Hi ! Is anyone doing this part of the work? I'm very interested in this, and if no one else is planning to do it at the moment, I'd like to try to contribute!Looking forward to your reply. @xushiyan

TheR1sing3un avatar Nov 22 '24 11:11 TheR1sing3un

@TheR1sing3un sure. but this task is not prioritized at the moment. if you're interested in contributing, i can plan some other high priority work for you. let me know how you want to collaborate.

xushiyan avatar Nov 24 '24 00:11 xushiyan

@TheR1sing3un sure. but this task is not prioritized at the moment. if you're interested in contributing, i can plan some other high priority work for you. let me know how you want to collaborate.

I'm very interested in writer/reader/index, you can assign some high priority issues for me, for example, introduce mor-snapshot-read? I'm very glad to contribute!

TheR1sing3un avatar Nov 25 '24 02:11 TheR1sing3un

@TheR1sing3un cool. i'll clean up the backlog a bit and then assign. thanks

xushiyan avatar Nov 27 '24 05:11 xushiyan