voonhous

Results 13 comments of voonhous

@yihua Thank you for the reply. > Is the INSERT_OVERWRITE the only write action Yes, INSERT_OVERWRITE is the only action being performed on the table. i.e. ensuring that an insert...

@yihua I still don't quite understand the: >This ensures that all base files in the metadata table are always in sync with the data table (w/o any holes) and only...

@rishabhbandi I don't quite understand the steps between: ```txt 1)created a dataframe(targetDf) and using the below statement to write it in GCS Bucket location (for ex - locA) targetDF.write.format(org.apache.hudi).options(hudiWriteConf).mode(SaveMode.Overwrite).save(locA) 2)then...

@waitingF Might need to update the RFC claim number as these **RFC-74** and **RFC-75** were claimed recently under this PR https://github.com/apache/hudi/pull/10275

Detailed explanation of this error can be found here: https://github.com/apache/hudi/issues/6849#issuecomment-1661734683

Adding some illustration for future reference: **Normal Flow** ![image](https://github.com/apache/hudi/assets/69956021/706d3f40-4c68-452f-a997-16234a9a7670) As can be seen in the normal flow (between the JM and TM), a cycle is depicted by the green box....

@zhangyue19921010 No worries, added the markdown formatting back. :) Yeap, root cause here is that checkpoint interval is still running and a long-running rollback/archive might cause the checkpoint to timeout....

@ganczarek https://hudi.apache.org/docs/schema_evolution This looks like a Hive-Sync error. Can you try dropping your hive table (without purging your local data) and recreating your table with the new schema? Hudi-Full-Schema-Evolution FWIU,...

Hello all, i can try picking this up. @ebyhr what are the pending items remaining here?