iceberg icon indicating copy to clipboard operation
iceberg copied to clipboard

Core: Use Bulk Delete when dropping table data and metadata

Open amogh-jahagirdar opened this issue 3 years ago • 5 comments

This change updates the implementation of dropping table data and metadata to use bulk delete if applicable.

amogh-jahagirdar avatar Aug 07 '22 21:08 amogh-jahagirdar

Currently it does not have any testing, I know we have some issue using s3mock, can we add an integ test in aws module for delete all files in a table?

jackye1995 avatar Aug 08 '22 00:08 jackye1995

Currently it does not have any testing, I know we have some issue using s3mock, can we add an integ test in aws module for delete all files in a table?

Yeah, I was just looking into it. If needed, I can add integration tests to the TestGlueCatalogTable. TestGlueCatalogTable already tests purging and the underlying file IO is S3, so if those succeed we should be good. I will run those and validate as well as add any other cases.

amogh-jahagirdar avatar Aug 08 '22 00:08 amogh-jahagirdar

TestGlueCatalogTable already tests purging and the underlying file IO is S3, so if those succeed we should be good. I will run those and validate as well as add any other cases.

I don't think we have enough files generated to trigger the full code path, let's make sure we do that

jackye1995 avatar Aug 08 '22 01:08 jackye1995

@szehon-ho @danielcweeks @aokolnychyi @rdblue Would be happy to get your feedback as well!

amogh-jahagirdar avatar Aug 10 '22 23:08 amogh-jahagirdar

I'd love to take a look later today.

aokolnychyi avatar Aug 11 '22 16:08 aokolnychyi

@amogh-jahagirdar, let me know what you think about the suggestions above. If you want to, we can do them in a follow-up PR as well to avoid blocking this feature. Looking forward to consuming this in actions!

aokolnychyi avatar Aug 19 '22 21:08 aokolnychyi

Thanks @aokolnychyi for the reviews, and sorry for the delay, I got busy with internal work. I've updated the PR with the suggested refactoring.

amogh-jahagirdar avatar Aug 20 '22 20:08 amogh-jahagirdar

Let me take a look. Sorry for the delay.

aokolnychyi avatar Aug 26 '22 16:08 aokolnychyi

Thanks @aokolnychyi @jackye1995 @singhpk234 for the reviews, I've addressed all the nits!

amogh-jahagirdar avatar Aug 27 '22 17:08 amogh-jahagirdar

Thanks for the contribution @amogh-jahagirdar , and thanks for reviewing @singhpk234 @aokolnychyi !

jackye1995 avatar Aug 27 '22 19:08 jackye1995