spark icon indicating copy to clipboard operation
spark copied to clipboard

[SPARK-49461][SS] Persistent Checkpoint ID to commit logs and read it back

Open WweiL opened this issue 1 year ago • 1 comments

What changes were proposed in this pull request?

In this change, we propose to add a new field to the commit log if STREAMING_STATE_STORE_COMMIT_LOG_VERSION is 2. The new field is a Map[String, Map[String, Map[String, Seq[String]]]] that maps OperatorId -> PartitionId -> StoreName -> Seq[uniqueId]. This is a necessary step to enable the v2 checkpoint.

Why are the changes needed?

New feature

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added UT

Was this patch authored or co-authored using generative AI tooling?

No

WweiL avatar Aug 29 '24 18:08 WweiL

cc @siying PTAL!

WweiL avatar Aug 29 '24 18:08 WweiL

Same here, I'm merging the PR on behalf of @brkyvz as he asked personally. Just to leave DISCLAIMER.

HeartSaVioR avatar Oct 22 '24 06:10 HeartSaVioR

Thanks! Merging to master.

HeartSaVioR avatar Oct 22 '24 06:10 HeartSaVioR