Skip to content

Conversation

@ericm-db
Copy link
Contributor

What changes were proposed in this pull request?

We are migrating to a new error framework in order to surface errors in a friendlier way to customers. This PR defines a new error class specifically for when there are concurrent updates to the log for the same batch ID

Why are the changes needed?

This gives more information to customers, and allows them to filter in a better way

Does this PR introduce any user-facing change?

How was this patch tested?

There is an existing test to check the error message upon this condition. Because we are only changing the error type, and not the error message, this test is sufficient.

@ericm-db ericm-db changed the title [WIP] [SC-130782] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint [WIP] [SPARK-43542] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint May 17, 2023
Copy link
Member

@MaxGekk MaxGekk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM except of a minor comment.

@MaxGekk
Copy link
Member

MaxGekk commented May 19, 2023

@ericm-db Could you allow GitHub actions in your fork and re-trigger GAs, please.

@ericm-db ericm-db changed the title [WIP] [SPARK-43542] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint [SPARK-43542] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint May 19, 2023
@ericm-db
Copy link
Contributor Author

Thanks for the review! I've made the changes, and I think it's ready to merge now @MaxGekk @HeartSaVioR

@MaxGekk
Copy link
Member

MaxGekk commented May 20, 2023

+1, LGTM. Merging to master.
Thank you, @ericm-db.

@MaxGekk MaxGekk changed the title [SPARK-43542] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint [SPARK-43542][SS] Define a new error class and apply for the case where streaming query fails due to concurrent run of streaming query with same checkpoint May 20, 2023
@MaxGekk MaxGekk closed this in 37b9c53 May 20, 2023
@MaxGekk
Copy link
Member

MaxGekk commented May 20, 2023

@ericm-db Congratulations with your first contribution to Apache Spark!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants