Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Apr 3, 2024

What changes were proposed in this pull request?

This PR aims to adjust the following in order to make Spark speculative execution behavior less aggressive from Apache Spark 4.0.0.

  • spark.speculation.multiplier: 1.5 -> 3
  • spark.speculation.quantile: 0.75 -> 0.9

Why are the changes needed?

Although spark.speculation is disabled by default, this has been used in many production use cases.

Does this PR introduce any user-facing change?

This will make a speculative execution less agressive.

How was this patch tested?

Pass the CIs.

Was this patch authored or co-authored using generative AI tooling?

No.

@dongjoon-hyun
Copy link
Member Author

WDYT, @cloud-fan and @HyukjinKwon ? I received many requests due to the following.

Copy link
Contributor

@mridulm mridulm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We actually use 4 and 0.9 internally :-)
We also have a minTaskRuntime of 30s ... though this might not work well for streaming apps.

@HyukjinKwon
Copy link
Member

cc @Ngone51 and @jiangxb1987 FYI

@dongjoon-hyun
Copy link
Member Author

Thank you, @mridulm and @HyukjinKwon .

@dongjoon-hyun
Copy link
Member Author

Merged to master for Apache Spark 4.0.0.

@dongjoon-hyun dongjoon-hyun deleted the SPARK-47720 branch April 4, 2024 02:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants