[SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted #25743

AngersZhuuuu · 2019-09-10T11:25:02Z

What changes were proposed in this pull request?

Discuss in #25611

If cancel() and close() is called very quickly after the query is started, then they may both call cleanup() before Spark Jobs are started. Then sqlContext.sparkContext.cancelJobGroup(statementId) does nothing.
But then the execute thread can start the jobs, and only then get interrupted and exit through here. But then it will exit here, and no-one will cancel these jobs and they will keep running even though this execution has exited.

So when execute() was interrupted by cancel(), when get into catch block, we should call canJobGroup again to make sure the job was canceled.

Why are the changes needed?

Does this PR introduce any user-facing change?

NO

How was this patch tested?

MT

This reverts commit c3de557.

AngersZhuuuu · 2019-09-10T14:10:10Z

@juliuszsompolski add a new pr for problem we have discussed .

juliuszsompolski

LGTM.
Thanks @AngersZhuuuu !

wangyum · 2019-09-12T10:34:26Z

ok to test

SparkQA · 2019-09-12T11:11:22Z

Test build #110513 has finished for PR 25743 at commit b047e76.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2019-09-12T13:29:41Z

...r/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkExecuteStatementOperation.scala

      // Actually do need to catch Throwable as some failures don't inherit from Exception and
      // HiveServer will silently swallow them.
      case e: Throwable =>
+        if (statementId != null) {


Could we add a comment explaining why we need this change?

SparkQA · 2019-09-12T15:51:55Z

Test build #110519 has finished for PR 25743 at commit 202f5ee.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-09-16T19:31:48Z

Test build #110644 has finished for PR 25743 at commit 232ef81.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

juliuszsompolski

LGTM from me. cc @wangyum

wangyum · 2019-09-23T10:39:59Z

retest this please

SparkQA · 2019-09-23T11:15:49Z

Test build #111216 has finished for PR 25743 at commit 232ef81.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2019-09-23T12:48:33Z

Merged to master.

AngersZhuuuu added 4 commits September 10, 2019 17:44

save change

c3de557

Revert "save change"

2cf3153

This reverts commit c3de557.

Merge remote-tracking branch 'upstream/master'

e5cd06c

cancel jobGroup after catch error

b047e76

AngersZhuuuu changed the title ~~[SPARK-29036][SQL]SparkThriftServer cancel job after throw exception and into catch block.~~ [WIP][SPARK-29036][SQL]SparkThriftServer cancel job after throw exception and into catch block. Sep 10, 2019

AngersZhuuuu changed the title ~~[WIP][SPARK-29036][SQL]SparkThriftServer cancel job after throw exception and into catch block.~~ [SPARK-29036][SQL]SparkThriftServer cancel job after throw exception and into catch block. Sep 10, 2019

AngersZhuuuu changed the title ~~[SPARK-29036][SQL]SparkThriftServer cancel job after throw exception and into catch block.~~ [SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted Sep 10, 2019

juliuszsompolski approved these changes Sep 10, 2019

View reviewed changes

dongjoon-hyun added the SQL label Sep 10, 2019

wangyum reviewed Sep 12, 2019

View reviewed changes

add comment

202f5ee

modify comment

232ef81

juliuszsompolski approved these changes Sep 23, 2019

View reviewed changes

wangyum approved these changes Sep 23, 2019

View reviewed changes

wangyum closed this in d22768a Sep 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted #25743

[SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted #25743

Uh oh!

AngersZhuuuu commented Sep 10, 2019 •

edited

Loading

Uh oh!

AngersZhuuuu commented Sep 10, 2019

Uh oh!

juliuszsompolski left a comment

Uh oh!

wangyum commented Sep 12, 2019

Uh oh!

SparkQA commented Sep 12, 2019

Uh oh!

wangyum Sep 12, 2019

Uh oh!

SparkQA commented Sep 12, 2019

Uh oh!

SparkQA commented Sep 16, 2019

Uh oh!

juliuszsompolski left a comment

Uh oh!

wangyum commented Sep 23, 2019

Uh oh!

SparkQA commented Sep 23, 2019

Uh oh!

wangyum commented Sep 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted #25743

[SPARK-29036][SQL]SparkThriftServer cancel job after execute() thread interrupted #25743

Uh oh!

Conversation

AngersZhuuuu commented Sep 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AngersZhuuuu commented Sep 10, 2019

Uh oh!

juliuszsompolski left a comment

Choose a reason for hiding this comment

Uh oh!

wangyum commented Sep 12, 2019

Uh oh!

SparkQA commented Sep 12, 2019

Uh oh!

wangyum Sep 12, 2019

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Sep 12, 2019

Uh oh!

SparkQA commented Sep 16, 2019

Uh oh!

juliuszsompolski left a comment

Choose a reason for hiding this comment

Uh oh!

wangyum commented Sep 23, 2019

Uh oh!

SparkQA commented Sep 23, 2019

Uh oh!

wangyum commented Sep 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

AngersZhuuuu commented Sep 10, 2019 •

edited

Loading