Skip to content

Conversation

@liupc
Copy link

@liupc liupc commented Dec 27, 2017

What changes were proposed in this pull request?

When fetching mapStatuses, IOException is not handled in MapOutputTracker.deserializeMapStatuses which might cause old broadcast not cleaned up. In task retrying, when task was scheduled to the executor where old broadcast was stored, An "Block is already present in the MemoryStore" exception would be thrown.

This PR will clean garbage broadcast when IOException occurred in MetadataFetch to avoid
repeated "Block is already present in the MemoryStore" caused by the uncleaned broadcast garbage

details see: https://issues.apache.org/jira/browse/SPARK-22907

(Please fill in changes proposed in this fix)

How was this patch tested?

manual
(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

Please review http://spark.apache.org/contributing.html before opening a pull request.

liupengcheng added 2 commits December 27, 2017 17:36
Summary: Ref T10399

Test Plan: n/a

Reviewers: liushaohui, peng.zhang, zhoukang, chenfan

Subscribers: cloud-computing

Maniphest Tasks: T10399

Differential Revision: https://phabricator.d.xiaomi.net/D79279
@squito
Copy link
Contributor

squito commented Dec 29, 2017

Thanks for submitting this @liupc. The jira description makes sense, though I need to take a closer look at the code to confirm things. But at least two things need to be fixed before this can be merged:

  1. can you clean up the PR description?

  2. can you add a test case for this?

@jiangxb1987
Copy link
Contributor

ping @liupc

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@HyukjinKwon
Copy link
Member

ping @liupc

@asfgit asfgit closed this in 1a4fda8 Jul 19, 2018
zifeif2 pushed a commit to zifeif2/spark that referenced this pull request Nov 22, 2025
Closes apache#17422
Closes apache#17619
Closes apache#18034
Closes apache#18229
Closes apache#18268
Closes apache#17973
Closes apache#18125
Closes apache#18918
Closes apache#19274
Closes apache#19456
Closes apache#19510
Closes apache#19420
Closes apache#20090
Closes apache#20177
Closes apache#20304
Closes apache#20319
Closes apache#20543
Closes apache#20437
Closes apache#21261
Closes apache#21726
Closes apache#14653
Closes apache#13143
Closes apache#17894
Closes apache#19758
Closes apache#12951
Closes apache#17092
Closes apache#21240
Closes apache#16910
Closes apache#12904
Closes apache#21731
Closes apache#21095

Added:
Closes apache#19233
Closes apache#20100
Closes apache#21453
Closes apache#21455
Closes apache#18477

Added:
Closes apache#21812
Closes apache#21787

Author: hyukjinkwon <[email protected]>

Closes apache#21781 from HyukjinKwon/closing-prs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants