Skip to content
Closed
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Refine docs
  • Loading branch information
jinxing committed May 22, 2017
commit 66aa56fd4d23645ec1d2f0e253ea4888b4882f9d
Original file line number Diff line number Diff line change
Expand Up @@ -127,8 +127,8 @@ private[spark] class CompressedMapStatus(

/**
* A [[MapStatus]] implementation that stores the accurate size of huge blocks, which are larger
* than both spark.shuffle.accurateBlockThreshold. It stores the average size of other non-empty
* blocks, plus a bitmap for tracking which blocks are empty.
* than spark.shuffle.accurateBlockThreshold. It stores the average size of other non-empty blocks,
* plus a bitmap for tracking which blocks are empty.
*
* @param loc location where the task is being executed
* @param numNonEmptyBlocks the number of non-empty blocks
Expand Down Expand Up @@ -211,8 +211,8 @@ private[spark] object HighlyCompressedMapStatus {
val size = uncompressedSizes(i)
if (size > 0) {
numNonEmptyBlocks += 1
// Remove the huge blocks from the calculation for average size and have accurate size for
// smaller blocks.
// Huge blocks are not included in the calculation for average size, thus size for smaller
// blocks is more accurate.
if (size < threshold) {
totalSize += size
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this should be put in the else branch

} else {
Expand Down