Add repartition workload #608

gczsjdy · 2020-03-19T03:37:42Z

We need a workload to solely benchmark shuffle's performance, excluding any I/O operations and non-shuffle-related compute. Without repartition, we often use Terasort as a replacement, but the problem is that I/O and sorts can take most of the time, blurring our eyes to measure shuffle performance.

This workload is only for Spark, it contains following process:

Cache the data in memory (Optional, decided by hibench.repartition.cacheinmemory in micro/repartition.conf)
Shuffle writes
Shuffle reads
Write the result to storage (Optional, decided by hibench.repartition.disableoutput)

carsonwang

Thanks for adding this!

carsonwang · 2020-03-27T06:21:58Z

conf/workloads/micro/repartition.conf

+hibench.repartition.large.datasize			100000000
+hibench.repartition.huge.datasize			1000000000
+hibench.repartition.gigantic.datasize		10000000000
+hibench.repartition.bigdata.datasize		60000000000


Can we change this to the same size defined in TeraSort to be consistent?

No problem, I made them bigger because it takes less time than Terasort. But since we have an output by default, durations should be on same level.

carsonwang · 2020-03-27T07:40:56Z

sparkbench/micro/src/main/scala/com/intel/sparkbench/micro/ScalaRepartition.scala

+  private def reparition(previous: RDD[Array[Byte]], numReducers: Int): ShuffledRDD[Int, Array[Byte], Array[Byte]] = {
+    /** Distributes elements evenly across output partitions, starting from a random partition. */
+    val distributePartition = (index: Int, items: Iterator[Array[Byte]]) => {
+      var position = (new Random(index)).nextInt(numReducers)


In spark code, I noticed hashing.byteswap32(index) is used for the seed. the hashing is removed here by purpose as there is no difference?

Good catch! I copied that from Spark 2.0.0, and the code you mentioned is introduced in apache/spark#18990, solving the skewed repartition when numReducers is power of 2.

carsonwang · 2020-03-27T07:58:50Z

README.md

+
+4. Repartition (micro/repartition)
+
+    This workload benchmarks shuffle performance. Input data is generated by Hadoop TeraGen. It is firstly cached in memory by default, then shuffle write and read in order to repartition. The last 2 stages solely reflects shuffle's performance, excluding I/O and other compute. Note: The parameter hibench.repartition.cacheinmemory(default is true) is provided, to allow reading from storage in the 1st stage without caching


What about setting this to hibench.repartition.cacheinmemory to false by default? HiBench measures the execution time of the entire workload and calculates the throughput. Caching in memory seems to be for our own need to measure the shuffle write and shuffle read. So we need to look at the stage level execution time ourselves.

Makes sense!

carsonwang · 2020-03-31T02:29:56Z

Why travis is not triggered ?

carsonwang · 2020-03-31T02:31:32Z

sparkbench/micro/src/main/scala/com/intel/sparkbench/micro/ScalaRepartition.scala

    /** Distributes elements evenly across output partitions, starting from a random partition. */
    val distributePartition = (index: Int, items: Iterator[Array[Byte]]) => {
-      var position = (new Random(index)).nextInt(numReducers)
+      var position = new Random(hashing.byteswap32(index)).nextInt(numReducers)


(new Random(hashing.byteswap32(index))) ?

I think they are the same. https://github.com/apache/spark/blob/1dce6c1fd45de3d9cf911842f52494051692c48b/core/src/main/scala/org/apache/spark/rdd/RDD.scala#L493

carsonwang · 2020-03-31T05:22:54Z

Merged this. Thanks!

Chenzhao Guo added 5 commits March 19, 2020 11:24

Add repartition workload

3c6007c

Docs

14334de

Not using RDD[Tuple2] so as to leverage unsafe shuffle writer

3b6a0fb

Add a parameter to switch on/off cache

bf801cd

Save RDDs in reduce stage

708e8e6

carsonwang reviewed Mar 27, 2020

View reviewed changes

Chenzhao Guo added 2 commits March 30, 2020 15:21

Change default

296e5ed

Write result or not configurable

b5477ba

carsonwang reviewed Mar 31, 2020

View reviewed changes

Travis?

fcf13d4

carsonwang merged commit ff5a42f into Intel-bigdata:master Mar 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add repartition workload #608

Add repartition workload #608

Uh oh!

gczsjdy commented Mar 19, 2020 •

edited

Loading

Uh oh!

carsonwang left a comment

Uh oh!

carsonwang Mar 27, 2020

Uh oh!

gczsjdy Mar 27, 2020

Uh oh!

carsonwang Mar 27, 2020

Uh oh!

gczsjdy Mar 30, 2020

Uh oh!

carsonwang Mar 27, 2020

Uh oh!

gczsjdy Mar 27, 2020

Uh oh!

carsonwang commented Mar 31, 2020

Uh oh!

carsonwang Mar 31, 2020

Uh oh!

gczsjdy Mar 31, 2020

Uh oh!

carsonwang commented Mar 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		4. Repartition (micro/repartition)

		This workload benchmarks shuffle performance. Input data is generated by Hadoop TeraGen. It is firstly cached in memory by default, then shuffle write and read in order to repartition. The last 2 stages solely reflects shuffle's performance, excluding I/O and other compute. Note: The parameter hibench.repartition.cacheinmemory(default is true) is provided, to allow reading from storage in the 1st stage without caching

Add repartition workload #608

Add repartition workload #608

Uh oh!

Conversation

gczsjdy commented Mar 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carsonwang left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carsonwang commented Mar 31, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carsonwang commented Mar 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gczsjdy commented Mar 19, 2020 •

edited

Loading