[SPARK-2304] tera sort example program for shuffle benchmarks #1242

rxin · 2014-06-27T05:14:49Z

This pull request adds an example program for benchmarking Spark shuffle. It dynamically generates a set of 100 byte records according to the tera sort spec, and repartitions the data based on an evenly spaced range partitioner. By design, it does NOT yet perform sorting after the range partitioning yet.

Some of the code copied directly from Hadoop and simplified (the data generator stuff).

I've been using this utility to benchmark Spark at scale, including shuffling 100TB of data in 12 mins and 300TB in 36 mins.

…into terasort

Conflicts: core/src/main/scala/org/apache/spark/Partitioner.scala

AmplabJenkins · 2014-06-27T05:15:24Z

Merged build triggered.

AmplabJenkins · 2014-06-27T05:15:31Z

Merged build started.

AmplabJenkins · 2014-06-27T05:17:06Z

Merged build finished.

AmplabJenkins · 2014-06-27T05:17:07Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16195/

AmplabJenkins · 2014-06-27T05:25:24Z

Merged build triggered.

AmplabJenkins · 2014-06-27T05:25:31Z

Merged build started.

AmplabJenkins · 2014-06-27T06:08:47Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-06-27T06:08:48Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16196/

AmplabJenkins · 2014-06-28T01:50:28Z

Merged build triggered.

AmplabJenkins · 2014-06-28T01:50:33Z

Merged build started.

AmplabJenkins · 2014-06-28T02:33:52Z

Merged build finished.

AmplabJenkins · 2014-06-28T02:33:52Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16224/

AmplabJenkins · 2014-06-28T07:30:28Z

Merged build triggered.

AmplabJenkins · 2014-06-28T07:30:35Z

Merged build started.

AmplabJenkins · 2014-06-28T08:14:05Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-06-28T08:14:06Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16227/

AmplabJenkins · 2014-06-28T22:20:31Z

Merged build triggered.

AmplabJenkins · 2014-06-28T22:20:40Z

Merged build started.

AmplabJenkins · 2014-06-28T23:03:53Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-06-28T23:03:53Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16231/

tgravescs · 2014-06-30T16:00:33Z

The hadoop code for generating the data is out of date. It might not matter for your purposes, but if you want the up to date one look at sortbenchmark.org. I had filed jira to update Hadoop one but haven't gotten to it.

mridulm · 2014-06-30T16:51:08Z

Nice addition, thanks Reynold !

pwendell · 2014-09-02T01:24:06Z

@rxin can you close this issue for now? It's been lingering a long time.

jerryshao · 2014-09-12T01:42:22Z

Hi @rxin , sorry to bring this out. Are you planning to merge this terasort example into Spark? I think this would be a good standard to test the performance of Shuffle.

Besides I think generated records should be copied, otherwise will lead to error in sort-based shuffle like SPARK-2967.

Also is this intended not to do in-partition sorting or will do later?

Thanks a lot.

rxin · 2014-09-12T05:52:51Z

I don't think we are going to merge this in Spark, unless there is huge demand from users...

liuqiyun · 2014-12-28T06:50:52Z

@rxin I am confusing on the input parameters of GenSort.scala.
It requires 3 parameters: " [num-parts] [records-per-part] [output-path]".
If I want to generate and sort 100 GB data using 4 partitions, is that correct to set the parameters as '4, 268435456, /tmp/sort-output'?

Seems 1 row(record) equals 100 byte, so I computed the records(rows) number as following way:
100 GB data = 107374182400 byte = 1073741824 row * 100 byte/row = 268435456 row * 4 partition * 100 byte/row
So each partition should compute 268435456 row(record), right?

However, If I save the output as sequence file, the size of output files is only 20 GB. if I save the output as text file, not sequence file, the size of output files is 309.2 GB(77.3 GB * 4 partition), but NOT 100 GB. why?

rxin · 2014-12-29T20:02:14Z

The size of the data is 100GB in its uncompressed binary representation. You are probably compressing the data when you saved it as sequence file. When you save it as text file, the text representation is much larger (i.e. a single byte is shown as multiple byte in text).

liuqiyun · 2014-12-30T05:28:17Z

So how to save as the uncompressed binary representation in the GenSort.scala program? I want to compare it with Hadoop MR which also use the uncompressed binary representation

rxin added 10 commits June 4, 2014 15:12

Added terasort data generator.

adcae69

Minor style fix.

a4a5789

Added sorting.

62c882f

Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/spark …

b993fac

…into terasort

Make data serializable with Kryo.

957efa5

Temporarily removed sorting, and reduced memory usage.

ec9a9bb

Merge branch 'master' into terasort

676ca52

Conflicts: core/src/main/scala/org/apache/spark/Partitioner.scala

Fixed header and used input size instead of num tuples.

7bfc7fc

Revert RangePartitioner change.

4efe7c7

Style cleaned Unsigned16.

f087068

Again, license header ...

92c0402

Avoid allocating RecordWrapper per row.

aba5289

Use the proper size (1000 base instead of 1024 base).

433f78a

Use long for size

4b88e3e

rxin closed this Sep 2, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-2304] tera sort example program for shuffle benchmarks #1242

[SPARK-2304] tera sort example program for shuffle benchmarks #1242

rxin commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

tgravescs commented Jun 30, 2014

mridulm commented Jun 30, 2014

pwendell commented Sep 2, 2014

jerryshao commented Sep 12, 2014

rxin commented Sep 12, 2014

liuqiyun commented Dec 28, 2014

rxin commented Dec 29, 2014

liuqiyun commented Dec 30, 2014

[SPARK-2304] tera sort example program for shuffle benchmarks #1242

[SPARK-2304] tera sort example program for shuffle benchmarks #1242

Conversation

rxin commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 27, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

AmplabJenkins commented Jun 28, 2014

tgravescs commented Jun 30, 2014

mridulm commented Jun 30, 2014

pwendell commented Sep 2, 2014

jerryshao commented Sep 12, 2014

rxin commented Sep 12, 2014

liuqiyun commented Dec 28, 2014

rxin commented Dec 29, 2014

liuqiyun commented Dec 30, 2014