Skip to content
Closed
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
2745330
Add implicit class DatasetToBenchmark
MaxGekk Jan 2, 2020
6d13c37
Use noop() in CSVBenchmark
MaxGekk Jan 2, 2020
fb8c2c4
Use noop() in AggregateBenchmark
MaxGekk Jan 2, 2020
c28743a
Use noop() in AvroReadBenchmark
MaxGekk Jan 2, 2020
e3724b9
Use noop() in BloomFilterBenchmark
MaxGekk Jan 2, 2020
714ecfb
Use noop() in DataSourceReadBenchmark
MaxGekk Jan 2, 2020
3e88a53
Use noop() in DateTimeBenchmark
MaxGekk Jan 2, 2020
052dd0e
Use noop() in ExtractBenchmark
MaxGekk Jan 2, 2020
bd0f1ea
Use noop() in FilterPushdownBenchmark
MaxGekk Jan 2, 2020
0d2babc
Use noop() in InExpressionBenchmark
MaxGekk Jan 2, 2020
3db70cc
Use noop() in IntervalBenchmark
MaxGekk Jan 2, 2020
6f89a87
Use noop() in JoinBenchmark
MaxGekk Jan 2, 2020
de6cc74
Use noop() in JsonBenchmark
MaxGekk Jan 2, 2020
c6f7527
Use noop() in MakeDateTimeBenchmark
MaxGekk Jan 2, 2020
7307ad4
Use noop() in MiscBenchmark
MaxGekk Jan 2, 2020
4cbbff7
Use noop() in NestedSchemaPruningBenchmark
MaxGekk Jan 2, 2020
4fafd43
Use noop() in ObjectHashAggregateExecBenchmark
MaxGekk Jan 2, 2020
4414856
Use noop() in OrcReadBenchmark
MaxGekk Jan 2, 2020
3d22d83
Use noop() in RangeBenchmark
MaxGekk Jan 2, 2020
b9bed56
Use noop() in TPCDSQueryBenchmark
MaxGekk Jan 2, 2020
4858f93
Use noop() in UDFBenchmark
MaxGekk Jan 2, 2020
eee2948
Use noop() in WideSchemaBenchmark
MaxGekk Jan 2, 2020
6615d5a
Use noop() in WideTableBenchmark
MaxGekk Jan 2, 2020
c26164a
Add run-benchmarks.py
MaxGekk Jan 4, 2020
d6e519a
Fix run-benchmarks.py
MaxGekk Jan 5, 2020
1957c20
revert "Fix run-benchmarks.py"
MaxGekk Jan 5, 2020
1413425
Revert "Add run-benchmarks.py"
MaxGekk Jan 5, 2020
a9b2dd4
Regen benchmark results for JDK 8 & 11 on Linux
MaxGekk Jan 5, 2020
8dd23b7
Regen TPCDSQueryBenchmark for JDK 8 & 11
MaxGekk Jan 6, 2020
7a287f6
Re-gen JsonBenchmark result on JDK 11
MaxGekk Jan 6, 2020
18173e4
Regen JsonBenchmark results on JDK 8
MaxGekk Jan 6, 2020
2a141fd
Merge remote-tracking branch 'origin/master' into noop-in-benchmarks
MaxGekk Jan 8, 2020
677d3e1
Regen WideSchemaBenchmark results on JDK 8 & 11
MaxGekk Jan 8, 2020
9becd93
init
dongjoon-hyun Jan 10, 2020
ec8977e
Add filterpushdown
dongjoon-hyun Jan 10, 2020
4f63c3c
All
dongjoon-hyun Jan 10, 2020
2795c24
a
dongjoon-hyun Jan 10, 2020
e1a46e7
a
dongjoon-hyun Jan 10, 2020
0cfe42a
Merge pull request #24 from dongjoon-hyun/PR-27078
MaxGekk Jan 10, 2020
24dd096
jdk8
dongjoon-hyun Jan 11, 2020
39f79b0
jdk11
dongjoon-hyun Jan 11, 2020
eccde05
Merge pull request #25 from dongjoon-hyun/PR-27078-2
MaxGekk Jan 12, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
a
  • Loading branch information
dongjoon-hyun committed Jan 10, 2020
commit e1a46e770382a32655b66a09d35b6a97ff594ff4
78 changes: 39 additions & 39 deletions sql/core/benchmarks/CSVBenchmark-results.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,58 +2,58 @@
Benchmark to measure CSV read/write performance
================================================================================================

OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1051-aws
Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1044-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parsing quoted values: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
One quoted string 38738 38801 84 0.0 774761.7 1.0X
One quoted string 51602 51659 59 0.0 1032039.4 1.0X

OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1051-aws
Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1044-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Wide rows with 1000 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Select 1000 columns 111186 113031 1649 0.0 111185.7 1.0X
Select 100 columns 34218 34613 389 0.0 34218.2 3.2X
Select one column 28630 28713 72 0.0 28629.8 3.9X
count() 8328 8365 32 0.1 8328.2 13.4X
Select 100 columns, one bad input field 39616 40140 474 0.0 39615.9 2.8X
Select 100 columns, corrupt record field 48388 48449 57 0.0 48388.3 2.3X
Select 1000 columns 191926 192879 1615 0.0 191925.6 1.0X
Select 100 columns 46766 46846 69 0.0 46766.1 4.1X
Select one column 35877 35930 83 0.0 35876.8 5.3X
count() 11186 11262 65 0.1 11186.0 17.2X
Select 100 columns, one bad input field 59943 60107 232 0.0 59943.0 3.2X
Select 100 columns, corrupt record field 73062 73406 479 0.0 73062.2 2.6X

OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1051-aws
Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1044-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Count a dataset with 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns + count() 15945 15956 15 0.6 1594.5 1.0X
Select 1 column + count() 11545 11578 30 0.9 1154.5 1.4X
count() 5311 5419 93 1.9 531.1 3.0X
Select 10 columns + count() 22389 22447 87 0.4 2238.9 1.0X
Select 1 column + count() 14844 14890 43 0.7 1484.4 1.5X
count() 5519 5538 18 1.8 551.9 4.1X

OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1051-aws
Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1044-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Write dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps 1056 1070 14 9.5 105.6 1.0X
to_csv(timestamp) 10256 10834 866 1.0 1025.6 0.1X
write timestamps to files 8610 8638 25 1.2 861.0 0.1X
Create a dataset of dates 1211 1244 38 8.3 121.1 0.9X
to_csv(date) 7318 7327 8 1.4 731.8 0.1X
write dates to files 5360 5368 13 1.9 536.0 0.2X
Create a dataset of timestamps 1949 1977 25 5.1 194.9 1.0X
to_csv(timestamp) 14944 15702 714 0.7 1494.4 0.1X
write timestamps to files 12983 12998 14 0.8 1298.3 0.2X
Create a dataset of dates 2156 2164 7 4.6 215.6 0.9X
to_csv(date) 9675 9709 41 1.0 967.5 0.2X
write dates to files 7880 7897 15 1.3 788.0 0.2X

OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1051-aws
Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux 4.15.0-1044-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Read dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
read timestamp text from files 1949 1966 27 5.1 194.9 1.0X
read timestamps from files 25611 25625 14 0.4 2561.1 0.1X
infer timestamps from files 47531 48295 678 0.2 4753.1 0.0X
read date text from files 1820 1832 11 5.5 182.0 1.1X
read date from files 20238 20258 19 0.5 2023.8 0.1X
infer date from files 19929 19958 26 0.5 1992.9 0.1X
timestamp strings 2472 2481 8 4.0 247.2 0.8X
parse timestamps from Dataset[String] 29423 29571 149 0.3 2942.3 0.1X
infer timestamps from Dataset[String] 53800 53903 158 0.2 5380.0 0.0X
date strings 2704 2706 2 3.7 270.4 0.7X
parse dates from Dataset[String] 23411 23435 35 0.4 2341.1 0.1X
from_csv(timestamp) 26991 27038 44 0.4 2699.1 0.1X
from_csv(date) 22072 22110 37 0.5 2207.2 0.1X
read timestamp text from files 2235 2245 10 4.5 223.5 1.0X
read timestamps from files 54490 54690 283 0.2 5449.0 0.0X
infer timestamps from files 104501 104737 236 0.1 10450.1 0.0X
read date text from files 2035 2040 6 4.9 203.5 1.1X
read date from files 39650 39707 52 0.3 3965.0 0.1X
infer date from files 29235 29363 164 0.3 2923.5 0.1X
timestamp strings 3412 3426 18 2.9 341.2 0.7X
parse timestamps from Dataset[String] 66864 67804 981 0.1 6686.4 0.0X
infer timestamps from Dataset[String] 118780 119284 837 0.1 11878.0 0.0X
date strings 3730 3734 4 2.7 373.0 0.6X
parse dates from Dataset[String] 48728 49071 309 0.2 4872.8 0.0X
from_csv(timestamp) 62294 62493 260 0.2 6229.4 0.0X
from_csv(date) 44581 44665 117 0.2 4458.1 0.1X


Loading