AMD Ryzen Threadripper 3960X GCC vs. LLVM Clang compiler benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1912235-PTS-THREADRI74 Threadripper EOY2019 Clang vs. GCC - Phoronix Test Suite Threadripper EOY2019 Clang vs. GCC AMD Ryzen Threadripper 3960X GCC vs. LLVM Clang compiler benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/1912235-PTS-THREADRI74&grs&sor .
Threadripper EOY2019 Clang vs. GCC Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 LLVM Clang 9.0.0 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32768MB 1000GB Sabrent Rocket 4.0 1TB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) AMD Baffin HDMI/DP ASUS VP28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723 Ubuntu 19.10 5.4.0-nvme-hwmon (x86_64) GNOME Shell 3.34.1 X Server 1.20.5 modesetting 1.20.5 4.5 Mesa 19.2.1 (LLVM 9.0.0) GCC 10.0.0 20191208 ext4 3840x2160 Clang 10.0.0 GCC 9.2.1 20191008 Clang 9.0.0 OpenBenchmarking.org Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Compiler Details - GCC 10.0.0 20191208: --disable-multilib --enable-checking=release - LLVM Clang 10.0 20191222: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2 - GCC 9.2.1: --disable-multilib --enable-checking=release Disk Details - NONE / errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 2.7.17rc1 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected
Threadripper EOY2019 Clang vs. GCC john-the-ripper: Blowfish ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping mt-dgemm: Sustained Floating-Point Rate build-php: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel build-imagemagick: Time To Compile openssl: RSA 4096-bit Performance dav1d: Chimera 1080p 10-bit mkl-dnn: Recurrent Neural Network Training - f32 parboil: OpenMP MRI Gridding cpp-perf-bench: Rand Numbers tungsten: Non-Exponential qmcpack: fftw: Float + SSE - 2D FFT Size 4096 encode-mp3: WAV To MP3 cpp-perf-bench: Stepanov Vector aobench: 2048 x 2048 - Total Time tscp: AI Chess Performance libgav1: Summer Nature 4K encode-flac: WAV To FLAC libgav1: Chimera 1080p libgav1: Summer Nature 1080p tungsten: Hair mkl-dnn: IP Batch 1D - f32 cpp-perf-bench: Ctype tungsten: Volumetric Caustic libgav1: Chimera 1080p 10-bit cpp-perf-bench: Stepanov Abstraction askap: tConvolve OpenMP - Degridding rocksdb: Rand Fill mrbayes: Primate Phylogeny Analysis himeno: Poisson Pressure Solver fftw: Stock - 2D FFT Size 4096 aom-av1: AV1 Video Encoding cpp-perf-bench: Function Objects sqlite-speedtest: Timed Time - Size 1,000 dav1d: Chimera 1080p tungsten: Water Caustic vpxenc: vpxenc VP9 1080p Video Encode stockfish: Total Time rocksdb: Seq Fill lczero: Rand rodinia: OpenMP CFD Solver mkl-dnn: Convolution Batch conv_alexnet - f32 askap: tConvolve OpenMP - Gridding cpp-perf-bench: Math Library x265: H.265 1080p Video Encoding minife: Small pgbench: Buffer Test - Normal Load - Read Only compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 dav1d: Summer Nature 1080p dav1d: Summer Nature 4K mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 rocksdb: Rand Read byte: Dhrystone 2 compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 nginx: Static Web Page Serving x264: H.264 Video Encoding cpp-perf-bench: Atol n-queens: Elapsed Time askap: tConvolve MT - Degridding crafty: Elapsed Time askap: tConvolve MT - Gridding apache: Static Web Page Serving rocksdb: Rand Fill Sync sqlite: 1 parboil: OpenMP Stencil rodinia: OpenMP LavaMD rocksdb: Read While Writing gromacs: Water Benchmark parboil: OpenMP CUTCP pgbench: Buffer Test - Normal Load - Read Write smallpt: Global Illumination Renderer; 128 Samples rodinia: OpenMP Streamcluster lczero: BLAS GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 LLVM Clang 9.0.0 41784 961.974 8.863183 50.242 19.134 15.064 7173.7 94.47 194.077 62.941657 1061.480 3.36927 1878.1 20287 6.701 76.263 27.389 1345991 23.89 8.042 51.70 77.92 9.44960 1.86131 32.311 3.96697 21.93 28.295 3716.33 930897 69.786 4898.170821 8135.2 0.15 13.838 57.116 612.94 18.2157 195.63 80909082 1024733 105805 9.185 124.282 5255.51 262.968 65.74 7767.98 673644.958285 10.052 676.37 289.29 52.4691 145368967 48002215.1 20.042 43035.46 199.22 58.278 4.378 3339.33 9027835 1943.58 34157.76 24277 14.235 7.551706 10.085 4945916 2.515 1.257410 15513.440423 4.775 19.069 29.3289 1932 78.178 30.839 20.905 5189.1 74.45 1282.033 4.05176 7.788 66.782 31.745 1528248 21.17 7.145 46.69 70.28 8.72625 29.795 3.72488 23.11 25.864 70.706 4866.903427 0.15 14.764 59.906 625.11 19.1580 207.68 258.404 67.11 668836.605453 10.107 685.40 291.76 19.687 43580.60 196.10 57.843 34327.90 14.248 14868.678849 41482 935.123 8.385972 44.939 19.155 13.505 7178.8 99.72 194.766 49.000965 1069.483 3.37419 1893.6 24184 6.729 77.501 28.620 1373089 23.74 8.073 51.84 77.11 9.70952 1.73710 32.220 4.07902 22.01 28.523 4096.25 1023335 70.330 4583.346212 8111.6 0.14 14.816 56.169 587.24 19.1975 197.37 76737462 1083649 105996 9.151 125.008 5509.3 269.484 64.69 7777.50 654500.676350 10.040 667.33 285.28 52.2820 142287766 48969411.2 19.753 43426.72 197.93 57.473 4.429 3376.27 8959937 1955.97 34228.50 24368 14.252 7.576396 10.101 4950890 2.516 1.257716 30910.991785 3.492 19.761 32.7598 60586 69.8522 0.791403 64.763 30.972 13.531 7191.6 73.36 145.850 1296.984 4.07043 2262.8 23559 7.951 67.108 31.585 1487001 23.94 7.192 52.47 78.57 8.70024 1.67315 29.085 3.68186 24.19 26.033 64.854 4983.281786 7531.2 0.15 14.698 58.332 623.05 19.3485 207.78 81336054 100236.8 9.633 118.989 260.638 66.27 8037.33 675821.095833 9.817 674.39 288.60 51.3284 19.685 43783.30 198.29 57.995 34144.39 14.200 31491.377190 18.818 33.3033 OpenBenchmarking.org
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 13K 26K 39K 52K 65K SE +/- 246.24, N = 3 SE +/- 170.74, N = 3 SE +/- 184.04, N = 3 SE +/- 3.28, N = 3 60586 41784 41482 1932 -fopenmp -fopenmp -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 9.0.0 200 400 600 800 1000 SE +/- 1.33, N = 3 SE +/- 2.99, N = 3 SE +/- 0.09, N = 3 961.97 935.12 69.85 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 9.0.0 2 4 6 8 10 SE +/- 0.059593, N = 3 SE +/- 0.037368, N = 3 SE +/- 0.007958, N = 3 8.863183 8.385972 0.791403 1. (CC) gcc options: -O3 -march=native -fopenmp
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.24, N = 3 SE +/- 0.12, N = 3 44.94 50.24 64.76 78.18 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 19.13 19.16 30.84 30.97 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile GCC 9.2.1 LLVM Clang 9.0.0 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 13.51 13.53 15.06 20.91
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 1500 3000 4500 6000 7500 SE +/- 29.40, N = 3 SE +/- 21.05, N = 3 SE +/- 20.75, N = 3 SE +/- 21.60, N = 3 7191.6 7178.8 7173.7 5189.1 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 10-bit GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 99.72 94.47 74.45 73.36 MIN: 60.51 / MAX: 199.48 MIN: 56.43 / MAX: 192.99 MIN: 45.96 / MAX: 154.32 MIN: 45.63 / MAX: 149.13 1. (CC) gcc options: -O3 -march=native -pthread
MKL-DNN DNNL Harness: Recurrent Neural Network Training - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Recurrent Neural Network Training - Data Type: f32 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 40 80 120 160 200 SE +/- 0.38, N = 3 SE +/- 0.39, N = 3 SE +/- 0.16, N = 3 145.85 194.08 194.77 -fopenmp=libomp - MIN: 143.95 -fopenmp - MIN: 192.29 -fopenmp - MIN: 192.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding GCC 9.2.1 GCC 10.0.0 20191208 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 49.00 62.94 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
CppPerformanceBenchmarks Test: Random Numbers OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 300 600 900 1200 1500 SE +/- 10.04, N = 3 SE +/- 2.69, N = 3 SE +/- 12.43, N = 3 SE +/- 0.16, N = 3 1061.48 1069.48 1282.03 1296.98 1. (CXX) g++ options: -O3 -march=native -std=c++11
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 0.9158 1.8316 2.7474 3.6632 4.579 SE +/- 0.01371, N = 3 SE +/- 0.01280, N = 3 SE +/- 0.01369, N = 3 SE +/- 0.00501, N = 3 3.36927 3.37419 4.05176 4.07043 -fstrict-aliasing -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
QMCPACK OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.8 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 9.0.0 500 1000 1500 2000 2500 1878.1 1893.6 2262.8 -finline-limit=1000 -funroll-all-loops -finline-limit=1000 -funroll-all-loops 1. (CXX) g++ options: -O3 -march=native -fopenmp -fomit-frame-pointer -fstrict-aliasing -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 GCC 9.2.1 LLVM Clang 9.0.0 GCC 10.0.0 20191208 5K 10K 15K 20K 25K SE +/- 309.06, N = 4 SE +/- 404.65, N = 3 SE +/- 239.62, N = 3 24184 23559 20287 1. (CC) gcc options: -pthread -O3 -march=native -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 2 4 6 8 10 SE +/- 0.007, N = 3 SE +/- 0.012, N = 3 SE +/- 0.008, N = 3 SE +/- 0.001, N = 6 6.701 6.729 7.788 7.951 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -lncurses 1. (CC) gcc options: -O3 -pipe -march=native -lm
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 66.78 67.11 76.26 77.50 1. (CXX) g++ options: -O3 -march=native -std=c++11
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.30, N = 3 SE +/- 0.01, N = 3 SE +/- 0.45, N = 15 27.39 28.62 31.59 31.75 1. (CC) gcc options: -lm -O3 -march=native
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 300K 600K 900K 1200K 1500K SE +/- 1690.40, N = 5 SE +/- 2711.28, N = 5 SE +/- 9469.71, N = 5 SE +/- 1231.70, N = 5 1528248 1487001 1373089 1345991 1. (CC) gcc options: -O3 -march=native
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 23.94 23.89 23.74 21.17 1. (CXX) g++ options: -O3 -march=native -lpthread
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 2 4 6 8 10 SE +/- 0.003, N = 5 SE +/- 0.005, N = 5 SE +/- 0.006, N = 5 SE +/- 0.007, N = 5 7.145 7.192 8.042 8.073 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -march=native -logg -lm
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 52.47 51.84 51.70 46.69 1. (CXX) g++ options: -O3 -march=native -lpthread
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 20 40 60 80 100 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 78.57 77.92 77.11 70.28 1. (CXX) g++ options: -O3 -march=native -lpthread
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 10.0.0 20191208 GCC 9.2.1 3 6 9 12 15 SE +/- 0.01343, N = 3 SE +/- 0.01435, N = 3 SE +/- 0.03984, N = 3 SE +/- 0.02093, N = 3 8.70024 8.72625 9.44960 9.70952 -fstrict-aliasing -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
MKL-DNN DNNL Harness: IP Batch 1D - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: IP Batch 1D - Data Type: f32 LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 0.4188 0.8376 1.2564 1.6752 2.094 SE +/- 0.00428, N = 3 SE +/- 0.00269, N = 3 SE +/- 0.00636, N = 3 1.67315 1.73710 1.86131 -fopenmp=libomp - MIN: 1.62 -fopenmp - MIN: 1.67 -fopenmp - MIN: 1.81 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.30, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 29.09 29.80 32.22 32.31 1. (CXX) g++ options: -O3 -march=native -std=c++11
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 10.0.0 20191208 GCC 9.2.1 0.9178 1.8356 2.7534 3.6712 4.589 SE +/- 0.00668, N = 3 SE +/- 0.00780, N = 3 SE +/- 0.04581, N = 3 SE +/- 0.03613, N = 3 3.68186 3.72488 3.96697 4.07902 -fstrict-aliasing -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 10-bit LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 24.19 23.11 22.01 21.93 1. (CXX) g++ options: -O3 -march=native -lpthread
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 25.86 26.03 28.30 28.52 1. (CXX) g++ options: -O3 -march=native -std=c++11
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Degridding GCC 9.2.1 GCC 10.0.0 20191208 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 46.05, N = 3 4096.25 3716.33 1. (CXX) g++ options: -lpthread
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill GCC 9.2.1 GCC 10.0.0 20191208 200K 400K 600K 800K 1000K SE +/- 4040.85, N = 3 SE +/- 13468.75, N = 3 1023335 930897 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 10.0 20191222 16 32 48 64 80 SE +/- 0.36, N = 3 SE +/- 0.32, N = 3 SE +/- 0.27, N = 3 SE +/- 0.32, N = 3 64.85 69.79 70.33 70.71 -mabm -mabm 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM Clang 9.0.0 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 1100 2200 3300 4400 5500 SE +/- 84.96, N = 3 SE +/- 51.56, N = 7 SE +/- 77.03, N = 3 SE +/- 61.37, N = 3 4983.28 4898.17 4866.90 4583.35 1. (CC) gcc options: -O3 -march=native -mavx2
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 GCC 10.0.0 20191208 GCC 9.2.1 LLVM Clang 9.0.0 2K 4K 6K 8K 10K SE +/- 76.26, N = 3 SE +/- 26.44, N = 3 SE +/- 24.98, N = 3 8135.2 8111.6 7531.2 1. (CC) gcc options: -pthread -O3 -march=native -lm
AOM AV1 AV1 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2019-09-16 AV1 Video Encoding LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 10.0.0 20191208 GCC 9.2.1 0.0338 0.0676 0.1014 0.1352 0.169 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.15 0.15 0.15 0.14 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects GCC 10.0.0 20191208 LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.15, N = 3 13.84 14.70 14.76 14.82 1. (CXX) g++ options: -O3 -march=native -std=c++11
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 13 26 39 52 65 SE +/- 0.46, N = 3 SE +/- 0.10, N = 3 SE +/- 0.44, N = 3 SE +/- 0.13, N = 3 56.17 57.12 58.33 59.91 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 130 260 390 520 650 SE +/- 1.62, N = 3 SE +/- 2.49, N = 3 SE +/- 4.43, N = 3 SE +/- 2.39, N = 3 625.11 623.05 612.94 587.24 MIN: 468.04 / MAX: 781 MIN: 475.37 / MAX: 782.95 MIN: 452.17 / MAX: 769.81 MIN: 439.81 / MAX: 722.49 1. (CC) gcc options: -O3 -march=native -pthread
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 LLVM Clang 9.0.0 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.22 19.16 19.20 19.35 -fstrict-aliasing -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
VP9 libvpx Encoding vpxenc VP9 1080p Video Encode OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.1 vpxenc VP9 1080p Video Encode LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 50 100 150 200 250 SE +/- 1.20, N = 3 SE +/- 1.73, N = 15 SE +/- 1.49, N = 3 SE +/- 1.29, N = 3 207.78 207.68 197.37 195.63 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 20M 40M 60M 80M 100M SE +/- 194228.81, N = 3 SE +/- 1221575.84, N = 3 SE +/- 74252.03, N = 3 81336054 80909082 76737462 1. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Sequential Fill GCC 9.2.1 GCC 10.0.0 20191208 200K 400K 600K 800K 1000K SE +/- 10427.62, N = 3 SE +/- 4276.33, N = 3 1083649 1024733 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.22.0 Backend: Random GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 9.0.0 20K 40K 60K 80K 100K SE +/- 354.64, N = 3 SE +/- 275.83, N = 3 SE +/- 553.66, N = 3 105996.0 105805.0 100236.8 1. (CXX) g++ options: -O3 -march=native -lpthread
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 9.0.0 3 6 9 12 15 SE +/- 0.052, N = 3 SE +/- 0.029, N = 3 SE +/- 0.133, N = 3 9.151 9.185 9.633 -O2 -lOpenCL -O2 -lOpenCL -O3 -fopenmp 1. (CXX) g++ options:
MKL-DNN DNNL Harness: Convolution Batch conv_alexnet - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_alexnet - Data Type: f32 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 1.45, N = 3 SE +/- 1.55, N = 3 118.99 124.28 125.01 -fopenmp=libomp - MIN: 118.28 -fopenmp - MIN: 122.25 -fopenmp - MIN: 122.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Gridding GCC 9.2.1 GCC 10.0.0 20191208 1200 2400 3600 4800 6000 SE +/- 37.73, N = 3 SE +/- 34.80, N = 3 5509.30 5255.51 1. (CXX) g++ options: -lpthread
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 60 120 180 240 300 SE +/- 2.46, N = 3 SE +/- 0.41, N = 3 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 258.40 260.64 262.97 269.48 1. (CXX) g++ options: -O3 -march=native -std=c++11
x265 H.265 1080p Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x265 3.1.2 H.265 1080p Video Encoding LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 15 30 45 60 75 SE +/- 0.39, N = 3 SE +/- 0.07, N = 3 SE +/- 0.29, N = 3 SE +/- 0.10, N = 3 67.11 66.27 65.74 64.69 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 2K 4K 6K 8K 10K SE +/- 3.20, N = 3 SE +/- 9.20, N = 3 SE +/- 4.74, N = 3 8037.33 7777.50 7767.98 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only LLVM Clang 9.0.0 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 140K 280K 420K 560K 700K SE +/- 2145.94, N = 3 SE +/- 876.37, N = 3 SE +/- 1018.99, N = 3 SE +/- 1703.60, N = 3 675821.10 673644.96 668836.61 654500.68 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Zstd Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 3 6 9 12 15 SE +/- 0.100, N = 3 SE +/- 0.125, N = 3 SE +/- 0.089, N = 3 SE +/- 0.033, N = 3 9.817 10.040 10.052 10.107 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 1080p LLVM Clang 10.0 20191222 GCC 10.0.0 20191208 LLVM Clang 9.0.0 GCC 9.2.1 150 300 450 600 750 SE +/- 2.78, N = 3 SE +/- 0.42, N = 3 SE +/- 1.73, N = 3 SE +/- 1.97, N = 3 685.40 676.37 674.39 667.33 MIN: 400.43 / MAX: 750.08 MIN: 396.97 / MAX: 738.73 MIN: 376.28 / MAX: 738.32 MIN: 387.89 / MAX: 728.98 1. (CC) gcc options: -O3 -march=native -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 4K LLVM Clang 10.0 20191222 GCC 10.0.0 20191208 LLVM Clang 9.0.0 GCC 9.2.1 60 120 180 240 300 SE +/- 0.34, N = 3 SE +/- 0.64, N = 3 SE +/- 0.37, N = 3 SE +/- 1.13, N = 3 291.76 289.29 288.60 285.28 MIN: 174.15 / MAX: 309.6 MIN: 172.47 / MAX: 306.73 MIN: 170.73 / MAX: 306.52 MIN: 168.88 / MAX: 304.29 1. (CC) gcc options: -O3 -march=native -pthread
MKL-DNN DNNL Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.24, N = 3 51.33 52.28 52.47 -fopenmp=libomp - MIN: 50.48 -fopenmp - MIN: 51.18 -fopenmp - MIN: 51.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Read GCC 10.0.0 20191208 GCC 9.2.1 30M 60M 90M 120M 150M SE +/- 467553.12, N = 3 SE +/- 234281.58, N = 3 145368967 142287766 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 GCC 9.2.1 GCC 10.0.0 20191208 10M 20M 30M 40M 50M SE +/- 70447.90, N = 3 SE +/- 638551.10, N = 4 48969411.2 48002215.1 1. (CC) gcc options: -O3 -march=native
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 19.69 19.69 19.75 20.04 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving LLVM Clang 9.0.0 LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 9K 18K 27K 36K 45K SE +/- 326.86, N = 3 SE +/- 238.62, N = 3 SE +/- 457.53, N = 3 SE +/- 490.20, N = 3 43783.30 43580.60 43426.72 43035.46 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding GCC 10.0.0 20191208 LLVM Clang 9.0.0 GCC 9.2.1 LLVM Clang 10.0 20191222 40 80 120 160 200 SE +/- 2.04, N = 8 SE +/- 1.58, N = 12 SE +/- 0.53, N = 3 SE +/- 0.76, N = 3 199.22 198.29 197.93 196.10 -mstack-alignment=64 -mstack-alignment=64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol GCC 9.2.1 LLVM Clang 10.0 20191222 LLVM Clang 9.0.0 GCC 10.0.0 20191208 13 26 39 52 65 SE +/- 0.20, N = 3 SE +/- 0.26, N = 3 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 57.47 57.84 58.00 58.28 1. (CXX) g++ options: -O3 -march=native -std=c++11
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time GCC 10.0.0 20191208 GCC 9.2.1 0.9965 1.993 2.9895 3.986 4.9825 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 4.378 4.429 1. (CC) gcc options: -static -fopenmp -O3 -march=native
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Degridding GCC 9.2.1 GCC 10.0.0 20191208 700 1400 2100 2800 3500 SE +/- 2.38, N = 3 SE +/- 13.81, N = 3 3376.27 3339.33 1. (CXX) g++ options: -lpthread
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 10.0.0 20191208 GCC 9.2.1 2M 4M 6M 8M 10M SE +/- 15301.93, N = 3 SE +/- 11113.29, N = 3 9027835 8959937 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Gridding GCC 9.2.1 GCC 10.0.0 20191208 400 800 1200 1600 2000 SE +/- 1.51, N = 3 SE +/- 10.21, N = 3 1955.97 1943.58 1. (CXX) g++ options: -lpthread
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving LLVM Clang 10.0 20191222 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 9.0.0 7K 14K 21K 28K 35K SE +/- 71.82, N = 3 SE +/- 27.04, N = 3 SE +/- 54.98, N = 3 SE +/- 32.67, N = 3 34327.90 34228.50 34157.76 34144.39 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Facebook RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Sync GCC 9.2.1 GCC 10.0.0 20191208 5K 10K 15K 20K 25K SE +/- 29.29, N = 3 SE +/- 40.43, N = 3 24368 24277 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
SQLite Threads / Copies: 1 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 1 LLVM Clang 9.0.0 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 GCC 9.2.1 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 14.20 14.24 14.25 14.25 1. (CC) gcc options: -O3 -march=native -lz -lm -ldl -lpthread
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil GCC 10.0.0 20191208 GCC 9.2.1 2 4 6 8 10 SE +/- 0.036635, N = 3 SE +/- 0.021046, N = 3 7.551706 7.576396 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD GCC 10.0.0 20191208 GCC 9.2.1 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 10.09 10.10 1. (CXX) g++ options: -O2 -lOpenCL
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Read While Writing GCC 9.2.1 GCC 10.0.0 20191208 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 73789.66, N = 4 SE +/- 38218.40, N = 15 4950890 4945916 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2019.4 Water Benchmark GCC 9.2.1 GCC 10.0.0 20191208 0.5661 1.1322 1.6983 2.2644 2.8305 SE +/- 0.001, N = 3 SE +/- 0.005, N = 3 2.516 2.515 1. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP GCC 10.0.0 20191208 GCC 9.2.1 0.283 0.566 0.849 1.132 1.415 SE +/- 0.007993, N = 3 SE +/- 0.002849, N = 3 1.257410 1.257716 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 LLVM Clang 10.0 20191222 7K 14K 21K 28K 35K SE +/- 127.72, N = 3 SE +/- 77.46, N = 3 SE +/- 2238.03, N = 12 SE +/- 1738.68, N = 15 31491.38 30910.99 15513.44 14868.68 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 9.2.1 GCC 10.0.0 20191208 1.0744 2.1488 3.2232 4.2976 5.372 SE +/- 0.006, N = 3 SE +/- 0.107, N = 15 3.492 4.775 1. (CXX) g++ options: -fopenmp -O3 -march=native
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster LLVM Clang 9.0.0 GCC 10.0.0 20191208 GCC 9.2.1 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.54, N = 15 18.82 19.07 19.76 -O3 -fopenmp -O2 -lOpenCL -O2 -lOpenCL 1. (CXX) g++ options:
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.22.0 Backend: BLAS LLVM Clang 9.0.0 GCC 9.2.1 GCC 10.0.0 20191208 8 16 24 32 40 SE +/- 0.60, N = 15 SE +/- 0.56, N = 12 SE +/- 0.41, N = 3 33.30 32.76 29.33 1. (CXX) g++ options: -O3 -march=native -lpthread
Phoronix Test Suite v10.8.4