Tests for a future article on Phoronix.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1801022-AL-CLANGCC8680 GCC 8.0 vs. Clang 6.0 AMD EPYC Tuning Comparison - Phoronix Test Suite GCC 8.0 vs. Clang 6.0 AMD EPYC Tuning Comparison Tests for a future article on Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1801022-AL-CLANGCC8680&grt&rdt .
Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Driver OpenCL Compiler File-System Screen Resolution System Layer Clang 6.0 GCC 8.0 -march=znver1 -march=x86-64 -march=znver1 -march=x86-64 AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores) TYAN B8026T70AE24HR AMD Device 1450 126976MB 280GB INTEL SSDPE21D280GA ASPEED ASPEED Family VE228 Broadcom Limited NetXtreme BCM5720 Gigabit PCIe Ubuntu 17.10 4.13.0-21-generic (x86_64) GNOME Shell 3.26.1 modesetting 1.19.5 OpenCL 1.2 pocl 1.0 LLVM 5.0.0 Clang 6.0.0 (SVN 321623) + LLVM 6.0.0svn ext4 1920x1080 vm-other Xen 4.9.0 Hypervisor GCC 8.0.0 20171231 + clang (GCC) 8.0.0 20171231 (experimental) + LLVM 5.0.0 OpenBenchmarking.org Compiler Details - Clang 6.0: -march=znver1: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1 - Clang 6.0: -march=x86-64: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1 - GCC 8.0: -march=znver1: --disable-multilib --enable-checking=release - GCC 8.0: -march=x86-64: --disable-multilib --enable-checking=release Disk Details - NONE / data=ordered,errors=remount-ro,relatime,rw
apache: Static Web Page Serving bullet: Raytests bullet: 3000 Fall bullet: 1000 Stack bullet: 1000 Convex bullet: 136 Ragdolls bullet: Prim Trimesh bullet: Convex Trimesh c-ray: Total Time ebizzy: fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 encode-flac: WAV To FLAC graphics-magick: Blur graphics-magick: Sharpen graphics-magick: HWB Color Space graphics-magick: Local Adaptive Thresholding himeno: Poisson Pressure Solver encode-mp3: WAV To MP3 polybench-c: 3 Matrix Multiplications scimark2: Composite scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation sqlite: Default Test Directory hmmer: Pfam Database Search tscp: AI Chess Performance Clang 6.0 GCC 8.0 -march=znver1 -march=x86-64 -march=znver1 -march=x86-64 9663.93 3.18 5.34 6.08 5.31 3.23 1.09 1.32 4.48 1145405 5031.60 12481 6.63 104 136 155 98 1052.47 12.81 62.75 1699.32 552.19 226.68 2258.64 4034.89 1424.21 7.48 11.09 918269 9531.43 3.22 5.48 6.30 5.43 3.28 1.10 1.33 4.53 1076648 4660.83 13649 7.94 101 131 150 97 1032.71 11.33 62.98 1479.53 531.38 179.29 2190.10 3190.43 1110.65 7.53 12.85 917658 9791.23 3.06 5.27 5.93 5.28 3.19 1.10 1.30 3.37 1101176 5627.83 13630 7.45 123 165 186 95 935.64 10.81 65.45 1680.45 555.76 231.09 2259.95 3678.86 1676.62 7.16 12.40 875085 9841.30 3.12 5.34 6.18 5.44 3.26 1.10 1.34 3.93 1126032 4959.73 7.12 116 157 177 92 949.19 11.10 60.68 1579.48 561.03 233.89 2263.87 3513.11 1423.14 7.61 13.65 874251 OpenBenchmarking.org
Apache Benchmark Static Web Page Serving Clang 6.0 GCC 8.0 OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving -march=znver1 -march=x86-64 2K 4K 6K 8K 10K SE +/- 121.20, N = 3 SE +/- 161.49, N = 3 SE +/- 35.54, N = 3 SE +/- 22.88, N = 3 9663.93 9531.43 9791.23 9841.30 1. (CC) gcc options: -shared -fPIC -pthread -O3
Bullet Physics Engine Test: Raytests Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -march=znver1 -march=x86-64 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.18 3.22 3.06 3.12 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 3000 Fall Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -march=znver1 -march=x86-64 1.233 2.466 3.699 4.932 6.165 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.34 5.48 5.27 5.34 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Stack Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -march=znver1 -march=x86-64 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.08 6.30 5.93 6.18 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Convex Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -march=znver1 -march=x86-64 1.224 2.448 3.672 4.896 6.12 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.31 5.43 5.28 5.44 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 136 Ragdolls Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -march=znver1 -march=x86-64 0.738 1.476 2.214 2.952 3.69 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.23 3.28 3.19 3.26 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Prim Trimesh Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -march=znver1 -march=x86-64 0.2475 0.495 0.7425 0.99 1.2375 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.09 1.10 1.10 1.10 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Convex Trimesh Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -march=znver1 -march=x86-64 0.3015 0.603 0.9045 1.206 1.5075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.32 1.33 1.30 1.34 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
C-Ray Total Time Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time -march=znver1 -march=x86-64 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 4.48 4.53 3.37 3.93 1. (CC) gcc options: -lm -lpthread -O3
ebizzy Clang 6.0 GCC 8.0 OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 -march=znver1 -march=x86-64 200K 400K 600K 800K 1000K SE +/- 17141.58, N = 6 SE +/- 12350.97, N = 3 SE +/- 19461.21, N = 3 SE +/- 20747.16, N = 6 1145405 1076648 1101176 1126032 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
FFTW Build: Stock - Size: 2D FFT Size 4096 Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -march=znver1 -march=x86-64 1200 2400 3600 4800 6000 SE +/- 17.93, N = 3 SE +/- 57.25, N = 3 SE +/- 10.37, N = 3 SE +/- 1.16, N = 3 5031.60 4660.83 5627.83 4959.73 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -march=znver1 -march=x86-64 3K 6K 9K 12K 15K SE +/- 76.70, N = 3 SE +/- 100.95, N = 3 SE +/- 15.38, N = 3 12481 13649 13630 1. (CC) gcc options: -pthread -O3 -lm
FLAC Audio Encoding WAV To FLAC Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.1 WAV To FLAC -march=znver1 -march=x86-64 2 4 6 8 10 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 6.63 7.94 7.45 7.12 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
GraphicsMagick Operation: Blur Clang 6.0 GCC 8.0 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Blur -march=znver1 -march=x86-64 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 104 101 123 116 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Sharpen Clang 6.0 GCC 8.0 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Sharpen -march=znver1 -march=x86-64 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 136 131 165 157 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: HWB Color Space Clang 6.0 GCC 8.0 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: HWB Color Space -march=znver1 -march=x86-64 40 80 120 160 200 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 155 150 186 177 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding Clang 6.0 GCC 8.0 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Local Adaptive Thresholding -march=znver1 -march=x86-64 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 98 97 95 92 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
Himeno Benchmark Poisson Pressure Solver Clang 6.0 GCC 8.0 OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -march=znver1 -march=x86-64 200 400 600 800 1000 SE +/- 1.50, N = 3 SE +/- 1.46, N = 3 SE +/- 0.79, N = 3 SE +/- 0.43, N = 3 1052.47 1032.71 935.64 949.19 1. (CC) gcc options: -O3 -mavx2
LAME MP3 Encoding WAV To MP3 Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.5 WAV To MP3 -march=znver1 -march=x86-64 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 12.81 11.33 10.81 11.10 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm
PolyBench-C Test: 3 Matrix Multiplications Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 3.2 Test: 3 Matrix Multiplications -march=znver1 -march=x86-64 15 30 45 60 75 SE +/- 0.26, N = 3 SE +/- 0.24, N = 3 SE +/- 0.21, N = 3 SE +/- 0.03, N = 3 62.75 62.98 65.45 60.68 1. (CC) gcc options: -O3 -march=native
SciMark Computational Test: Composite Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -march=znver1 -march=x86-64 400 800 1200 1600 2000 SE +/- 25.95, N = 4 SE +/- 36.18, N = 8 SE +/- 20.36, N = 4 SE +/- 20.51, N = 7 1699.32 1479.53 1680.45 1579.48 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Monte Carlo Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -march=znver1 -march=x86-64 120 240 360 480 600 SE +/- 0.05, N = 4 SE +/- 0.05, N = 4 SE +/- 0.01, N = 4 SE +/- 0.04, N = 4 552.19 531.38 555.76 561.03 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Fast Fourier Transform Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -march=znver1 -march=x86-64 50 100 150 200 250 SE +/- 0.38, N = 4 SE +/- 45.12, N = 4 SE +/- 0.14, N = 4 SE +/- 0.35, N = 4 226.68 179.29 231.09 233.89 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Sparse Matrix Multiply Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -march=znver1 -march=x86-64 500 1000 1500 2000 2500 SE +/- 29.07, N = 4 SE +/- 11.89, N = 4 SE +/- 9.55, N = 4 SE +/- 6.72, N = 4 2258.64 2190.10 2259.95 2263.87 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Dense LU Matrix Factorization Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -march=znver1 -march=x86-64 900 1800 2700 3600 4500 SE +/- 135.63, N = 4 SE +/- 17.92, N = 4 SE +/- 110.38, N = 4 SE +/- 177.13, N = 4 4034.89 3190.43 3678.86 3513.11 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation Clang 6.0 GCC 8.0 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -march=znver1 -march=x86-64 400 800 1200 1600 2000 SE +/- 0.49, N = 4 SE +/- 310.16, N = 4 SE +/- 0.77, N = 4 SE +/- 0.28, N = 4 1424.21 1110.65 1676.62 1423.14 1. (CC) gcc options: -O3 -lm
SQLite Test Target: Default Test Directory Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.8.10.2 Test Target: Default Test Directory -march=znver1 -march=x86-64 2 4 6 8 10 SE +/- 0.15, N = 6 SE +/- 0.06, N = 3 SE +/- 0.12, N = 6 SE +/- 0.12, N = 3 7.48 7.53 7.16 7.61 1. (CC) gcc options: -O3 -ldl -lpthread
Timed HMMer Search Pfam Database Search Clang 6.0 GCC 8.0 OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -march=znver1 -march=x86-64 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 1.28, N = 6 SE +/- 0.04, N = 3 SE +/- 1.74, N = 6 11.09 12.85 12.40 13.65 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
TSCP AI Chess Performance Clang 6.0 GCC 8.0 OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -march=znver1 -march=x86-64 200K 400K 600K 800K 1000K SE +/- 306.40, N = 5 SE +/- 572.37, N = 5 SE +/- 556.88, N = 5 SE +/- 438.77, N = 5 918269 917658 875085 874251 1. (CC) gcc options: -O3 -march=native
Phoronix Test Suite v10.8.4