AMD EPYC compiler benchmarks.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1807210-RA-AMDEPYCCO34 AMD EPYC Compiler Testing July 2018 - Phoronix Test Suite AMD EPYC Compiler Testing July 2018 AMD EPYC compiler benchmarks.
HTML result view exported from: https://openbenchmarking.org/result/1807210-RA-AMDEPYCCO34&export=pdf&gru&rdt .
AMD EPYC Compiler Testing July 2018 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Compiler File-System Screen Resolution GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads) TYAN B8026T70AE24HR (V1.02.B10 BIOS) AMD Family 17h 129024MB 280GB INTEL SSDPE21D280GA ASPEED ASPEED Family VE228 Broadcom Limited NetXtreme BCM5720 Gigabit PCIe Ubuntu 18.10 4.16.0-041600-generic (x86_64) GCC 8.1.1 20180720 ext4 1920x1080 GCC 9.0.0 20180720 OpenBenchmarking.org Environment Details - GCC 8.2 RC1: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 8.2 RC1: -O3 -march=native - PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 8.2 RC1: -O3: CXXFLAGS=-O3 CFLAGS=-O3 - GCC 9.0 SVN: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native Compiler Details - --disable-multilib --enable-checking=release Disk Details - NONE / data=ordered,errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand Security Details - __user pointer sanitization + Full AMD retpoline IBPB Protection
AMD EPYC Compiler Testing July 2018 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping vpxenc: vpxenc graphics-magick: HWB Color Space graphics-magick: Blur graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen tinymembench: Standard Memcpy tjbench: Decompression Throughput scimark2: Composite scimark2: Fast Fourier Transform scimark2: Jacobi Successive Over-Relaxation scimark2: Monte Carlo scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization fftw: Stock - 1D FFT Size 1024 fftw: Stock - 2D FFT Size 1024 himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test crafty: Elapsed Time stockfish: Total Time tscp: AI Chess Performance ebizzy: redis: SET redis: GET redis: LPUSH redis: LPOP redis: SADD apache: Static Web Page Serving openssl: RSA 4096-bit Performance pgbench: Buffer Test - Single Thread - Read Write pgbench: Buffer Test - Single Thread - Read Only pgbench: Buffer Test - Normal Load - Read Write pgbench: Buffer Test - Normal Load - Read Only polybench-c: 3 Matrix Multiplications polybench-c: Correlation Computation polybench-c: Covariance Computation sqlite: Timed SQLite Insertions aobench: 2048 x 2048 - Total Time compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 m-queens: Time To Solve c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples encode-mp3: WAV To MP3 encode-flac: WAV To FLAC hmmer: Pfam Database Search bullet: 3000 Fall bullet: 1000 Stack bullet: 136 Ragdolls bullet: 1000 Convex bullet: Prim Trimesh bullet: Convex Trimesh bullet: Raytests sockperf: Latency Under Load sockperf: Latency Ping Pong GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 439.54 12.38 209 154 110 185 188 12497.20 142.18 1943.30 219.38 1687.64 556.37 2421.38 4831.75 8518.67 6557.83 982.67 100684 6014866 59730290 851019 1147009 1375161.96 1691908.00 1108304.54 1108268.08 1430638.45 16205.32 4610.63 2071.78 14374.13 36226.69 408873.25 3.58 5.34 5.34 2.89 50.80 155.64 32.02 2.52 4 10.51 12.63 7.65 5.02 5.79 3.09 5.32 1.13 1.33 3.11 19.75 6.42 3.04 46 18 23 29 6 12463.50 106.03 1766.38 214.57 1676.49 195.98 2442.87 4301.99 8289.10 6351.50 972.07 98726 5998759 1372510 729507 1131696 15210.47 4656.40 1427.38 9213.45 28411.49 242221.35 3.66 5.39 5.37 3.13 57.85 260.02 2694.47 566.22 533 15.40 12.87 192.26 31.29 6.88 12.29 12453.20 1877.54 219.09 1676.15 213.98 2413.24 4865.24 852.63 99762 6026655 61080869 919193 1112448 4617.23 3.35 5.33 5.42 49.05 32.44 2.51 3 441.58 12.36 199 147 108 175 177 12555.70 141.74 1874.09 221.30 1429.81 561.85 2473.98 4683.55 7176.77 5835.90 998.89 97917 5989207 59837512 851283 1113356 1332541.96 1813896.31 1187656.29 1143259.50 1517826.77 15891.72 4609.73 2044.30 13985.70 36248.65 408599.13 3.59 5.40 5.39 2.88 52.60 166.70 32.09 2.84 4 11.13 12.27 7.71 5.06 6.02 3.14 5.52 1.12 1.36 3.18 19.67 6.76 12.37 201 149 104 180 182 12377.60 145.66 1940.65 219.09 1688.09 556.37 2458.23 4781.45 8506.83 6531.37 1007.74 97385 5876297 59201545 856576 1111662 1284845.13 1586367.58 1133637.46 1152018.46 1423169.79 15856.03 4616.47 2015.18 14307.54 36115.94 401369.61 3.66 5.38 5.41 2.89 50.72 156.58 31.56 2.52 4 10.50 12.44 7.63 5.04 5.78 3.08 5.29 1.13 1.33 3.12 19.88 46 18 23 30 6 12474.20 106.90 1869.33 216.86 1678.45 190.06 2391.26 4869.99 8345.20 6448.40 1006.19 99362 5878800 756563 1142302 15244.02 4647.80 1348.77 8870.94 28285.32 238819.60 3.61 5.38 5.45 3.07 57.00 261.52 3263.94 561.50 560 15.49 12.87 192.14 18.06 6.79 OpenBenchmarking.org
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 100 200 300 400 500 SE +/- 1.94, N = 3 SE +/- 1.61, N = 3 439.54 3.04 441.58 -march=native 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
VP9 libvpx Encoding vpxenc GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.7.0 vpxenc -O3 -march=native PGO -O3 -march=native -O3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 12.38 12.29 12.36 12.37 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: HWB Color Space GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 50 100 150 200 250 SE +/- 0.58, N = 3 209 46 199 201 46 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Blur GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 30 60 90 120 150 154 18 147 149 18 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Local Adaptive Thresholding GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 20 40 60 80 100 SE +/- 0.33, N = 3 110 23 108 104 23 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Resizing GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 185 29 175 180 30 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Sharpen GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 188 6 177 182 6 -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
Tinymembench Standard Memcpy OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memcpy GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K 12497.20 12463.50 12453.20 12555.70 12377.60 12474.20 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
Tinymembench Standard Memset OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memset GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2K 4K 6K 8K 10K 10063.10 10015.50 9975.20 10037.70 9982.80 10034.00 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 142.18 106.03 141.74 145.66 106.90 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 5.96, N = 3 SE +/- 3.46, N = 3 SE +/- 5.53, N = 3 SE +/- 4.00, N = 3 1943.30 1766.38 1877.54 1874.09 1940.65 1869.33 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 219.38 214.57 219.09 221.30 219.09 216.86 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 SE +/- 0.16, N = 3 SE +/- 0.72, N = 3 1687.64 1676.49 1676.15 1429.81 1688.09 1678.45 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 556.37 195.98 213.98 561.85 556.37 190.06 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 500 1000 1500 2000 2500 SE +/- 11.03, N = 3 SE +/- 10.35, N = 3 SE +/- 1.01, N = 3 SE +/- 13.99, N = 3 2421.38 2442.87 2413.24 2473.98 2458.23 2391.26 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 20.82, N = 3 SE +/- 17.02, N = 3 SE +/- 27.07, N = 3 SE +/- 25.52, N = 3 4831.75 4301.99 4865.24 4683.55 4781.45 4869.99 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
FFTW Build: Stock - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2K 4K 6K 8K 10K SE +/- 18.71, N = 3 SE +/- 8.90, N = 3 SE +/- 1.83, N = 3 8518.67 8289.10 7176.77 8506.83 8345.20 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1400 2800 4200 5600 7000 SE +/- 73.40, N = 3 SE +/- 3.18, N = 3 SE +/- 16.29, N = 3 6557.83 6351.50 5835.90 6531.37 6448.40 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200 400 600 800 1000 SE +/- 0.78, N = 3 SE +/- 0.73, N = 3 SE +/- 2.17, N = 3 SE +/- 4.57, N = 3 982.67 972.07 852.63 998.89 1007.74 1006.19 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -mavx2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 20K 40K 60K 80K 100K SE +/- 715.28, N = 3 SE +/- 555.15, N = 3 SE +/- 586.49, N = 3 SE +/- 465.88, N = 3 100684 98726 99762 97917 97385 99362 1. (CXX) g++ options: -pipe -lpthread
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 5146.22, N = 3 SE +/- 11021.16, N = 3 SE +/- 13110.39, N = 3 SE +/- 13470.58, N = 3 6014866 5998759 6026655 5989207 5876297 5878800 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native 13M 26M 39M 52M 65M SE +/- 576746.80, N = 3 SE +/- 512120.63, N = 3 SE +/- 551889.61, N = 3 SE +/- 756265.41, N = 3 59730290 1372510 61080869 59837512 59201545 -march=native -march=native -fprofile-correction -march=native 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200K 400K 600K 800K 1000K SE +/- 766.17, N = 5 SE +/- 1018.49, N = 5 SE +/- 833.02, N = 5 SE +/- 421.22, N = 5 851019 729507 919193 851283 856576 756563 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200K 400K 600K 800K 1000K SE +/- 28582.82, N = 6 SE +/- 20234.15, N = 6 SE +/- 18183.13, N = 4 SE +/- 23788.74, N = 6 1147009 1131696 1112448 1113356 1111662 1142302 -fprofile-correction 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
Redis Test: SET GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 13759.71, N = 3 SE +/- 23587.86, N = 6 SE +/- 31380.33, N = 6 1375161.96 1332541.96 1284845.13 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: GET GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=native -O3 400K 800K 1200K 1600K 2000K SE +/- 37527.46, N = 6 SE +/- 34086.99, N = 6 SE +/- 25099.79, N = 6 1691908.00 1813896.31 1586367.58 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: LPUSH GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 18347.41, N = 6 SE +/- 2157.90, N = 3 SE +/- 12639.17, N = 3 1108304.54 1187656.29 1133637.46 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: LPOP GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O3 -march=native -O3 200K 400K 600K 800K 1000K SE +/- 4079.04, N = 3 SE +/- 15264.52, N = 3 SE +/- 21693.88, N = 3 1108268.08 1143259.50 1152018.46 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: SADD GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 4081.67, N = 3 SE +/- 35661.81, N = 6 SE +/- 3767.82, N = 3 1430638.45 1517826.77 1423169.79 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K SE +/- 193.23, N = 3 SE +/- 188.72, N = 3 SE +/- 89.48, N = 3 16205.32 15210.47 15891.72 15856.03 15244.02 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.0f RSA 4096-bit Performance GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 21.56, N = 3 SE +/- 23.93, N = 3 SE +/- 22.71, N = 3 SE +/- 23.13, N = 3 4610.63 4656.40 4617.23 4609.73 4616.47 4647.80 -lssl -lssl -lssl 1. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 12.52, N = 3 SE +/- 30.23, N = 5 SE +/- 37.36, N = 3 2071.78 1427.38 2044.30 2015.18 1348.77 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K SE +/- 113.33, N = 3 SE +/- 266.50, N = 3 SE +/- 152.49, N = 3 14374.13 9213.45 13985.70 14307.54 8870.94 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 8K 16K 24K 32K 40K SE +/- 133.09, N = 3 SE +/- 81.46, N = 3 SE +/- 66.03, N = 3 36226.69 28411.49 36248.65 36115.94 28285.32 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 90K 180K 270K 360K 450K SE +/- 1289.79, N = 3 SE +/- 1716.32, N = 3 SE +/- 3319.69, N = 3 408873.25 242221.35 408599.13 401369.61 238819.60 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PolyBench-C Test: 3 Matrix Multiplications OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 3.58 3.66 3.35 3.59 3.66 3.61 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
PolyBench-C Test: Correlation Computation OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Correlation Computation GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.215 2.43 3.645 4.86 6.075 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.34 5.39 5.33 5.40 5.38 5.38 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
PolyBench-C Test: Covariance Computation OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Covariance Computation GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.34 5.37 5.42 5.39 5.41 5.45 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
SQLite Timed SQLite Insertions OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.22 Timed SQLite Insertions GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 0.7043 1.4086 2.1129 2.8172 3.5215 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.89 3.13 2.88 2.89 3.07 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lz -ldl -lpthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 50.80 57.85 49.05 52.60 50.72 57.00 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -lm -O3
Zstd Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 60 120 180 240 300 SE +/- 0.54, N = 3 SE +/- 0.68, N = 3 SE +/- 1.43, N = 3 155.64 260.02 166.70 156.58 261.52 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lz -llzma
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.1 Time To Solve GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 700 1400 2100 2800 3500 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 32.02 2694.47 32.44 32.09 31.56 3263.94 1. (CXX) g++ options: -fopenmp -O3 -march=native -O2
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.52 566.22 2.51 2.84 2.52 561.50 -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.21, N = 6 4 533 3 4 4 560 -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.51 15.40 11.13 10.50 15.49 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3 6 9 12 15 SE +/- 0.19, N = 5 SE +/- 0.12, N = 5 SE +/- 0.01, N = 5 12.63 12.87 12.27 12.44 12.87 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 7.65 192.26 7.71 7.63 192.14 -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Bullet Physics Engine Test: 3000 Fall GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -march=native -O3 1.1385 2.277 3.4155 4.554 5.6925 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 5.02 5.06 5.04 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Stack GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -march=native -O3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.79 6.02 5.78 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 136 Ragdolls GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O3 -march=native -O3 0.7065 1.413 2.1195 2.826 3.5325 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 3.09 3.14 3.08 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Convex GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -march=native -O3 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.32 5.52 5.29 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Prim Trimesh GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O3 -march=native -O3 0.2543 0.5086 0.7629 1.0172 1.2715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.13 1.12 1.13 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Convex Trimesh GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O3 -march=native -O3 0.306 0.612 0.918 1.224 1.53 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.33 1.36 1.33 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Raytests GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -march=native -O3 0.7155 1.431 2.1465 2.862 3.5775 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.11 3.18 3.12 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 7 14 21 28 35 SE +/- 0.30, N = 10 SE +/- 0.31, N = 10 SE +/- 0.24, N = 8 19.75 31.29 19.67 19.88 18.06 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2 4 6 8 10 SE +/- 0.11, N = 10 SE +/- 0.07, N = 5 6.42 6.88 6.76 6.79 -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Phoronix Test Suite v10.8.4