AMD EPYC compiler benchmarks.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1807210-RA-AMDEPYCCO34 AMD EPYC Compiler Testing July 2018 - Phoronix Test Suite AMD EPYC Compiler Testing July 2018 AMD EPYC compiler benchmarks.
HTML result view exported from: https://openbenchmarking.org/result/1807210-RA-AMDEPYCCO34&export=pdf&gru&sor&rro .
AMD EPYC Compiler Testing July 2018 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Compiler File-System Screen Resolution GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads) TYAN B8026T70AE24HR (V1.02.B10 BIOS) AMD Family 17h 129024MB 280GB INTEL SSDPE21D280GA ASPEED ASPEED Family VE228 Broadcom Limited NetXtreme BCM5720 Gigabit PCIe Ubuntu 18.10 4.16.0-041600-generic (x86_64) GCC 8.1.1 20180720 ext4 1920x1080 GCC 9.0.0 20180720 OpenBenchmarking.org Environment Details - GCC 8.2 RC1: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 8.2 RC1: -O3 -march=native - PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 8.2 RC1: -O3: CXXFLAGS=-O3 CFLAGS=-O3 - GCC 9.0 SVN: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native - GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native Compiler Details - --disable-multilib --enable-checking=release Disk Details - NONE / data=ordered,errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand Security Details - __user pointer sanitization + Full AMD retpoline IBPB Protection
AMD EPYC Compiler Testing July 2018 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping vpxenc: vpxenc graphics-magick: HWB Color Space graphics-magick: Blur graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen tinymembench: Standard Memcpy tjbench: Decompression Throughput scimark2: Composite scimark2: Fast Fourier Transform scimark2: Jacobi Successive Over-Relaxation scimark2: Monte Carlo scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization fftw: Stock - 1D FFT Size 1024 fftw: Stock - 2D FFT Size 1024 himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test crafty: Elapsed Time stockfish: Total Time tscp: AI Chess Performance ebizzy: redis: SET redis: GET redis: LPUSH redis: LPOP redis: SADD apache: Static Web Page Serving openssl: RSA 4096-bit Performance pgbench: Buffer Test - Single Thread - Read Write pgbench: Buffer Test - Single Thread - Read Only pgbench: Buffer Test - Normal Load - Read Write pgbench: Buffer Test - Normal Load - Read Only polybench-c: 3 Matrix Multiplications polybench-c: Correlation Computation polybench-c: Covariance Computation sqlite: Timed SQLite Insertions aobench: 2048 x 2048 - Total Time compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 m-queens: Time To Solve c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples encode-mp3: WAV To MP3 encode-flac: WAV To FLAC hmmer: Pfam Database Search bullet: 3000 Fall bullet: 1000 Stack bullet: 136 Ragdolls bullet: 1000 Convex bullet: Prim Trimesh bullet: Convex Trimesh bullet: Raytests sockperf: Latency Under Load sockperf: Latency Ping Pong GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 439.54 12.38 209 154 110 185 188 12497.20 142.18 1943.30 219.38 1687.64 556.37 2421.38 4831.75 8518.67 6557.83 982.67 100684 6014866 59730290 851019 1147009 1375161.96 1691908.00 1108304.54 1108268.08 1430638.45 16205.32 4610.63 2071.78 14374.13 36226.69 408873.25 3.58 5.34 5.34 2.89 50.80 155.64 32.02 2.52 4 10.51 12.63 7.65 5.02 5.79 3.09 5.32 1.13 1.33 3.11 19.75 6.42 3.04 46 18 23 29 6 12463.50 106.03 1766.38 214.57 1676.49 195.98 2442.87 4301.99 8289.10 6351.50 972.07 98726 5998759 1372510 729507 1131696 15210.47 4656.40 1427.38 9213.45 28411.49 242221.35 3.66 5.39 5.37 3.13 57.85 260.02 2694.47 566.22 533 15.40 12.87 192.26 31.29 6.88 12.29 12453.20 1877.54 219.09 1676.15 213.98 2413.24 4865.24 852.63 99762 6026655 61080869 919193 1112448 4617.23 3.35 5.33 5.42 49.05 32.44 2.51 3 441.58 12.36 199 147 108 175 177 12555.70 141.74 1874.09 221.30 1429.81 561.85 2473.98 4683.55 7176.77 5835.90 998.89 97917 5989207 59837512 851283 1113356 1332541.96 1813896.31 1187656.29 1143259.50 1517826.77 15891.72 4609.73 2044.30 13985.70 36248.65 408599.13 3.59 5.40 5.39 2.88 52.60 166.70 32.09 2.84 4 11.13 12.27 7.71 5.06 6.02 3.14 5.52 1.12 1.36 3.18 19.67 6.76 12.37 201 149 104 180 182 12377.60 145.66 1940.65 219.09 1688.09 556.37 2458.23 4781.45 8506.83 6531.37 1007.74 97385 5876297 59201545 856576 1111662 1284845.13 1586367.58 1133637.46 1152018.46 1423169.79 15856.03 4616.47 2015.18 14307.54 36115.94 401369.61 3.66 5.38 5.41 2.89 50.72 156.58 31.56 2.52 4 10.50 12.44 7.63 5.04 5.78 3.08 5.29 1.13 1.33 3.12 19.88 46 18 23 30 6 12474.20 106.90 1869.33 216.86 1678.45 190.06 2391.26 4869.99 8345.20 6448.40 1006.19 99362 5878800 756563 1142302 15244.02 4647.80 1348.77 8870.94 28285.32 238819.60 3.61 5.38 5.45 3.07 57.00 261.52 3263.94 561.50 560 15.49 12.87 192.14 18.06 6.79 OpenBenchmarking.org
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 100 200 300 400 500 SE +/- 1.94, N = 3 SE +/- 1.61, N = 3 3.04 439.54 441.58 -march=native 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
VP9 libvpx Encoding vpxenc GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.7.0 vpxenc PGO -O3 -march=native -O3 -O3 -march=native 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 12.29 12.36 12.38 12.37 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: HWB Color Space AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 50 100 150 200 250 SE +/- 0.58, N = 3 46 46 199 201 209 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Blur AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 30 60 90 120 150 18 18 147 149 154 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Local Adaptive Thresholding AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 20 40 60 80 100 SE +/- 0.33, N = 3 23 23 104 108 110 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Resizing AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 40 80 120 160 200 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 SE +/- 1.20, N = 3 29 30 175 180 185 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Sharpen AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 40 80 120 160 200 6 6 177 182 188 -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
Tinymembench Standard Memcpy OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memcpy GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 3K 6K 9K 12K 15K 12377.60 12453.20 12463.50 12474.20 12497.20 12555.70 -march=native -march=native -fprofile-correction -march=native -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
Tinymembench Standard Memset OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memset GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 2K 4K 6K 8K 10K 9975.20 9982.80 10015.50 10034.00 10037.70 10063.10 -march=native -fprofile-correction -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 106.03 106.90 141.74 142.18 145.66 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 400 800 1200 1600 2000 SE +/- 5.53, N = 3 SE +/- 3.46, N = 3 SE +/- 4.00, N = 3 SE +/- 5.96, N = 3 1766.38 1869.33 1874.09 1877.54 1940.65 1943.30 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 50 100 150 200 250 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 214.57 216.86 219.09 219.09 219.38 221.30 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native - PGO AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native 400 800 1200 1600 2000 SE +/- 0.16, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 SE +/- 0.72, N = 3 1429.81 1676.15 1676.49 1678.45 1687.64 1688.09 -march=native -fprofile-correction -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 120 240 360 480 600 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 190.06 195.98 213.98 556.37 556.37 561.85 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 -march=native AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 500 1000 1500 2000 2500 SE +/- 10.35, N = 3 SE +/- 11.03, N = 3 SE +/- 13.99, N = 3 SE +/- 1.01, N = 3 2391.26 2413.24 2421.38 2442.87 2458.23 2473.98 -march=native -march=native -fprofile-correction -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 27.07, N = 3 SE +/- 25.52, N = 3 SE +/- 20.82, N = 3 SE +/- 17.02, N = 3 4301.99 4683.55 4781.45 4831.75 4865.24 4869.99 -march=native -march=native -march=native -march=native -fprofile-correction -march=native 1. (CC) gcc options: -O3 -lm
FFTW Build: Stock - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 2K 4K 6K 8K 10K SE +/- 8.90, N = 3 SE +/- 1.83, N = 3 SE +/- 18.71, N = 3 7176.77 8289.10 8345.20 8506.83 8518.67 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 1400 2800 4200 5600 7000 SE +/- 3.18, N = 3 SE +/- 16.29, N = 3 SE +/- 73.40, N = 3 5835.90 6351.50 6448.40 6531.37 6557.83 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 8.2 RC1: -O3 -march=native - PGO AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native 200 400 600 800 1000 SE +/- 0.73, N = 3 SE +/- 0.78, N = 3 SE +/- 2.17, N = 3 SE +/- 4.57, N = 3 852.63 972.07 982.67 998.89 1006.19 1007.74 -march=native -fprofile-correction -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -mavx2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 -march=native 20K 40K 60K 80K 100K SE +/- 465.88, N = 3 SE +/- 586.49, N = 3 SE +/- 555.15, N = 3 SE +/- 715.28, N = 3 97385 97917 98726 99362 99762 100684 1. (CXX) g++ options: -pipe -lpthread
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 13470.58, N = 3 SE +/- 13110.39, N = 3 SE +/- 5146.22, N = 3 SE +/- 11021.16, N = 3 5876297 5878800 5989207 5998759 6014866 6026655 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native - PGO 13M 26M 39M 52M 65M SE +/- 756265.41, N = 3 SE +/- 576746.80, N = 3 SE +/- 551889.61, N = 3 SE +/- 512120.63, N = 3 1372510 59201545 59730290 59837512 61080869 -march=native -march=native -march=native -fprofile-correction 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 200K 400K 600K 800K 1000K SE +/- 766.17, N = 5 SE +/- 833.02, N = 5 SE +/- 421.22, N = 5 SE +/- 1018.49, N = 5 729507 756563 851019 851283 856576 919193 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native 200K 400K 600K 800K 1000K SE +/- 23788.74, N = 6 SE +/- 20234.15, N = 6 SE +/- 18183.13, N = 4 SE +/- 28582.82, N = 6 1111662 1112448 1113356 1131696 1142302 1147009 -fprofile-correction 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
Redis Test: SET GCC 9.0 SVN GCC 8.2 RC1 OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 31380.33, N = 6 SE +/- 23587.86, N = 6 SE +/- 13759.71, N = 3 1284845.13 1332541.96 1375161.96 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: GET GCC 9.0 SVN GCC 8.2 RC1 OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=native -O3 400K 800K 1200K 1600K 2000K SE +/- 25099.79, N = 6 SE +/- 37527.46, N = 6 SE +/- 34086.99, N = 6 1586367.58 1691908.00 1813896.31 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: LPUSH GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 18347.41, N = 6 SE +/- 2157.90, N = 3 SE +/- 12639.17, N = 3 1108304.54 1187656.29 1133637.46 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: LPOP GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O3 -march=native -O3 200K 400K 600K 800K 1000K SE +/- 4079.04, N = 3 SE +/- 15264.52, N = 3 SE +/- 21693.88, N = 3 1108268.08 1143259.50 1152018.46 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: SADD GCC 9.0 SVN GCC 8.2 RC1 OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -march=native -O3 300K 600K 900K 1200K 1500K SE +/- 3767.82, N = 3 SE +/- 4081.67, N = 3 SE +/- 35661.81, N = 6 1423169.79 1430638.45 1517826.77 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 3K 6K 9K 12K 15K SE +/- 89.48, N = 3 SE +/- 188.72, N = 3 SE +/- 193.23, N = 3 15210.47 15244.02 15856.03 15891.72 16205.32 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.0f RSA 4096-bit Performance GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 22.71, N = 3 SE +/- 21.56, N = 3 SE +/- 23.13, N = 3 SE +/- 23.93, N = 3 4609.73 4610.63 4616.47 4617.23 4647.80 4656.40 -lssl -lssl -lssl 1. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 400 800 1200 1600 2000 SE +/- 37.36, N = 3 SE +/- 30.23, N = 5 SE +/- 12.52, N = 3 1348.77 1427.38 2015.18 2044.30 2071.78 -march=native -march=native -march=native -lpq -lpq -march=native -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 3K 6K 9K 12K 15K SE +/- 266.50, N = 3 SE +/- 152.49, N = 3 SE +/- 113.33, N = 3 8870.94 9213.45 13985.70 14307.54 14374.13 -march=native -march=native -lpq -march=native -lpq -march=native -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 8K 16K 24K 32K 40K SE +/- 66.03, N = 3 SE +/- 133.09, N = 3 SE +/- 81.46, N = 3 28285.32 28411.49 36115.94 36226.69 36248.65 -march=native -march=native -march=native -lpq -march=native -lpq -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 90K 180K 270K 360K 450K SE +/- 3319.69, N = 3 SE +/- 1716.32, N = 3 SE +/- 1289.79, N = 3 238819.60 242221.35 401369.61 408599.13 408873.25 -march=native -march=native -march=native -lpq -lpq -march=native -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PolyBench-C Test: 3 Matrix Multiplications OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications GCC 9.0 SVN: -O3 -march=native AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 3.66 3.66 3.61 3.59 3.58 3.35 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
PolyBench-C Test: Correlation Computation OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Correlation Computation GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 1.215 2.43 3.645 4.86 6.075 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 5.40 5.39 5.38 5.38 5.34 5.33 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
PolyBench-C Test: Covariance Computation OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Covariance Computation GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.45 5.42 5.41 5.39 5.37 5.34 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
SQLite Timed SQLite Insertions OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.22 Timed SQLite Insertions AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 0.7043 1.4086 2.1129 2.8172 3.5215 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.13 3.07 2.89 2.89 2.88 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lz -ldl -lpthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 13 26 39 52 65 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.20, N = 3 57.85 57.00 52.60 50.80 50.72 49.05 -march=native -march=native -march=native -march=native -march=native -fprofile-correction 1. (CC) gcc options: -lm -O3
Zstd Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native 60 120 180 240 300 SE +/- 0.68, N = 3 SE +/- 1.43, N = 3 SE +/- 0.54, N = 3 261.52 260.02 166.70 156.58 155.64 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lz -llzma
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.1 Time To Solve GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native 700 1400 2100 2800 3500 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 3263.94 2694.47 32.44 32.09 32.02 31.56 1. (CXX) g++ options: -fopenmp -O3 -march=native -O2
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 120 240 360 480 600 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 566.22 561.50 2.84 2.52 2.52 2.51 -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO 120 240 360 480 600 SE +/- 0.21, N = 6 560 533 4 4 4 3 -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 15.49 15.40 11.13 10.51 10.50 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 3 6 9 12 15 SE +/- 0.19, N = 5 SE +/- 0.01, N = 5 SE +/- 0.12, N = 5 12.87 12.87 12.63 12.44 12.27 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 192.26 192.14 7.71 7.65 7.63 -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Bullet Physics Engine Test: 3000 Fall GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -O3 -march=native 1.1385 2.277 3.4155 4.554 5.6925 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 5.06 5.02 5.04 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Stack GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -O3 -march=native 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.02 5.79 5.78 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 136 Ragdolls GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O3 -O3 -march=native 0.7065 1.413 2.1195 2.826 3.5325 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 3.14 3.09 3.08 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Convex GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -O3 -march=native 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.52 5.32 5.29 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Prim Trimesh GCC 9.0 SVN GCC 8.2 RC1 OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O3 -march=native -O3 0.2543 0.5086 0.7629 1.0172 1.2715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.13 1.13 1.12 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Convex Trimesh GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O3 -O3 -march=native 0.306 0.612 0.918 1.224 1.53 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.36 1.33 1.33 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Raytests GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -O3 -march=native 0.7155 1.431 2.1465 2.862 3.5775 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.18 3.11 3.12 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load AMD EPYC 7601 32-Core GCC 9.0 SVN: -O3 -march=native GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 7 14 21 28 35 SE +/- 0.24, N = 8 SE +/- 0.30, N = 10 SE +/- 0.31, N = 10 31.29 19.88 19.75 19.67 18.06 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong AMD EPYC 7601 32-Core GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 2 4 6 8 10 SE +/- 0.07, N = 5 SE +/- 0.11, N = 10 6.88 6.79 6.76 6.42 -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Phoronix Test Suite v10.8.4