AMD EPYC compiler benchmarks.
GCC 8.2 RC1: -O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 -march=native - PGO GCC 8.2 RC1: -O3 Processor: AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads), Motherboard: TYAN B8026T70AE24HR (V1.02.B10 BIOS), Chipset: AMD Family 17h, Memory: 129024MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe
OS: Ubuntu 18.10, Kernel: 4.16.0-041600-generic (x86_64), Compiler: GCC 8.1.1 20180720, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
GCC 9.0 SVN: -O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core OS: Ubuntu 18.10, Kernel: 4.16.0-041600-generic (x86_64), Compiler: GCC 9.0.0 20180720, File-System: ext4, Screen Resolution: 1920x1080
GraphicsMagick OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Blur AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 30 60 90 120 150 18 147 154 149 18 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Resizing AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 SE +/- 1.00, N = 3 SE +/- 1.20, N = 3 SE +/- 1.00, N = 3 29 175 185 180 30 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native 13M 26M 39M 52M 65M SE +/- 551889.61, N = 3 SE +/- 576746.80, N = 3 SE +/- 512120.63, N = 3 SE +/- 756265.41, N = 3 1372510 59837512 59730290 61080869 59201545 -march=native -march=native -fprofile-correction -march=native 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
GraphicsMagick OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Local Adaptive Thresholding AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 20 40 60 80 100 SE +/- 0.33, N = 3 23 108 110 104 23 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: HWB Color Space AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 50 100 150 200 250 SE +/- 0.58, N = 3 46 199 209 201 46 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.28 Operation: Sharpen AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 6 177 188 182 6 -march=native -march=native 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 192.26 7.71 7.65 7.63 192.14 -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
C-Ray OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 566.22 2.84 2.52 2.51 2.52 561.50 -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 SE +/- 0.02, N = 3 195.98 561.85 556.37 213.98 556.37 190.06 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
Smallpt OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 120 240 360 480 600 SE +/- 0.21, N = 6 533 4 4 3 4 560 -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native 100 200 300 400 500 SE +/- 1.61, N = 3 SE +/- 1.94, N = 3 3.04 441.58 439.54 -march=native 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
m-queens A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.1 Time To Solve AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 700 1400 2100 2800 3500 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.16, N = 3 2694.47 32.09 32.02 32.44 31.56 3263.94 1. (CXX) g++ options: -fopenmp -O3 -march=native -O2
Sockperf This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 7 14 21 28 35 SE +/- 0.31, N = 10 SE +/- 0.30, N = 10 SE +/- 0.24, N = 8 31.29 19.67 19.75 19.88 18.06 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 90K 180K 270K 360K 450K SE +/- 1716.32, N = 3 SE +/- 1289.79, N = 3 SE +/- 3319.69, N = 3 242221.35 408599.13 408873.25 401369.61 238819.60 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 60 120 180 240 300 SE +/- 0.68, N = 3 SE +/- 0.54, N = 3 SE +/- 1.43, N = 3 260.02 166.70 155.64 156.58 261.52 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lz -llzma
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K SE +/- 266.50, N = 3 SE +/- 113.33, N = 3 SE +/- 152.49, N = 3 9213.45 13985.70 14374.13 14307.54 8870.94 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 30.23, N = 5 SE +/- 12.52, N = 3 SE +/- 37.36, N = 3 1427.38 2044.30 2071.78 2015.18 1348.77 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 15.40 11.13 10.51 10.50 15.49 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 106.03 141.74 142.18 145.66 106.90 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lm
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 8K 16K 24K 32K 40K SE +/- 81.46, N = 3 SE +/- 133.09, N = 3 SE +/- 66.03, N = 3 28411.49 36248.65 36226.69 36115.94 28285.32 -march=native -lpq -march=native -lpq -march=native -lpq -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200K 400K 600K 800K 1000K SE +/- 833.02, N = 5 SE +/- 766.17, N = 5 SE +/- 1018.49, N = 5 SE +/- 421.22, N = 5 729507 851283 851019 919193 856576 756563 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2K 4K 6K 8K 10K SE +/- 8.90, N = 3 SE +/- 18.71, N = 3 SE +/- 1.83, N = 3 8289.10 7176.77 8518.67 8506.83 8345.20 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200 400 600 800 1000 SE +/- 2.17, N = 3 SE +/- 0.78, N = 3 SE +/- 0.73, N = 3 SE +/- 4.57, N = 3 972.07 998.89 982.67 852.63 1007.74 1006.19 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -mavx2
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 SE +/- 0.72, N = 3 1676.49 1429.81 1687.64 1676.15 1688.09 1678.45 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 13 26 39 52 65 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.20, N = 3 SE +/- 0.02, N = 3 57.85 52.60 50.80 49.05 50.72 57.00 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -lm -O3
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 27.07, N = 3 SE +/- 20.82, N = 3 SE +/- 17.02, N = 3 SE +/- 25.52, N = 3 4301.99 4683.55 4831.75 4865.24 4781.45 4869.99 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1400 2800 4200 5600 7000 SE +/- 3.18, N = 3 SE +/- 73.40, N = 3 SE +/- 16.29, N = 3 6351.50 5835.90 6557.83 6531.37 6448.40 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 400 800 1200 1600 2000 SE +/- 5.53, N = 3 SE +/- 5.96, N = 3 SE +/- 3.46, N = 3 SE +/- 4.00, N = 3 1766.38 1874.09 1943.30 1877.54 1940.65 1869.33 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
PolyBench-C PolyBench-C is a C-language polyhedral benchmark suite made at the Ohio State University. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 3.66 3.59 3.58 3.35 3.66 3.61 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
SQLite This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.22 Timed SQLite Insertions AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 0.7043 1.4086 2.1129 2.8172 3.5215 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.13 2.88 2.89 2.89 3.07 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -O3 -lz -ldl -lpthread
Sockperf This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2 4 6 8 10 SE +/- 0.07, N = 5 SE +/- 0.11, N = 10 6.88 6.76 6.42 6.79 -march=native -march=native -march=native 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -O3 -march=native 300K 600K 900K 1200K 1500K SE +/- 23587.86, N = 6 SE +/- 13759.71, N = 3 SE +/- 31380.33, N = 6 1332541.96 1375161.96 1284845.13 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -O3 -march=native 300K 600K 900K 1200K 1500K SE +/- 35661.81, N = 6 SE +/- 4081.67, N = 3 SE +/- 3767.82, N = 3 1517826.77 1430638.45 1423169.79 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K SE +/- 188.72, N = 3 SE +/- 193.23, N = 3 SE +/- 89.48, N = 3 15210.47 15891.72 16205.32 15856.03 15244.02 -march=native -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3 6 9 12 15 SE +/- 0.12, N = 5 SE +/- 0.19, N = 5 SE +/- 0.01, N = 5 12.87 12.27 12.63 12.44 12.87 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -O3 -march=native 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.02 5.79 5.78 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 500 1000 1500 2000 2500 SE +/- 1.01, N = 3 SE +/- 11.03, N = 3 SE +/- 10.35, N = 3 SE +/- 13.99, N = 3 2442.87 2473.98 2421.38 2413.24 2458.23 2391.26 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
7-Zip Compression This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 20K 40K 60K 80K 100K SE +/- 586.49, N = 3 SE +/- 715.28, N = 3 SE +/- 555.15, N = 3 SE +/- 465.88, N = 3 98726 97917 100684 99762 97385 99362 1. (CXX) g++ options: -pipe -lpthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 200K 400K 600K 800K 1000K SE +/- 18183.13, N = 4 SE +/- 28582.82, N = 6 SE +/- 20234.15, N = 6 SE +/- 23788.74, N = 6 1131696 1113356 1147009 1112448 1111662 1142302 -fprofile-correction 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 50 100 150 200 250 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 214.57 221.30 219.38 219.09 219.09 216.86 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O3 -lm
Crafty This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 13110.39, N = 3 SE +/- 5146.22, N = 3 SE +/- 11021.16, N = 3 SE +/- 13470.58, N = 3 5998759 5989207 6014866 6026655 5876297 5878800 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -O3 -march=native 0.7155 1.431 2.1465 2.862 3.5775 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.18 3.11 3.12 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
PolyBench-C PolyBench-C is a C-language polyhedral benchmark suite made at the Ohio State University. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Covariance Computation AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.37 5.39 5.34 5.42 5.41 5.45 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
Tinymembench This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memcpy AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 3K 6K 9K 12K 15K 12463.50 12555.70 12497.20 12453.20 12377.60 12474.20 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
PolyBench-C PolyBench-C is a C-language polyhedral benchmark suite made at the Ohio State University. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: Correlation Computation AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1.215 2.43 3.645 4.86 6.075 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 5.39 5.40 5.34 5.33 5.38 5.38 -fprofile-correction 1. (CC) gcc options: -O3 -march=native
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.0f RSA 4096-bit Performance AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 1000 2000 3000 4000 5000 SE +/- 22.71, N = 3 SE +/- 21.56, N = 3 SE +/- 23.93, N = 3 SE +/- 23.13, N = 3 4656.40 4609.73 4610.63 4617.23 4616.47 4647.80 -lssl -lssl -lssl 1. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl
Tinymembench OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memset AMD EPYC 7601 32-Core GCC 8.2 RC1: -O3 GCC 8.2 RC1: -O3 -march=native GCC 8.2 RC1: -O3 -march=native - PGO GCC 9.0 SVN: -O3 -march=native GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core 2K 4K 6K 8K 10K 10015.50 10037.70 10063.10 9975.20 9982.80 10034.00 -march=native -march=native -march=native -fprofile-correction -march=native -march=native 1. (CC) gcc options: -O2 -O3 -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format. Learn more via the OpenBenchmarking.org test page.
GCC 8.2 RC1 GCC 9.0 SVN OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.7.0 vpxenc -O3 -O3 -march=native PGO -O3 -march=native 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 12.36 12.38 12.29 12.37 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE
GCC 8.2 RC1: -O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 11:08 by user phoronix.
AMD EPYC 7601 32-Core Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 13:41 by user phoronix.
GCC 8.2 RC1: -O3 -march=native - PGO Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 17:05 by user phoronix.
GCC 8.2 RC1: -O3 Processor: AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads), Motherboard: TYAN B8026T70AE24HR (V1.02.B10 BIOS), Chipset: AMD Family 17h, Memory: 129024MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe
OS: Ubuntu 18.10, Kernel: 4.16.0-041600-generic (x86_64), Compiler: GCC 8.1.1 20180720, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 18:42 by user phoronix.
GCC 9.0 SVN: -O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 21:03 by user phoronix.
GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core Processor: AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads), Motherboard: TYAN B8026T70AE24HR (V1.02.B10 BIOS), Chipset: AMD Family 17h, Memory: 129024MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe
OS: Ubuntu 18.10, Kernel: 4.16.0-041600-generic (x86_64), Compiler: GCC 9.0.0 20180720, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: acpi-cpufreq ondemandSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB Protection
Testing initiated at 20 July 2018 23:33 by user phoronix.