GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com.
-O0 Environment Notes: CXXFLAGS=-O0 CFLAGS=-O0Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-Og Environment Notes: CXXFLAGS=-Og CFLAGS=-OgCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O1 Environment Notes: CXXFLAGS=-O1 CFLAGS=-O1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 Environment Notes: CXXFLAGS=-O2 CFLAGS=-O2Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -ftree-vectorize -ftree-slp-vectorize Environment Notes: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorizeCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -march=znver1 Environment Notes: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -flto Environment Notes: CXXFLAGS=-O2-flto CFLAGS=-O2-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=znver1 Environment Notes: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=znver1 -flto Environment Notes: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-Ofast -march=znver1 Processor: 2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox G200eW3, Monitor: VE228, Network: 2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 18.04, Kernel: 5.0.0-050000rc6-generic (x86_64) 20190210, Desktop: GNOME Shell 3.28.3, Display Server: X Server, Compiler: GCC 9.0.1 20190210, File-System: ext4, Screen Resolution: 1600x1200
Environment Notes: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=znver1 -O0 -O3 -march=znver1 -flto -O1 -O2 -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -Ofast -march=znver1 -O2 -flto -Og 1100 2200 3300 4400 5500 SE +/- 50.49, N = 3 SE +/- 29.67, N = 3 SE +/- 66.20, N = 5 SE +/- 71.59, N = 9 SE +/- 12.79, N = 3 SE +/- 29.58, N = 3 SE +/- 55.61, N = 6 SE +/- 49.50, N = 9 SE +/- 66.04, N = 9 SE +/- 26.99, N = 3 SE +/- 47.02, N = 3 5068 4585 4319 4301 4272 4262 4239 4167 4102 4095 3767 -O3 -march=znver1 -O0 -O3 -march=znver1 -flto -O1 -O2 -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -Ofast -march=znver1 -O2 -flto -Og 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write -O3 -march=znver1 -O2 -flto -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og -O3 -O3 -march=znver1 -flto -O1 -O2 -march=znver1 -O2 -O0 200 400 600 800 1000 SE +/- 19.20, N = 3 SE +/- 3.41, N = 3 SE +/- 19.14, N = 3 SE +/- 1.22, N = 3 SE +/- 4.15, N = 3 SE +/- 12.77, N = 8 SE +/- 9.80, N = 3 SE +/- 8.54, N = 3 SE +/- 12.77, N = 9 SE +/- 17.89, N = 3 SE +/- 6.16, N = 3 1145 1127 1125 1125 1080 1079 1074 1065 1060 1037 886 -O3 -march=znver1 -O2 -flto -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og -O3 -O3 -march=znver1 -flto -O1 -O2 -march=znver1 -O2 -O0 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -O2 -O1 -O2 -march=znver1 -Ofast -march=znver1 -Og -O3 -march=znver1 -O3 -O3 -march=znver1 -flto -O0 110K 220K 330K 440K 550K SE +/- 3875.50, N = 3 SE +/- 4765.38, N = 3 SE +/- 2768.55, N = 3 SE +/- 6061.06, N = 3 SE +/- 5952.34, N = 3 SE +/- 1629.04, N = 3 SE +/- 3395.99, N = 3 SE +/- 3819.62, N = 3 SE +/- 8068.95, N = 9 SE +/- 7546.41, N = 4 SE +/- 4794.85, N = 3 529699 520570 515340 515102 510425 508384 507203 505031 490551 454256 419700 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -O2 -O1 -O2 -march=znver1 -Ofast -march=znver1 -Og -O3 -march=znver1 -O3 -O3 -march=znver1 -flto -O0 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O3 -O2 -O2 -flto -Og -O1 -O0 3K 6K 9K 12K 15K SE +/- 224.15, N = 6 SE +/- 295.94, N = 3 SE +/- 125.79, N = 3 SE +/- 101.16, N = 3 SE +/- 122.77, N = 3 SE +/- 32.93, N = 3 SE +/- 48.94, N = 3 SE +/- 172.87, N = 3 SE +/- 149.12, N = 3 SE +/- 119.98, N = 3 SE +/- 69.77, N = 3 16012 15353 15352 15188 15111 15099 14931 14851 13333 13303 9063 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O3 -O2 -O2 -flto -Og -O1 -O0 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE -O2 -flto -O3 -march=znver1 -flto -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O0 -Og -O3 -O3 -march=znver1 -O1 130M 260M 390M 520M 650M SE +/- 10585229.89, N = 3 SE +/- 7234705.24, N = 9 SE +/- 9814749.06, N = 4 SE +/- 7419115.18, N = 3 SE +/- 2338894.48, N = 3 SE +/- 6546514.72, N = 3 SE +/- 8177784.85, N = 3 SE +/- 9099115.17, N = 9 SE +/- 1832535.45, N = 3 SE +/- 7042504.19, N = 3 SE +/- 1617493.63, N = 3 626640400 618644101 617516626 605331833 602535297 599481605 598545342 597234266 595428047 589289926 585060029 -O2 -flto -march=znver1 -flto -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O0 -Og -march=znver1 -O1 1. (CC) gcc options: -O3 -march=native -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -O1 -O2 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -Og -O0 3K 6K 9K 12K 15K SE +/- 115.66, N = 3 SE +/- 70.29, N = 3 SE +/- 160.95, N = 3 SE +/- 15.71, N = 3 SE +/- 72.78, N = 3 SE +/- 134.39, N = 3 SE +/- 160.71, N = 8 SE +/- 49.65, N = 3 SE +/- 78.62, N = 3 SE +/- 165.58, N = 3 SE +/- 1.00, N = 3 13555 13468 13391 13346 13285 13214 13166 13110 12752 12642 2193 -O3 -O1 -O2 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -Og -O0 1. (CC) gcc options: -pthread -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O1 -Og -O0 -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.18, N = 5 12.75 12.56 12.54 12.53 12.52 12.50 12.42 12.41 12.37 12.31
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT -O3 -march=znver1 -O1 -O2 -flto -O0 -Og -O3 -O2 -O2 -march=znver1 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 60M 120M 180M 240M 300M SE +/- 1208988.13, N = 3 SE +/- 1109543.66, N = 3 SE +/- 1057235.60, N = 3 SE +/- 321625.93, N = 3 SE +/- 144731.32, N = 3 SE +/- 219028.41, N = 3 SE +/- 232284.24, N = 3 SE +/- 211192.91, N = 3 SE +/- 67545.97, N = 3 SE +/- 54052.05, N = 3 SE +/- 193963.83, N = 3 268506472 268455578 268173400 267404445 267368671 267315647 267311970 267268023 267239405 267172145 267055407 -march=znver1 -O1 -O2 -flto -O0 -Og -O2 -O2 -march=znver1 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 1. (CC) gcc options: -O3 -march=native -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O3 -march=znver1 -flto -Og -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O2 -march=znver1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 20.86 20.39 20.34 20.13 20.05
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=znver1 -flto -O2 -flto -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O1 -O2 -Og -O0 1200 2400 3600 4800 6000 SE +/- 26.34, N = 3 SE +/- 5.87, N = 3 SE +/- 13.33, N = 3 SE +/- 47.88, N = 3 SE +/- 10.24, N = 3 SE +/- 26.82, N = 3 SE +/- 41.07, N = 3 SE +/- 6.37, N = 3 SE +/- 2.28, N = 3 SE +/- 11.16, N = 3 SE +/- 5.17, N = 3 5571 5091 5074 5006 4885 4805 4751 4632 4625 4366 1708 -O3 -march=znver1 -flto -O2 -flto -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O1 -O2 -Og -O0 1. (CC) gcc options: -pthread -lm
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-02-03 1080p 8-bit YUV To AV1 Video Encode -O3 -Og -O3 -march=znver1 -flto -Ofast -march=znver1 -O2 -march=znver1 -O1 -O2 -flto -O2 -O0 -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize 0.3893 0.7786 1.1679 1.5572 1.9465 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 6 SE +/- 0.02, N = 9 SE +/- 0.03, N = 3 SE +/- 0.02, N = 8 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 1.73 1.73 1.71 1.70 1.70 1.70 1.69 1.69 1.69 1.68 1.67 1. (CC) gcc options: -mavx2 -fPIE -fPIC -O2 -pie -lpthread -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O2 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O2 -march=znver1 -O1 -O3 -march=znver1 -flto -Ofast -march=znver1 -Og -O3 -O0 30 60 90 120 150 SE +/- 2.52, N = 3 SE +/- 1.50, N = 12 SE +/- 1.53, N = 3 SE +/- 1.94, N = 5 SE +/- 1.43, N = 12 SE +/- 1.50, N = 8 SE +/- 1.32, N = 10 SE +/- 1.40, N = 8 SE +/- 1.20, N = 3 131 128 128 127 127 126 125 124 120 118 74 -O2 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O2 -march=znver1 -O1 -O3 -march=znver1 -flto -Ofast -march=znver1 -Og -O3 -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O0 -Og -O1 -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 20M 40M 60M 80M 100M SE +/- 460638.54, N = 3 SE +/- 402849.09, N = 3 SE +/- 324013.38, N = 3 SE +/- 1595135.10, N = 3 SE +/- 823190.91, N = 3 SE +/- 549524.55, N = 3 SE +/- 579693.09, N = 3 SE +/- 1016511.53, N = 3 SE +/- 468403.16, N = 3 SE +/- 673773.08, N = 3 106507244 106497994 106084276 105868175 105709690 105698092 104536605 104480422 104197865 104121840 -Ofast -march=znver1 -march=znver1 -O2 -march=znver1 -O0 -Og -O1 -O2 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0-jumbo-1 Test: Blowfish -O3 -march=znver1 -O1 -O3 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O2 -O2 -march=znver1 -O3 -march=znver1 -flto -Og -O0 14K 28K 42K 56K 70K SE +/- 1082.96, N = 12 SE +/- 1098.57, N = 4 SE +/- 1049.43, N = 3 SE +/- 1395.50, N = 12 SE +/- 1953.49, N = 12 SE +/- 1454.31, N = 11 SE +/- 1387.83, N = 12 SE +/- 1967.27, N = 12 SE +/- 1598.30, N = 12 SE +/- 1339.16, N = 9 SE +/- 215.16, N = 12 66823 65995 65806 65117 63586 62841 62718 61309 58764 56453 15179 1. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -Ofast -march=znver1 -O3 -march=znver1 -O2 -flto -O2 -march=znver1 -O2 -O1 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -Og -O0 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 SE +/- 0.58, N = 3 187 180 180 180 180 179 178 178 172 168 92 -Ofast -march=znver1 -O3 -march=znver1 -O2 -flto -O2 -march=znver1 -O2 -O1 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O3 -march=znver1 -flto -O3 -Og -O0 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 3.18, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 193 191 191 190 189 188 187 186 181 173 90 -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O3 -march=znver1 -flto -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -flto -O2 -march=znver1 -Ofast -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O3 -Og -O0 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 183 183 183 183 182 181 180 180 174 156 82 -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -flto -O2 -march=znver1 -Ofast -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -Ofast -march=znver1 -O2 -flto -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O2 -O3 -march=znver1 -flto -O1 -O3 -Og -O0 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 196 196 196 196 195 195 194 194 189 181 96 -Ofast -march=znver1 -O2 -flto -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O2 -O3 -march=znver1 -flto -O1 -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O2 -O3 -march=znver1 -O1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -Og -O0 50 100 150 200 250 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 214 212 211 211 210 210 209 209 203 195 102 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O2 -O3 -march=znver1 -O1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O2 -flto -O2 -march=znver1 -O2 -O1 -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -Og -O0 40 80 120 160 200 SE +/- 0.33, N = 3 191 191 191 191 190 190 189 188 183 181 98 -O2 -flto -O2 -march=znver1 -O2 -O1 -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -Og -O0 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0-jumbo-1 Test: Traditional DES -O2 -flto -O3 -march=znver1 -Ofast -march=znver1 -O2 -O1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O3 -march=znver1 -flto -O3 -Og -O0 60M 120M 180M 240M 300M SE +/- 642920.77, N = 3 SE +/- 2346357.84, N = 3 SE +/- 1656338.97, N = 3 SE +/- 2178423.16, N = 3 SE +/- 2774527.11, N = 10 SE +/- 2839112.24, N = 3 SE +/- 2041895.52, N = 3 SE +/- 1374420.40, N = 3 SE +/- 3859011.69, N = 12 SE +/- 2445677.03, N = 3 SE +/- 2756947.41, N = 3 260736667 260019667 258770667 257407667 257067200 257058000 255957000 254777333 253868583 239289333 218232000 1. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -Ofast -march=znver1 -O2 -flto -O2 -O3 -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O3 -march=znver1 -flto -O1 -Og -O0 200 400 600 800 1000 SE +/- 8.54, N = 3 SE +/- 2.58, N = 3 SE +/- 5.25, N = 3 SE +/- 0.08, N = 3 SE +/- 6.27, N = 3 SE +/- 7.21, N = 3 SE +/- 2.90, N = 3 SE +/- 2.81, N = 3 SE +/- 5.78, N = 3 SE +/- 4.02, N = 3 SE +/- 0.11, N = 3 1022 1022 1017 1011 1008 1007 1001 1000 785 772 383 -Ofast -march=znver1 -O2 -flto -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -march=znver1 -flto -O1 -Og -O0 1. (CC) gcc options: -O3 -mavx2
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=znver1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -O2 -march=znver1 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O1 -Og -O0 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 51.49 51.73 52.08 53.53 54.35 55.52 55.53 55.54 56.61 77.41 92.50 -march=znver1 -Ofast -march=znver1 -march=znver1 -flto -O2 -march=znver1 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O1 -Og -O0 1. (CC) gcc options: -lm -O3
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=znver1 -flto -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O2 -flto -O1 -O3 -march=znver1 -Og -O2 -O2 -march=znver1 -O0 6 12 18 24 30 SE +/- 0.24, N = 11 SE +/- 0.38, N = 12 SE +/- 0.33, N = 12 SE +/- 0.21, N = 12 SE +/- 0.31, N = 12 SE +/- 0.25, N = 12 SE +/- 0.44, N = 12 SE +/- 0.29, N = 12 SE +/- 0.49, N = 9 SE +/- 0.35, N = 12 SE +/- 0.38, N = 4 13.16 13.66 13.67 13.77 14.08 14.11 14.37 14.39 14.48 14.71 23.12 -O3 -march=znver1 -flto -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -Ofast -march=znver1 -O2 -flto -O1 -O3 -march=znver1 -Og -O2 -O2 -march=znver1 -O0 1. (CC) gcc options: -pthread -lz -llzma
Timed PHP Compilation This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O0 -Og -O1 -O2 -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.25, N = 3 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.33, N = 3 SE +/- 0.22, N = 3 15.19 21.42 29.05 51.96 52.17 52.58 78.13 78.19 -Og -O1 -O2 -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 1. (CC) gcc options: -pedantic -ldl -lz -lm
Timed ImageMagick Compilation This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O0 -Og -O1 -O2 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 -Ofast -march=znver1 -O2 -flto -O3 -march=znver1 -flto 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 SE +/- 0.21, N = 8 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.20, N = 3 SE +/- 0.30, N = 3 SE +/- 0.44, N = 3 SE +/- 0.34, N = 3 SE +/- 0.98, N = 3 SE +/- 0.45, N = 3 5.23 7.89 18.42 23.63 23.78 23.91 24.88 25.06 25.21 98.67 118.48
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-02-15 1080p 8-bit YUV To AV1 Video Encode -Ofast -march=znver1 -O3 -O2 -flto -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O0 -O1 -Og -O3 -march=znver1 -flto -O2 -march=znver1 -O2 1.3298 2.6596 3.9894 5.3192 6.649 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 5.91 5.90 5.90 5.89 5.89 5.88 5.87 5.86 5.84 5.84 5.81 1. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O3 -march=znver1 -Ofast -march=znver1 -O3 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O2 -march=znver1 -O2 -O2 -flto -Og -O0 400 800 1200 1600 2000 SE +/- 11.89, N = 3 SE +/- 20.59, N = 3 SE +/- 7.96, N = 3 SE +/- 35.09, N = 3 SE +/- 8.94, N = 3 SE +/- 6.18, N = 3 SE +/- 23.45, N = 5 SE +/- 12.17, N = 3 SE +/- 24.31, N = 3 SE +/- 18.65, N = 5 SE +/- 5.30, N = 3 1961 1825 1800 1747 1724 1519 1501 1369 1307 1205 434 -O3 -march=znver1 -Ofast -march=znver1 -O3 -O3 -march=znver1 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O2 -march=znver1 -O2 -O2 -flto -Og -O0 1. (CC) gcc options: -lm
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O2 -march=znver1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O1 -Og -O0 20 40 60 80 100 SE +/- 0.12, N = 5 SE +/- 0.11, N = 5 SE +/- 0.08, N = 5 SE +/- 0.14, N = 5 SE +/- 0.10, N = 5 SE +/- 0.09, N = 5 SE +/- 0.08, N = 5 SE +/- 0.10, N = 5 SE +/- 0.09, N = 5 SE +/- 0.11, N = 5 SE +/- 0.12, N = 5 13.61 13.64 13.65 13.70 13.85 13.89 13.95 14.21 15.01 15.58 96.77 -O0 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O3 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O2 -flto -Og -O1 -O0 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 10.40 11.31 11.35 12.60 21.58 25.77 25.84 25.96 28.64 28.74 44.92 -Ofast -march=znver1 -march=znver1 -flto -march=znver1 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O2 -flto -Og -O1 -O0 1. (CC) gcc options: -lm -lpthread -O3
Timed Apache Compilation This test times how long it takes to build the Apache HTTP Server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile -O0 -Og -O1 -O2 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 -Ofast -march=znver1 -O2 -flto -O3 -march=znver1 -flto 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 11.43 14.59 17.51 23.82 23.82 24.03 25.94 26.08 26.11 26.50 28.62
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O1 -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O2 -flto -O0 -Ofast -march=znver1 -O2 -march=znver1 -Og -O2 8 16 24 32 40 SE +/- 0.37, N = 11 SE +/- 0.18, N = 3 SE +/- 0.46, N = 3 SE +/- 0.41, N = 3 SE +/- 0.38, N = 3 SE +/- 0.67, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.58, N = 3 SE +/- 0.24, N = 3 35.62 35.57 35.41 35.21 35.07 35.00 34.91 34.80 34.76 34.55 -O1 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -O0 -Ofast -march=znver1 -O2 -march=znver1 -Og -O2 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O2 -O2 -flto -O1 -Og -O0 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 9.80 10.38 10.57 10.84 10.96 14.00 14.07 14.14 14.32 16.78 41.79 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O2 -O2 -flto -O1 -Og -O0 1. (CC) gcc options: -lm
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -Ofast -march=znver1 -O3 -march=znver1 -flto -O2 -march=znver1 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -Og 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.50, N = 3 SE +/- 0.18, N = 3 SE +/- 0.75, N = 3 SE +/- 1.09, N = 3 SE +/- 1.17, N = 3 97.80 97.26 95.91 95.79 94.82 92.68 -Ofast -march=znver1 -O3 -march=znver1 -march=native -ftree-vectorize -ftree-slp-vectorize -Og 1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -march=znver1 -O2 -flto -O3 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -Og -O0 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 6.00 6.16 6.29 6.54 6.56 6.57 6.62 6.82 6.93 7.39 9.02 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -march=znver1 -O2 -flto -O3 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -Og -O0 1. (CC) gcc options: -pthread -lhmmer -lsquid -lm
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -march=znver1 -O3 -Og -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O0 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.91, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.67, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.76, N = 3 144 144 144 142 141 141 140 140 139 139 111 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -march=znver1 -O2 -march=znver1 -O3 -Og -O2 -flto -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O0 1. (CC) gcc options: -lm
Bullet Physics Engine This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O2 -flto -O3 -march=znver1 -Ofast -march=znver1 -O2 -march=znver1 -O0 -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O1 -O2 0.702 1.404 2.106 2.808 3.51 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.05 3.09 3.09 3.10 3.11 3.11 3.11 3.11 3.12 3.12 -O2 -flto -march=znver1 -Ofast -march=znver1 -O2 -march=znver1 -O0 -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O1 -O2 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -O1 -Ofast -march=znver1 -O3 -march=znver1 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -Og -O0 30 60 90 120 150 SE +/- 0.81, N = 3 SE +/- 1.49, N = 3 SE +/- 0.52, N = 3 SE +/- 1.78, N = 3 SE +/- 1.40, N = 3 SE +/- 0.47, N = 3 SE +/- 0.97, N = 3 SE +/- 1.09, N = 3 SE +/- 0.12, N = 3 147 145 144 144 144 144 144 142 102 -O1 -Ofast -march=znver1 -march=znver1 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -Og -O0 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=znver1 -O0 -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -O1 -O2 -flto -O3 -march=znver1 -flto 200K 400K 600K 800K 1000K SE +/- 667.00, N = 5 SE +/- 333.13, N = 5 SE +/- 333.13, N = 5 SE +/- 508.06, N = 5 SE +/- 272.00, N = 5 SE +/- 272.00, N = 5 SE +/- 507.80, N = 5 SE +/- 507.80, N = 5 SE +/- 542.88, N = 5 SE +/- 331.91, N = 5 SE +/- 270.20, N = 5 865732 865459 865187 864916 864915 864915 864373 864373 864102 864101 863018 -march=znver1 -O0 -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -Ofast -march=znver1 -O2 -O1 -O2 -flto -march=znver1 -flto 1. (CC) gcc options: -O3 -march=native
ctx_clock Ctx_clock is a simple test program to measure the context switch time in clock cycles. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Clocks, Fewer Is Better ctx_clock Context Switch Time -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -O3 -march=znver1 -flto 30 60 90 120 150 132 132 132 132 -Og -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -O3 -march=znver1 -flto 1. (CC) gcc options:
Bullet Physics Engine This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O2 -flto -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 0.3038 0.6076 0.9114 1.2152 1.519 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.32 1.32 1.32 1.33 1.35 1.35 1.35 1.35 1.35 1.35 -O2 -march=znver1 -march=znver1 -Ofast -march=znver1 -O2 -flto -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O2 -flto -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -O3 -O3 -march=znver1 -Ofast -march=znver1 0.2498 0.4996 0.7494 0.9992 1.249 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.09 1.11 1.11 1.11 1.11 1.11 1.11 1.11 1.11 1.11 -O2 -flto -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -march=znver1 -march=znver1 -Ofast -march=znver1 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O0 -Og -O2 -O1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O2 -flto 0.729 1.458 2.187 2.916 3.645 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.05 3.06 3.06 3.14 3.14 3.14 3.15 3.15 3.15 3.24 -O2 -march=znver1 -march=znver1 -Ofast -march=znver1 -O0 -Og -O2 -O1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -march=znver1 -Ofast -march=znver1 -O2 -march=znver1 -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -O2 -flto 1.215 2.43 3.645 4.86 6.075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.18 5.18 5.19 5.36 5.37 5.37 5.37 5.37 5.38 5.40 -march=znver1 -Ofast -march=znver1 -O2 -march=znver1 -O0 -Og -O1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O2 -march=znver1 -O3 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O0 -O1 -O2 -Og -O3 -O2 -flto 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 5.80 5.80 5.80 5.98 6.00 6.00 6.01 6.02 6.05 6.32 -O2 -march=znver1 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O0 -O1 -O2 -Og -O2 -flto 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -march=znver1 -O2 -march=znver1 -Ofast -march=znver1 -O0 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -Og -O1 -O3 -O2 -flto 1.1723 2.3446 3.5169 4.6892 5.8615 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 5.07 5.08 5.09 5.14 5.14 5.14 5.15 5.16 5.16 5.21 -march=znver1 -O2 -march=znver1 -Ofast -march=znver1 -O0 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -Og -O1 -O2 -flto 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -O3 -march=znver1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O1 -Og -O2 -flto -O0 400 800 1200 1600 2000 SE +/- 0.21, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.08, N = 3 SE +/- 0.35, N = 3 1689 1676 1675 1427 1016 919 919 919 919 918 832 -O3 -march=znver1 -Ofast -march=znver1 -O3 -march=znver1 -flto -O3 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O1 -Og -O2 -flto -O0 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -Ofast -march=znver1 -O1 -O3 -march=znver1 -flto -O2 -march=znver1 -O2 -Og -O2 -flto -O0 1000 2000 3000 4000 5000 SE +/- 65.35, N = 3 SE +/- 57.07, N = 3 SE +/- 42.05, N = 3 SE +/- 107.53, N = 3 SE +/- 32.13, N = 3 SE +/- 178.79, N = 3 SE +/- 173.68, N = 3 SE +/- 54.41, N = 3 SE +/- 139.26, N = 3 SE +/- 129.41, N = 3 SE +/- 30.37, N = 3 4851 4396 4307 4089 3466 3300 3231 2609 2539 2515 512 -O3 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -Ofast -march=znver1 -O1 -O3 -march=znver1 -flto -O2 -march=znver1 -O2 -Og -O2 -flto -O0 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -O2 -march=znver1 -Ofast -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 -O1 -O2 -flto -Og -O3 -march=znver1 -flto -O0 600 1200 1800 2400 3000 SE +/- 14.53, N = 3 SE +/- 12.59, N = 3 SE +/- 11.61, N = 3 SE +/- 12.19, N = 3 SE +/- 10.26, N = 3 SE +/- 3.37, N = 3 SE +/- 59.22, N = 3 SE +/- 5.41, N = 3 SE +/- 3.66, N = 3 SE +/- 2.13, N = 3 SE +/- 5.25, N = 3 2584 2579 2527 2515 2482 2475 2411 2299 2188 2052 516 -O2 -march=znver1 -Ofast -march=znver1 -O2 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -O3 -O1 -O2 -flto -Og -O3 -march=znver1 -flto -O0 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -Og -O3 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -O2 -march=znver1 -O3 -march=znver1 -O1 -Ofast -march=znver1 -O0 60 120 180 240 300 SE +/- 2.03, N = 3 SE +/- 0.51, N = 3 SE +/- 2.57, N = 3 SE +/- 0.71, N = 3 SE +/- 1.03, N = 3 SE +/- 0.61, N = 3 SE +/- 1.88, N = 3 SE +/- 0.03, N = 3 SE +/- 0.44, N = 3 SE +/- 1.04, N = 3 SE +/- 1.00, N = 3 257 232 232 231 230 230 229 227 226 221 201 -Og -O3 -O2 -flto -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -O2 -march=znver1 -O3 -march=znver1 -O1 -Ofast -march=znver1 -O0 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -O3 -march=znver1 -flto -O1 -O2 -flto -Ofast -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O3 -march=znver1 -O2 -march=znver1 -Og -O0 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.27, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.28, N = 3 SE +/- 0.03, N = 3 1480 576 568 561 560 560 560 557 557 210 108 -O3 -march=znver1 -flto -O1 -O2 -flto -Ofast -march=znver1 -O3 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -O3 -march=znver1 -O2 -march=znver1 -Og 1. (CC) gcc options: -lm
-O0 Environment Notes: CXXFLAGS=-O0 CFLAGS=-O0Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 15 February 2019 20:24 by user root.
-Og Environment Notes: CXXFLAGS=-Og CFLAGS=-OgCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 19 February 2019 05:54 by user root.
-O1 Environment Notes: CXXFLAGS=-O1 CFLAGS=-O1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 07:27 by user root.
-O2 Environment Notes: CXXFLAGS=-O2 CFLAGS=-O2Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 15:25 by user root.
-O2 -ftree-vectorize -ftree-slp-vectorize Environment Notes: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorizeCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 05:50 by user root.
-O2 -march=znver1 Environment Notes: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 17 February 2019 07:16 by user root.
-O2 -flto Environment Notes: CXXFLAGS=-O2-flto CFLAGS=-O2-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 20:32 by user root.
-O3 Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 21:58 by user root.
-O3 -march=znver1 Environment Notes: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 15 February 2019 13:07 by user root.
-O3 -march=znver1 -flto Environment Notes: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 12:40 by user root.
-Ofast -march=znver1 Processor: 2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox G200eW3, Monitor: VE228, Network: 2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 18.04, Kernel: 5.0.0-050000rc6-generic (x86_64) 20190210, Desktop: GNOME Shell 3.28.3, Display Server: X Server, Compiler: GCC 9.0.1 20190210, File-System: ext4, Screen Resolution: 1600x1200
Environment Notes: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 17 February 2019 14:18 by user root.