GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com.
-O3 -march=znver1 Environment Notes: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O0 Environment Notes: CXXFLAGS=-O0 CFLAGS=-O0Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O1 Environment Notes: CXXFLAGS=-O1 CFLAGS=-O1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 Environment Notes: CXXFLAGS=-O2 CFLAGS=-O2Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -march=znver1 Environment Notes: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-Ofast -march=znver1 Environment Notes: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -ftree-vectorize -ftree-slp-vectorize Environment Notes: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorizeCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=znver1 -flto Environment Notes: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -flto Environment Notes: CXXFLAGS=-O2-flto CFLAGS=-O2-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-Og Processor: 2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox G200eW3, Monitor: VE228, Network: 2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 18.04, Kernel: 5.0.0-050000rc6-generic (x86_64) 20190210, Desktop: GNOME Shell 3.28.3, Display Server: X Server, Compiler: GCC 9.0.1 20190210, File-System: ext4, Screen Resolution: 1600x1200
Environment Notes: CXXFLAGS=-Og CFLAGS=-OgCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1100 2200 3300 4400 5500 SE +/- 50.49, N = 3 SE +/- 29.67, N = 3 SE +/- 71.59, N = 9 SE +/- 49.50, N = 9 SE +/- 29.58, N = 3 SE +/- 12.79, N = 3 SE +/- 66.04, N = 9 SE +/- 55.61, N = 6 SE +/- 66.20, N = 5 SE +/- 26.99, N = 3 SE +/- 47.02, N = 3 5068 4585 4301 4167 4262 4272 4102 4239 4319 4095 3767 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 200 400 600 800 1000 SE +/- 19.20, N = 3 SE +/- 6.16, N = 3 SE +/- 8.54, N = 3 SE +/- 17.89, N = 3 SE +/- 12.77, N = 8 SE +/- 12.77, N = 9 SE +/- 19.14, N = 3 SE +/- 1.22, N = 3 SE +/- 9.80, N = 3 SE +/- 3.41, N = 3 SE +/- 4.15, N = 3 1145 886 1065 1037 1079 1060 1125 1125 1074 1127 1080 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 110K 220K 330K 440K 550K SE +/- 3819.62, N = 3 SE +/- 4794.85, N = 3 SE +/- 6061.06, N = 3 SE +/- 2768.55, N = 3 SE +/- 8068.95, N = 9 SE +/- 5952.34, N = 3 SE +/- 1629.04, N = 3 SE +/- 3875.50, N = 3 SE +/- 7546.41, N = 4 SE +/- 4765.38, N = 3 SE +/- 3395.99, N = 3 505031 419700 515102 515340 490551 510425 508384 529699 454256 520570 507203 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 3K 6K 9K 12K 15K SE +/- 101.16, N = 3 SE +/- 69.77, N = 3 SE +/- 119.98, N = 3 SE +/- 48.94, N = 3 SE +/- 32.93, N = 3 SE +/- 122.77, N = 3 SE +/- 125.79, N = 3 SE +/- 295.94, N = 3 SE +/- 224.15, N = 6 SE +/- 172.87, N = 3 SE +/- 149.12, N = 3 15188 9063 13303 14931 15099 15111 15352 15353 16012 14851 13333 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 130M 260M 390M 520M 650M SE +/- 7042504.19, N = 3 SE +/- 8177784.85, N = 3 SE +/- 1617493.63, N = 3 SE +/- 6546514.72, N = 3 SE +/- 1832535.45, N = 3 SE +/- 9814749.06, N = 4 SE +/- 7419115.18, N = 3 SE +/- 2338894.48, N = 3 SE +/- 7234705.24, N = 9 SE +/- 10585229.89, N = 3 SE +/- 9099115.17, N = 9 589289926 598545342 585060029 599481605 595428047 617516626 605331833 602535297 618644101 626640400 597234266 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -O3 -march=native -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 3K 6K 9K 12K 15K SE +/- 78.62, N = 3 SE +/- 1.00, N = 3 SE +/- 70.29, N = 3 SE +/- 160.95, N = 3 SE +/- 115.66, N = 3 SE +/- 15.71, N = 3 SE +/- 160.71, N = 8 SE +/- 72.78, N = 3 SE +/- 49.65, N = 3 SE +/- 134.39, N = 3 SE +/- 165.58, N = 3 12752 2193 13468 13391 13555 13346 13166 13285 13110 13214 12642 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -pthread -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -Og 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.18, N = 5 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 12.41 12.50 12.53 12.54 12.31 12.42 12.37 12.56 12.75 12.52 -march=znver1 -std=c++11 -O0 -std=c++11 -O1 -std=c++11 -O2 -std=c++11 -std=c++11 -O2 -march=znver1 -std=c++11 -Ofast -march=znver1 -std=c++11 -O2 -ftree-vectorize -ftree-slp-vectorize -std=c++11 -march=znver1 -flto -Og -std=c++11 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 60M 120M 180M 240M 300M SE +/- 1208988.13, N = 3 SE +/- 321625.93, N = 3 SE +/- 1109543.66, N = 3 SE +/- 232284.24, N = 3 SE +/- 219028.41, N = 3 SE +/- 211192.91, N = 3 SE +/- 193963.83, N = 3 SE +/- 54052.05, N = 3 SE +/- 67545.97, N = 3 SE +/- 1057235.60, N = 3 SE +/- 144731.32, N = 3 268506472 267404445 268455578 267311970 267315647 267268023 267055407 267172145 267239405 268173400 267368671 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -O3 -march=native -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -Og 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 20.05 20.13 20.34 20.86 20.39 -O2 -march=native -std=c++11 -Ofast -march=znver1 -std=c++11 -O2 -ftree-vectorize -ftree-slp-vectorize -std=c++11 -march=znver1 -flto -Og -std=c++11 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1200 2400 3600 4800 6000 SE +/- 47.88, N = 3 SE +/- 5.17, N = 3 SE +/- 6.37, N = 3 SE +/- 2.28, N = 3 SE +/- 41.07, N = 3 SE +/- 13.33, N = 3 SE +/- 10.24, N = 3 SE +/- 26.82, N = 3 SE +/- 26.34, N = 3 SE +/- 5.87, N = 3 SE +/- 11.16, N = 3 5006 1708 4632 4625 4751 5074 4885 4805 5571 5091 4366 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -pthread -lm
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-02-03 1080p 8-bit YUV To AV1 Video Encode -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 0.3893 0.7786 1.1679 1.5572 1.9465 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 9 SE +/- 0.02, N = 8 SE +/- 0.02, N = 3 SE +/- 0.02, N = 6 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 1.68 1.69 1.70 1.69 1.73 1.70 1.70 1.67 1.71 1.69 1.73 1. (CC) gcc options: -mavx2 -fPIE -fPIC -O2 -pie -lpthread -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 30 60 90 120 150 SE +/- 1.94, N = 5 SE +/- 1.43, N = 12 SE +/- 2.52, N = 3 SE +/- 1.20, N = 3 SE +/- 1.32, N = 10 SE +/- 1.53, N = 3 SE +/- 1.50, N = 8 SE +/- 1.50, N = 12 SE +/- 1.40, N = 8 127 74 126 131 118 127 124 128 125 128 120 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 20M 40M 60M 80M 100M SE +/- 402849.09, N = 3 SE +/- 1595135.10, N = 3 SE +/- 549524.55, N = 3 SE +/- 1016511.53, N = 3 SE +/- 673773.08, N = 3 SE +/- 324013.38, N = 3 SE +/- 460638.54, N = 3 SE +/- 468403.16, N = 3 SE +/- 579693.09, N = 3 SE +/- 823190.91, N = 3 106497994 105868175 105698092 104480422 104121840 106084276 106507244 104197865 104536605 105709690 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -Og 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0-jumbo-1 Test: Blowfish -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 14K 28K 42K 56K 70K SE +/- 1082.96, N = 12 SE +/- 215.16, N = 12 SE +/- 1098.57, N = 4 SE +/- 1387.83, N = 12 SE +/- 1049.43, N = 3 SE +/- 1967.27, N = 12 SE +/- 1454.31, N = 11 SE +/- 1953.49, N = 12 SE +/- 1598.30, N = 12 SE +/- 1395.50, N = 12 SE +/- 1339.16, N = 9 66823 15179 65995 62718 65806 61309 62841 63586 58764 65117 56453 1. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 180 92 179 180 172 180 187 178 178 180 168 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 3.18, N = 3 SE +/- 0.33, N = 3 191 90 187 189 181 191 193 188 186 190 173 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.58, N = 3 183 82 180 181 174 183 182 180 183 183 156 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 195 96 194 195 189 196 196 196 194 196 181 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 210 102 210 211 203 211 209 212 209 214 195 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 40 80 120 160 200 SE +/- 0.33, N = 3 190 98 191 191 183 191 189 190 188 191 181 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0-jumbo-1 Test: Traditional DES -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 60M 120M 180M 240M 300M SE +/- 2346357.84, N = 3 SE +/- 2756947.41, N = 3 SE +/- 2774527.11, N = 10 SE +/- 2178423.16, N = 3 SE +/- 3859011.69, N = 12 SE +/- 2041895.52, N = 3 SE +/- 1656338.97, N = 3 SE +/- 2839112.24, N = 3 SE +/- 1374420.40, N = 3 SE +/- 642920.77, N = 3 SE +/- 2445677.03, N = 3 260019667 218232000 257067200 257407667 253868583 255957000 258770667 257058000 254777333 260736667 239289333 1. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 200 400 600 800 1000 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 5.78, N = 3 SE +/- 5.25, N = 3 SE +/- 6.27, N = 3 SE +/- 2.90, N = 3 SE +/- 8.54, N = 3 SE +/- 7.21, N = 3 SE +/- 2.81, N = 3 SE +/- 2.58, N = 3 SE +/- 4.02, N = 3 1011 383 785 1017 1008 1001 1022 1007 1000 1022 772 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -O3 -mavx2
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 51.49 92.50 56.61 55.54 53.53 54.35 51.73 55.53 52.08 55.52 77.41 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm -O3
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 6 12 18 24 30 SE +/- 0.44, N = 12 SE +/- 0.38, N = 4 SE +/- 0.25, N = 12 SE +/- 0.49, N = 9 SE +/- 0.38, N = 12 SE +/- 0.35, N = 12 SE +/- 0.21, N = 12 SE +/- 0.33, N = 12 SE +/- 0.24, N = 11 SE +/- 0.31, N = 12 SE +/- 0.29, N = 12 14.37 23.12 14.11 14.48 13.66 14.71 13.77 13.67 13.16 14.08 14.39 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -pthread -lz -llzma
Timed PHP Compilation This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.22, N = 3 SE +/- 0.25, N = 3 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 78.13 15.19 29.05 52.17 78.19 51.96 52.58 21.42 -O3 -march=znver1 -O1 -O2 -O3 -O2 -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og 1. (CC) gcc options: -pedantic -ldl -lz -lm
Timed ImageMagick Compilation This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 30 60 90 120 150 SE +/- 0.30, N = 3 SE +/- 0.10, N = 3 SE +/- 0.21, N = 8 SE +/- 0.10, N = 3 SE +/- 0.44, N = 3 SE +/- 0.45, N = 3 SE +/- 0.34, N = 3 SE +/- 0.20, N = 3 SE +/- 0.45, N = 3 SE +/- 0.98, N = 3 SE +/- 0.04, N = 3 24.88 5.23 18.42 23.63 25.06 23.78 25.21 23.91 118.48 98.67 7.89
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-02-15 1080p 8-bit YUV To AV1 Video Encode -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1.3298 2.6596 3.9894 5.3192 6.649 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 5.89 5.88 5.87 5.81 5.90 5.84 5.91 5.89 5.84 5.90 5.86 1. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 400 800 1200 1600 2000 SE +/- 11.89, N = 3 SE +/- 5.30, N = 3 SE +/- 6.18, N = 3 SE +/- 12.17, N = 3 SE +/- 7.96, N = 3 SE +/- 23.45, N = 5 SE +/- 20.59, N = 3 SE +/- 8.94, N = 3 SE +/- 35.09, N = 3 SE +/- 24.31, N = 3 SE +/- 18.65, N = 5 1961 434 1519 1369 1800 1501 1825 1724 1747 1307 1205 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 20 40 60 80 100 SE +/- 0.10, N = 5 SE +/- 0.12, N = 5 SE +/- 0.09, N = 5 SE +/- 0.08, N = 5 SE +/- 0.12, N = 5 SE +/- 0.09, N = 5 SE +/- 0.08, N = 5 SE +/- 0.14, N = 5 SE +/- 0.10, N = 5 SE +/- 0.11, N = 5 SE +/- 0.11, N = 5 13.85 96.77 15.01 13.65 13.61 13.89 13.95 13.70 14.21 13.64 15.58 -O0 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 11.35 44.92 28.74 25.84 12.60 21.58 10.40 25.77 11.31 25.96 28.64 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm -lpthread -O3
Timed Apache Compilation This test times how long it takes to build the Apache HTTP Server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 25.94 11.43 17.51 23.82 26.08 23.82 26.11 24.03 28.62 26.50 14.59
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 8 16 24 32 40 SE +/- 0.18, N = 3 SE +/- 0.67, N = 3 SE +/- 0.37, N = 11 SE +/- 0.24, N = 3 SE +/- 0.41, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.46, N = 3 SE +/- 0.38, N = 3 SE +/- 0.58, N = 3 35.57 35.00 35.62 34.55 35.21 34.80 34.91 35.41 35.07 34.76 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.57 41.79 14.32 14.07 10.84 14.00 9.80 10.96 10.38 14.14 16.78 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.30, N = 3 SE +/- 1.09, N = 3 SE +/- 0.50, N = 3 SE +/- 0.75, N = 3 SE +/- 1.17, N = 3 95.91 97.80 94.82 97.26 95.79 92.68 -march=native -Ofast -march=znver1 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -Og 1. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 6.29 9.02 6.93 6.62 6.57 6.54 6.00 6.82 6.16 6.56 7.39 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -pthread -lhmmer -lsquid -lm
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.76, N = 3 SE +/- 0.03, N = 3 SE +/- 0.67, N = 3 SE +/- 0.03, N = 3 SE +/- 0.91, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 144 111 139 140 141 142 144 139 144 140 141 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
Bullet Physics Engine This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 0.702 1.404 2.106 2.808 3.51 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.09 3.11 3.12 3.12 3.11 3.10 3.09 3.11 3.05 3.11 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og 30 60 90 120 150 SE +/- 1.78, N = 3 SE +/- 0.12, N = 3 SE +/- 1.49, N = 3 SE +/- 0.97, N = 3 SE +/- 0.81, N = 3 SE +/- 1.40, N = 3 SE +/- 0.52, N = 3 SE +/- 0.47, N = 3 SE +/- 1.09, N = 3 144 102 145 144 147 144 144 144 142 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -Og 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 200K 400K 600K 800K 1000K SE +/- 667.00, N = 5 SE +/- 333.13, N = 5 SE +/- 542.88, N = 5 SE +/- 507.80, N = 5 SE +/- 272.00, N = 5 SE +/- 272.00, N = 5 SE +/- 507.80, N = 5 SE +/- 508.06, N = 5 SE +/- 270.20, N = 5 SE +/- 331.91, N = 5 SE +/- 333.13, N = 5 865732 865459 864102 864373 864915 864915 864373 864916 863018 864101 865187 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -O3 -march=native
ctx_clock Ctx_clock is a simple test program to measure the context switch time in clock cycles. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Clocks, Fewer Is Better ctx_clock Context Switch Time -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 30 60 90 120 150 132 132 132 132 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options:
Bullet Physics Engine This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 0.3038 0.6076 0.9114 1.2152 1.519 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.32 1.35 1.35 1.35 1.35 1.32 1.32 1.35 1.33 1.35 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 0.2498 0.4996 0.7494 0.9992 1.249 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.11 1.11 1.11 1.11 1.11 1.11 1.11 1.11 1.09 1.11 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 0.729 1.458 2.187 2.916 3.645 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.06 3.14 3.15 3.14 3.15 3.05 3.06 3.15 3.24 3.14 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1.215 2.43 3.645 4.86 6.075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.18 5.36 5.37 5.37 5.38 5.19 5.18 5.37 5.40 5.37 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 5.80 6.00 6.00 6.01 6.05 5.80 5.80 5.98 6.32 6.02 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1.1723 2.3446 3.5169 4.6892 5.8615 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.07 5.14 5.16 5.14 5.16 5.08 5.09 5.14 5.21 5.15 -march=znver1 -O0 -O1 -O2 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O2 -flto -Og 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 400 800 1200 1600 2000 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.31, N = 3 1689 832 919 919 1427 1016 1676 919 1675 918 919 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1000 2000 3000 4000 5000 SE +/- 65.35, N = 3 SE +/- 30.37, N = 3 SE +/- 32.13, N = 3 SE +/- 54.41, N = 3 SE +/- 42.05, N = 3 SE +/- 173.68, N = 3 SE +/- 107.53, N = 3 SE +/- 57.07, N = 3 SE +/- 178.79, N = 3 SE +/- 129.41, N = 3 SE +/- 139.26, N = 3 4851 512 3466 2609 4307 3231 4089 4396 3300 2515 2539 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 600 1200 1800 2400 3000 SE +/- 10.26, N = 3 SE +/- 5.25, N = 3 SE +/- 59.22, N = 3 SE +/- 11.61, N = 3 SE +/- 3.37, N = 3 SE +/- 14.53, N = 3 SE +/- 12.59, N = 3 SE +/- 12.19, N = 3 SE +/- 2.13, N = 3 SE +/- 5.41, N = 3 SE +/- 3.66, N = 3 2482 516 2411 2527 2475 2584 2579 2515 2052 2299 2188 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 60 120 180 240 300 SE +/- 0.03, N = 3 SE +/- 1.00, N = 3 SE +/- 0.44, N = 3 SE +/- 0.61, N = 3 SE +/- 0.51, N = 3 SE +/- 1.88, N = 3 SE +/- 1.04, N = 3 SE +/- 0.71, N = 3 SE +/- 1.03, N = 3 SE +/- 2.57, N = 3 SE +/- 2.03, N = 3 227 201 226 230 232 229 221 231 230 232 257 -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -O3 -march=znver1 -O0 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 300 600 900 1200 1500 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.27, N = 3 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.22, N = 3 SE +/- 0.33, N = 3 SE +/- 0.09, N = 3 SE +/- 0.28, N = 3 557 108 576 560 560 557 561 560 1480 568 210 -O3 -march=znver1 -O1 -O2 -O3 -O2 -march=znver1 -Ofast -march=znver1 -O2 -ftree-vectorize -ftree-slp-vectorize -O3 -march=znver1 -flto -O2 -flto -Og 1. (CC) gcc options: -lm
-O3 -march=znver1 Environment Notes: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 15 February 2019 13:07 by user root.
-O0 Environment Notes: CXXFLAGS=-O0 CFLAGS=-O0Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 15 February 2019 20:24 by user root.
-O1 Environment Notes: CXXFLAGS=-O1 CFLAGS=-O1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 07:27 by user root.
-O2 Environment Notes: CXXFLAGS=-O2 CFLAGS=-O2Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 15:25 by user root.
-O3 Environment Notes: CXXFLAGS=-O3 CFLAGS=-O3Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 16 February 2019 21:58 by user root.
-O2 -march=znver1 Environment Notes: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 17 February 2019 07:16 by user root.
-Ofast -march=znver1 Environment Notes: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1Compiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 17 February 2019 14:18 by user root.
-O2 -ftree-vectorize -ftree-slp-vectorize Environment Notes: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorizeCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 05:50 by user root.
-O3 -march=znver1 -flto Environment Notes: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 12:40 by user root.
-O2 -flto Environment Notes: CXXFLAGS=-O2-flto CFLAGS=-O2-fltoCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 18 February 2019 20:32 by user root.
-Og Processor: 2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox G200eW3, Monitor: VE228, Network: 2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 18.04, Kernel: 5.0.0-050000rc6-generic (x86_64) 20190210, Desktop: GNOME Shell 3.28.3, Display Server: X Server, Compiler: GCC 9.0.1 20190210, File-System: ext4, Screen Resolution: 1600x1200
Environment Notes: CXXFLAGS=-Og CFLAGS=-OgCompiler Notes: --disable-multilib --enable-checking=releaseSecurity Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 19 February 2019 05:54 by user root.