AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2003230-VE-1907117HV93 Znver2 GCC9 Compiler Tests - Phoronix Test Suite Znver2 GCC9 Compiler Tests AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2003230-VE-1907117HV93&rdt&grs .
Znver2 GCC9 Compiler Tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS) AMD Device 1480 16384MB 2000GB Force MP600 Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz) AMD Device aae0 ASUS VP28U Realtek Device 8125 + Intel I211 + Intel Device 2723 Ubuntu 18.04 5.2.0-999-generic (x86_64) 20190703 GNOME Shell 3.28.3 X Server 1.20.1 modesetting 1.20.1 4.5 Mesa 18.2.2 (LLVM 7.0.0) GCC 9.1.0 ext4 3840x2160 AMD Ryzen 7 3800X 8-Core @ 4.20GHz (8 Cores / 16 Threads) ASUS TUF GAMING X570-PLUS_BR (1405 BIOS) AMD Starship/Matisse 16GB 480GB Force MP510 + 320GB SAMSUNG HD322HJ NVIDIA GeForce GTX 960 2GB (1126/3505MHz) NVIDIA GM206 HD Audio Realtek RTL8111/8168/8411 + Broadcom BCM4360 802.11ac Ubuntu 20.04 5.4.0-18-generic (x86_64) MATE 1.24.0 X Server 1.20.7 NVIDIA 435.21 4.6.0 GCC 9.3.0 1920x1080 OpenBenchmarking.org Environment Details - -O3 -march=znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2 - -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1 - -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64 Compiler Details - -O3 -march=znver2: --disable-multilib --enable-checking=release - -O3 -march=znver1: --disable-multilib --enable-checking=release - -O3 -march=x86-64: --disable-multilib --enable-checking=release - ryzen3800x4200: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - -O3 -march=znver2: Scaling Governor: acpi-cpufreq ondemand - -O3 -march=znver1: Scaling Governor: acpi-cpufreq ondemand - -O3 -march=x86-64: Scaling Governor: acpi-cpufreq ondemand - ryzen3800x4200: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013 Python Details - -O3 -march=znver2: Python 2.7.15+ + Python 3.6.8 - -O3 -march=znver1: Python 2.7.15+ + Python 3.6.8 - -O3 -march=x86-64: Python 2.7.15+ + Python 3.6.8 - ryzen3800x4200: + Python 3.8.2 Security Details - -O3 -march=znver2: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled - -O3 -march=znver1: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled - -O3 -march=x86-64: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled - ryzen3800x4200: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected
Znver2 GCC9 Compiler Tests mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 scimark2: Composite scimark2: Monte Carlo scimark2: Sparse Matrix Multiply encode-mp3: WAV To MP3 john-the-ripper: Blowfish scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation pgbench: Buffer Test - Normal Load - Read Write c-ray: Total Time - 4K, 16 Rays Per Pixel build-llvm: Time To Compile pgbench: Buffer Test - Normal Load - Read Only stockfish: Total Time compress-7zip: Compress Speed Test x264: H.264 Video Encoding fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 graphics-magick: Sharpen scimark2: Fast Fourier Transform mcperf: Get x265: H.265 1080p Video Encoding vpxenc: vpxenc VP9 1080p Video Encode aobench: 2048 x 2048 - Total Time cpp-perf-bench: Math Library bullet: 1000 Stack redis: SET fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 bullet: 136 Ragdolls bullet: 3000 Fall tscp: AI Chess Performance bullet: 1000 Convex cpp-perf-bench: Function Objects graphics-magick: Rotate himeno: Poisson Pressure Solver graphics-magick: Resizing encode-flac: WAV To FLAC bullet: Raytests build-php: Time To Compile mcperf: Set redis: GET mkl-dnn: IP Batch 1D - f32 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 2524.93 216.70 3700.64 799.07 3575.96 7.04 20232 11431.93 2422.10 30044.32 39.46 286.56 382751.07 39471726 78562 140.96 11231 8001.67 25.21 194 274.11 110755.89 52.91 175.34 34.64 309.02 3.79 2089609.47 56652 19960 2.05 3.23 1321681 3.59 14.56 276 1347.96 285 8.12 2.06 53.41 69121.64 3090850.69 158.46 2562.89 219.29 3128.65 757.56 3702.03 6.98 28221 8631.93 2291.62 29380.50 39.42 284.24 383329.30 39908751 78412 141.82 11448 7660.90 25.09 193 260.12 112447.77 53.15 174.40 35.14 311.77 3.98 2080013.12 51757 19405 2.13 3.36 1372148 3.73 14.99 260 1345.95 279 8.15 2.13 53.44 60810.38 3126726.23 159.95 2512.04 221.00 2786.33 766.81 3762.43 7.16 28401 6959.69 2145.57 30143.80 43.20 281.90 385510.95 39537930 78655 143.27 9534.70 7039.43 25.22 179 297.13 107814.85 53.33 175.99 36.49 312.01 4.08 2074827.24 2.17 3.37 1333926 3.70 14.90 261 1336.77 270 7.75 2.09 52.89 60770.80 3026826.32 152.36 331.468 31.0087 618.82 153.28 750.46 26.537 13993 607.72 1259.58 16722.385240 60.317 415.041 264125.566092 27465891 54535 101.97 8510.6 5968.3 32.865 156 323.04 94377.5 45.82 202.41 32.233 281.879 4.189241 1893390.85 54956 18240 2.229157 3.509403 1271423 3.862570 15.629 261 1274.463139 276 7.922 2.154750 54.847 63869.3 2695667.50 22.6361 OpenBenchmarking.org
MKL-DNN Harness: Convolution Batch conv_alexnet - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 500 1000 1500 2000 2500 SE +/- 12.02, N = 3 SE +/- 19.59, N = 3 SE +/- 11.23, N = 3 SE +/- 0.83, N = 3 2524.93 2562.89 2512.04 331.47 -march=znver2 - MIN: 2478.96 -march=znver1 - MIN: 2476.4 -march=x86-64 - MIN: 2466.12 MIN: 327.27 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN Harness: Deconvolution Batch deconv_1d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 50 100 150 200 250 SE +/- 2.63, N = 6 SE +/- 1.89, N = 11 SE +/- 2.51, N = 3 SE +/- 0.18, N = 3 216.70 219.29 221.00 31.01 -march=znver2 - MIN: 202.77 -march=znver1 - MIN: 203.58 -march=x86-64 - MIN: 202.64 MIN: 29.38 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 800 1600 2400 3200 4000 SE +/- 7.82, N = 3 SE +/- 7.70, N = 3 SE +/- 27.49, N = 3 SE +/- 1.56, N = 3 3700.64 3128.65 2786.33 618.82 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 200 400 600 800 1000 SE +/- 1.53, N = 3 SE +/- 0.16, N = 3 SE +/- 7.83, N = 3 SE +/- 0.10, N = 3 799.07 757.56 766.81 153.28 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 800 1600 2400 3200 4000 SE +/- 48.59, N = 3 SE +/- 48.59, N = 3 SE +/- 48.28, N = 3 SE +/- 10.72, N = 3 3575.96 3702.03 3762.43 750.46 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 6 12 18 24 30 SE +/- 0.121, N = 3 SE +/- 0.088, N = 4 SE +/- 0.070, N = 3 SE +/- 0.005, N = 3 7.040 6.980 7.160 26.537 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lncurses -lm
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 6K 12K 18K 24K 30K SE +/- 50.35, N = 3 SE +/- 51.72, N = 3 SE +/- 61.75, N = 3 SE +/- 8.95, N = 3 20232 28221 28401 13993 -lbz2 -lbz2 -lbz2 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 2K 4K 6K 8K 10K SE +/- 10.78, N = 3 SE +/- 25.99, N = 3 SE +/- 73.09, N = 3 SE +/- 4.26, N = 3 11431.93 8631.93 6959.69 607.72 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 500 1000 1500 2000 2500 SE +/- 0.28, N = 3 SE +/- 0.13, N = 3 SE +/- 20.09, N = 3 SE +/- 0.10, N = 3 2422.10 2291.62 2145.57 1259.58 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 6K 12K 18K 24K 30K SE +/- 197.60, N = 3 SE +/- 361.55, N = 15 SE +/- 169.86, N = 3 SE +/- 14.38, N = 3 30044.32 29380.50 30143.80 16722.39 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 14 28 42 56 70 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 39.46 39.42 43.20 60.32 -march=znver2 -march=znver1 -march=x86-64 1. (CC) gcc options: -lm -lpthread -O3
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 6.0.1 Time To Compile -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 90 180 270 360 450 286.56 284.24 281.90 415.04
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 80K 160K 240K 320K 400K SE +/- 738.26, N = 3 SE +/- 539.80, N = 3 SE +/- 837.27, N = 3 SE +/- 844.67, N = 3 382751.07 383329.30 385510.95 264125.57 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 9M 18M 27M 36M 45M SE +/- 265193.75, N = 3 SE +/- 208989.61, N = 3 SE +/- 75524.60, N = 3 SE +/- 251488.84, N = 3 39471726 39908751 39537930 27465891 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 20K 40K 60K 80K 100K SE +/- 205.86, N = 3 SE +/- 193.52, N = 3 SE +/- 419.64, N = 3 SE +/- 103.41, N = 3 78562 78412 78655 54535 1. (CXX) g++ options: -pipe -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 30 60 90 120 150 SE +/- 0.93, N = 3 SE +/- 0.82, N = 3 SE +/- 0.64, N = 3 SE +/- 0.09, N = 3 140.96 141.82 143.27 101.97 -march=znver2 -march=znver1 -march=x86-64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 2K 4K 6K 8K 10K SE +/- 123.02, N = 3 SE +/- 53.59, N = 3 SE +/- 11.52, N = 3 SE +/- 2.07, N = 3 11231.00 11448.00 9534.70 8510.60 -march=znver2 -march=znver1 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 2K 4K 6K 8K 10K SE +/- 28.05, N = 3 SE +/- 34.96, N = 3 SE +/- 7.75, N = 3 SE +/- 9.60, N = 3 8001.67 7660.90 7039.43 5968.30 -march=znver2 -march=znver1 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -O3 -lm
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 8 16 24 32 40 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 25.21 25.09 25.22 32.87 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -pthread -fvisibility=hidden
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 40 80 120 160 200 SE +/- 0.58, N = 3 194 193 179 156 -O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O2 -lgomp 1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 70 140 210 280 350 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 3.16, N = 3 SE +/- 0.10, N = 3 274.11 260.12 297.13 323.04 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 1. (CC) gcc options: -lm
Memcached mcperf Method: Get OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 20K 40K 60K 80K 100K SE +/- 1084.23, N = 15 SE +/- 1283.75, N = 15 SE +/- 879.11, N = 15 SE +/- 703.67, N = 14 110755.89 112447.77 107814.85 94377.50 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -lm -rdynamic
x265 H.265 1080p Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 52.91 53.15 53.33 45.82 -march=znver2 -lnuma -march=znver1 -lnuma -march=x86-64 -lnuma 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
VP9 libvpx Encoding vpxenc VP9 1080p Video Encode OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 40 80 120 160 200 SE +/- 0.55, N = 3 SE +/- 0.98, N = 3 SE +/- 0.86, N = 3 SE +/- 1.23, N = 3 175.34 174.40 175.99 202.41 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 8 16 24 32 40 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.32, N = 3 SE +/- 0.01, N = 3 34.64 35.14 36.49 32.23 -march=znver2 -march=znver1 -march=x86-64 1. (CC) gcc options: -lm -O3
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 70 140 210 280 350 SE +/- 1.69, N = 3 SE +/- 5.00, N = 3 SE +/- 4.58, N = 4 SE +/- 0.19, N = 3 309.02 311.77 312.01 281.88 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -std=c++11
Bullet Physics Engine Test: 1000 Stack OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 0.9426 1.8852 2.8278 3.7704 4.713 SE +/- 0.004049, N = 3 SE +/- 0.033405, N = 3 SE +/- 0.041553, N = 3 SE +/- 0.002805, N = 3 3.790000 3.980000 4.080000 4.189241 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 400K 800K 1200K 1600K 2000K SE +/- 26820.33, N = 15 SE +/- 26581.97, N = 15 SE +/- 28039.49, N = 15 SE +/- 24405.05, N = 15 2089609.47 2080013.12 2074827.24 1893390.85 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 -O3 -march=znver2 -O3 -march=znver1 ryzen3800x4200 12K 24K 36K 48K 60K SE +/- 647.09, N = 3 SE +/- 455.36, N = 3 SE +/- 114.46, N = 3 56652 51757 54956 -march=znver2 -march=znver1 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=znver2 -O3 -march=znver1 ryzen3800x4200 4K 8K 12K 16K 20K SE +/- 71.31, N = 3 SE +/- 83.01, N = 3 SE +/- 314.37, N = 3 19960 19405 18240 -march=znver2 -march=znver1 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -O3 -lm
Bullet Physics Engine Test: 136 Ragdolls OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 0.5016 1.0032 1.5048 2.0064 2.508 SE +/- 0.003680, N = 3 SE +/- 0.015833, N = 3 SE +/- 0.029906, N = 3 SE +/- 0.000665, N = 3 2.050000 2.130000 2.170000 2.229157 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 3000 Fall OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 0.7896 1.5792 2.3688 3.1584 3.948 SE +/- 0.007900, N = 3 SE +/- 0.026494, N = 3 SE +/- 0.035000, N = 3 SE +/- 0.003247, N = 3 3.230000 3.360000 3.370000 3.509403 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 300K 600K 900K 1200K 1500K SE +/- 8868.66, N = 5 SE +/- 1369.78, N = 5 SE +/- 10274.76, N = 5 SE +/- 1439.32, N = 5 1321681 1372148 1333926 1271423 -march=znver2 -march=znver1 -march=x86-64 1. (CC) gcc options: -O3 -march=native
Bullet Physics Engine Test: 1000 Convex OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 0.8691 1.7382 2.6073 3.4764 4.3455 SE +/- 0.007030, N = 3 SE +/- 0.032731, N = 3 SE +/- 0.037882, N = 3 SE +/- 0.000795, N = 3 3.590000 3.730000 3.700000 3.862570 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 SE +/- 0.02, N = 3 14.56 14.99 14.90 15.63 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -std=c++11
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 276 260 261 261 -O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O2 -lgomp 1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 300 600 900 1200 1500 SE +/- 12.91, N = 3 SE +/- 22.06, N = 3 SE +/- 13.46, N = 3 SE +/- 4.19, N = 3 1347.96 1345.95 1336.77 1274.46 -march=znver2 -march=znver1 -march=x86-64 1. (CC) gcc options: -O3 -mavx2
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 60 120 180 240 300 SE +/- 1.53, N = 3 285 279 270 276 -O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl -O2 -lgomp 1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 2 4 6 8 10 SE +/- 0.060, N = 5 SE +/- 0.038, N = 5 SE +/- 0.040, N = 5 SE +/- 0.009, N = 5 8.120 8.150 7.750 7.922 -O3 -march=znver2 -logg -O3 -march=znver1 -logg -O3 -march=x86-64 -logg -O2 1. (CXX) g++ options: -fvisibility=hidden -lm
Bullet Physics Engine Test: Raytests OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 0.4848 0.9696 1.4544 1.9392 2.424 SE +/- 0.002581, N = 3 SE +/- 0.018779, N = 3 SE +/- 0.020748, N = 3 SE +/- 0.000756, N = 3 2.060000 2.130000 2.090000 2.154750 -march=znver2 -march=znver1 -march=x86-64 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 12 24 36 48 60 SE +/- 0.20, N = 3 SE +/- 0.10, N = 3 SE +/- 0.34, N = 3 SE +/- 0.23, N = 3 53.41 53.44 52.89 54.85 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -pedantic -ldl -lz -lm
Memcached mcperf Method: Set OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 15K 30K 45K 60K 75K SE +/- 4868.10, N = 15 SE +/- 1488.06, N = 15 SE +/- 807.63, N = 4 SE +/- 77.26, N = 3 69121.64 60810.38 60770.80 63869.30 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 -O2 1. (CC) gcc options: -lm -rdynamic
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 700K 1400K 2100K 2800K 3500K SE +/- 61045.92, N = 12 SE +/- 62689.04, N = 13 SE +/- 58033.83, N = 15 SE +/- 18403.22, N = 3 3090850.69 3126726.23 3026826.32 2695667.50 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
MKL-DNN Harness: IP Batch 1D - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 -O3 -march=znver2 -O3 -march=znver1 -O3 -march=x86-64 ryzen3800x4200 40 80 120 160 200 SE +/- 2.44, N = 15 SE +/- 2.27, N = 15 SE +/- 3.65, N = 15 SE +/- 0.23, N = 3 158.46 159.95 152.36 22.64 -march=znver2 - MIN: 112.01 -march=znver1 - MIN: 108.37 -march=x86-64 - MIN: 109.43 MIN: 13.13 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
Phoronix Test Suite v10.8.4