Tests for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012152-HA-GCCZNVER322 GCC Znver3 Compiler Optimization Levels - Phoronix Test Suite GCC Znver3 Compiler Optimization Levels Tests for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012152-HA-GCCZNVER322&rdt&gru .
GCC Znver3 Compiler Optimization Levels Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 11.0.0 20201213 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O3 -march=znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3" - -O3 -march=znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2" - -O3 -march=x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64" - -O3 -maech=znver3 -flto: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto" - -Ofast -march=znver3 -flto: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto" - -O2 -march=znver3 -flto: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto" Compiler Details - --disable-multilib --enable-checking=release Disk Details - -O3 -march=znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 Python Details - -O3 -march=znver3: Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
GCC Znver3 Compiler Optimization Levels dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit aom-av1: Speed 4 Realtime aom-av1: Speed 5 Two-Pass aom-av1: Speed 8 Realtime kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast svt-av1: Enc Mode 4 - 1080p svt-av1: Enc Mode 8 - 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p vpxenc: Speed 0 vpxenc: Speed 5 x264: H.264 Video Encoding x265: Bosphorus 4K x265: Bosphorus 1080p mt-dgemm: Sustained Floating-Point Rate graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space coremark: CoreMark Size 666 - Iterations Per Second aircrack-ng: compress-zstd: 3 compress-zstd: 19 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 scimark2: Composite scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test libraw: Post-Processing Benchmark crafty: Elapsed Time tscp: AI Chess Performance hint: FLOAT nginx: Static Web Page Serving apache: Static Web Page Serving openssl: RSA 4096-bit Performance daphne: OpenMP - NDT Mapping daphne: OpenMP - Points2Image webp: Quality 100 mrbayes: Primate Phylogeny Analysis hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA build-apache: Time To Compile build-ffmpeg: Time To Compile build-imagemagick: Time To Compile build-mplayer: Time To Compile build-php: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel smallpt: Global Illumination Renderer; 128 Samples aobench: 2048 x 2048 - Total Time bullet: Raytests bullet: 3000 Fall bullet: 1000 Stack bullet: 1000 Convex bullet: 136 Ragdolls bullet: Prim Trimesh bullet: Convex Trimesh compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 encode-flac: WAV To FLAC encode-mp3: WAV To MP3 rnnoise: tachyon: Total Time cpp-perf-bench: Atol cpp-perf-bench: Ctype cpp-perf-bench: Math Library cpp-perf-bench: Stepanov Vector cpp-perf-bench: Function Objects cpp-perf-bench: Stepanov Abstraction sqlite-speedtest: Timed Time - Size 1,000 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 879.20 240.24 914.92 261.39 2.64 4.69 62.33 11.58 11.74 44.91 46.42 30.31 54.73 111.50 200.12 7.014 60.383 239.32 230.15 10.03 39.78 199.02 27.54 89.13 7.307093 1196 1064 361 432 2135 441 1115 716026.620271 75202.344 5066.0 44.2 13702 9314.1 69819 32774 3958.17 883.30 467.20 4779.43 10128.76 3043.46 4918.506823 95844 73.83 11396837 2055431 530415720.66861 51446.92 46267.92 4849.5 971.31 30333.660297739 1.648 76.317 85.020 7.115 15.138 30.871 18.859 19.798 36.676 25.254 4.673 25.065 1.663293 2.651887 3.265452 2.672805 1.710504 0.656643 0.731618 23.603 6.534 5.445 16.563 49.4827 35.813 40.081 222.245 42.380 11.198 18.864 40.643 893.72 241.87 932.67 260.36 2.53 4.79 62.28 11.59 11.72 44.97 46.38 30.34 54.75 111.54 199.35 6.956 59.944 234.58 223.97 10.75 39.66 200.66 27.81 88.80 7.512189 1139 919 377 444 1907 434 1064 721620.067510 75330.448 5028.1 44.2 14096 8927.1 71917 30461 4407.32 867.28 531.10 5128.66 12537.23 2972.33 5341.802444 95715 75.26 11460428 2010350 535659290.50880 48905.01 46212.92 4857.9 964.72 30133.174304110 1.684 75.697 84.791 7.324 15.189 30.811 19.370 19.598 36.753 25.251 4.677 25.016 1.785458 3.164171 3.890664 3.197418 2.041453 0.786183 0.876382 23.415 6.616 5.673 14.070 49.2891 35.332 40.712 220.550 41.717 11.464 20.635 41.912 903.91 243.36 921.07 187.65 2.62 4.84 60.07 11.69 11.86 45.75 46.74 30.48 54.43 111.77 198.84 6.960 59.840 235.02 224.11 9.44 33.71 202.78 27.63 88.92 6.484557 1094 987 225 382 1721 426 1067 717480.237613 75091.643 5042.5 44.3 12436 8169.2 3660.74 897.30 550.92 4634.54 9747.10 2473.84 4972.634733 95316 63.90 11161599 2035599 536140751.38618 51463.87 45996.80 4847.3 931.03 29933.960506534 1.726 77.074 85.596 7.399 15.177 30.711 19.000 19.817 35.348 31.364 5.068 29.991 1.792863 3.331015 4.077866 3.627370 2.184048 0.815518 0.930682 23.173 6.441 5.787 14.523 49.5027 36.377 41.935 223.874 42.383 11.328 19.848 41.948 2.63 4.90 62.52 11.88 12.03 46.30 47.32 31.44 56.12 116.07 203.89 6.972 60.224 235.83 224.25 27.75 89.08 7.696089 1156 948 370 445 1906 430 1110 736850.433639 75160.065 5043.1 44.2 15584 10125 64468 31709 3855.30 1625.41 460.20 4016.10 8601.61 2552.11 5235.075565 95172 73.73 12020364 2082271 525844746.56770 51640.99 4836.6 971.45 29966.049994027 1.636 73.919 84.119 7.286 63.476 25.480 4.656 25.848 1.632244 2.69281 3.319278 2.625405 1.771787 0.640668 0.708918 24.093 6.606 5.789 13.860 49.4275 35.329 39.867 219.480 41.579 11.473 18.884 41.305 2.66 4.81 63.76 12.12 12.27 47.25 48.54 32.40 58.69 121.74 220.99 6.934 59.761 237.74 228.43 27.32 88.21 7.758952 1201 961 328 449 1901 504 1119 728279.792211 75105.172 5030.2 44.2 12149.0 8755.9 63882 31754 4101.89 1893.64 539.06 5235.17 9843.10 2998.47 4974.883078 95872 75.23 12258866 2187546 483531659.13924 49006.08 4839.5 969.91 29964.692392405 1.660 45.274 76.943 7.341 66.843 27.342 4.602 24.505 1.627688 2.692827 3.316208 2.629167 1.769900 0.640757 0.708037 23.710 6.233 5.282 49.5398 36.784 40.206 217.509 41.827 11.259 18.550 41.127 2.67 4.87 62.08 11.68 11.84 45.68 47.04 30.76 54.91 113.41 201.85 6.957 59.679 236.33 225.36 27.48 87.71 7.345638 1153 888 377 476 1854 445 1111 681281.706242 75257.164 5045.8 44.1 15232 9718.2 72763 31717 2924.23 891.29 540.63 6037.39 5574.17 1577.67 5357.653937 95588 49.32 11650472 2153845 533589946.00456 51831.76 4852.6 970.46 29486.004018584 1.648 79.513 84.929 7.365 50.407 46.419 4.607 27.363 1.594243 2.632108 3.255847 2.573982 1.737217 0.627687 0.69347 23.336 6.186 6.473 15.654 49.5202 35.385 48.606 283.694 42.033 13.449 25.795 45.485 OpenBenchmarking.org
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Chimera 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 200 400 600 800 1000 SE +/- 2.44, N = 3 SE +/- 2.12, N = 3 SE +/- 1.99, N = 3 879.20 893.72 903.91 -march=znver3 - MIN: 689.9 / MAX: 1074.02 -march=znver2 - MIN: 699.85 / MAX: 1086.55 -march=x86-64 - MIN: 707.97 / MAX: 1105.39 1. (CC) gcc options: -O3 -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Summer Nature 4K -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 50 100 150 200 250 SE +/- 2.87, N = 3 SE +/- 0.26, N = 3 SE +/- 0.92, N = 3 240.24 241.87 243.36 -march=znver3 - MIN: 152.32 / MAX: 251.91 -march=znver2 - MIN: 177.93 / MAX: 250.43 -march=x86-64 - MIN: 175.57 / MAX: 254.29 1. (CC) gcc options: -O3 -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Summer Nature 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 200 400 600 800 1000 SE +/- 6.51, N = 3 SE +/- 1.62, N = 3 SE +/- 10.42, N = 3 914.92 932.67 921.07 -march=znver3 - MIN: 671.69 / MAX: 1004.13 -march=znver2 - MIN: 678.72 / MAX: 1016.6 -march=x86-64 - MIN: 615.39 / MAX: 1021.01 1. (CC) gcc options: -O3 -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Chimera 1080p 10-bit -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 60 120 180 240 300 SE +/- 0.57, N = 3 SE +/- 1.53, N = 3 SE +/- 0.69, N = 3 261.39 260.36 187.65 -march=znver3 - MIN: 173.03 / MAX: 490.65 -march=znver2 - MIN: 173.47 / MAX: 476.91 -march=x86-64 - MIN: 122.85 / MAX: 373.39 1. (CC) gcc options: -O3 -pthread
AOM AV1 Encoder Mode: Speed 4 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Realtime -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.6008 1.2016 1.8024 2.4032 3.004 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 7 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 2.64 2.53 2.62 2.63 2.66 2.67 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 5 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 5 Two-Pass -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1.1025 2.205 3.3075 4.41 5.5125 SE +/- 0.07, N = 15 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 12 SE +/- 0.05, N = 3 4.69 4.79 4.84 4.90 4.81 4.87 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 14 28 42 56 70 SE +/- 0.65, N = 3 SE +/- 0.23, N = 3 SE +/- 0.99, N = 15 SE +/- 0.59, N = 9 SE +/- 0.25, N = 3 SE +/- 0.79, N = 3 62.33 62.28 60.07 62.52 63.76 62.08 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 11.58 11.59 11.69 11.88 12.12 11.68 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 11.74 11.72 11.86 12.03 12.27 11.84 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 44.91 44.97 45.75 46.30 47.25 45.68 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 46.42 46.38 46.74 47.32 48.54 47.04 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 30.31 30.34 30.48 31.44 32.40 30.76 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 13 26 39 52 65 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 54.73 54.75 54.43 56.12 58.69 54.91 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 30 60 90 120 150 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 SE +/- 0.30, N = 3 SE +/- 0.16, N = 3 SE +/- 0.24, N = 3 SE +/- 0.14, N = 3 111.50 111.54 111.77 116.07 121.74 113.41 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 50 100 150 200 250 SE +/- 0.25, N = 3 SE +/- 0.35, N = 3 SE +/- 0.89, N = 3 SE +/- 0.63, N = 3 SE +/- 0.20, N = 3 SE +/- 0.55, N = 3 200.12 199.35 198.84 203.89 220.99 201.85 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2 4 6 8 10 SE +/- 0.033, N = 3 SE +/- 0.038, N = 3 SE +/- 0.018, N = 3 SE +/- 0.010, N = 3 SE +/- 0.057, N = 3 SE +/- 0.055, N = 3 7.014 6.956 6.960 6.972 6.934 6.957 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 14 28 42 56 70 SE +/- 0.64, N = 3 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 SE +/- 0.22, N = 3 SE +/- 0.04, N = 3 SE +/- 0.46, N = 3 60.38 59.94 59.84 60.22 59.76 59.68 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 50 100 150 200 250 SE +/- 3.14, N = 4 SE +/- 1.99, N = 12 SE +/- 2.28, N = 12 SE +/- 2.07, N = 12 SE +/- 2.03, N = 13 SE +/- 2.16, N = 12 239.32 234.58 235.02 235.83 237.74 236.33 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 50 100 150 200 250 SE +/- 0.72, N = 3 SE +/- 0.63, N = 3 SE +/- 0.95, N = 3 SE +/- 0.12, N = 3 SE +/- 0.74, N = 3 SE +/- 1.47, N = 3 230.15 223.97 224.11 224.25 228.43 225.36 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding Speed: Speed 0 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 3 6 9 12 15 SE +/- 0.29, N = 14 SE +/- 0.09, N = 3 SE +/- 0.41, N = 15 10.03 10.75 9.44 -march=znver3 -march=znver2 -march=x86-64 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
VP9 libvpx Encoding Speed: Speed 5 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.29, N = 3 SE +/- 1.60, N = 14 39.78 39.66 33.71 -march=znver3 -march=znver2 -march=x86-64 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 40 80 120 160 200 SE +/- 2.81, N = 4 SE +/- 1.66, N = 15 SE +/- 1.58, N = 15 199.02 200.66 202.78 -march=znver3 -march=znver2 -march=x86-64 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 27.54 27.81 27.63 27.75 27.32 27.48 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.30, N = 3 SE +/- 0.36, N = 3 SE +/- 0.18, N = 3 SE +/- 0.25, N = 3 SE +/- 0.33, N = 3 89.13 88.80 88.92 89.08 88.21 87.71 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2 4 6 8 10 SE +/- 0.086169, N = 3 SE +/- 0.069362, N = 3 SE +/- 0.050043, N = 15 SE +/- 0.089342, N = 6 SE +/- 0.087025, N = 3 SE +/- 0.071987, N = 3 7.307093 7.512189 6.484557 7.696089 7.758952 7.345638 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -O3 -march=native -fopenmp
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 300 600 900 1200 1500 SE +/- 4.36, N = 3 SE +/- 4.18, N = 3 SE +/- 2.85, N = 3 SE +/- 2.96, N = 3 SE +/- 2.40, N = 3 SE +/- 2.60, N = 3 1196 1139 1094 1156 1201 1153 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 200 400 600 800 1000 SE +/- 5.70, N = 3 SE +/- 27.55, N = 15 SE +/- 8.62, N = 3 SE +/- 2.65, N = 3 SE +/- 7.88, N = 3 SE +/- 37.25, N = 15 1064 919 987 948 961 888 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 80 160 240 320 400 SE +/- 2.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 361 377 225 370 328 377 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 100 200 300 400 500 SE +/- 5.29, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 432 444 382 445 449 476 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 500 1000 1500 2000 2500 SE +/- 8.37, N = 3 SE +/- 1.20, N = 3 SE +/- 14.34, N = 3 SE +/- 1.76, N = 3 SE +/- 1.15, N = 3 2135 1907 1721 1906 1901 1854 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 110 220 330 440 550 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 4.91, N = 12 SE +/- 1.53, N = 3 SE +/- 1.53, N = 3 SE +/- 1.45, N = 3 441 434 426 430 504 445 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 200 400 600 800 1000 SE +/- 4.41, N = 3 SE +/- 1.53, N = 3 SE +/- 3.00, N = 3 SE +/- 1.45, N = 3 SE +/- 8.84, N = 3 SE +/- 2.65, N = 3 1115 1064 1067 1110 1119 1111 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 160K 320K 480K 640K 800K SE +/- 449.85, N = 3 SE +/- 1896.73, N = 3 SE +/- 371.91, N = 3 SE +/- 1257.49, N = 3 SE +/- 1316.80, N = 3 SE +/- 5589.70, N = 3 716026.62 721620.07 717480.24 736850.43 728279.79 681281.71 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto 1. (CC) gcc options: -O2 -lrt" -lrt
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 16K 32K 48K 64K 80K SE +/- 57.69, N = 3 SE +/- 19.16, N = 3 SE +/- 83.30, N = 3 SE +/- 20.34, N = 3 SE +/- 44.68, N = 3 SE +/- 96.05, N = 3 75202.34 75330.45 75091.64 75160.07 75105.17 75257.16 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Zstd Compression Compression Level: 3 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 3 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1100 2200 3300 4400 5500 SE +/- 12.00, N = 3 SE +/- 7.47, N = 3 SE +/- 40.40, N = 3 SE +/- 11.65, N = 3 SE +/- 7.82, N = 3 SE +/- 31.10, N = 3 5066.0 5028.1 5042.5 5043.1 5030.2 5045.8 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lz -llzma
Zstd Compression Compression Level: 19 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 19 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.00, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 44.2 44.2 44.3 44.2 44.2 44.1 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lz -llzma
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3K 6K 9K 12K 15K SE +/- 54.77, N = 3 SE +/- 75.72, N = 3 SE +/- 80.35, N = 3 SE +/- 105.64, N = 3 SE +/- 564.61, N = 15 SE +/- 16.50, N = 3 13702.0 14096.0 12436.0 15584.0 12149.0 15232.0 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2K 4K 6K 8K 10K SE +/- 18.57, N = 3 SE +/- 15.97, N = 3 SE +/- 312.57, N = 12 SE +/- 25.21, N = 3 SE +/- 86.92, N = 3 SE +/- 96.35, N = 3 9314.1 8927.1 8169.2 10125.0 8755.9 9718.2 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 -O3 -march=znver3 -O3 -march=znver2 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 16K 32K 48K 64K 80K SE +/- 842.00, N = 3 SE +/- 675.82, N = 3 SE +/- 3728.41, N = 12 SE +/- 3515.17, N = 15 SE +/- 1067.12, N = 3 69819 71917 64468 63882 72763 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=znver3 -O3 -march=znver2 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 7K 14K 21K 28K 35K SE +/- 428.13, N = 3 SE +/- 945.33, N = 9 SE +/- 261.90, N = 3 SE +/- 225.04, N = 3 SE +/- 342.00, N = 7 32774 30461 31709 31754 31717 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 900 1800 2700 3600 4500 SE +/- 35.50, N = 15 SE +/- 11.01, N = 3 SE +/- 12.53, N = 3 SE +/- 130.68, N = 12 SE +/- 33.94, N = 3 SE +/- 25.20, N = 3 3958.17 4407.32 3660.74 3855.30 4101.89 2924.23 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 400 800 1200 1600 2000 SE +/- 7.40, N = 3 SE +/- 2.47, N = 3 SE +/- 1.74, N = 3 SE +/- 249.49, N = 3 SE +/- 19.53, N = 3 SE +/- 7.74, N = 3 883.30 867.28 897.30 1625.41 1893.64 891.29 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 120 240 360 480 600 SE +/- 4.02, N = 3 SE +/- 2.93, N = 3 SE +/- 1.10, N = 3 SE +/- 71.30, N = 3 SE +/- 4.57, N = 3 SE +/- 5.61, N = 3 467.20 531.10 550.92 460.20 539.06 540.63 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1300 2600 3900 5200 6500 SE +/- 18.73, N = 3 SE +/- 19.91, N = 3 SE +/- 14.37, N = 3 SE +/- 594.21, N = 3 SE +/- 33.83, N = 3 SE +/- 52.45, N = 3 4779.43 5128.66 4634.54 4016.10 5235.17 6037.39 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3K 6K 9K 12K 15K SE +/- 581.61, N = 3 SE +/- 14.88, N = 3 SE +/- 36.04, N = 3 SE +/- 1258.08, N = 3 SE +/- 85.21, N = 3 SE +/- 41.97, N = 3 10128.76 12537.23 9747.10 8601.61 9843.10 5574.17 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 700 1400 2100 2800 3500 SE +/- 34.33, N = 3 SE +/- 14.93, N = 3 SE +/- 10.49, N = 3 SE +/- 398.20, N = 3 SE +/- 27.35, N = 3 SE +/- 18.54, N = 3 3043.46 2972.33 2473.84 2552.11 2998.47 1577.67 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1100 2200 3300 4400 5500 SE +/- 247.48, N = 12 SE +/- 26.98, N = 3 SE +/- 199.91, N = 15 SE +/- 179.16, N = 12 SE +/- 307.73, N = 12 SE +/- 203.31, N = 12 4918.51 5341.80 4972.63 5235.08 4974.88 5357.65 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -O3 -mavx2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 20K 40K 60K 80K 100K SE +/- 286.46, N = 3 SE +/- 115.21, N = 3 SE +/- 378.23, N = 3 SE +/- 384.90, N = 3 SE +/- 132.15, N = 3 SE +/- 249.42, N = 3 95844 95715 95316 95172 95872 95588 1. (CXX) g++ options: -pipe -lpthread
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 20 40 60 80 100 SE +/- 1.48, N = 15 SE +/- 0.59, N = 3 SE +/- 0.24, N = 3 SE +/- 0.89, N = 3 SE +/- 0.56, N = 3 SE +/- 0.36, N = 3 73.83 75.26 63.90 73.73 75.23 49.32 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -ljpeg -lz -lm
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3M 6M 9M 12M 15M SE +/- 376923.05, N = 12 SE +/- 425288.16, N = 15 SE +/- 505442.39, N = 15 SE +/- 25650.81, N = 3 SE +/- 110552.33, N = 3 SE +/- 388230.44, N = 12 11396837 11460428 11161599 12020364 12258866 11650472 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 500K 1000K 1500K 2000K 2500K SE +/- 12293.02, N = 5 SE +/- 47716.25, N = 25 SE +/- 10113.82, N = 5 SE +/- 49215.47, N = 25 SE +/- 9982.18, N = 5 SE +/- 19185.63, N = 5 2055431 2010350 2035599 2082271 2187546 2153845 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -O3 -march=native
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 110M 220M 330M 440M 550M SE +/- 5853530.73, N = 3 SE +/- 5929051.95, N = 3 SE +/- 4969986.40, N = 3 SE +/- 794515.24, N = 3 SE +/- 31530973.95, N = 9 SE +/- 4842392.51, N = 3 530415720.67 535659290.51 536140751.39 525844746.57 483531659.14 533589946.00 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -O3 -march=native -lm
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11K 22K 33K 44K 55K SE +/- 101.68, N = 3 SE +/- 1677.08, N = 12 SE +/- 186.11, N = 3 SE +/- 95.73, N = 3 SE +/- 1017.37, N = 15 SE +/- 77.42, N = 3 51446.92 48905.01 51463.87 51640.99 49006.08 51831.76 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 10K 20K 30K 40K 50K SE +/- 521.28, N = 3 SE +/- 362.23, N = 3 SE +/- 201.19, N = 3 46267.92 46212.92 45996.80 -march=znver3 -march=znver2 -march=x86-64 1. (CC) gcc options: -shared -fPIC -pthread -O3
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1000 2000 3000 4000 5000 SE +/- 3.15, N = 3 SE +/- 7.55, N = 3 SE +/- 7.21, N = 3 SE +/- 8.42, N = 3 SE +/- 2.72, N = 3 SE +/- 3.01, N = 3 4849.5 4857.9 4847.3 4836.6 4839.5 4852.6 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -m64 -lssl -lcrypto -ldl
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 200 400 600 800 1000 SE +/- 2.43, N = 3 SE +/- 0.72, N = 3 SE +/- 3.82, N = 3 SE +/- 2.69, N = 3 SE +/- 3.18, N = 3 SE +/- 2.71, N = 3 971.31 964.72 931.03 971.45 969.91 970.46 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 6K 12K 18K 24K 30K SE +/- 132.80, N = 3 SE +/- 334.66, N = 15 SE +/- 382.84, N = 3 SE +/- 432.33, N = 14 SE +/- 241.17, N = 15 SE +/- 474.52, N = 3 30333.66 30133.17 29933.96 29966.05 29964.69 29486.00 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.3884 0.7768 1.1652 1.5536 1.942 SE +/- 0.017, N = 3 SE +/- 0.002, N = 3 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 SE +/- 0.004, N = 3 SE +/- 0.014, N = 3 1.648 1.684 1.726 1.636 1.660 1.648 -O3 -march=znver3 -ltiff -O3 -march=znver2 -ltiff -O3 -march=x86-64 -ltiff -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg -lpng16
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 20 40 60 80 100 SE +/- 0.72, N = 9 SE +/- 0.64, N = 15 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 76.32 75.70 77.07 73.92 45.27 79.51 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 85.02 84.79 85.60 84.12 76.94 84.93 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2 4 6 8 10 SE +/- 0.068, N = 3 SE +/- 0.117, N = 3 SE +/- 0.101, N = 3 SE +/- 0.086, N = 3 SE +/- 0.096, N = 4 SE +/- 0.062, N = 15 7.115 7.324 7.399 7.286 7.341 7.365 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 4 8 12 16 20 SE +/- 0.20, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 15.14 15.19 15.18
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 7 14 21 28 35 SE +/- 0.13, N = 3 SE +/- 0.28, N = 13 SE +/- 0.10, N = 3 30.87 30.81 30.71
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.18, N = 15 SE +/- 0.03, N = 3 SE +/- 0.32, N = 3 SE +/- 0.30, N = 3 SE +/- 0.34, N = 3 18.86 19.37 19.00 63.48 66.84 50.41
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.4 Time To Compile -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 5 10 15 20 25 SE +/- 0.22, N = 6 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 19.80 19.60 19.82
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 8 16 24 32 40 SE +/- 0.50, N = 4 SE +/- 0.49, N = 5 SE +/- 0.07, N = 3 36.68 36.75 35.35
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 25.25 25.25 31.36 25.48 27.34 46.42 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1.1403 2.2806 3.4209 4.5612 5.7015 SE +/- 0.025, N = 3 SE +/- 0.016, N = 3 SE +/- 0.018, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.009, N = 3 4.673 4.677 5.068 4.656 4.602 4.607 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -O3
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 7 14 21 28 35 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 SE +/- 1.25, N = 15 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 25.07 25.02 29.99 25.85 24.51 27.36 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -lm -O3
Bullet Physics Engine Test: Raytests OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.4034 0.8068 1.2102 1.6136 2.017 SE +/- 0.002327, N = 3 SE +/- 0.101438, N = 15 SE +/- 0.075833, N = 15 SE +/- 0.002589, N = 3 SE +/- 0.004567, N = 3 SE +/- 0.014472, N = 3 1.663293 1.785458 1.792863 1.632244 1.627688 1.594243 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: 3000 Fall OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.7495 1.499 2.2485 2.998 3.7475 SE +/- 0.004533, N = 3 SE +/- 0.599476, N = 3 SE +/- 0.590113, N = 3 SE +/- 0.005845, N = 3 SE +/- 0.009048, N = 3 SE +/- 0.027956, N = 3 2.651887 3.164171 3.331015 2.692810 2.692827 2.632108 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: 1000 Stack OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.9175 1.835 2.7525 3.67 4.5875 SE +/- 0.005866, N = 3 SE +/- 0.728617, N = 3 SE +/- 0.721313, N = 3 SE +/- 0.006493, N = 3 SE +/- 0.007476, N = 3 SE +/- 0.032238, N = 3 3.265452 3.890664 4.077866 3.319278 3.316208 3.255847 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: 1000 Convex OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.8162 1.6324 2.4486 3.2648 4.081 SE +/- 0.004286, N = 3 SE +/- 0.604893, N = 3 SE +/- 0.648498, N = 3 SE +/- 0.006441, N = 3 SE +/- 0.006861, N = 3 SE +/- 0.026939, N = 3 2.672805 3.197418 3.627370 2.625405 2.629167 2.573982 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: 136 Ragdolls OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.4914 0.9828 1.4742 1.9656 2.457 SE +/- 0.003623, N = 3 SE +/- 0.384993, N = 3 SE +/- 0.385000, N = 3 SE +/- 0.002552, N = 3 SE +/- 0.004039, N = 3 SE +/- 0.024499, N = 3 1.710504 2.041453 2.184048 1.771787 1.769900 1.737217 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: Prim Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.1835 0.367 0.5505 0.734 0.9175 SE +/- 0.001134, N = 3 SE +/- 0.149603, N = 3 SE +/- 0.146828, N = 3 SE +/- 0.001284, N = 3 SE +/- 0.002855, N = 3 SE +/- 0.006380, N = 3 0.656643 0.786183 0.815518 0.640668 0.640757 0.627687 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
Bullet Physics Engine Test: Convex Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 0.2094 0.4188 0.6282 0.8376 1.047 SE +/- 0.000878, N = 3 SE +/- 0.167457, N = 3 SE +/- 0.168559, N = 3 SE +/- 0.001768, N = 3 SE +/- 0.002714, N = 3 SE +/- 0.007502, N = 3 0.731618 0.876382 0.930682 0.708918 0.708037 0.693470 -march=znver3 -lglut -lGL -lGLU -march=znver2 -lglut -lGL -lGLU -march=x86-64 -lglut -lGL -lGLU -march=znver3 -flto -Ofast -march=znver3 -flto -lglut -lGL -lGLU -O2 -march=znver3 -flto -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -rdynamic
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.28, N = 3 SE +/- 0.17, N = 3 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.24, N = 3 23.60 23.42 23.17 24.09 23.71 23.34 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pthread -fvisibility=hidden
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2 4 6 8 10 SE +/- 0.131, N = 25 SE +/- 0.175, N = 25 SE +/- 0.166, N = 25 SE +/- 0.232, N = 25 SE +/- 0.047, N = 5 SE +/- 0.043, N = 5 6.534 6.616 6.441 6.606 6.233 6.186 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 2 4 6 8 10 SE +/- 0.016, N = 3 SE +/- 0.239, N = 15 SE +/- 0.023, N = 3 SE +/- 0.319, N = 15 SE +/- 0.022, N = 3 SE +/- 0.033, N = 3 5.445 5.673 5.787 5.789 5.282 6.473 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -O2 -march=znver3 -flto 4 8 12 16 20 SE +/- 1.10, N = 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.16, N = 3 SE +/- 0.65, N = 15 16.56 14.07 14.52 13.86 15.65 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -pedantic -fvisibility=hidden
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 49.48 49.29 49.50 49.43 49.54 49.52 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 8 16 24 32 40 SE +/- 0.38, N = 3 SE +/- 0.49, N = 3 SE +/- 0.37, N = 3 SE +/- 0.35, N = 3 SE +/- 1.55, N = 15 SE +/- 0.39, N = 3 35.81 35.33 36.38 35.33 36.78 35.39 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 11 22 33 44 55 SE +/- 0.18, N = 3 SE +/- 1.66, N = 15 SE +/- 2.39, N = 15 SE +/- 0.32, N = 15 SE +/- 0.37, N = 3 SE +/- 3.45, N = 15 40.08 40.71 41.94 39.87 40.21 48.61 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 60 120 180 240 300 SE +/- 0.77, N = 3 SE +/- 1.88, N = 3 SE +/- 2.74, N = 3 SE +/- 1.97, N = 3 SE +/- 2.13, N = 3 SE +/- 26.60, N = 9 222.25 220.55 223.87 219.48 217.51 283.69 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.46, N = 3 SE +/- 0.45, N = 3 SE +/- 0.25, N = 3 SE +/- 0.45, N = 3 SE +/- 0.40, N = 3 42.38 41.72 42.38 41.58 41.83 42.03 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 3 6 9 12 15 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 SE +/- 0.93, N = 15 11.20 11.46 11.33 11.47 11.26 13.45 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 6 12 18 24 30 SE +/- 0.20, N = 3 SE +/- 1.21, N = 15 SE +/- 0.88, N = 15 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 1.80, N = 15 18.86 20.64 19.85 18.88 18.55 25.80 -march=znver3 -march=znver2 -march=x86-64 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 10 20 30 40 50 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 SE +/- 0.35, N = 3 SE +/- 0.51, N = 3 SE +/- 2.71, N = 15 40.64 41.91 41.95 41.31 41.13 45.49 -O3 -march=znver3 -O3 -march=znver2 -O3 -march=x86-64 -O3 -march=znver3 -flto -Ofast -march=znver3 -flto -O2 -march=znver3 -flto 1. (CC) gcc options: -ldl -lz -lpthread
Phoronix Test Suite v10.8.4