Benchmarks for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012160-PTS-GCCZNVER86 GCC AMD Znver3 Compiler Optimization Levels - Phoronix Test Suite GCC AMD Znver3 Compiler Optimization Levels Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012160-PTS-GCCZNVER86&export=pdf&grr&sro .
GCC AMD Znver3 Compiler Optimization Levels Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O2 -march=x86-64 -O3 -march=x86-64 -O3 -march=znver2 -O2 -march=znver3 -O3 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 11.0.0 20201213 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O2 -march=x86-64: CXXFLAGS="-O2 -march=x86-64" CFLAGS="-O2 -march=x86-64" - -O3 -march=x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64" - -O3 -march=znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2" - -O2 -march=znver3: CXXFLAGS="-O2 -march=znver3" CFLAGS="-O2 -march=znver3" - -O3 -march=znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3" - -O2 -march=znver3 -flto: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto" - -O3 -maech=znver3 -flto: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto" - -Ofast -march=znver3 -flto: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected Disk Details - -O3 -march=znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Python Details - -O3 -march=znver3: Python 2.7.18 + Python 3.8.5
GCC AMD Znver3 Compiler Optimization Levels fftw: Stock - 2D FFT Size 4096 mrbayes: Primate Phylogeny Analysis hmmer: Pfam Database Search graphics-magick: Noise-Gaussian aobench: 2048 x 2048 - Total Time graphics-magick: Enhanced graphics-magick: Sharpen graphics-magick: Resizing scimark2: Composite mt-dgemm: Sustained Floating-Point Rate build-imagemagick: Time To Compile kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium c-ray: Total Time - 4K, 16 Rays Per Pixel coremark: CoreMark Size 666 - Iterations Per Second libraw: Post-Processing Benchmark daphne: OpenMP - NDT Mapping kvazaar: Bosphorus 4K - Very Fast aom-av1: Speed 8 Realtime kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Ultra Fast fftw: Stock - 1D FFT Size 4096 kvazaar: Bosphorus 1080p - Very Fast smallpt: Global Illumination Renderer; 128 Samples kvazaar: Bosphorus 1080p - Ultra Fast -O2 -march=x86-64 -O3 -march=x86-64 -O3 -march=znver2 -O2 -march=znver3 -O3 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto 8352.8 79.341 85.788 429 35.035 419 224 1605 2883.78 7.526567 16.398 11.48 11.68 60.896 700520.673201 51.45 921.71 29.55 60.88 45.09 46.32 53.13 12892 108.72 4.829 194.52 8169.2 77.074 85.596 426 29.991 382 225 1721 3660.74 6.484557 19.000 11.69 11.86 31.364 717480.237613 63.90 931.03 30.48 60.07 45.75 46.74 54.43 12436 111.77 5.068 198.84 8927.1 75.697 84.791 434 25.016 444 377 1907 4407.32 7.512189 19.370 11.59 11.72 25.251 721620.067510 75.26 964.72 30.34 62.28 44.97 46.38 54.75 14096 111.54 4.677 199.35 8823.7 79.543 85.126 452 30.711 430 359 2150 2625.59 7.391742 15.804 11.52 11.73 46.720 669924.921861 50.95 966.99 29.87 62.10 44.89 46.31 53.83 13771 109.42 4.567 195.13 9314.1 76.317 85.020 441 25.065 432 361 2135 3958.17 7.307093 18.859 11.58 11.74 25.254 716026.620271 73.83 971.31 30.31 62.33 44.91 46.42 54.73 13702 111.50 4.673 200.12 9718.2 79.513 84.929 445 27.363 476 377 1854 2924.23 7.345638 50.407 11.68 11.84 46.419 681281.706242 49.32 970.46 30.76 62.08 45.68 47.04 54.91 15232 113.41 4.607 201.85 10125 73.919 84.119 430 25.848 445 370 1906 3855.30 7.696089 63.476 11.88 12.03 25.480 736850.433639 73.73 971.45 31.44 62.52 46.30 47.32 56.12 15584 116.07 4.656 203.89 8755.9 45.274 76.943 504 24.505 449 328 1901 4101.89 7.758952 66.843 12.12 12.27 27.342 728279.792211 75.23 969.91 32.40 63.76 47.25 48.54 58.69 12149.0 121.74 4.602 220.99 OpenBenchmarking.org
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 2K 4K 6K 8K 10K SE +/- 88.61, N = 3 SE +/- 39.49, N = 3 SE +/- 96.35, N = 3 SE +/- 25.21, N = 3 SE +/- 312.57, N = 12 SE +/- 15.97, N = 3 SE +/- 18.57, N = 3 SE +/- 86.92, N = 3 8352.8 8823.7 9718.2 10125.0 8169.2 8927.1 9314.1 8755.9 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.78, N = 9 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.64, N = 15 SE +/- 0.72, N = 9 SE +/- 0.29, N = 3 79.34 79.54 79.51 73.92 77.07 75.70 76.32 45.27 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 85.79 85.13 84.93 84.12 85.60 84.79 85.02 76.94 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lhmmer -leasel -lm
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 110 220 330 440 550 SE +/- 3.51, N = 3 SE +/- 1.45, N = 3 SE +/- 1.53, N = 3 SE +/- 4.91, N = 12 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 1.53, N = 3 429 452 445 430 426 434 441 504 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 8 16 24 32 40 SE +/- 2.22, N = 15 SE +/- 1.92, N = 15 SE +/- 0.23, N = 3 SE +/- 0.08, N = 3 SE +/- 1.25, N = 15 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 35.04 30.71 27.36 25.85 29.99 25.02 25.07 24.51 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm -O3
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 100 200 300 400 500 SE +/- 1.53, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 5.29, N = 3 SE +/- 0.88, N = 3 419 430 476 445 382 444 432 449 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 80 160 240 320 400 SE +/- 0.58, N = 3 SE +/- 4.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 2.33, N = 3 SE +/- 0.88, N = 3 224 359 377 370 225 377 361 328 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 500 1000 1500 2000 2500 SE +/- 1.53, N = 3 SE +/- 4.91, N = 3 SE +/- 1.76, N = 3 SE +/- 14.34, N = 3 SE +/- 1.20, N = 3 SE +/- 8.37, N = 3 SE +/- 1.15, N = 3 1605 2150 1854 1906 1721 1907 2135 1901 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 900 1800 2700 3600 4500 SE +/- 9.37, N = 3 SE +/- 34.75, N = 3 SE +/- 25.20, N = 3 SE +/- 130.68, N = 12 SE +/- 12.53, N = 3 SE +/- 11.01, N = 3 SE +/- 35.50, N = 15 SE +/- 33.94, N = 3 2883.78 2625.59 2924.23 3855.30 3660.74 4407.32 3958.17 4101.89 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 2 4 6 8 10 SE +/- 0.110286, N = 3 SE +/- 0.055721, N = 3 SE +/- 0.071987, N = 3 SE +/- 0.089342, N = 6 SE +/- 0.050043, N = 15 SE +/- 0.069362, N = 3 SE +/- 0.086169, N = 3 SE +/- 0.087025, N = 3 7.526567 7.391742 7.345638 7.696089 6.484557 7.512189 7.307093 7.758952 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -O3 -march=native -fopenmp
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 15 30 45 60 75 SE +/- 0.22, N = 15 SE +/- 0.02, N = 3 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 SE +/- 0.03, N = 3 SE +/- 0.18, N = 15 SE +/- 0.07, N = 3 SE +/- 0.30, N = 3 16.40 15.80 50.41 63.48 19.00 19.37 18.86 66.84
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 11.48 11.52 11.68 11.88 11.69 11.59 11.58 12.12 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.68 11.73 11.84 12.03 11.86 11.72 11.74 12.27 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 14 28 42 56 70 SE +/- 0.16, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 60.90 46.72 46.42 25.48 31.36 25.25 25.25 27.34 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm -lpthread -O3
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 160K 320K 480K 640K 800K SE +/- 2080.13, N = 3 SE +/- 4015.82, N = 3 SE +/- 5589.70, N = 3 SE +/- 1257.49, N = 3 SE +/- 371.91, N = 3 SE +/- 1896.73, N = 3 SE +/- 449.85, N = 3 SE +/- 1316.80, N = 3 700520.67 669924.92 681281.71 736850.43 717480.24 721620.07 716026.62 728279.79 -march=x86-64 -march=znver3 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -O2 -lrt" -lrt
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.41, N = 3 SE +/- 0.46, N = 3 SE +/- 0.36, N = 3 SE +/- 0.89, N = 3 SE +/- 0.24, N = 3 SE +/- 0.59, N = 3 SE +/- 1.48, N = 15 SE +/- 0.56, N = 3 51.45 50.95 49.32 73.73 63.90 75.26 73.83 75.23 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -ljpeg -lz -lm
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 200 400 600 800 1000 SE +/- 13.06, N = 3 SE +/- 3.99, N = 3 SE +/- 2.71, N = 3 SE +/- 2.69, N = 3 SE +/- 3.82, N = 3 SE +/- 0.72, N = 3 SE +/- 2.43, N = 3 SE +/- 3.18, N = 3 921.71 966.99 970.46 971.45 931.03 964.72 971.31 969.91 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 29.55 29.87 30.76 31.44 30.48 30.34 30.31 32.40 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 14 28 42 56 70 SE +/- 0.73, N = 6 SE +/- 0.37, N = 3 SE +/- 0.79, N = 3 SE +/- 0.59, N = 9 SE +/- 0.99, N = 15 SE +/- 0.23, N = 3 SE +/- 0.65, N = 3 SE +/- 0.25, N = 3 60.88 62.10 62.08 62.52 60.07 62.28 62.33 63.76 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 11 22 33 44 55 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 45.09 44.89 45.68 46.30 45.75 44.97 44.91 47.25 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 46.32 46.31 47.04 47.32 46.74 46.38 46.42 48.54 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 53.13 53.83 54.91 56.12 54.43 54.75 54.73 58.69 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3K 6K 9K 12K 15K SE +/- 135.67, N = 3 SE +/- 161.07, N = 3 SE +/- 16.50, N = 3 SE +/- 105.64, N = 3 SE +/- 80.35, N = 3 SE +/- 75.72, N = 3 SE +/- 54.77, N = 3 SE +/- 564.61, N = 15 12892.0 13771.0 15232.0 15584.0 12436.0 14096.0 13702.0 12149.0 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 30 60 90 120 150 SE +/- 0.51, N = 3 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 SE +/- 0.30, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 SE +/- 0.24, N = 3 108.72 109.42 113.41 116.07 111.77 111.54 111.50 121.74 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1.1403 2.2806 3.4209 4.5612 5.7015 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.009, N = 3 SE +/- 0.012, N = 3 SE +/- 0.018, N = 3 SE +/- 0.016, N = 3 SE +/- 0.025, N = 3 SE +/- 0.005, N = 3 4.829 4.567 4.607 4.656 5.068 4.677 4.673 4.602 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -O3
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 50 100 150 200 250 SE +/- 0.75, N = 3 SE +/- 0.09, N = 3 SE +/- 0.55, N = 3 SE +/- 0.63, N = 3 SE +/- 0.89, N = 3 SE +/- 0.35, N = 3 SE +/- 0.25, N = 3 SE +/- 0.20, N = 3 194.52 195.13 201.85 203.89 198.84 199.35 200.12 220.99 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Phoronix Test Suite v10.8.4