Benchmarks for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012160-PTS-GCCZNVER86 GCC AMD Znver3 Compiler Optimization Levels - Phoronix Test Suite GCC AMD Znver3 Compiler Optimization Levels Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012160-PTS-GCCZNVER86&export=pdf&sro&grs .
GCC AMD Znver3 Compiler Optimization Levels Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O2 -march=x86-64 -O3 -march=x86-64 -O3 -march=znver2 -O2 -march=znver3 -O3 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 11.0.0 20201213 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O2 -march=x86-64: CXXFLAGS="-O2 -march=x86-64" CFLAGS="-O2 -march=x86-64" - -O3 -march=x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64" - -O3 -march=znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2" - -O2 -march=znver3: CXXFLAGS="-O2 -march=znver3" CFLAGS="-O2 -march=znver3" - -O3 -march=znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3" - -O2 -march=znver3 -flto: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto" - -O3 -maech=znver3 -flto: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto" - -Ofast -march=znver3 -flto: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected Disk Details - -O3 -march=znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Python Details - -O3 -march=znver3: Python 2.7.18 + Python 3.8.5
GCC AMD Znver3 Compiler Optimization Levels build-imagemagick: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel mrbayes: Primate Phylogeny Analysis graphics-magick: Sharpen scimark2: Composite libraw: Post-Processing Benchmark graphics-magick: Resizing fftw: Stock - 1D FFT Size 4096 graphics-magick: Enhanced fftw: Stock - 2D FFT Size 4096 mt-dgemm: Sustained Floating-Point Rate graphics-magick: Noise-Gaussian kvazaar: Bosphorus 1080p - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast hmmer: Pfam Database Search smallpt: Global Illumination Renderer; 128 Samples kvazaar: Bosphorus 4K - Ultra Fast coremark: CoreMark Size 666 - Iterations Per Second kvazaar: Bosphorus 4K - Very Fast aom-av1: Speed 8 Realtime kvazaar: Bosphorus 4K - Slow daphne: OpenMP - NDT Mapping kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Medium aobench: 2048 x 2048 - Total Time -O2 -march=x86-64 -O3 -march=x86-64 -O3 -march=znver2 -O2 -march=znver3 -O3 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -Ofast -march=znver3 -flto 16.398 60.896 79.341 224 2883.78 51.45 1605 12892 419 8352.8 7.526567 429 194.52 108.72 85.788 4.829 53.13 700520.673201 29.55 60.88 11.48 921.71 45.09 11.68 46.32 35.035 19.000 31.364 77.074 225 3660.74 63.90 1721 12436 382 8169.2 6.484557 426 198.84 111.77 85.596 5.068 54.43 717480.237613 30.48 60.07 11.69 931.03 45.75 11.86 46.74 29.991 19.370 25.251 75.697 377 4407.32 75.26 1907 14096 444 8927.1 7.512189 434 199.35 111.54 84.791 4.677 54.75 721620.067510 30.34 62.28 11.59 964.72 44.97 11.72 46.38 25.016 15.804 46.720 79.543 359 2625.59 50.95 2150 13771 430 8823.7 7.391742 452 195.13 109.42 85.126 4.567 53.83 669924.921861 29.87 62.10 11.52 966.99 44.89 11.73 46.31 30.711 18.859 25.254 76.317 361 3958.17 73.83 2135 13702 432 9314.1 7.307093 441 200.12 111.50 85.020 4.673 54.73 716026.620271 30.31 62.33 11.58 971.31 44.91 11.74 46.42 25.065 50.407 46.419 79.513 377 2924.23 49.32 1854 15232 476 9718.2 7.345638 445 201.85 113.41 84.929 4.607 54.91 681281.706242 30.76 62.08 11.68 970.46 45.68 11.84 47.04 27.363 63.476 25.480 73.919 370 3855.30 73.73 1906 15584 445 10125 7.696089 430 203.89 116.07 84.119 4.656 56.12 736850.433639 31.44 62.52 11.88 971.45 46.30 12.03 47.32 25.848 66.843 27.342 45.274 328 4101.89 75.23 1901 12149.0 449 8755.9 7.758952 504 220.99 121.74 76.943 4.602 58.69 728279.792211 32.40 63.76 12.12 969.91 47.25 12.27 48.54 24.505 OpenBenchmarking.org
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 15 30 45 60 75 SE +/- 0.22, N = 15 SE +/- 0.02, N = 3 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 SE +/- 0.03, N = 3 SE +/- 0.18, N = 15 SE +/- 0.07, N = 3 SE +/- 0.30, N = 3 16.40 15.80 50.41 63.48 19.00 19.37 18.86 66.84
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 14 28 42 56 70 SE +/- 0.16, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 60.90 46.72 46.42 25.48 31.36 25.25 25.25 27.34 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm -lpthread -O3
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.78, N = 9 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.64, N = 15 SE +/- 0.72, N = 9 SE +/- 0.29, N = 3 79.34 79.54 79.51 73.92 77.07 75.70 76.32 45.27 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 80 160 240 320 400 SE +/- 0.58, N = 3 SE +/- 4.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 2.33, N = 3 SE +/- 0.88, N = 3 224 359 377 370 225 377 361 328 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 900 1800 2700 3600 4500 SE +/- 9.37, N = 3 SE +/- 34.75, N = 3 SE +/- 25.20, N = 3 SE +/- 130.68, N = 12 SE +/- 12.53, N = 3 SE +/- 11.01, N = 3 SE +/- 35.50, N = 15 SE +/- 33.94, N = 3 2883.78 2625.59 2924.23 3855.30 3660.74 4407.32 3958.17 4101.89 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.41, N = 3 SE +/- 0.46, N = 3 SE +/- 0.36, N = 3 SE +/- 0.89, N = 3 SE +/- 0.24, N = 3 SE +/- 0.59, N = 3 SE +/- 1.48, N = 15 SE +/- 0.56, N = 3 51.45 50.95 49.32 73.73 63.90 75.26 73.83 75.23 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -ljpeg -lz -lm
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 500 1000 1500 2000 2500 SE +/- 1.53, N = 3 SE +/- 4.91, N = 3 SE +/- 1.76, N = 3 SE +/- 14.34, N = 3 SE +/- 1.20, N = 3 SE +/- 8.37, N = 3 SE +/- 1.15, N = 3 1605 2150 1854 1906 1721 1907 2135 1901 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3K 6K 9K 12K 15K SE +/- 135.67, N = 3 SE +/- 161.07, N = 3 SE +/- 16.50, N = 3 SE +/- 105.64, N = 3 SE +/- 80.35, N = 3 SE +/- 75.72, N = 3 SE +/- 54.77, N = 3 SE +/- 564.61, N = 15 12892.0 13771.0 15232.0 15584.0 12436.0 14096.0 13702.0 12149.0 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 100 200 300 400 500 SE +/- 1.53, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 5.29, N = 3 SE +/- 0.88, N = 3 419 430 476 445 382 444 432 449 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 2K 4K 6K 8K 10K SE +/- 88.61, N = 3 SE +/- 39.49, N = 3 SE +/- 96.35, N = 3 SE +/- 25.21, N = 3 SE +/- 312.57, N = 12 SE +/- 15.97, N = 3 SE +/- 18.57, N = 3 SE +/- 86.92, N = 3 8352.8 8823.7 9718.2 10125.0 8169.2 8927.1 9314.1 8755.9 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lm
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 2 4 6 8 10 SE +/- 0.110286, N = 3 SE +/- 0.055721, N = 3 SE +/- 0.071987, N = 3 SE +/- 0.089342, N = 6 SE +/- 0.050043, N = 15 SE +/- 0.069362, N = 3 SE +/- 0.086169, N = 3 SE +/- 0.087025, N = 3 7.526567 7.391742 7.345638 7.696089 6.484557 7.512189 7.307093 7.758952 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -O3 -march=native -fopenmp
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 110 220 330 440 550 SE +/- 3.51, N = 3 SE +/- 1.45, N = 3 SE +/- 1.53, N = 3 SE +/- 4.91, N = 12 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 1.53, N = 3 429 452 445 430 426 434 441 504 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 50 100 150 200 250 SE +/- 0.75, N = 3 SE +/- 0.09, N = 3 SE +/- 0.55, N = 3 SE +/- 0.63, N = 3 SE +/- 0.89, N = 3 SE +/- 0.35, N = 3 SE +/- 0.25, N = 3 SE +/- 0.20, N = 3 194.52 195.13 201.85 203.89 198.84 199.35 200.12 220.99 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 30 60 90 120 150 SE +/- 0.51, N = 3 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 SE +/- 0.30, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 SE +/- 0.24, N = 3 108.72 109.42 113.41 116.07 111.77 111.54 111.50 121.74 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 85.79 85.13 84.93 84.12 85.60 84.79 85.02 76.94 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -lhmmer -leasel -lm
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1.1403 2.2806 3.4209 4.5612 5.7015 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.009, N = 3 SE +/- 0.012, N = 3 SE +/- 0.018, N = 3 SE +/- 0.016, N = 3 SE +/- 0.025, N = 3 SE +/- 0.005, N = 3 4.829 4.567 4.607 4.656 5.068 4.677 4.673 4.602 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -fopenmp -O3
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 53.13 53.83 54.91 56.12 54.43 54.75 54.73 58.69 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 160K 320K 480K 640K 800K SE +/- 2080.13, N = 3 SE +/- 4015.82, N = 3 SE +/- 5589.70, N = 3 SE +/- 1257.49, N = 3 SE +/- 371.91, N = 3 SE +/- 1896.73, N = 3 SE +/- 449.85, N = 3 SE +/- 1316.80, N = 3 700520.67 669924.92 681281.71 736850.43 717480.24 721620.07 716026.62 728279.79 -march=x86-64 -march=znver3 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -O2 -lrt" -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 29.55 29.87 30.76 31.44 30.48 30.34 30.31 32.40 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 14 28 42 56 70 SE +/- 0.73, N = 6 SE +/- 0.37, N = 3 SE +/- 0.79, N = 3 SE +/- 0.59, N = 9 SE +/- 0.99, N = 15 SE +/- 0.23, N = 3 SE +/- 0.65, N = 3 SE +/- 0.25, N = 3 60.88 62.10 62.08 62.52 60.07 62.28 62.33 63.76 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 11.48 11.52 11.68 11.88 11.69 11.59 11.58 12.12 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 200 400 600 800 1000 SE +/- 13.06, N = 3 SE +/- 3.99, N = 3 SE +/- 2.71, N = 3 SE +/- 2.69, N = 3 SE +/- 3.82, N = 3 SE +/- 0.72, N = 3 SE +/- 2.43, N = 3 SE +/- 3.18, N = 3 921.71 966.99 970.46 971.45 931.03 964.72 971.31 969.91 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 11 22 33 44 55 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 45.09 44.89 45.68 46.30 45.75 44.97 44.91 47.25 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.68 11.73 11.84 12.03 11.86 11.72 11.74 12.27 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 46.32 46.31 47.04 47.32 46.74 46.38 46.42 48.54 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 -Ofast -march=znver3 -flto 8 16 24 32 40 SE +/- 2.22, N = 15 SE +/- 1.92, N = 15 SE +/- 0.23, N = 3 SE +/- 0.08, N = 3 SE +/- 1.25, N = 15 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 35.04 30.71 27.36 25.85 29.99 25.02 25.07 24.51 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 -Ofast -march=znver3 -flto 1. (CC) gcc options: -lm -O3
Phoronix Test Suite v10.8.4