Benchmarks for a future article.
-O3 -march=znver3 Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"Compiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Python Notes: Python 2.7.18 + Python 3.8.5Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O3 -march=znver2 Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O3 -march=x86-64 Environment Notes: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O3 -maech=znver3 -flto Environment Notes: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-Ofast -march=znver3 -flto Environment Notes: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O2 -march=znver3 -flto Environment Notes: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O2 -march=znver3 Environment Notes: CXXFLAGS="-O2 -march=znver3" CFLAGS="-O2 -march=znver3"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
-O2 -march=x86-64 Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200
OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160
Environment Notes: CXXFLAGS="-O2 -march=x86-64" CFLAGS="-O2 -march=x86-64"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Timed ImageMagick Compilation This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 15 30 45 60 75 SE +/- 0.22, N = 15 SE +/- 0.02, N = 3 SE +/- 0.34, N = 3 SE +/- 0.30, N = 3 SE +/- 0.32, N = 3 SE +/- 0.03, N = 3 SE +/- 0.18, N = 15 SE +/- 0.07, N = 3 16.40 15.80 50.41 66.84 63.48 19.00 19.37 18.86
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 13 26 39 52 65 Min: 15.78 / Avg: 16.4 / Max: 18.47 Min: 15.77 / Avg: 15.8 / Max: 15.84 Min: 49.8 / Avg: 50.41 / Max: 50.98 Min: 66.37 / Avg: 66.84 / Max: 67.41 Min: 62.92 / Avg: 63.48 / Max: 64.04 Min: 18.96 / Avg: 19 / Max: 19.06 Min: 18.85 / Avg: 19.37 / Max: 21.43 Min: 18.75 / Avg: 18.86 / Max: 19
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 14 28 42 56 70 SE +/- 0.16, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 60.90 46.72 46.42 27.34 25.48 31.36 25.25 25.25 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -lpthread -O3
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 12 24 36 48 60 Min: 60.64 / Avg: 60.9 / Max: 61.18 Min: 46.5 / Avg: 46.72 / Max: 46.86 Min: 46.21 / Avg: 46.42 / Max: 46.53 Min: 27.26 / Avg: 27.34 / Max: 27.43 Min: 25.37 / Avg: 25.48 / Max: 25.59 Min: 31.27 / Avg: 31.36 / Max: 31.51 Min: 25.1 / Avg: 25.25 / Max: 25.41 Min: 25.06 / Avg: 25.25 / Max: 25.42 1. (CC) gcc options: -lm -lpthread -O3
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 20 40 60 80 100 SE +/- 0.78, N = 9 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 SE +/- 0.29, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.64, N = 15 SE +/- 0.72, N = 9 79.34 79.54 79.51 45.27 73.92 77.07 75.70 76.32 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 15 30 45 60 75 Min: 73.19 / Avg: 79.34 / Max: 80.61 Min: 79.41 / Avg: 79.54 / Max: 79.73 Min: 79.21 / Avg: 79.51 / Max: 79.87 Min: 44.8 / Avg: 45.27 / Max: 45.81 Min: 73.82 / Avg: 73.92 / Max: 74.03 Min: 76.94 / Avg: 77.07 / Max: 77.18 Min: 70.66 / Avg: 75.7 / Max: 77.17 Min: 70.6 / Avg: 76.32 / Max: 77.67 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 80 160 240 320 400 SE +/- 0.58, N = 3 SE +/- 4.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 2.33, N = 3 224 359 377 328 370 225 377 361 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Result Confidence
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 70 140 210 280 350 Min: 223 / Avg: 224 / Max: 225 Min: 352 / Avg: 359.33 / Max: 367 Min: 375 / Avg: 376.67 / Max: 379 Min: 327 / Avg: 328.33 / Max: 330 Min: 369 / Avg: 369.67 / Max: 371 Min: 224 / Avg: 224.67 / Max: 226 Min: 375 / Avg: 376.67 / Max: 379 Min: 359 / Avg: 361.33 / Max: 366 1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 900 1800 2700 3600 4500 SE +/- 9.37, N = 3 SE +/- 34.75, N = 3 SE +/- 25.20, N = 3 SE +/- 33.94, N = 3 SE +/- 130.68, N = 12 SE +/- 12.53, N = 3 SE +/- 11.01, N = 3 SE +/- 35.50, N = 15 2883.78 2625.59 2924.23 4101.89 3855.30 3660.74 4407.32 3958.17 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -lm
Result Confidence
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 800 1600 2400 3200 4000 Min: 2865.07 / Avg: 2883.78 / Max: 2893.89 Min: 2590.39 / Avg: 2625.59 / Max: 2695.1 Min: 2884.82 / Avg: 2924.23 / Max: 2971.14 Min: 4045.4 / Avg: 4101.89 / Max: 4162.74 Min: 2422.61 / Avg: 3855.3 / Max: 4083.86 Min: 3644.12 / Avg: 3660.74 / Max: 3685.3 Min: 4396.06 / Avg: 4407.32 / Max: 4429.33 Min: 3691.72 / Avg: 3958.17 / Max: 4143.02 1. (CC) gcc options: -lm
LibRaw LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 20 40 60 80 100 SE +/- 0.41, N = 3 SE +/- 0.46, N = 3 SE +/- 0.36, N = 3 SE +/- 0.56, N = 3 SE +/- 0.89, N = 3 SE +/- 0.24, N = 3 SE +/- 0.59, N = 3 SE +/- 1.48, N = 15 51.45 50.95 49.32 75.23 73.73 63.90 75.26 73.83 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CXX) g++ options: -fopenmp -ljpeg -lz -lm
Result Confidence
OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 14 28 42 56 70 Min: 50.68 / Avg: 51.45 / Max: 52.09 Min: 50.45 / Avg: 50.95 / Max: 51.88 Min: 48.92 / Avg: 49.32 / Max: 50.03 Min: 74.26 / Avg: 75.23 / Max: 76.2 Min: 72.36 / Avg: 73.73 / Max: 75.39 Min: 63.53 / Avg: 63.9 / Max: 64.35 Min: 74.09 / Avg: 75.26 / Max: 75.93 Min: 59.91 / Avg: 73.83 / Max: 77.16 1. (CXX) g++ options: -fopenmp -ljpeg -lz -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 3K 6K 9K 12K 15K SE +/- 135.67, N = 3 SE +/- 161.07, N = 3 SE +/- 16.50, N = 3 SE +/- 564.61, N = 15 SE +/- 105.64, N = 3 SE +/- 80.35, N = 3 SE +/- 75.72, N = 3 SE +/- 54.77, N = 3 12892.0 13771.0 15232.0 12149.0 15584.0 12436.0 14096.0 13702.0 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -lm
Result Confidence
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 3K 6K 9K 12K 15K Min: 12754 / Avg: 12891.67 / Max: 13163 Min: 13451 / Avg: 13771.33 / Max: 13961 Min: 15200 / Avg: 15232 / Max: 15255 Min: 7928.2 / Avg: 12148.95 / Max: 13539 Min: 15380 / Avg: 15584.33 / Max: 15733 Min: 12353 / Avg: 12436.33 / Max: 12597 Min: 13956 / Avg: 14096 / Max: 14216 Min: 13594 / Avg: 13701.67 / Max: 13773 1. (CC) gcc options: -pthread -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 2K 4K 6K 8K 10K SE +/- 88.61, N = 3 SE +/- 39.49, N = 3 SE +/- 96.35, N = 3 SE +/- 86.92, N = 3 SE +/- 25.21, N = 3 SE +/- 312.57, N = 12 SE +/- 15.97, N = 3 SE +/- 18.57, N = 3 8352.8 8823.7 9718.2 8755.9 10125.0 8169.2 8927.1 9314.1 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -lm
Result Confidence
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 2K 4K 6K 8K 10K Min: 8256.5 / Avg: 8352.8 / Max: 8529.8 Min: 8748.7 / Avg: 8823.73 / Max: 8882.6 Min: 9534.3 / Avg: 9718.2 / Max: 9860 Min: 8582.4 / Avg: 8755.93 / Max: 8851.5 Min: 10093 / Avg: 10125.33 / Max: 10175 Min: 5835.3 / Avg: 8169.18 / Max: 8803.1 Min: 8903.9 / Avg: 8927.1 / Max: 8957.7 Min: 9277 / Avg: 9314.07 / Max: 9334.7 1. (CC) gcc options: -pthread -lm
ACES DGEMM This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 2 4 6 8 10 SE +/- 0.110286, N = 3 SE +/- 0.055721, N = 3 SE +/- 0.071987, N = 3 SE +/- 0.087025, N = 3 SE +/- 0.089342, N = 6 SE +/- 0.050043, N = 15 SE +/- 0.069362, N = 3 SE +/- 0.086169, N = 3 7.526567 7.391742 7.345638 7.758952 7.696089 6.484557 7.512189 7.307093 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -march=native -fopenmp
Result Confidence
OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 3 6 9 12 15 Min: 7.34 / Avg: 7.53 / Max: 7.72 Min: 7.33 / Avg: 7.39 / Max: 7.5 Min: 7.2 / Avg: 7.35 / Max: 7.42 Min: 7.65 / Avg: 7.76 / Max: 7.93 Min: 7.42 / Avg: 7.7 / Max: 7.98 Min: 6.2 / Avg: 6.48 / Max: 6.91 Min: 7.37 / Avg: 7.51 / Max: 7.59 Min: 7.22 / Avg: 7.31 / Max: 7.48 1. (CC) gcc options: -O3 -march=native -fopenmp
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 50 100 150 200 250 SE +/- 0.75, N = 3 SE +/- 0.09, N = 3 SE +/- 0.55, N = 3 SE +/- 0.20, N = 3 SE +/- 0.63, N = 3 SE +/- 0.89, N = 3 SE +/- 0.35, N = 3 SE +/- 0.25, N = 3 194.52 195.13 201.85 220.99 203.89 198.84 199.35 200.12 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 40 80 120 160 200 Min: 193.53 / Avg: 194.52 / Max: 196 Min: 195.03 / Avg: 195.13 / Max: 195.3 Min: 200.96 / Avg: 201.85 / Max: 202.85 Min: 220.65 / Avg: 220.99 / Max: 221.33 Min: 202.63 / Avg: 203.89 / Max: 204.56 Min: 197.06 / Avg: 198.84 / Max: 199.88 Min: 198.65 / Avg: 199.35 / Max: 199.71 Min: 199.63 / Avg: 200.12 / Max: 200.43 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 30 60 90 120 150 SE +/- 0.51, N = 3 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.24, N = 3 SE +/- 0.16, N = 3 SE +/- 0.30, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 108.72 109.42 113.41 121.74 116.07 111.77 111.54 111.50 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 20 40 60 80 100 Min: 108.06 / Avg: 108.72 / Max: 109.71 Min: 109.18 / Avg: 109.42 / Max: 109.81 Min: 113.14 / Avg: 113.41 / Max: 113.62 Min: 121.34 / Avg: 121.74 / Max: 122.18 Min: 115.83 / Avg: 116.07 / Max: 116.37 Min: 111.25 / Avg: 111.77 / Max: 112.3 Min: 111.34 / Avg: 111.54 / Max: 111.84 Min: 111.22 / Avg: 111.5 / Max: 111.82 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 85.79 85.13 84.93 76.94 84.12 85.60 84.79 85.02 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -lhmmer -leasel -lm
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 16 32 48 64 80 Min: 85.57 / Avg: 85.79 / Max: 86.08 Min: 84.92 / Avg: 85.13 / Max: 85.36 Min: 84.8 / Avg: 84.93 / Max: 85.11 Min: 76.82 / Avg: 76.94 / Max: 77.06 Min: 83.94 / Avg: 84.12 / Max: 84.25 Min: 85.39 / Avg: 85.6 / Max: 85.89 Min: 84.57 / Avg: 84.79 / Max: 84.93 Min: 84.67 / Avg: 85.02 / Max: 85.21 1. (CC) gcc options: -pthread -lhmmer -leasel -lm
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1.1403 2.2806 3.4209 4.5612 5.7015 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.009, N = 3 SE +/- 0.005, N = 3 SE +/- 0.012, N = 3 SE +/- 0.018, N = 3 SE +/- 0.016, N = 3 SE +/- 0.025, N = 3 4.829 4.567 4.607 4.602 4.656 5.068 4.677 4.673 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CXX) g++ options: -fopenmp -O3
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 2 4 6 8 10 Min: 4.82 / Avg: 4.83 / Max: 4.84 Min: 4.56 / Avg: 4.57 / Max: 4.57 Min: 4.59 / Avg: 4.61 / Max: 4.62 Min: 4.59 / Avg: 4.6 / Max: 4.61 Min: 4.64 / Avg: 4.66 / Max: 4.68 Min: 5.04 / Avg: 5.07 / Max: 5.1 Min: 4.65 / Avg: 4.68 / Max: 4.71 Min: 4.64 / Avg: 4.67 / Max: 4.72 1. (CXX) g++ options: -fopenmp -O3
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 53.13 53.83 54.91 58.69 56.12 54.43 54.75 54.73 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 12 24 36 48 60 Min: 53 / Avg: 53.13 / Max: 53.21 Min: 53.77 / Avg: 53.83 / Max: 53.88 Min: 54.29 / Avg: 54.91 / Max: 55.24 Min: 58.53 / Avg: 58.69 / Max: 58.86 Min: 55.91 / Avg: 56.12 / Max: 56.36 Min: 54.21 / Avg: 54.43 / Max: 54.55 Min: 54.68 / Avg: 54.75 / Max: 54.8 Min: 54.64 / Avg: 54.73 / Max: 54.81 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 160K 320K 480K 640K 800K SE +/- 2080.13, N = 3 SE +/- 4015.82, N = 3 SE +/- 5589.70, N = 3 SE +/- 1316.80, N = 3 SE +/- 1257.49, N = 3 SE +/- 371.91, N = 3 SE +/- 1896.73, N = 3 SE +/- 449.85, N = 3 700520.67 669924.92 681281.71 728279.79 736850.43 717480.24 721620.07 716026.62 -march=x86-64 -march=znver3 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -O2 -lrt" -lrt
Result Confidence
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 130K 260K 390K 520K 650K Min: 698131.05 / Avg: 700520.67 / Max: 704664.73 Min: 661909.19 / Avg: 669924.92 / Max: 674370.41 Min: 670133.68 / Avg: 681281.71 / Max: 687580.58 Min: 725760.73 / Avg: 728279.79 / Max: 730204.61 Min: 734900.1 / Avg: 736850.43 / Max: 739200.74 Min: 716819.12 / Avg: 717480.24 / Max: 718106 Min: 718240.31 / Avg: 721620.07 / Max: 724801.81 Min: 715563.51 / Avg: 716026.62 / Max: 716926.18 1. (CC) gcc options: -O2 -lrt" -lrt
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 29.55 29.87 30.76 32.40 31.44 30.48 30.34 30.31 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 7 14 21 28 35 Min: 29.38 / Avg: 29.55 / Max: 29.67 Min: 29.82 / Avg: 29.87 / Max: 29.94 Min: 30.71 / Avg: 30.76 / Max: 30.85 Min: 32.3 / Avg: 32.4 / Max: 32.47 Min: 31.41 / Avg: 31.44 / Max: 31.47 Min: 30.44 / Avg: 30.48 / Max: 30.51 Min: 30.3 / Avg: 30.34 / Max: 30.39 Min: 30.18 / Avg: 30.31 / Max: 30.45 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
AOM AV1 This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 14 28 42 56 70 SE +/- 0.73, N = 6 SE +/- 0.37, N = 3 SE +/- 0.79, N = 3 SE +/- 0.25, N = 3 SE +/- 0.59, N = 9 SE +/- 0.99, N = 15 SE +/- 0.23, N = 3 SE +/- 0.65, N = 3 60.88 62.10 62.08 63.76 62.52 60.07 62.28 62.33 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 12 24 36 48 60 Min: 57.56 / Avg: 60.88 / Max: 62.73 Min: 61.67 / Avg: 62.1 / Max: 62.85 Min: 60.52 / Avg: 62.08 / Max: 63.05 Min: 63.31 / Avg: 63.76 / Max: 64.19 Min: 58.96 / Avg: 62.52 / Max: 64.75 Min: 49.39 / Avg: 60.07 / Max: 62.97 Min: 61.86 / Avg: 62.28 / Max: 62.65 Min: 61.67 / Avg: 62.33 / Max: 63.62 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 11.48 11.52 11.68 12.12 11.88 11.69 11.59 11.58 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 4 8 12 16 20 Min: 11.46 / Avg: 11.48 / Max: 11.51 Min: 11.5 / Avg: 11.52 / Max: 11.54 Min: 11.64 / Avg: 11.68 / Max: 11.73 Min: 12.06 / Avg: 12.12 / Max: 12.17 Min: 11.82 / Avg: 11.88 / Max: 11.94 Min: 11.66 / Avg: 11.69 / Max: 11.72 Min: 11.55 / Avg: 11.59 / Max: 11.65 Min: 11.53 / Avg: 11.58 / Max: 11.66 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Darmstadt Automotive Parallel Heterogeneous Suite DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 200 400 600 800 1000 SE +/- 13.06, N = 3 SE +/- 3.99, N = 3 SE +/- 2.71, N = 3 SE +/- 3.18, N = 3 SE +/- 2.69, N = 3 SE +/- 3.82, N = 3 SE +/- 0.72, N = 3 SE +/- 2.43, N = 3 921.71 966.99 970.46 969.91 971.45 931.03 964.72 971.31 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Result Confidence
OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 200 400 600 800 1000 Min: 895.62 / Avg: 921.71 / Max: 935.87 Min: 961.55 / Avg: 966.99 / Max: 974.77 Min: 967.42 / Avg: 970.46 / Max: 975.87 Min: 965.72 / Avg: 969.91 / Max: 976.15 Min: 967.41 / Avg: 971.45 / Max: 976.55 Min: 925.19 / Avg: 931.03 / Max: 938.22 Min: 963.37 / Avg: 964.72 / Max: 965.8 Min: 967.18 / Avg: 971.31 / Max: 975.58 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 11 22 33 44 55 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 45.09 44.89 45.68 47.25 46.30 45.75 44.97 44.91 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 10 20 30 40 50 Min: 44.86 / Avg: 45.09 / Max: 45.22 Min: 44.67 / Avg: 44.89 / Max: 45.09 Min: 45.62 / Avg: 45.68 / Max: 45.78 Min: 47.2 / Avg: 47.25 / Max: 47.28 Min: 46.18 / Avg: 46.3 / Max: 46.38 Min: 45.66 / Avg: 45.75 / Max: 45.9 Min: 44.86 / Avg: 44.97 / Max: 45.05 Min: 44.7 / Avg: 44.91 / Max: 45.06 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.68 11.73 11.84 12.27 12.03 11.86 11.72 11.74 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 4 8 12 16 20 Min: 11.67 / Avg: 11.68 / Max: 11.69 Min: 11.71 / Avg: 11.73 / Max: 11.75 Min: 11.79 / Avg: 11.84 / Max: 11.89 Min: 12.26 / Avg: 12.27 / Max: 12.29 Min: 12.02 / Avg: 12.03 / Max: 12.04 Min: 11.85 / Avg: 11.86 / Max: 11.89 Min: 11.7 / Avg: 11.72 / Max: 11.74 Min: 11.73 / Avg: 11.74 / Max: 11.74 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 46.32 46.31 47.04 48.54 47.32 46.74 46.38 46.42 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -march=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
Result Confidence
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 10 20 30 40 50 Min: 46.21 / Avg: 46.32 / Max: 46.48 Min: 46.09 / Avg: 46.31 / Max: 46.43 Min: 46.97 / Avg: 47.04 / Max: 47.07 Min: 48.46 / Avg: 48.54 / Max: 48.66 Min: 47.22 / Avg: 47.32 / Max: 47.42 Min: 46.62 / Avg: 46.74 / Max: 46.86 Min: 46.32 / Avg: 46.38 / Max: 46.42 Min: 46.24 / Avg: 46.42 / Max: 46.54 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 8 16 24 32 40 SE +/- 2.22, N = 15 SE +/- 1.92, N = 15 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 1.25, N = 15 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 35.04 30.71 27.36 24.51 25.85 29.99 25.02 25.07 -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -march=znver3 -flto -march=x86-64 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -O3
Result Confidence
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O2 -march=x86-64 -O2 -march=znver3 -O2 -march=znver3 -flto -Ofast -march=znver3 -flto -O3 -maech=znver3 -flto -O3 -march=x86-64 -O3 -march=znver2 -O3 -march=znver3 7 14 21 28 35 Min: 30.18 / Avg: 35.04 / Max: 51.67 Min: 26.48 / Avg: 30.71 / Max: 45.15 Min: 26.91 / Avg: 27.36 / Max: 27.61 Min: 24.24 / Avg: 24.5 / Max: 24.66 Min: 25.76 / Avg: 25.85 / Max: 26 Min: 27.86 / Avg: 29.99 / Max: 47.43 Min: 24.95 / Avg: 25.02 / Max: 25.08 Min: 24.92 / Avg: 25.06 / Max: 25.29 1. (CC) gcc options: -lm -O3
-O3 -march=znver3 Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"Compiler Notes: --disable-multilib --enable-checking=releaseDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Python Notes: Python 2.7.18 + Python 3.8.5Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 13 December 2020 11:18 by user phoronix.
-O3 -march=znver2 Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 13 December 2020 20:13 by user phoronix.
-O3 -march=x86-64 Environment Notes: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 14 December 2020 06:00 by user phoronix.
-O3 -maech=znver3 -flto Environment Notes: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 14 December 2020 17:26 by user phoronix.
-Ofast -march=znver3 -flto Environment Notes: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 15 December 2020 05:44 by user phoronix.
-O2 -march=znver3 -flto Environment Notes: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 15 December 2020 11:57 by user phoronix.
-O2 -march=znver3 Environment Notes: CXXFLAGS="-O2 -march=znver3" CFLAGS="-O2 -march=znver3"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 15 December 2020 18:24 by user phoronix.
-O2 -march=x86-64 Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200
OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160
Environment Notes: CXXFLAGS="-O2 -march=x86-64" CFLAGS="-O2 -march=x86-64"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 15 December 2020 20:25 by user phoronix.