GCC AMD Znver3 Compiler Optimization Levels

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2012160-PTS-GCCZNVER86&export=pdf&grs&sro&rro.

GCC AMD Znver3 Compiler Optimization LevelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution-O2 -march=x86-64-O3 -march=x86-64-O3 -march=znver2-O2 -march=znver3-O3 -march=znver3-O2 -march=znver3 -flto-O3 -maech=znver3 -flto-Ofast -march=znver3 -fltoAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.10.0-051000rc6daily20201205-generic (x86_64) 20201204GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.84.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0)1.2.145GCC 11.0.0 20201213ext43840x2160OpenBenchmarking.orgEnvironment Details- -O2 -march=x86-64: CXXFLAGS="-O2 -march=x86-64" CFLAGS="-O2 -march=x86-64"- -O3 -march=x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"- -O3 -march=znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"- -O2 -march=znver3: CXXFLAGS="-O2 -march=znver3" CFLAGS="-O2 -march=znver3"- -O3 -march=znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"- -O2 -march=znver3 -flto: CXXFLAGS="-O2 -march=znver3 -flto" CFLAGS="-O2 -march=znver3 -flto"- -O3 -maech=znver3 -flto: CXXFLAGS="-O3 -march=znver3 -flto" CFLAGS="-O3 -march=znver3 -flto"- -Ofast -march=znver3 -flto: CXXFLAGS="-Ofast -march=znver3 -flto" CFLAGS="-Ofast -march=znver3 -flto"Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected Disk Details- -O3 -march=znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Python Details- -O3 -march=znver3: Python 2.7.18 + Python 3.8.5

GCC AMD Znver3 Compiler Optimization Levelsbuild-imagemagick: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelmrbayes: Primate Phylogeny Analysisgraphics-magick: Sharpenscimark2: Compositelibraw: Post-Processing Benchmarkgraphics-magick: Resizingfftw: Stock - 1D FFT Size 4096graphics-magick: Enhancedfftw: Stock - 2D FFT Size 4096mt-dgemm: Sustained Floating-Point Rategraphics-magick: Noise-Gaussiankvazaar: Bosphorus 1080p - Ultra Fastkvazaar: Bosphorus 1080p - Very Fasthmmer: Pfam Database Searchsmallpt: Global Illumination Renderer; 128 Sampleskvazaar: Bosphorus 4K - Ultra Fastcoremark: CoreMark Size 666 - Iterations Per Secondkvazaar: Bosphorus 4K - Very Fastaom-av1: Speed 8 Realtimekvazaar: Bosphorus 4K - Slowdaphne: OpenMP - NDT Mappingkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Mediumaobench: 2048 x 2048 - Total Time-O2 -march=x86-64-O3 -march=x86-64-O3 -march=znver2-O2 -march=znver3-O3 -march=znver3-O2 -march=znver3 -flto-O3 -maech=znver3 -flto-Ofast -march=znver3 -flto16.39860.89679.3412242883.7851.451605128924198352.87.526567429194.52108.7285.7884.82953.13700520.67320129.5560.8811.48921.7145.0911.6846.3235.03519.00031.36477.0742253660.7463.901721124363828169.26.484557426198.84111.7785.5965.06854.43717480.23761330.4860.0711.69931.0345.7511.8646.7429.99119.37025.25175.6973774407.3275.261907140964448927.17.512189434199.35111.5484.7914.67754.75721620.06751030.3462.2811.59964.7244.9711.7246.3825.01615.80446.72079.5433592625.5950.952150137714308823.77.391742452195.13109.4285.1264.56753.83669924.92186129.8762.1011.52966.9944.8911.7346.3130.71118.85925.25476.3173613958.1773.832135137024329314.17.307093441200.12111.5085.0204.67354.73716026.62027130.3162.3311.58971.3144.9111.7446.4225.06550.40746.41979.5133772924.2349.321854152324769718.27.345638445201.85113.4184.9294.60754.91681281.70624230.7662.0811.68970.4645.6811.8447.0427.36363.47625.48073.9193703855.3073.73190615584445101257.696089430203.89116.0784.1194.65656.12736850.43363931.4462.5211.88971.4546.3012.0347.3225.84866.84327.34245.2743284101.8975.23190112149.04498755.97.758952504220.99121.7476.9434.60258.69728279.79221132.4063.7612.12969.9147.2512.2748.5424.505OpenBenchmarking.org

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641530456075SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 15SE +/- 0.03, N = 3SE +/- 0.32, N = 3SE +/- 0.34, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 1566.8418.8619.3719.0063.4850.4115.8016.40

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641428425670SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.16, N = 327.3425.2525.2531.3625.4846.4246.7260.90-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -lm -lpthread -O3

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-6420406080100SE +/- 0.29, N = 3SE +/- 0.72, N = 9SE +/- 0.64, N = 15SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 0.78, N = 945.2776.3275.7077.0773.9279.5179.5479.34-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-6480160240320400SE +/- 0.88, N = 3SE +/- 2.33, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 4.33, N = 3SE +/- 0.58, N = 3328361377225370377359224-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-649001800270036004500SE +/- 33.94, N = 3SE +/- 35.50, N = 15SE +/- 11.01, N = 3SE +/- 12.53, N = 3SE +/- 130.68, N = 12SE +/- 25.20, N = 3SE +/- 34.75, N = 3SE +/- 9.37, N = 34101.893958.174407.323660.743855.302924.232625.592883.78-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -lm

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-6420406080100SE +/- 0.56, N = 3SE +/- 1.48, N = 15SE +/- 0.59, N = 3SE +/- 0.24, N = 3SE +/- 0.89, N = 3SE +/- 0.36, N = 3SE +/- 0.46, N = 3SE +/- 0.41, N = 375.2373.8375.2663.9073.7349.3250.9551.45-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CXX) g++ options: -fopenmp -ljpeg -lz -lm

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-645001000150020002500SE +/- 1.15, N = 3SE +/- 8.37, N = 3SE +/- 1.20, N = 3SE +/- 14.34, N = 3SE +/- 1.76, N = 3SE +/- 4.91, N = 3SE +/- 1.53, N = 319012135190717211906185421501605-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-643K6K9K12K15KSE +/- 564.61, N = 15SE +/- 54.77, N = 3SE +/- 75.72, N = 3SE +/- 80.35, N = 3SE +/- 105.64, N = 3SE +/- 16.50, N = 3SE +/- 161.07, N = 3SE +/- 135.67, N = 312149.013702.014096.012436.015584.015232.013771.012892.0-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver31. (CC) gcc options: -pthread -lm

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64100200300400500SE +/- 0.88, N = 3SE +/- 5.29, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 3449432444382445476430419-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-642K4K6K8K10KSE +/- 86.92, N = 3SE +/- 18.57, N = 3SE +/- 15.97, N = 3SE +/- 312.57, N = 12SE +/- 25.21, N = 3SE +/- 96.35, N = 3SE +/- 39.49, N = 3SE +/- 88.61, N = 38755.99314.18927.18169.210125.09718.28823.78352.8-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver31. (CC) gcc options: -pthread -lm

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64246810SE +/- 0.087025, N = 3SE +/- 0.086169, N = 3SE +/- 0.069362, N = 3SE +/- 0.050043, N = 15SE +/- 0.089342, N = 6SE +/- 0.071987, N = 3SE +/- 0.055721, N = 3SE +/- 0.110286, N = 37.7589527.3070937.5121896.4845577.6960897.3456387.3917427.526567-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -O3 -march=native -fopenmp

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64110220330440550SE +/- 1.53, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 4.91, N = 12SE +/- 1.53, N = 3SE +/- 1.45, N = 3SE +/- 3.51, N = 3504441434426430445452429-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-6450100150200250SE +/- 0.20, N = 3SE +/- 0.25, N = 3SE +/- 0.35, N = 3SE +/- 0.89, N = 3SE +/- 0.63, N = 3SE +/- 0.55, N = 3SE +/- 0.09, N = 3SE +/- 0.75, N = 3220.99200.12199.35198.84203.89201.85195.13194.52-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64306090120150SE +/- 0.24, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.30, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.51, N = 3121.74111.50111.54111.77116.07113.41109.42108.72-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-6420406080100SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 376.9485.0284.7985.6084.1284.9385.1385.79-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -lhmmer -leasel -lm

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641.14032.28063.42094.56125.7015SE +/- 0.005, N = 3SE +/- 0.025, N = 3SE +/- 0.016, N = 3SE +/- 0.018, N = 3SE +/- 0.012, N = 3SE +/- 0.009, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 34.6024.6734.6775.0684.6564.6074.5674.829-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CXX) g++ options: -fopenmp -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641326395265SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.31, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 358.6954.7354.7554.4356.1254.9153.8353.13-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64160K320K480K640K800KSE +/- 1316.80, N = 3SE +/- 449.85, N = 3SE +/- 1896.73, N = 3SE +/- 371.91, N = 3SE +/- 1257.49, N = 3SE +/- 5589.70, N = 3SE +/- 4015.82, N = 3SE +/- 2080.13, N = 3728279.79716026.62721620.07717480.24736850.43681281.71669924.92700520.67-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-march=znver3 -flto-march=znver3-march=x86-641. (CC) gcc options: -O2 -lrt" -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64816243240SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 332.4030.3130.3430.4831.4430.7629.8729.55-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641428425670SE +/- 0.25, N = 3SE +/- 0.65, N = 3SE +/- 0.23, N = 3SE +/- 0.99, N = 15SE +/- 0.59, N = 9SE +/- 0.79, N = 3SE +/- 0.37, N = 3SE +/- 0.73, N = 663.7662.3362.2860.0762.5262.0862.1060.88-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-643691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 312.1211.5811.5911.6911.8811.6811.5211.48-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mapping-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-642004006008001000SE +/- 3.18, N = 3SE +/- 2.43, N = 3SE +/- 0.72, N = 3SE +/- 3.82, N = 3SE +/- 2.69, N = 3SE +/- 2.71, N = 3SE +/- 3.99, N = 3SE +/- 13.06, N = 3969.91971.31964.72931.03971.45970.46966.99921.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641122334455SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 347.2544.9144.9745.7546.3045.6844.8945.09-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-643691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.2711.7411.7211.8612.0311.8411.7311.68-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641122334455SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 348.5446.4246.3846.7447.3247.0446.3146.32-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-Ofast -march=znver3 -flto-O3 -march=znver3-O3 -march=znver2-O3 -march=x86-64-O3 -maech=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-64816243240SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 1.25, N = 15SE +/- 0.08, N = 3SE +/- 0.23, N = 3SE +/- 1.92, N = 15SE +/- 2.22, N = 1524.5125.0725.0229.9925.8527.3630.7135.04-Ofast -march=znver3 -flto-march=znver3-march=znver2-march=x86-64-march=znver3 -flto-O2 -march=znver3 -flto-O2 -march=znver3-O2 -march=x86-641. (CC) gcc options: -lm -O3


Phoronix Test Suite v10.8.5