11900K Compiler

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2105179-IB-11900KCOM00&sor&grr.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 11.1 -O3 -march=native -O3 -march=native -flto -O2Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0AMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)GCC 11.1.1 20210428btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.1Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

qe: AUSURF112gmpbench: Total Timencnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetx265: Bosphorus 4Khmmer: Pfam Database Searchastcenc: Exhaustivesysbench: CPUmrbayes: Primate Phylogeny Analysisonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUdav1d: Chimera 1080p 10-bitcryptopp: Unkeyed Algorithmsc-ray: Total Time - 4K, 16 Rays Per Pixelpjsip: INVITEpjsip: OPTIONS, Statefulgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Rotategraphics-magick: Resizinghimeno: Poisson Pressure Solvercompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedsqlite-speedtest: Timed Time - Size 1,000compress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedstockfish: Total Timeespeak: Text-To-Speech Synthesiswebp: Quality 100, Lossless, Highest Compressiononednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUtjbench: Decompression Throughputcoremark: CoreMark Size 666 - Iterations Per Secondaobench: 2048 x 2048 - Total Timeonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUliquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57dav1d: Summer Nature 4Kencode-wavpack: WAV To WavPacktnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1astcenc: Thoroughdav1d: Chimera 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUlammps: Rhodopsin Proteinredis: SETwebp: Quality 100, Losslessredis: GETonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080ppjsip: OPTIONS, Statelessencode-flac: WAV To FLACencode-opus: WAV To Opus Encodeonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUsmallpt: Global Illumination Renderer; 128 Samplessvt-hevc: 7 - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUencode-mp3: WAV To MP3webp: Quality 100, Highest Compressionastcenc: Mediumdav1d: Summer Nature 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsvt-hevc: 10 - Bosphorus 1080pGCC 11.1 -O3 -march=native -O3 -march=native -flto -O22576.976172.98.6215.5320.2318.239.6311.0854.5010.201.194.382.303.242.553.2411.8315.81100.73785.415734776.0886.6963173.473171.463172.191890.591887.611891.71223.02489.75955147.34549599389195270114111986878.5076864582.333.04514.835.444.0855546.0285.32993244121.70527.26416.5026273.100046432583.96435221.5444.865220.829637722893333686530000190.3111.084230.019227.66311.3846763.054.066178.575480.7224308.0672980192.0012.9014036791.923.527911.322713.53708195.872414395.9315.58711.23955.287263.170268.405139.1314.288912.514216.18285.4795.1275.1820717.31164.77201.7017.05764.307981.45788278.722540.196171.68.9115.9223.4518.439.7011.3954.1310.271.694.322.275.602.523.2513.3415.4099.97185.420734751.0184.9293152.893154.693148.671874.701876.251877.51488.6278047.61350589395195269107212297079.8838704579.832.84503.134.843.7775477.9281.12908639422.60427.07417.4188272.600758435901.44395921.5774.746110.83169972239333368435666711.099247.889242.55011.39524.044818.572480.7204828.3282990164.9212.7064060369.083.523811.323113.53500195.072398925.9365.57511.22695.400803.139418.454141.8314.252312.523216.18645.3765.1035.1705166.05201.1017.12894.251761.47524278.592538.259.6116.1521.0722.079.6311.3054.8011.111.195.233.113.483.184.2015.1515.64103.29191.379934799.7087.2973124.563123.953123.641845.741842.141841.63148.40491.644932106.52250019381164219106610916305.4818504777.332.74718.134.543.6165760.9296.02909481921.32527.84116.6931261.034785430127.49818924.4584.876010.829564711343333635506667186.7511.077243.416236.05012.0949773.934.044778.576230.7178828.0232936296.0813.7634051463.173.533151.321003.54019191.832397926.0866.46710.74565.011993.135328.771136.3114.166112.365916.17427.3045.3605.2481727.60160.65198.0117.05684.270771.46726273.60OpenBenchmarking.org

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O2-O3 -march=native -flto-O3 -march=native6001200180024003000SE +/- 18.09, N = 3SE +/- 24.60, N = 3SE +/- 21.65, N = 32538.252540.192576.971. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native-O3 -march=native -flto130026003900520065006172.96171.6-flto1. (CC) gcc options: -O3 -march=native -lm

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 128.628.919.61-O3 -march=native - MIN: 8.51 / MAX: 12.11-O3 -march=native -flto - MIN: 8.72 / MAX: 12.48MIN: 9.44 / MAX: 13.781. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.12, N = 3SE +/- 0.28, N = 3SE +/- 0.01, N = 1515.5315.9216.15-O3 -march=native - MIN: 15.19 / MAX: 20.95-O3 -march=native -flto - MIN: 15.55 / MAX: 21.06MIN: 15.95 / MAX: 21.541. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O3 -march=native-O2-O3 -march=native -flto612182430SE +/- 0.04, N = 3SE +/- 0.09, N = 15SE +/- 0.08, N = 320.2321.0723.45-O3 -march=native - MIN: 20.02 / MAX: 23.8MIN: 20.27 / MAX: 26.62-O3 -march=native -flto - MIN: 23.14 / MAX: 26.981. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O3 -march=native-O3 -march=native -flto-O2510152025SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 1518.2318.4322.07-O3 -march=native - MIN: 17.76 / MAX: 22.08-O3 -march=native -flto - MIN: 18.19 / MAX: 22.12MIN: 21.33 / MAX: 27.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O3 -march=native-O2-O3 -march=native -flto3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 15SE +/- 0.02, N = 39.639.639.70-O3 -march=native - MIN: 9.56 / MAX: 13.14MIN: 9.47 / MAX: 14.51-O3 -march=native -flto - MIN: 9.56 / MAX: 13.191. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O3 -march=native-O2-O3 -march=native -flto3691215SE +/- 0.17, N = 3SE +/- 0.06, N = 14SE +/- 0.02, N = 311.0811.3011.39-O3 -march=native - MIN: 10.66 / MAX: 16.66MIN: 10.84 / MAX: 14.99-O3 -march=native -flto - MIN: 11.27 / MAX: 15.151. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O3 -march=native -flto-O3 -march=native-O21224364860SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.05, N = 1554.1354.5054.80-O3 -march=native -flto - MIN: 53.54 / MAX: 59.11-O3 -march=native - MIN: 53.96 / MAX: 58.57MIN: 54.15 / MAX: 641. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 1510.2010.2711.11-O3 -march=native - MIN: 9.72 / MAX: 13.93-O3 -march=native -flto - MIN: 9.93 / MAX: 13.87MIN: 10.75 / MAX: 16.771. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O3 -march=native-O2-O3 -march=native -flto0.38030.76061.14091.52121.9015SE +/- 0.06, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 31.191.191.69-O3 -march=native - MIN: 1.09 / MAX: 2.02MIN: 1.14 / MAX: 5.67-O3 -march=native -flto - MIN: 1.64 / MAX: 2.461. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O3 -march=native -flto-O3 -march=native-O21.17682.35363.53044.70725.884SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 154.324.385.23-O3 -march=native -flto - MIN: 4.25 / MAX: 8.71-O3 -march=native - MIN: 4.18 / MAX: 7.9MIN: 5.12 / MAX: 8.961. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O3 -march=native -flto-O3 -march=native-O20.69981.39962.09942.79923.499SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 142.272.303.11-O3 -march=native -flto - MIN: 2.21 / MAX: 5.8-O3 -march=native - MIN: 2.18 / MAX: 3.19MIN: 3.05 / MAX: 9.871. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O3 -march=native-O2-O3 -march=native -flto1.262.523.785.046.3SE +/- 0.02, N = 3SE +/- 0.00, N = 15SE +/- 0.06, N = 33.243.485.60-O3 -march=native - MIN: 3.17 / MAX: 6.75MIN: 3.4 / MAX: 7.05-O3 -march=native -flto - MIN: 5.41 / MAX: 9.211. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O3 -march=native -flto-O3 -march=native-O20.71551.4312.14652.8623.5775SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 152.522.553.18-O3 -march=native -flto - MIN: 2.47 / MAX: 6.06-O3 -march=native - MIN: 2.43 / MAX: 6.16MIN: 3.1 / MAX: 6.781. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O3 -march=native-O3 -march=native -flto-O20.9451.892.8353.784.725SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 153.243.254.20-O3 -march=native - MIN: 3.1 / MAX: 6.67-O3 -march=native -flto - MIN: 3.14 / MAX: 6.68MIN: 4.04 / MAX: 7.771. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 1511.8313.3415.15-O3 -march=native - MIN: 11.62 / MAX: 15.38-O3 -march=native -flto - MIN: 13.01 / MAX: 16.82MIN: 14.73 / MAX: 342.421. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O3 -march=native-O2-O3 -march=native -flto48121620SE +/- 0.13, N = 15SE +/- 0.21, N = 3SE +/- 0.15, N = 615.8115.6415.40-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O3 -march=native -flto-O3 -march=native-O220406080100SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 399.97100.74103.29-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 385.4285.4291.38-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O2-O3 -march=native-O3 -march=native -flto7K14K21K28K35KSE +/- 0.65, N = 3SE +/- 0.97, N = 3SE +/- 1.11, N = 334799.7034776.0834751.01-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=native -flto-O3 -march=native-O220406080100SE +/- 0.32, N = 3SE +/- 0.06, N = 3SE +/- 0.53, N = 384.9386.7087.30-march=native -flto-march=native-O21. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native7001400210028003500SE +/- 0.76, N = 3SE +/- 3.52, N = 3SE +/- 2.46, N = 33124.563152.893173.47MIN: 3112.25-flto - MIN: 3137.49MIN: 3161.041. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native7001400210028003500SE +/- 2.61, N = 3SE +/- 3.24, N = 3SE +/- 0.26, N = 33123.953154.693171.46MIN: 3109.77-flto - MIN: 3138.34MIN: 3160.111. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native7001400210028003500SE +/- 5.44, N = 3SE +/- 0.32, N = 3SE +/- 1.30, N = 33123.643148.673172.19MIN: 3105.42-flto - MIN: 3137.59MIN: 3159.81. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native400800120016002000SE +/- 1.34, N = 3SE +/- 1.23, N = 3SE +/- 2.13, N = 31845.741874.701890.59MIN: 1834.84-flto - MIN: 1865.22MIN: 1879.821. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native400800120016002000SE +/- 1.67, N = 3SE +/- 1.27, N = 3SE +/- 0.80, N = 31842.141876.251887.61MIN: 1831.93-flto - MIN: 1866.41MIN: 1877.871. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native400800120016002000SE +/- 0.68, N = 3SE +/- 1.29, N = 3SE +/- 1.66, N = 31841.631877.511891.71MIN: 1831.74-flto - MIN: 1866.09MIN: 1880.741. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit-O3 -march=native-O250100150200250SE +/- 0.03, N = 3SE +/- 0.09, N = 3223.02148.40-O3 -march=native - MIN: 153.51 / MAX: 490.73-O2 -lm - MIN: 95.23 / MAX: 345.291. (CC) gcc options: -pthread

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O2-O3 -march=native-O3 -march=native -flto110220330440550SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3491.64489.76488.63-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -fPIC -pthread -pipe

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 347.3547.61106.52-march=native-march=native -flto-O21. (CC) gcc options: -lm -lpthread -O3

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O3 -march=native -flto-O2-O3 -march=native11002200330044005500SE +/- 3.18, N = 3SE +/- 32.83, N = 3SE +/- 41.25, N = 3505850014959-O3 -march=native -flto-O2-O3 -march=native1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O3 -march=native -flto-O3 -march=native-O22K4K6K8K10KSE +/- 4.58, N = 3SE +/- 6.96, N = 3SE +/- 1.67, N = 3939593899381-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=native -flto-O3 -march=native-O24080120160200SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3195195164-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3270269219-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=native-O3 -march=native -flto-O22004006008001000SE +/- 1.53, N = 3SE +/- 0.67, N = 3114110721066-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=native -flto-O3 -march=native-O230060090012001500SE +/- 1.20, N = 3SE +/- 6.89, N = 3122911981091-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=native -flto-O3 -march=native-O215003000450060007500SE +/- 3.24, N = 3SE +/- 6.62, N = 3SE +/- 0.74, N = 37079.886878.516305.48-march=native -flto-march=native-O21. (CC) gcc options: -O3 -mavx2

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto10002000300040005000SE +/- 4.91, N = 3SE +/- 11.11, N = 3SE +/- 14.15, N = 34777.34582.34579.8-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O3 -march=native-O3 -march=native -flto-O2816243240SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 333.032.832.7-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto10002000300040005000SE +/- 5.61, N = 3SE +/- 8.15, N = 3SE +/- 17.62, N = 34718.14514.84503.1-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O3 -march=native-O3 -march=native -flto-O2816243240SE +/- 0.44, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 335.434.834.5-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O2-O3 -march=native -flto-O3 -march=native1020304050SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.30, N = 343.6243.7844.09-O2-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -ldl -lz -lpthread

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto12002400360048006000SE +/- 5.74, N = 3SE +/- 15.18, N = 3SE +/- 6.81, N = 45760.95546.05477.9-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 1.68, N = 3SE +/- 2.26, N = 3SE +/- 3.12, N = 4296.0285.3281.1-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O3 -march=native-O2-O3 -march=native -flto6M12M18M24M30MSE +/- 279559.22, N = 3SE +/- 96950.30, N = 3SE +/- 94171.94, N = 3299324412909481929086394-march=native-O2-march=native1. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O2-O3 -march=native-O3 -march=native -flto510152025SE +/- 0.05, N = 4SE +/- 0.06, N = 4SE +/- 0.05, N = 421.3321.7122.60-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -std=c99 -lpthread -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression-O3 -march=native -flto-O3 -march=native-O2714212835SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 327.0727.2627.84-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O2-O3 -march=native -flto48121620SE +/- 0.01, N = 3SE +/- 0.17, N = 5SE +/- 0.00, N = 316.5016.6917.42MIN: 16.39MIN: 16.38-flto - MIN: 17.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.16, N = 3273.10272.60261.03-march=native -lm-march=native -flto -lm-O21. (CC) gcc options: -O3 -rdynamic

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O3 -march=native -flto-O3 -march=native-O290K180K270K360K450KSE +/- 166.46, N = 3SE +/- 1364.82, N = 3SE +/- 1236.61, N = 3435901.44432583.96430127.50-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -O2 -lrt" -lrt

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=native-O3 -march=native -flto-O2612182430SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 321.5421.5824.46-march=native-march=native -flto-O21. (CC) gcc options: -lm -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O3 -march=native -flto-O3 -march=native-O21.09712.19423.29134.38845.4855SE +/- 0.02041, N = 3SE +/- 0.02065, N = 3SE +/- 0.02350, N = 34.746114.865224.87601-flto - MIN: 3.7MIN: 3.82MIN: 3.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.18710.37420.56130.74840.9355SE +/- 0.003232, N = 3SE +/- 0.003135, N = 3SE +/- 0.003541, N = 30.8295640.8296370.831699MIN: 0.81MIN: 0.81-flto - MIN: 0.811. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2150M300M450M600M750MSE +/- 209549.78, N = 3SE +/- 322714.18, N = 3SE +/- 189414.30, N = 3722893333722393333711343333-march=native-march=native -flto-O21. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2150M300M450M600M750MSE +/- 2160717.47, N = 3SE +/- 2050604.25, N = 3SE +/- 766753.62, N = 3686530000684356667635506667-march=native-march=native -flto-O21. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K-O3 -march=native-O24080120160200SE +/- 0.09, N = 3SE +/- 0.05, N = 3190.31186.75-O3 -march=native - MIN: 174.59 / MAX: 201.24-O2 -lm - MIN: 170.98 / MAX: 196.551. (CC) gcc options: -pthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 511.0811.0811.10-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -rdynamic

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O3 -march=native-O2-O3 -march=native -flto50100150200250SE +/- 0.15, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 3230.02243.42247.89-O3 -march=native - MIN: 229.3 / MAX: 233.4MIN: 241.9 / MAX: 246.46-O3 -march=native -flto - MIN: 247.03 / MAX: 249.921. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O3 -march=native-O2-O3 -march=native -flto50100150200250SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3227.66236.05242.55-O3 -march=native - MIN: 226.71 / MAX: 229.36MIN: 234.65 / MAX: 236.77-O3 -march=native -flto - MIN: 241.93 / MAX: 243.451. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.3811.4012.09-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p-O2-O3 -march=native170340510680850SE +/- 1.36, N = 3SE +/- 0.33, N = 3773.93763.05-O2 -lm - MIN: 589.24 / MAX: 1160.82-O3 -march=native - MIN: 584.4 / MAX: 1127.781. (CC) gcc options: -pthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native0.91491.82982.74473.65964.5745SE +/- 0.00867, N = 3SE +/- 0.00379, N = 3SE +/- 0.00741, N = 34.044774.044814.06617MIN: 3.93-flto - MIN: 3.91MIN: 3.931. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native -flto-O3 -march=native-O2246810SE +/- 0.00390, N = 3SE +/- 0.00184, N = 3SE +/- 0.00352, N = 38.572488.575488.57623-flto - MIN: 8.44MIN: 8.41MIN: 8.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native0.16250.3250.48750.650.8125SE +/- 0.001308, N = 3SE +/- 0.001704, N = 3SE +/- 0.002639, N = 30.7178820.7204820.722430MIN: 0.66-flto - MIN: 0.67MIN: 0.671. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O3 -march=native -flto-O3 -march=native-O2246810SE +/- 0.055, N = 15SE +/- 0.063, N = 15SE +/- 0.106, N = 38.3288.0678.023-O3 -march=native -flto-O3 -march=native1. (CXX) g++ options: -O2 -pthread -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O3 -march=native -flto-O3 -march=native-O2600K1200K1800K2400K3000KSE +/- 3890.24, N = 3SE +/- 15075.35, N = 3SE +/- 20903.58, N = 32990164.922980192.002936296.08-march=native -flto-march=native-O21. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-O3 -march=native -flto-O3 -march=native-O248121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.7112.9013.76-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O3 -march=native -flto-O2-O3 -march=native900K1800K2700K3600K4500KSE +/- 23615.46, N = 3SE +/- 8839.00, N = 3SE +/- 16885.42, N = 34060369.084051463.174036791.92-march=native -flto-O2-march=native1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O3 -march=native -flto-O3 -march=native-O20.7951.592.3853.183.975SE +/- 0.00147, N = 3SE +/- 0.00143, N = 3SE +/- 0.00099, N = 33.523813.527913.53315-flto - MIN: 3.46MIN: 3.45MIN: 3.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.29770.59540.89311.19081.4885SE +/- 0.00212, N = 3SE +/- 0.00166, N = 3SE +/- 0.00175, N = 31.321001.322711.32311MIN: 1.25MIN: 1.26-flto - MIN: 1.261. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native -flto-O3 -march=native-O20.79651.5932.38953.1863.9825SE +/- 0.00020, N = 3SE +/- 0.00232, N = 3SE +/- 0.00185, N = 33.535003.537083.54019-flto - MIN: 3.44MIN: 3.41MIN: 3.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 1.48, N = 10SE +/- 1.49, N = 10SE +/- 1.51, N = 10195.87195.07191.83-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O3 -march=native-O3 -march=native -flto-O250K100K150K200K250KSE +/- 1015.58, N = 3SE +/- 101.47, N = 3SE +/- 504.43, N = 3241439239892239792-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.004, N = 5SE +/- 0.003, N = 5SE +/- 0.003, N = 55.9315.9366.086-O3 -march=native-O3 -march=native -flto-O21. (CXX) g++ options: -fvisibility=hidden -logg -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O3 -march=native -flto-O3 -march=native-O2246810SE +/- 0.033, N = 5SE +/- 0.007, N = 5SE +/- 0.030, N = 55.5755.5876.467-O3 -march=native -flto-O3 -march=native-O21. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 310.7511.2311.24MIN: 10.65-flto - MIN: 11.14MIN: 11.151. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto1.21522.43043.64564.86086.076SE +/- 0.03422, N = 3SE +/- 0.02936, N = 3SE +/- 0.01659, N = 35.011995.287265.40080MIN: 4.47MIN: 4.8-flto - MIN: 4.781. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native0.71331.42662.13992.85323.5665SE +/- 0.00129, N = 3SE +/- 0.00623, N = 3SE +/- 0.00399, N = 33.135323.139413.17026MIN: 3.07-flto - MIN: 3.07MIN: 3.11. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.014, N = 38.4058.4548.771-march=native-march=native -flto-O21. (CXX) g++ options: -fopenmp -O3

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O3 -march=native -flto-O3 -march=native-O2306090120150SE +/- 1.44, N = 5SE +/- 1.58, N = 4SE +/- 1.53, N = 4141.83139.13136.31-march=native -flto-march=native1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O2-O3 -march=native -flto-O3 -march=native48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 314.1714.2514.29MIN: 14.04-flto - MIN: 14.14MIN: 14.181. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3712.5112.52MIN: 12.28MIN: 12.43-flto - MIN: 12.411. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.1716.1816.19MIN: 16.09MIN: 16.09-flto - MIN: 16.091. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native -flto-O3 -march=native-O2246810SE +/- 0.003, N = 3SE +/- 0.010, N = 3SE +/- 0.048, N = 35.3765.4797.304-march=native -flto-march=native-O21. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-O3 -march=native -flto-O3 -march=native-O21.2062.4123.6184.8246.03SE +/- 0.008, N = 3SE +/- 0.014, N = 3SE +/- 0.005, N = 35.1035.1275.360-O3 -march=native -flto-O3 -march=native-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium-O3 -march=native -flto-O3 -march=native-O21.18082.36163.54244.72325.904SE +/- 0.0065, N = 3SE +/- 0.0013, N = 3SE +/- 0.0027, N = 35.17055.18205.2481-O3 -march=native-O3 -march=native1. (CXX) g++ options: -flto -O2 -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p-O2-O3 -march=native160320480640800SE +/- 2.55, N = 3SE +/- 1.03, N = 3727.60717.31-O2 -lm - MIN: 643.78 / MAX: 798.32-O3 -march=native - MIN: 641.13 / MAX: 782.171. (CC) gcc options: -pthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=native -flto-O3 -march=native-O24080120160200SE +/- 0.31, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 3166.05164.77160.65-march=native -flto-march=native1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3201.70201.10198.01-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 317.0617.0617.13MIN: 16.67MIN: 16.72-flto - MIN: 16.731. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O3 -march=native -flto-O2-O3 -march=native0.96931.93862.90793.87724.8465SE +/- 0.00501, N = 3SE +/- 0.01250, N = 3SE +/- 0.01947, N = 34.251764.270774.30798-flto - MIN: 4.15MIN: 4.16MIN: 4.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O2-O3 -march=native -flto0.33190.66380.99571.32761.6595SE +/- 0.00602, N = 3SE +/- 0.01597, N = 3SE +/- 0.00575, N = 31.457881.467261.47524MIN: 1.36MIN: 1.37-flto - MIN: 1.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.52, N = 3278.72278.59273.60-march=native-march=native -flto1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.5