11900K Compiler

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2105179-IB-11900KCOM00&grs.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 11.1 -O3 -march=native -O3 -march=native -flto -O2Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0AMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)GCC 11.1.1 20210428btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.1Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

c-ray: Total Time - 4K, 16 Rays Per Pixelncnn: CPU - shufflenet-v2dav1d: Chimera 1080p 10-bitncnn: CPU - mnasnetencode-mp3: WAV To MP3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU-v3-v3 - mobilenet-v3graphics-magick: Enhancedncnn: CPU - efficientnet-b0ncnn: CPU - resnet50graphics-magick: Sharpenencode-opus: WAV To Opus Encodencnn: CPU - yolov4-tinyaobench: 2048 x 2048 - Total Timegraphics-magick: Resizinghimeno: Poisson Pressure Solverncnn: CPU - regnety_400mncnn: CPU - googlenetwebp: Quality 100, Losslessliquid-dsp: 8 - 256 - 57tnn: CPU - MobileNet v2onednn: IP Shapes 3D - bf16bf16bf16 - CPUgraphics-magick: Rotateastcenc: Exhaustivetnn: CPU - SqueezeNet v1.1astcenc: Thoroughespeak: Text-To-Speech Synthesisonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedwebp: Quality 100, Highest Compressioncompress-zstd: 19 - Decompression Speedtjbench: Decompression Throughputonednn: IP Shapes 3D - f32 - CPUsmallpt: Global Illumination Renderer; 128 Samplescompress-zstd: 19, Long Mode - Decompression Speedsvt-hevc: 7 - Bosphorus 1080pncnn: CPU - squeezenet_ssdlammps: Rhodopsin Proteinsvt-vp9: Visual Quality Optimized - Bosphorus 1080phmmer: Pfam Database Searchstockfish: Total Timewebp: Quality 100, Lossless, Highest Compressionncnn: CPU - resnet18mrbayes: Primate Phylogeny Analysisonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUx265: Bosphorus 4Kencode-flac: WAV To FLACcompress-zstd: 19 - Compression Speedonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080ppjsip: INVITEdav1d: Summer Nature 4Ksvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080predis: SETliquid-dsp: 16 - 256 - 57onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUqe: AUSURF112onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUastcenc: Mediumdav1d: Summer Nature 1080pdav1d: Chimera 1080pcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUncnn: CPU - vgg16onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000compress-zstd: 19, Long Mode - Compression Speedonednn: Convolution Batch Shapes Auto - f32 - CPUncnn: CPU - alexnetpjsip: OPTIONS, Statelessonednn: IP Shapes 1D - u8s8f32 - CPUcryptopp: Unkeyed Algorithmsredis: GETonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUencode-wavpack: WAV To WavPackonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUpjsip: OPTIONS, Statefulonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUsysbench: CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUgmpbench: Total Timencnn: CPU - blazefaceGCC 11.1 -O3 -march=native -O3 -march=native -flto -O247.3453.24223.022.305.4793.2411.832.552704.3818.231955.58720.2321.54411986878.5076868.6210.2012.901686530000230.0195.28726114185.4157227.66311.384621.70516.5026285.35546.05.1274514.8273.10004611.23958.4054582.3139.1315.538.067164.77100.7372993244127.26411.0886.6964.865221891.7115.815.93135.41887.611890.59195.874959190.31278.72201.702980192.007228933333173.473172.192576.973171.465.1820717.31763.05432583.9643524.3079812.514254.501.457883.1702644.08533.014.28899.632414390.722430489.7595514036791.924.0661717.05763.527910.82963711.0841.3227193893.5370834776.0816.18288.575486172.91.1947.6135.602.275.3763.2513.342.522694.3218.431955.57523.4521.57712297079.8838708.9110.2712.706684356667247.8895.40080107285.4207242.55011.395222.60417.4188281.15477.95.1034503.1272.60075811.22698.4544579.8141.8315.928.328166.0599.9712908639427.07411.3984.9294.746111877.5115.405.93634.81876.251874.70195.075058278.59201.102990164.927223933333152.893148.672540.193154.695.1705435901.4439594.2517612.523254.131.475243.1394143.77732.814.25239.702398920.720482488.627804060369.084.0448117.12893.523810.83169911.0991.3231193953.5350034751.0116.18648.572486171.61.69106.5223.48148.403.117.3044.2015.153.182195.2322.071646.46721.0724.45810916305.4818509.6111.1113.763635506667243.4165.01199106691.3799236.05012.094921.32516.6931296.05760.95.3604718.1261.03478510.74568.7714777.3136.3116.158.023160.65103.2912909481927.84111.3087.2974.876011841.6315.646.08634.51842.141845.74191.835001186.75273.60198.012936296.087113433333124.563123.642538.253123.955.2481727.60773.93430127.4981894.2707712.365954.801.467263.1353243.61632.714.16619.632397920.717882491.6449324051463.174.0447717.05683.533150.82956411.0771.3210093813.5401934799.7016.17428.576231.19OpenBenchmarking.org

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 347.3547.61106.52-march=native-march=native -flto-O21. (CC) gcc options: -lm -lpthread -O3

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O3 -march=native-O3 -march=native -flto-O21.262.523.785.046.3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 153.245.603.48-O3 -march=native - MIN: 3.17 / MAX: 6.75-O3 -march=native -flto - MIN: 5.41 / MAX: 9.21MIN: 3.4 / MAX: 7.051. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit-O3 -march=native-O250100150200250SE +/- 0.03, N = 3SE +/- 0.09, N = 3223.02148.40-O3 -march=native - MIN: 153.51 / MAX: 490.73-O2 -lm - MIN: 95.23 / MAX: 345.291. (CC) gcc options: -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O3 -march=native-O3 -march=native -flto-O20.69981.39962.09942.79923.499SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 142.302.273.11-O3 -march=native - MIN: 2.18 / MAX: 3.19-O3 -march=native -flto - MIN: 2.21 / MAX: 5.8MIN: 3.05 / MAX: 9.871. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.010, N = 3SE +/- 0.003, N = 3SE +/- 0.048, N = 35.4795.3767.304-march=native-march=native -flto-O21. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O3 -march=native-O3 -march=native -flto-O20.9451.892.8353.784.725SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 153.243.254.20-O3 -march=native - MIN: 3.1 / MAX: 6.67-O3 -march=native -flto - MIN: 3.14 / MAX: 6.68MIN: 4.04 / MAX: 7.771. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 1511.8313.3415.15-O3 -march=native - MIN: 11.62 / MAX: 15.38-O3 -march=native -flto - MIN: 13.01 / MAX: 16.82MIN: 14.73 / MAX: 342.421. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O3 -march=native-O3 -march=native -flto-O20.71551.4312.14652.8623.5775SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 152.552.523.18-O3 -march=native - MIN: 2.43 / MAX: 6.16-O3 -march=native -flto - MIN: 2.47 / MAX: 6.06MIN: 3.1 / MAX: 6.781. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3270269219-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O3 -march=native-O3 -march=native -flto-O21.17682.35363.53044.70725.884SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 154.384.325.23-O3 -march=native - MIN: 4.18 / MAX: 7.9-O3 -march=native -flto - MIN: 4.25 / MAX: 8.71MIN: 5.12 / MAX: 8.961. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O3 -march=native-O3 -march=native -flto-O2510152025SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 1518.2318.4322.07-O3 -march=native - MIN: 17.76 / MAX: 22.08-O3 -march=native -flto - MIN: 18.19 / MAX: 22.12MIN: 21.33 / MAX: 27.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3195195164-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.007, N = 5SE +/- 0.033, N = 5SE +/- 0.030, N = 55.5875.5756.467-O3 -march=native-O3 -march=native -flto-O21. (CXX) g++ options: -fvisibility=hidden -logg -lm

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O3 -march=native-O3 -march=native -flto-O2612182430SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 1520.2323.4521.07-O3 -march=native - MIN: 20.02 / MAX: 23.8-O3 -march=native -flto - MIN: 23.14 / MAX: 26.98MIN: 20.27 / MAX: 26.621. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=native-O3 -march=native -flto-O2612182430SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 321.5421.5824.46-march=native-march=native -flto-O21. (CC) gcc options: -lm -O3

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=native-O3 -march=native -flto-O230060090012001500SE +/- 6.89, N = 3SE +/- 1.20, N = 3119812291091-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=native-O3 -march=native -flto-O215003000450060007500SE +/- 6.62, N = 3SE +/- 3.24, N = 3SE +/- 0.74, N = 36878.517079.886305.48-march=native-march=native -flto-O21. (CC) gcc options: -O3 -mavx2

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 128.628.919.61-O3 -march=native - MIN: 8.51 / MAX: 12.11-O3 -march=native -flto - MIN: 8.72 / MAX: 12.48MIN: 9.44 / MAX: 13.781. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 1510.2010.2711.11-O3 -march=native - MIN: 9.72 / MAX: 13.93-O3 -march=native -flto - MIN: 9.93 / MAX: 13.87MIN: 10.75 / MAX: 16.771. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.9012.7113.76-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2150M300M450M600M750MSE +/- 2160717.47, N = 3SE +/- 2050604.25, N = 3SE +/- 766753.62, N = 3686530000684356667635506667-march=native-march=native -flto-O21. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O3 -march=native-O3 -march=native -flto-O250100150200250SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.21, N = 3230.02247.89243.42-O3 -march=native - MIN: 229.3 / MAX: 233.4-O3 -march=native -flto - MIN: 247.03 / MAX: 249.92MIN: 241.9 / MAX: 246.461. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O21.21522.43043.64564.86086.076SE +/- 0.02936, N = 3SE +/- 0.01659, N = 3SE +/- 0.03422, N = 35.287265.400805.01199MIN: 4.8-flto - MIN: 4.78MIN: 4.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=native-O3 -march=native -flto-O22004006008001000SE +/- 1.53, N = 3SE +/- 0.67, N = 3114110721066-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 385.4285.4291.38-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O3 -march=native-O3 -march=native -flto-O250100150200250SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 3227.66242.55236.05-O3 -march=native - MIN: 226.71 / MAX: 229.36-O3 -march=native -flto - MIN: 241.93 / MAX: 243.45MIN: 234.65 / MAX: 236.771. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.3811.4012.09-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O3 -march=native-O3 -march=native -flto-O2510152025SE +/- 0.06, N = 4SE +/- 0.05, N = 4SE +/- 0.05, N = 421.7122.6021.33-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -std=c99 -lpthread -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 516.5017.4216.69MIN: 16.39-flto - MIN: 17.27MIN: 16.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 2.26, N = 3SE +/- 3.12, N = 4SE +/- 1.68, N = 3285.3281.1296.0-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O3 -march=native-O3 -march=native -flto-O212002400360048006000SE +/- 15.18, N = 3SE +/- 6.81, N = 4SE +/- 5.74, N = 35546.05477.95760.9-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-O3 -march=native-O3 -march=native -flto-O21.2062.4123.6184.8246.03SE +/- 0.014, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 35.1275.1035.360-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O3 -march=native-O3 -march=native -flto-O210002000300040005000SE +/- 8.15, N = 3SE +/- 17.62, N = 3SE +/- 5.61, N = 34514.84503.14718.1-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.16, N = 3273.10272.60261.03-march=native -lm-march=native -flto -lm-O21. (CC) gcc options: -O3 -rdynamic

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.2411.2310.75MIN: 11.15-flto - MIN: 11.14MIN: 10.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.014, N = 38.4058.4548.771-march=native-march=native -flto-O21. (CXX) g++ options: -fopenmp -O3

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O3 -march=native-O3 -march=native -flto-O210002000300040005000SE +/- 11.11, N = 3SE +/- 14.15, N = 3SE +/- 4.91, N = 34582.34579.84777.3-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O2306090120150SE +/- 1.58, N = 4SE +/- 1.44, N = 5SE +/- 1.53, N = 4139.13141.83136.31-march=native-march=native -flto1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.12, N = 3SE +/- 0.28, N = 3SE +/- 0.01, N = 1515.5315.9216.15-O3 -march=native - MIN: 15.19 / MAX: 20.95-O3 -march=native -flto - MIN: 15.55 / MAX: 21.06MIN: 15.95 / MAX: 21.541. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.063, N = 15SE +/- 0.055, N = 15SE +/- 0.106, N = 38.0678.3288.023-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -O2 -pthread -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 0.01, N = 3SE +/- 0.31, N = 3SE +/- 0.13, N = 3164.77166.05160.65-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3100.7499.97103.29-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O3 -march=native-O3 -march=native -flto-O26M12M18M24M30MSE +/- 279559.22, N = 3SE +/- 94171.94, N = 3SE +/- 96950.30, N = 3299324412908639429094819-march=native-march=native-O21. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression-O3 -march=native-O3 -march=native -flto-O2714212835SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 327.2627.0727.84-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 1411.0811.3911.30-O3 -march=native - MIN: 10.66 / MAX: 16.66-O3 -march=native -flto - MIN: 11.27 / MAX: 15.15MIN: 10.84 / MAX: 14.991. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=native-O3 -march=native -flto-O220406080100SE +/- 0.06, N = 3SE +/- 0.32, N = 3SE +/- 0.53, N = 386.7084.9387.30-march=native-march=native -flto-O21. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O21.09712.19423.29134.38845.4855SE +/- 0.02065, N = 3SE +/- 0.02041, N = 3SE +/- 0.02350, N = 34.865224.746114.87601MIN: 3.82-flto - MIN: 3.7MIN: 3.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O2400800120016002000SE +/- 1.66, N = 3SE +/- 1.29, N = 3SE +/- 0.68, N = 31891.711877.511841.63MIN: 1880.74-flto - MIN: 1866.09MIN: 1831.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.13, N = 15SE +/- 0.15, N = 6SE +/- 0.21, N = 315.8115.4015.64-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.004, N = 5SE +/- 0.003, N = 5SE +/- 0.003, N = 55.9315.9366.086-O3 -march=native-O3 -march=native -flto-O21. (CXX) g++ options: -fvisibility=hidden -logg -lm

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O3 -march=native-O3 -march=native -flto-O2816243240SE +/- 0.44, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 335.434.834.5-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O2400800120016002000SE +/- 0.80, N = 3SE +/- 1.27, N = 3SE +/- 1.67, N = 31887.611876.251842.14MIN: 1877.87-flto - MIN: 1866.41MIN: 1831.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O2400800120016002000SE +/- 2.13, N = 3SE +/- 1.23, N = 3SE +/- 1.34, N = 31890.591874.701845.74MIN: 1879.82-flto - MIN: 1865.22MIN: 1834.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 1.48, N = 10SE +/- 1.49, N = 10SE +/- 1.51, N = 10195.87195.07191.83-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O3 -march=native-O3 -march=native -flto-O211002200330044005500SE +/- 41.25, N = 3SE +/- 3.18, N = 3SE +/- 32.83, N = 3495950585001-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K-O3 -march=native-O24080120160200SE +/- 0.09, N = 3SE +/- 0.05, N = 3190.31186.75-O3 -march=native - MIN: 174.59 / MAX: 201.24-O2 -lm - MIN: 170.98 / MAX: 196.551. (CC) gcc options: -pthread

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O260120180240300SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.52, N = 3278.72278.59273.60-march=native-march=native -flto1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O24080120160200SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3201.70201.10198.01-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O3 -march=native-O3 -march=native -flto-O2600K1200K1800K2400K3000KSE +/- 15075.35, N = 3SE +/- 3890.24, N = 3SE +/- 20903.58, N = 32980192.002990164.922936296.08-march=native-march=native -flto-O21. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2150M300M450M600M750MSE +/- 209549.78, N = 3SE +/- 322714.18, N = 3SE +/- 189414.30, N = 3722893333722393333711343333-march=native-march=native -flto-O21. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O27001400210028003500SE +/- 2.46, N = 3SE +/- 3.52, N = 3SE +/- 0.76, N = 33173.473152.893124.56MIN: 3161.04-flto - MIN: 3137.49MIN: 3112.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O27001400210028003500SE +/- 1.30, N = 3SE +/- 0.32, N = 3SE +/- 5.44, N = 33172.193148.673123.64MIN: 3159.8-flto - MIN: 3137.59MIN: 3105.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O3 -march=native-O3 -march=native -flto-O26001200180024003000SE +/- 21.65, N = 3SE +/- 24.60, N = 3SE +/- 18.09, N = 32576.972540.192538.251. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O27001400210028003500SE +/- 0.26, N = 3SE +/- 3.24, N = 3SE +/- 2.61, N = 33171.463154.693123.95MIN: 3160.11-flto - MIN: 3138.34MIN: 3109.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium-O3 -march=native-O3 -march=native -flto-O21.18082.36163.54244.72325.904SE +/- 0.0013, N = 3SE +/- 0.0065, N = 3SE +/- 0.0027, N = 35.18205.17055.2481-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p-O3 -march=native-O2160320480640800SE +/- 1.03, N = 3SE +/- 2.55, N = 3717.31727.60-O3 -march=native - MIN: 641.13 / MAX: 782.17-O2 -lm - MIN: 643.78 / MAX: 798.321. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p-O3 -march=native-O2170340510680850SE +/- 0.33, N = 3SE +/- 1.36, N = 3763.05773.93-O3 -march=native - MIN: 584.4 / MAX: 1127.78-O2 -lm - MIN: 589.24 / MAX: 1160.821. (CC) gcc options: -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O3 -march=native-O3 -march=native -flto-O290K180K270K360K450KSE +/- 1364.82, N = 3SE +/- 166.46, N = 3SE +/- 1236.61, N = 3432583.96435901.44430127.50-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.96931.93862.90793.87724.8465SE +/- 0.01947, N = 3SE +/- 0.00501, N = 3SE +/- 0.01250, N = 34.307984.251764.27077MIN: 4.19-flto - MIN: 4.15MIN: 4.161. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.5112.5212.37MIN: 12.43-flto - MIN: 12.41MIN: 12.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O3 -march=native-O3 -march=native -flto-O21224364860SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 1554.5054.1354.80-O3 -march=native - MIN: 53.96 / MAX: 58.57-O3 -march=native -flto - MIN: 53.54 / MAX: 59.11MIN: 54.15 / MAX: 641. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.33190.66380.99571.32761.6595SE +/- 0.00602, N = 3SE +/- 0.00575, N = 3SE +/- 0.01597, N = 31.457881.475241.46726MIN: 1.36-flto - MIN: 1.37MIN: 1.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.71331.42662.13992.85323.5665SE +/- 0.00399, N = 3SE +/- 0.00623, N = 3SE +/- 0.00129, N = 33.170263.139413.13532MIN: 3.1-flto - MIN: 3.07MIN: 3.071. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=native-O3 -march=native -flto-O21020304050SE +/- 0.30, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 344.0943.7843.62-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -ldl -lz -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O3 -march=native-O3 -march=native -flto-O2816243240SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 333.032.832.7-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -pthread -lz

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 314.2914.2514.17MIN: 14.18-flto - MIN: 14.14MIN: 14.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 159.639.709.63-O3 -march=native - MIN: 9.56 / MAX: 13.14-O3 -march=native -flto - MIN: 9.56 / MAX: 13.19MIN: 9.47 / MAX: 14.511. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O3 -march=native-O3 -march=native -flto-O250K100K150K200K250KSE +/- 1015.58, N = 3SE +/- 101.47, N = 3SE +/- 504.43, N = 3241439239892239792-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.16250.3250.48750.650.8125SE +/- 0.002639, N = 3SE +/- 0.001704, N = 3SE +/- 0.001308, N = 30.7224300.7204820.717882MIN: 0.67-flto - MIN: 0.67MIN: 0.661. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O3 -march=native-O3 -march=native -flto-O2110220330440550SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3489.76488.63491.64-O3 -march=native-O3 -march=native -flto-O21. (CXX) g++ options: -fPIC -pthread -pipe

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O3 -march=native-O3 -march=native -flto-O2900K1800K2700K3600K4500KSE +/- 16885.42, N = 3SE +/- 23615.46, N = 3SE +/- 8839.00, N = 34036791.924060369.084051463.17-march=native-march=native -flto-O21. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.91491.82982.74473.65964.5745SE +/- 0.00741, N = 3SE +/- 0.00379, N = 3SE +/- 0.00867, N = 34.066174.044814.04477MIN: 3.93-flto - MIN: 3.91MIN: 3.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 317.0617.1317.06MIN: 16.72-flto - MIN: 16.73MIN: 16.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.7951.592.3853.183.975SE +/- 0.00143, N = 3SE +/- 0.00147, N = 3SE +/- 0.00099, N = 33.527913.523813.53315MIN: 3.45-flto - MIN: 3.46MIN: 3.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.18710.37420.56130.74840.9355SE +/- 0.003135, N = 3SE +/- 0.003541, N = 3SE +/- 0.003232, N = 30.8296370.8316990.829564MIN: 0.81-flto - MIN: 0.81MIN: 0.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O3 -march=native-O3 -march=native -flto-O23691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 511.0811.1011.08-O3 -march=native-O3 -march=native -flto-O21. (CXX) g++ options: -rdynamic

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.29770.59540.89311.19081.4885SE +/- 0.00166, N = 3SE +/- 0.00175, N = 3SE +/- 0.00212, N = 31.322711.323111.32100MIN: 1.26-flto - MIN: 1.26MIN: 1.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O3 -march=native-O3 -march=native -flto-O22K4K6K8K10KSE +/- 6.96, N = 3SE +/- 4.58, N = 3SE +/- 1.67, N = 3938993959381-O3 -march=native-O3 -march=native -flto-O21. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O20.79651.5932.38953.1863.9825SE +/- 0.00232, N = 3SE +/- 0.00020, N = 3SE +/- 0.00185, N = 33.537083.535003.54019MIN: 3.41-flto - MIN: 3.44MIN: 3.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O3 -march=native-O3 -march=native -flto-O27K14K21K28K35KSE +/- 0.97, N = 3SE +/- 1.11, N = 3SE +/- 0.65, N = 334776.0834751.0134799.70-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O248121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.1816.1916.17MIN: 16.09-flto - MIN: 16.09MIN: 16.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU-O3 -march=native-O3 -march=native -flto-O2246810SE +/- 0.00184, N = 3SE +/- 0.00390, N = 3SE +/- 0.00352, N = 38.575488.572488.57623MIN: 8.41-flto - MIN: 8.44MIN: 8.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native-O3 -march=native -flto130026003900520065006172.96171.6-flto1. (CC) gcc options: -O3 -march=native -lm

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O3 -march=native-O3 -march=native -flto-O20.38030.76061.14091.52121.9015SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 151.191.691.19-O3 -march=native - MIN: 1.09 / MAX: 2.02-O3 -march=native -flto - MIN: 1.64 / MAX: 2.46MIN: 1.14 / MAX: 5.671. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.5