11900K Compiler

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2105179-IB-11900KCOM00&sro&grs.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 11.1 -O3 -march=native -O3 -march=native -flto -O2Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0AMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)GCC 11.1.1 20210428btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.1Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

c-ray: Total Time - 4K, 16 Rays Per Pixelncnn: CPU - shufflenet-v2dav1d: Chimera 1080p 10-bitncnn: CPU - mnasnetencode-mp3: WAV To MP3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU-v3-v3 - mobilenet-v3graphics-magick: Enhancedncnn: CPU - efficientnet-b0ncnn: CPU - resnet50graphics-magick: Sharpenencode-opus: WAV To Opus Encodencnn: CPU - yolov4-tinyaobench: 2048 x 2048 - Total Timegraphics-magick: Resizinghimeno: Poisson Pressure Solverncnn: CPU - regnety_400mncnn: CPU - googlenetwebp: Quality 100, Losslessliquid-dsp: 8 - 256 - 57tnn: CPU - MobileNet v2onednn: IP Shapes 3D - bf16bf16bf16 - CPUgraphics-magick: Rotateastcenc: Exhaustivetnn: CPU - SqueezeNet v1.1astcenc: Thoroughespeak: Text-To-Speech Synthesisonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedwebp: Quality 100, Highest Compressioncompress-zstd: 19 - Decompression Speedtjbench: Decompression Throughputonednn: IP Shapes 3D - f32 - CPUsmallpt: Global Illumination Renderer; 128 Samplescompress-zstd: 19, Long Mode - Decompression Speedsvt-hevc: 7 - Bosphorus 1080pncnn: CPU - squeezenet_ssdlammps: Rhodopsin Proteinsvt-vp9: Visual Quality Optimized - Bosphorus 1080phmmer: Pfam Database Searchstockfish: Total Timewebp: Quality 100, Lossless, Highest Compressionncnn: CPU - resnet18mrbayes: Primate Phylogeny Analysisonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUx265: Bosphorus 4Kencode-flac: WAV To FLACcompress-zstd: 19 - Compression Speedonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080ppjsip: INVITEdav1d: Summer Nature 4Ksvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080predis: SETliquid-dsp: 16 - 256 - 57onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUqe: AUSURF112onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUastcenc: Mediumdav1d: Summer Nature 1080pdav1d: Chimera 1080pcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUncnn: CPU - vgg16onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000compress-zstd: 19, Long Mode - Compression Speedonednn: Convolution Batch Shapes Auto - f32 - CPUncnn: CPU - alexnetpjsip: OPTIONS, Statelessonednn: IP Shapes 1D - u8s8f32 - CPUcryptopp: Unkeyed Algorithmsredis: GETonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUencode-wavpack: WAV To WavPackonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUpjsip: OPTIONS, Statefulonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUsysbench: CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUgmpbench: Total Timencnn: CPU - blazefaceGCC 11.1 -O3 -march=native -O3 -march=native -flto -O247.3453.24223.022.305.4793.2411.832.552704.3818.231955.58720.2321.54411986878.5076868.6210.2012.901686530000230.0195.28726114185.4157227.66311.384621.70516.5026285.35546.05.1274514.8273.10004611.23958.4054582.3139.1315.538.067164.77100.7372993244127.26411.0886.6964.865221891.7115.815.93135.41887.611890.59195.874959190.31278.72201.702980192.007228933333173.473172.192576.973171.465.1820717.31763.05432583.9643524.3079812.514254.501.457883.1702644.08533.014.28899.632414390.722430489.7595514036791.924.0661717.05763.527910.82963711.0841.3227193893.5370834776.0816.18288.575486172.91.1947.6135.602.275.3763.2513.342.522694.3218.431955.57523.4521.57712297079.8838708.9110.2712.706684356667247.8895.40080107285.4207242.55011.395222.60417.4188281.15477.95.1034503.1272.60075811.22698.4544579.8141.8315.928.328166.0599.9712908639427.07411.3984.9294.746111877.5115.405.93634.81876.251874.70195.075058278.59201.102990164.927223933333152.893148.672540.193154.695.1705435901.4439594.2517612.523254.131.475243.1394143.77732.814.25239.702398920.720482488.627804060369.084.0448117.12893.523810.83169911.0991.3231193953.5350034751.0116.18648.572486171.61.69106.5223.48148.403.117.3044.2015.153.182195.2322.071646.46721.0724.45810916305.4818509.6111.1113.763635506667243.4165.01199106691.3799236.05012.094921.32516.6931296.05760.95.3604718.1261.03478510.74568.7714777.3136.3116.158.023160.65103.2912909481927.84111.3087.2974.876011841.6315.646.08634.51842.141845.74191.835001186.75273.60198.012936296.087113433333124.563123.642538.253123.955.2481727.60773.93430127.4981894.2707712.365954.801.467263.1353243.61632.714.16619.632397920.717882491.6449324051463.174.0447717.05683.533150.82956411.0771.3210093813.5401934799.7016.17428.576231.19OpenBenchmarking.org

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3106.5247.3547.61-O2-march=native-march=native -flto1. (CC) gcc options: -lm -lpthread -O3

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O2-O3 -march=native-O3 -march=native -flto1.262.523.785.046.3SE +/- 0.00, N = 15SE +/- 0.02, N = 3SE +/- 0.06, N = 33.483.245.60MIN: 3.4 / MAX: 7.05-O3 -march=native - MIN: 3.17 / MAX: 6.75-O3 -march=native -flto - MIN: 5.41 / MAX: 9.211. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit-O2-O3 -march=native50100150200250SE +/- 0.09, N = 3SE +/- 0.03, N = 3148.40223.02-O2 -lm - MIN: 95.23 / MAX: 345.29-O3 -march=native - MIN: 153.51 / MAX: 490.731. (CC) gcc options: -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O2-O3 -march=native-O3 -march=native -flto0.69981.39962.09942.79923.499SE +/- 0.01, N = 14SE +/- 0.06, N = 3SE +/- 0.01, N = 33.112.302.27MIN: 3.05 / MAX: 9.87-O3 -march=native - MIN: 2.18 / MAX: 3.19-O3 -march=native -flto - MIN: 2.21 / MAX: 5.81. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.048, N = 3SE +/- 0.010, N = 3SE +/- 0.003, N = 37.3045.4795.376-O2-march=native-march=native -flto1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O2-O3 -march=native-O3 -march=native -flto0.9451.892.8353.784.725SE +/- 0.02, N = 15SE +/- 0.04, N = 3SE +/- 0.01, N = 34.203.243.25MIN: 4.04 / MAX: 7.77-O3 -march=native - MIN: 3.1 / MAX: 6.67-O3 -march=native -flto - MIN: 3.14 / MAX: 6.681. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.14, N = 15SE +/- 0.06, N = 3SE +/- 0.01, N = 315.1511.8313.34MIN: 14.73 / MAX: 342.42-O3 -march=native - MIN: 11.62 / MAX: 15.38-O3 -march=native -flto - MIN: 13.01 / MAX: 16.821. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O2-O3 -march=native-O3 -march=native -flto0.71551.4312.14652.8623.5775SE +/- 0.01, N = 15SE +/- 0.05, N = 3SE +/- 0.01, N = 33.182.552.52MIN: 3.1 / MAX: 6.78-O3 -march=native - MIN: 2.43 / MAX: 6.16-O3 -march=native -flto - MIN: 2.47 / MAX: 6.061. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3219270269-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O2-O3 -march=native-O3 -march=native -flto1.17682.35363.53044.70725.884SE +/- 0.02, N = 15SE +/- 0.08, N = 3SE +/- 0.01, N = 35.234.384.32MIN: 5.12 / MAX: 8.96-O3 -march=native - MIN: 4.18 / MAX: 7.9-O3 -march=native -flto - MIN: 4.25 / MAX: 8.711. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O2-O3 -march=native-O3 -march=native -flto510152025SE +/- 0.08, N = 15SE +/- 0.16, N = 3SE +/- 0.06, N = 322.0718.2318.43MIN: 21.33 / MAX: 27.94-O3 -march=native - MIN: 17.76 / MAX: 22.08-O3 -march=native -flto - MIN: 18.19 / MAX: 22.121. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3164195195-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.030, N = 5SE +/- 0.007, N = 5SE +/- 0.033, N = 56.4675.5875.575-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -fvisibility=hidden -logg -lm

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O2-O3 -march=native-O3 -march=native -flto612182430SE +/- 0.09, N = 15SE +/- 0.04, N = 3SE +/- 0.08, N = 321.0720.2323.45MIN: 20.27 / MAX: 26.62-O3 -march=native - MIN: 20.02 / MAX: 23.8-O3 -march=native -flto - MIN: 23.14 / MAX: 26.981. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O2-O3 -march=native-O3 -march=native -flto612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 324.4621.5421.58-O2-march=native-march=native -flto1. (CC) gcc options: -lm -O3

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O2-O3 -march=native-O3 -march=native -flto30060090012001500SE +/- 6.89, N = 3SE +/- 1.20, N = 3109111981229-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O3 -march=native-O3 -march=native -flto15003000450060007500SE +/- 0.74, N = 3SE +/- 6.62, N = 3SE +/- 3.24, N = 36305.486878.517079.88-O2-march=native-march=native -flto1. (CC) gcc options: -O3 -mavx2

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.02, N = 12SE +/- 0.03, N = 3SE +/- 0.06, N = 39.618.628.91MIN: 9.44 / MAX: 13.78-O3 -march=native - MIN: 8.51 / MAX: 12.11-O3 -march=native -flto - MIN: 8.72 / MAX: 12.481. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.08, N = 15SE +/- 0.21, N = 3SE +/- 0.13, N = 311.1110.2010.27MIN: 10.75 / MAX: 16.77-O3 -march=native - MIN: 9.72 / MAX: 13.93-O3 -march=native -flto - MIN: 9.93 / MAX: 13.871. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 313.7612.9012.71-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto150M300M450M600M750MSE +/- 766753.62, N = 3SE +/- 2160717.47, N = 3SE +/- 2050604.25, N = 3635506667686530000684356667-O2-march=native-march=native -flto1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3243.42230.02247.89MIN: 241.9 / MAX: 246.46-O3 -march=native - MIN: 229.3 / MAX: 233.4-O3 -march=native -flto - MIN: 247.03 / MAX: 249.921. (CXX) g++ options: -O2 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto1.21522.43043.64564.86086.076SE +/- 0.03422, N = 3SE +/- 0.02936, N = 3SE +/- 0.01659, N = 35.011995.287265.40080MIN: 4.47MIN: 4.8-flto - MIN: 4.781. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O2-O3 -march=native-O3 -march=native -flto2004006008001000SE +/- 0.67, N = 3SE +/- 1.53, N = 3106611411072-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 391.3885.4285.42-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3236.05227.66242.55MIN: 234.65 / MAX: 236.77-O3 -march=native - MIN: 226.71 / MAX: 229.36-O3 -march=native -flto - MIN: 241.93 / MAX: 243.451. (CXX) g++ options: -O2 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.0911.3811.40-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O2-O3 -march=native-O3 -march=native -flto510152025SE +/- 0.05, N = 4SE +/- 0.06, N = 4SE +/- 0.05, N = 421.3321.7122.60-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -std=c99 -lpthread -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.17, N = 5SE +/- 0.01, N = 3SE +/- 0.00, N = 316.6916.5017.42MIN: 16.38MIN: 16.39-flto - MIN: 17.271. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 1.68, N = 3SE +/- 2.26, N = 3SE +/- 3.12, N = 4296.0285.3281.1-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto12002400360048006000SE +/- 5.74, N = 3SE +/- 15.18, N = 3SE +/- 6.81, N = 45760.95546.05477.9-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-O2-O3 -march=native-O3 -march=native -flto1.2062.4123.6184.8246.03SE +/- 0.005, N = 3SE +/- 0.014, N = 3SE +/- 0.008, N = 35.3605.1275.103-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto10002000300040005000SE +/- 5.61, N = 3SE +/- 8.15, N = 3SE +/- 17.62, N = 34718.14514.84503.1-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.16, N = 3SE +/- 0.20, N = 3SE +/- 0.41, N = 3261.03273.10272.60-O2-march=native -lm-march=native -flto -lm1. (CC) gcc options: -O3 -rdynamic

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.7511.2411.23MIN: 10.65MIN: 11.15-flto - MIN: 11.141. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.014, N = 3SE +/- 0.012, N = 3SE +/- 0.020, N = 38.7718.4058.454-O2-march=native-march=native -flto1. (CXX) g++ options: -fopenmp -O3

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O2-O3 -march=native-O3 -march=native -flto10002000300040005000SE +/- 4.91, N = 3SE +/- 11.11, N = 3SE +/- 14.15, N = 34777.34582.34579.8-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto306090120150SE +/- 1.53, N = 4SE +/- 1.58, N = 4SE +/- 1.44, N = 5136.31139.13141.83-march=native-march=native -flto1. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.01, N = 15SE +/- 0.12, N = 3SE +/- 0.28, N = 316.1515.5315.92MIN: 15.95 / MAX: 21.54-O3 -march=native - MIN: 15.19 / MAX: 20.95-O3 -march=native -flto - MIN: 15.55 / MAX: 21.061. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.106, N = 3SE +/- 0.063, N = 15SE +/- 0.055, N = 158.0238.0678.328-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -O2 -pthread -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.31, N = 3160.65164.77166.05-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3103.29100.7499.97-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O2-O3 -march=native-O3 -march=native -flto6M12M18M24M30MSE +/- 96950.30, N = 3SE +/- 279559.22, N = 3SE +/- 94171.94, N = 3290948192993244129086394-O2-march=native-march=native1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression-O2-O3 -march=native-O3 -march=native -flto714212835SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 327.8427.2627.07-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.06, N = 14SE +/- 0.17, N = 3SE +/- 0.02, N = 311.3011.0811.39MIN: 10.84 / MAX: 14.99-O3 -march=native - MIN: 10.66 / MAX: 16.66-O3 -march=native -flto - MIN: 11.27 / MAX: 15.151. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.53, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 387.3086.7084.93-O2-march=native-march=native -flto1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto1.09712.19423.29134.38845.4855SE +/- 0.02350, N = 3SE +/- 0.02065, N = 3SE +/- 0.02041, N = 34.876014.865224.74611MIN: 3.82MIN: 3.82-flto - MIN: 3.71. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto400800120016002000SE +/- 0.68, N = 3SE +/- 1.66, N = 3SE +/- 1.29, N = 31841.631891.711877.51MIN: 1831.74MIN: 1880.74-flto - MIN: 1866.091. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.21, N = 3SE +/- 0.13, N = 15SE +/- 0.15, N = 615.6415.8115.40-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.003, N = 5SE +/- 0.004, N = 5SE +/- 0.003, N = 56.0865.9315.936-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -fvisibility=hidden -logg -lm

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O2-O3 -march=native-O3 -march=native -flto816243240SE +/- 0.15, N = 3SE +/- 0.44, N = 3SE +/- 0.03, N = 334.535.434.8-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto400800120016002000SE +/- 1.67, N = 3SE +/- 0.80, N = 3SE +/- 1.27, N = 31842.141887.611876.25MIN: 1831.93MIN: 1877.87-flto - MIN: 1866.411. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto400800120016002000SE +/- 1.34, N = 3SE +/- 2.13, N = 3SE +/- 1.23, N = 31845.741890.591874.70MIN: 1834.84MIN: 1879.82-flto - MIN: 1865.221. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 1.51, N = 10SE +/- 1.48, N = 10SE +/- 1.49, N = 10191.83195.87195.07-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O2-O3 -march=native-O3 -march=native -flto11002200330044005500SE +/- 32.83, N = 3SE +/- 41.25, N = 3SE +/- 3.18, N = 3500149595058-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K-O2-O3 -march=native4080120160200SE +/- 0.05, N = 3SE +/- 0.09, N = 3186.75190.31-O2 -lm - MIN: 170.98 / MAX: 196.55-O3 -march=native - MIN: 174.59 / MAX: 201.241. (CC) gcc options: -pthread

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.52, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 3273.60278.72278.59-march=native-march=native -flto1. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.29, N = 3198.01201.70201.10-march=native-march=native -flto1. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O2-O3 -march=native-O3 -march=native -flto600K1200K1800K2400K3000KSE +/- 20903.58, N = 3SE +/- 15075.35, N = 3SE +/- 3890.24, N = 32936296.082980192.002990164.92-O2-march=native-march=native -flto1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto150M300M450M600M750MSE +/- 189414.30, N = 3SE +/- 209549.78, N = 3SE +/- 322714.18, N = 3711343333722893333722393333-O2-march=native-march=native -flto1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto7001400210028003500SE +/- 0.76, N = 3SE +/- 2.46, N = 3SE +/- 3.52, N = 33124.563173.473152.89MIN: 3112.25MIN: 3161.04-flto - MIN: 3137.491. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto7001400210028003500SE +/- 5.44, N = 3SE +/- 1.30, N = 3SE +/- 0.32, N = 33123.643172.193148.67MIN: 3105.42MIN: 3159.8-flto - MIN: 3137.591. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O2-O3 -march=native-O3 -march=native -flto6001200180024003000SE +/- 18.09, N = 3SE +/- 21.65, N = 3SE +/- 24.60, N = 32538.252576.972540.191. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto7001400210028003500SE +/- 2.61, N = 3SE +/- 0.26, N = 3SE +/- 3.24, N = 33123.953171.463154.69MIN: 3109.77MIN: 3160.11-flto - MIN: 3138.341. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium-O2-O3 -march=native-O3 -march=native -flto1.18082.36163.54244.72325.904SE +/- 0.0027, N = 3SE +/- 0.0013, N = 3SE +/- 0.0065, N = 35.24815.18205.1705-O3 -march=native-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p-O2-O3 -march=native160320480640800SE +/- 2.55, N = 3SE +/- 1.03, N = 3727.60717.31-O2 -lm - MIN: 643.78 / MAX: 798.32-O3 -march=native - MIN: 641.13 / MAX: 782.171. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p-O2-O3 -march=native170340510680850SE +/- 1.36, N = 3SE +/- 0.33, N = 3773.93763.05-O2 -lm - MIN: 589.24 / MAX: 1160.82-O3 -march=native - MIN: 584.4 / MAX: 1127.781. (CC) gcc options: -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O2-O3 -march=native-O3 -march=native -flto90K180K270K360K450KSE +/- 1236.61, N = 3SE +/- 1364.82, N = 3SE +/- 166.46, N = 3430127.50432583.96435901.44-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.96931.93862.90793.87724.8465SE +/- 0.01250, N = 3SE +/- 0.01947, N = 3SE +/- 0.00501, N = 34.270774.307984.25176MIN: 4.16MIN: 4.19-flto - MIN: 4.151. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3712.5112.52MIN: 12.28MIN: 12.43-flto - MIN: 12.411. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O2-O3 -march=native-O3 -march=native -flto1224364860SE +/- 0.05, N = 15SE +/- 0.14, N = 3SE +/- 0.13, N = 354.8054.5054.13MIN: 54.15 / MAX: 64-O3 -march=native - MIN: 53.96 / MAX: 58.57-O3 -march=native -flto - MIN: 53.54 / MAX: 59.111. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.33190.66380.99571.32761.6595SE +/- 0.01597, N = 3SE +/- 0.00602, N = 3SE +/- 0.00575, N = 31.467261.457881.47524MIN: 1.37MIN: 1.36-flto - MIN: 1.371. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.71331.42662.13992.85323.5665SE +/- 0.00129, N = 3SE +/- 0.00399, N = 3SE +/- 0.00623, N = 33.135323.170263.13941MIN: 3.07MIN: 3.1-flto - MIN: 3.071. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O2-O3 -march=native-O3 -march=native -flto1020304050SE +/- 0.15, N = 3SE +/- 0.30, N = 3SE +/- 0.13, N = 343.6244.0943.78-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -ldl -lz -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O2-O3 -march=native-O3 -march=native -flto816243240SE +/- 0.22, N = 3SE +/- 0.23, N = 3SE +/- 0.12, N = 332.733.032.8-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -lz

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 314.1714.2914.25MIN: 14.04MIN: 14.18-flto - MIN: 14.141. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.01, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 39.639.639.70MIN: 9.47 / MAX: 14.51-O3 -march=native - MIN: 9.56 / MAX: 13.14-O3 -march=native -flto - MIN: 9.56 / MAX: 13.191. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O2-O3 -march=native-O3 -march=native -flto50K100K150K200K250KSE +/- 504.43, N = 3SE +/- 1015.58, N = 3SE +/- 101.47, N = 3239792241439239892-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.16250.3250.48750.650.8125SE +/- 0.001308, N = 3SE +/- 0.002639, N = 3SE +/- 0.001704, N = 30.7178820.7224300.720482MIN: 0.66MIN: 0.67-flto - MIN: 0.671. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O2-O3 -march=native-O3 -march=native -flto110220330440550SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3491.64489.76488.63-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -fPIC -pthread -pipe

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O2-O3 -march=native-O3 -march=native -flto900K1800K2700K3600K4500KSE +/- 8839.00, N = 3SE +/- 16885.42, N = 3SE +/- 23615.46, N = 34051463.174036791.924060369.08-O2-march=native-march=native -flto1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.91491.82982.74473.65964.5745SE +/- 0.00867, N = 3SE +/- 0.00741, N = 3SE +/- 0.00379, N = 34.044774.066174.04481MIN: 3.93MIN: 3.93-flto - MIN: 3.911. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 317.0617.0617.13MIN: 16.67MIN: 16.72-flto - MIN: 16.731. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.7951.592.3853.183.975SE +/- 0.00099, N = 3SE +/- 0.00143, N = 3SE +/- 0.00147, N = 33.533153.527913.52381MIN: 3.47MIN: 3.45-flto - MIN: 3.461. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.18710.37420.56130.74840.9355SE +/- 0.003232, N = 3SE +/- 0.003135, N = 3SE +/- 0.003541, N = 30.8295640.8296370.831699MIN: 0.81MIN: 0.81-flto - MIN: 0.811. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 511.0811.0811.10-O2-O3 -march=native-O3 -march=native -flto1. (CXX) g++ options: -rdynamic

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.29770.59540.89311.19081.4885SE +/- 0.00212, N = 3SE +/- 0.00166, N = 3SE +/- 0.00175, N = 31.321001.322711.32311MIN: 1.25MIN: 1.26-flto - MIN: 1.261. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O2-O3 -march=native-O3 -march=native -flto2K4K6K8K10KSE +/- 1.67, N = 3SE +/- 6.96, N = 3SE +/- 4.58, N = 3938193899395-O2-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto0.79651.5932.38953.1863.9825SE +/- 0.00185, N = 3SE +/- 0.00232, N = 3SE +/- 0.00020, N = 33.540193.537083.53500MIN: 3.46MIN: 3.41-flto - MIN: 3.441. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O2-O3 -march=native-O3 -march=native -flto7K14K21K28K35KSE +/- 0.65, N = 3SE +/- 0.97, N = 3SE +/- 1.11, N = 334799.7034776.0834751.01-O3 -march=native-O3 -march=native -flto1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.1716.1816.19MIN: 16.09MIN: 16.09-flto - MIN: 16.091. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.00352, N = 3SE +/- 0.00184, N = 3SE +/- 0.00390, N = 38.576238.575488.57248MIN: 8.42MIN: 8.41-flto - MIN: 8.441. (CXX) g++ options: -O3 -march=native -O2 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native-O3 -march=native -flto130026003900520065006172.96171.6-flto1. (CC) gcc options: -O3 -march=native -lm

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O2-O3 -march=native-O3 -march=native -flto0.38030.76061.14091.52121.9015SE +/- 0.01, N = 15SE +/- 0.06, N = 3SE +/- 0.01, N = 31.191.191.69MIN: 1.14 / MAX: 5.67-O3 -march=native - MIN: 1.09 / MAX: 2.02-O3 -march=native -flto - MIN: 1.64 / MAX: 2.461. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.4