Intel 10980XE  GCC Compiler Benchmarks

Intel Core i9-10980XE GCC compiler benchmarking by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2107032-IB-10980XECO53&grr.

Intel 10980XE  GCC Compiler BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads)ASRock X299 Steel Legend (P1.30 BIOS)Intel Sky Lake-E DMI3 Registers32GBSamsung SSD 970 PRO 512GBNVIDIA NV132 11GBRealtek ALC1220ASUS VP28UIntel I219-V + Intel I211Ubuntu 21.045.11.0-22-generic (x86_64)GNOME Shell 3.38.4X Server + Waylandnouveau4.3 Mesa 21.0.11.0.2GCC 8.5.0ext42560x1600GCC 9.4.0GCC 10.3.0GCC 11.1.0GCC 12.0.0 20210701OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Details- Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Python Details- Python 3.9.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Intel 10980XE  GCC Compiler Benchmarkslibgav1: Chimera 1080p 10-bitcryptopp: Keyed Algorithmssecuremark: SecureMark-TLScryptopp: Integer + Elliptic Curve Public Key Algorithmstnn: CPU - DenseNetaom-av1: Speed 6 Two-Pass - Bosphorus 4Kmrbayes: Primate Phylogeny Analysisaom-av1: Speed 6 Realtime - Bosphorus 4Kgcrypt: mnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: squeezenetv1.1mnn: mobilenetV3pjsip: INVITEgraphics-magick: Rotatengspice: C2670ngspice: C7552libgav1: Summer Nature 4Kvpxenc: Speed 0 - Bosphorus 4Khmmer: Pfam Database Searchcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedsvt-av1: Preset 4 - Bosphorus 4Kstockfish: Total Timeonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUvpxenc: Speed 5 - Bosphorus 4Kfinancebench: Bonds OpenMPcryptopp: Unkeyed Algorithmspjsip: OPTIONS, Statefulgnupg: 2.7GB Sample File Encryptiongraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: Sharpengraphics-magick: Swirlgraphics-magick: HWB Color Spacedav1d: Chimera 1080p 10-bitsqlite-speedtest: Timed Time - Size 1,000compress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedsvt-av1: Preset 8 - Bosphorus 4Kblosc: blosclzncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenettachyon: Total Timesvt-hevc: 1 - Bosphorus 1080pfinancebench: Repo OpenMPhimeno: Poisson Pressure Solvercompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-7zip: Compress Speed Testespeak: Text-To-Speech Synthesiscompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcoremark: CoreMark Size 666 - Iterations Per Secondwebp: Quality 100, Lossless, Highest Compressionaom-av1: Speed 8 Realtime - Bosphorus 4Kbotan: AES-256 - Decryptbotan: AES-256quantlib: botan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305c-ray: Total Time - 4K, 16 Rays Per Pixelbotan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIkvazaar: Bosphorus 4K - Very Fastx265: Bosphorus 4Ktjbench: Decompression Throughputetcpak: ETC2aom-av1: Speed 9 Realtime - Bosphorus 4Kdav1d: Summer Nature 4Kviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYtnn: CPU - MobileNet v2encode-wavpack: WAV To WavPackonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUvosk: tnn: CPU - SqueezeNet v1.1liquid-dsp: 32 - 256 - 57liquid-dsp: 36 - 256 - 57pjsip: OPTIONS, Statelesswebp: Quality 100, Losslessonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUkvazaar: Bosphorus 4K - Ultra Fastetcpak: ETC1 + Ditheringencode-flac: WAV To FLACencode-opus: WAV To Opus Encodeonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUencode-mp3: WAV To MP3svt-vp9: VMAF Optimized - Bosphorus 1080pwebp: Quality 100, Highest Compressiononednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUsmallpt: Global Illumination Renderer; 128 Samplestnn: CPU - SqueezeNet v2svt-hevc: 7 - Bosphorus 1080petcpak: DXT1onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUsvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701714.4876692598695519.1442123505.9314.46146.0967.51194.12130.1542.4333.7045.59427.2974.5642.4033307809134.346126.2004.75126.2043547.4335.51.340511400131573.181573.03937.607939.9238.2274820.187500377.780902580164.2394241444390197792903215.1657.6062676.860.311.92511800.713.7715.6320.8418.339.0510.5436.2012.732.566.694.825.054.725.3614.1648.161112.8842354.8984374522.7311702819.243.39971126.8363361.7425.7618050.83000937.69519.403991.2013998.6272586.3977.165984.01929.818482.780491.138420.366416.184152.429152.23597.64499.71621.3521.53218.813581198.10927.18197.4655.256.054.755.579.969.563.657.138.177.068.845.9321.38713.3559.828580.46145920.721289.96392463000091679000013530217.2025.540620.52841640.13329.1288.4368.2830.4415001.763087.890322.949031.232398.716306.086.6420.6794829.347665.28670.124190.031445.23511.0180247.45310.79374.62719.7624842634575538.5470393527.6824.44157.3327.43193.97332.4122.4553.5945.54031.4524.4202.2973252776134.222127.6884.87126.5063628.4375.61.355506225521639.781636.96960.128961.3838.6176452.778646375.231492572964.2284241617387265752864219.6957.0342802.760.212.02111802.613.8715.0821.3517.599.1510.5836.0112.442.546.574.734.974.575.1013.9447.884912.9142795.0533854609.3985633017.643.59829828.1073436.8429.4650499.47488836.86919.343993.8903987.0802568.8945.449951.15230.035475.408486.315411.573414.445150.879150.58598.557100.41121.2021.64220.617787199.09727.24199.8954.956.254.156.179.771.863.757.438.178.071.247.1347.43613.3759.791850.45945620.993296.14293023666792167000013590316.8235.595330.52679840.16336.1068.5008.2410.4675941.791508.100382.860081.279918.525295.076.5600.8288669.554556.04573.938190.621450.56010.9596243.48302.25376.89717.1655592634725503.0786063508.2904.48154.6297.38193.41130.8042.4673.6695.61228.2554.2712.3413281765133.912126.8774.84126.6873479.0370.11.359499422761566.751565.41935.962938.2208.5949799.115885372.583041574464.4244271585403317761864222.2757.3032701.960.112.06211713.413.6315.2621.2617.879.1811.1636.3513.062.616.624.805.024.645.1213.8147.904513.0435458.6627614538.6619612876.943.99742632.6723285.5424.0630485.58851037.86919.283998.4793999.2482529.8780.123788.57330.430474.804486.288411.522404.157151.248151.18496.87998.20021.0921.84219.542799197.62927.86195.1557.158.956.458.779.671.654.757.338.377.470.346.4311.60413.3409.390620.46048220.753286.21593971666794066000013822217.2515.533340.52601440.64329.4868.3698.2830.4269021.755207.901632.939781.227038.730299.976.8660.6805229.350956.13069.297189.881484.23010.9199246.75306.89372.69692.9863992595655593.1304393508.3904.33142.9347.51208.34331.2752.4773.7545.72328.4454.6092.4123240852134.711129.3964.87126.6163553.2469.41.347522069631566.251563.05938.043936.8808.6348802.755208360.414192576964.2044321571403319924916223.0657.4112642.261.012.09011926.813.9315.2321.3717.779.0810.9736.5612.842.556.564.725.054.665.0613.7947.865912.9634558.2239584592.9474012775.744.39814935.0863351.9419.5597455.16081236.47119.643995.2323985.2662749.1774.645779.37629.960439.555442.670373.562367.677140.850141.03197.001100.69721.0121.16218.618162194.75928.07192.9550.551.949.851.079.371.763.757.438.377.670.746.7314.56113.3539.413460.45995620.894288.91095117000095453666713644416.8035.545440.52813340.31327.3298.4118.4560.4306761.762647.904772.934281.230128.732297.326.7800.6970809.355506.20169.802191.021468.96810.9199244.05305.72375.6521.32714.2052692645145532.4679403524.7464.40145.1747.49196.22230.7972.4773.7295.50628.2724.2832.3723304794135.428128.32028.244.92129.4543531.4385.51.348507345711564.711567.41936.491935.3788.6548317.579427374.640795576364.6134291607403319903884221.9456.7152773.960.811.96611889.413.4815.2822.8317.748.9910.9336.7012.212.546.274.584.984.544.8913.8149.323712.9434223.3033864580.1880732782.743.99849327.2813332.7432.7601830.26329237.16119.673993.1733972.0202773.6775.488781.38329.973442.410442.265374.707366.648140.480140.30996.610100.60921.1521.08217.425816197.12828.10194.0651.551.949.851.279.571.463.757.238.277.170.146.4318.05613.3319.594520.45966020.574287.49094443333393755333313796616.8925.536720.52793940.20325.2788.3798.1860.4256751.744717.912582.946701.224968.599293.716.5480.6786269.342065.99169.231191.331419.36510.9330244.56303.54375.32OpenBenchmarking.org

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Chimera 1080p 10-bitGCC 12.0.0 20210701510152025SE +/- 0.01, N = 321.321. (CXX) g++ options: -O3 -march=native -lpthread -lrt

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701160320480640800SE +/- 0.10, N = 3SE +/- 0.35, N = 3SE +/- 0.48, N = 3SE +/- 0.20, N = 3SE +/- 0.24, N = 3714.49719.76717.17692.99714.211. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070160K120K180K240K300KSE +/- 106.27, N = 3SE +/- 247.29, N = 3SE +/- 95.66, N = 3SE +/- 68.80, N = 3SE +/- 101.43, N = 32598692634572634722595652645141. (CC) gcc options: -pedantic -O3

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070112002400360048006000SE +/- 1.33, N = 3SE +/- 6.16, N = 3SE +/- 1.85, N = 3SE +/- 1.99, N = 3SE +/- 5.19, N = 35519.145538.555503.085593.135532.471. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107018001600240032004000SE +/- 0.16, N = 3SE +/- 2.55, N = 3SE +/- 0.59, N = 3SE +/- 0.77, N = 3SE +/- 0.21, N = 33505.933527.683508.293508.393524.75MIN: 3487.54 / MAX: 3535.34MIN: 3508.67 / MAX: 3981.67MIN: 3489.27 / MAX: 3603.98MIN: 3486.98 / MAX: 3606.8MIN: 3509.67 / MAX: 3548.511. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.0082.0163.0244.0325.04SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 104.464.444.484.334.401. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.45, N = 3SE +/- 1.65, N = 12SE +/- 0.58, N = 3SE +/- 0.32, N = 3SE +/- 1.02, N = 3146.10157.33154.63142.93145.171. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -march=native -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.05, N = 3SE +/- 0.08, N = 15SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 127.517.437.387.517.491. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070150100150200250SE +/- 0.27, N = 3SE +/- 0.21, N = 3SE +/- 0.44, N = 3SE +/- 0.18, N = 3SE +/- 0.33, N = 3194.12193.97193.41208.34196.221. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: inception-v3GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701816243240SE +/- 0.10, N = 3SE +/- 0.28, N = 15SE +/- 0.42, N = 3SE +/- 0.42, N = 3SE +/- 0.43, N = 330.1532.4130.8031.2830.80MIN: 29.77 / MAX: 30.53MIN: 29.17 / MAX: 33.89MIN: 30.12 / MAX: 31.88MIN: 30.28 / MAX: 31.97MIN: 30.14 / MAX: 31.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenet-v1-1.0GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.55731.11461.67192.22922.7865SE +/- 0.025, N = 3SE +/- 0.012, N = 15SE +/- 0.025, N = 3SE +/- 0.024, N = 3SE +/- 0.029, N = 32.4332.4552.4672.4772.477MIN: 2.32 / MAX: 2.62MIN: 2.23 / MAX: 3.16MIN: 2.32 / MAX: 2.68MIN: 2.3 / MAX: 2.65MIN: 2.3 / MAX: 2.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: MobileNetV2_224GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.84471.68942.53413.37884.2235SE +/- 0.069, N = 3SE +/- 0.040, N = 15SE +/- 0.090, N = 3SE +/- 0.018, N = 3SE +/- 0.058, N = 33.7043.5943.6693.7543.729MIN: 3.31 / MAX: 3.95MIN: 3.07 / MAX: 4.08MIN: 3.42 / MAX: 4.19MIN: 3.47 / MAX: 3.95MIN: 3.41 / MAX: 3.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: SqueezeNetV1.0GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.28772.57543.86315.15086.4385SE +/- 0.056, N = 3SE +/- 0.039, N = 15SE +/- 0.083, N = 3SE +/- 0.020, N = 3SE +/- 0.079, N = 35.5945.5405.6125.7235.506MIN: 5.4 / MAX: 5.85MIN: 5.06 / MAX: 6.72MIN: 5.24 / MAX: 6.01MIN: 5.47 / MAX: 6.72MIN: 5.22 / MAX: 5.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: resnet-v2-50GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701714212835SE +/- 0.10, N = 3SE +/- 0.47, N = 15SE +/- 0.24, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 327.3031.4528.2628.4528.27MIN: 26.86 / MAX: 27.93MIN: 24.41 / MAX: 36.24MIN: 27.6 / MAX: 28.76MIN: 27.77 / MAX: 28.83MIN: 27.7 / MAX: 28.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: squeezenetv1.1GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.0372.0743.1114.1485.185SE +/- 0.036, N = 3SE +/- 0.061, N = 15SE +/- 0.162, N = 3SE +/- 0.007, N = 3SE +/- 0.149, N = 34.5644.4204.2714.6094.283MIN: 4.42 / MAX: 4.75MIN: 3.98 / MAX: 4.76MIN: 3.97 / MAX: 4.72MIN: 4.51 / MAX: 4.78MIN: 3.97 / MAX: 4.711. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenetV3GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.54271.08541.62812.17082.7135SE +/- 0.012, N = 3SE +/- 0.015, N = 15SE +/- 0.011, N = 3SE +/- 0.032, N = 3SE +/- 0.011, N = 32.4032.2972.3412.4122.372MIN: 2.28 / MAX: 2.54MIN: 1.96 / MAX: 2.53MIN: 2.16 / MAX: 2.53MIN: 2.23 / MAX: 2.61MIN: 2.25 / MAX: 2.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107017001400210028003500SE +/- 25.16, N = 15SE +/- 27.02, N = 3SE +/- 6.36, N = 3SE +/- 7.22, N = 3SE +/- 25.40, N = 15330732523281324033041. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 7.07, N = 15SE +/- 3.51, N = 3SE +/- 5.24, N = 3SE +/- 2.52, N = 3SE +/- 7.25, N = 158097767658527941. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.81, N = 3SE +/- 0.71, N = 3SE +/- 1.13, N = 3SE +/- 0.92, N = 3SE +/- 0.83, N = 3134.35134.22133.91134.71135.431. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 1.35, N = 3SE +/- 1.54, N = 3SE +/- 1.51, N = 3SE +/- 0.26, N = 3SE +/- 1.05, N = 3126.20127.69126.88129.40128.321. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Summer Nature 4KGCC 12.0.0 20210701714212835SE +/- 0.01, N = 328.241. (CXX) g++ options: -O3 -march=native -lpthread -lrt

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.1072.2143.3214.4285.535SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 34.754.874.844.874.921. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database SearchGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.32, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3126.20126.51126.69126.62129.451. (CC) gcc options: -O3 -march=native -pthread -lhmmer -leasel -lm -lmpi

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107018001600240032004000SE +/- 5.55, N = 3SE +/- 2.52, N = 13SE +/- 3.49, N = 15SE +/- 3.35, N = 3SE +/- 3.04, N = 153547.43628.43479.03553.23531.41. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701100200300400500SE +/- 3.63, N = 3SE +/- 2.56, N = 13SE +/- 3.28, N = 15SE +/- 3.45, N = 3SE +/- 3.88, N = 15335.5375.6370.1469.4385.51. (CC) gcc options: -O3 -march=native -pthread -lz

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.30580.61160.91741.22321.529SE +/- 0.005, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.3401.3551.3591.3471.3481. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070111M22M33M44M55MSE +/- 562135.06, N = 15SE +/- 508293.85, N = 15SE +/- 623930.99, N = 3SE +/- 211973.29, N = 3SE +/- 432778.20, N = 851140013506225524994227652206963507345711. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701400800120016002000SE +/- 0.81, N = 3SE +/- 1.53, N = 3SE +/- 1.06, N = 3SE +/- 3.72, N = 3SE +/- 1.54, N = 31573.181639.781566.751566.251564.71MIN: 1566.72MIN: 1630.86MIN: 1558.77MIN: 1553.36MIN: 1557.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701400800120016002000SE +/- 1.09, N = 3SE +/- 2.09, N = 3SE +/- 0.93, N = 3SE +/- 2.29, N = 3SE +/- 0.75, N = 31573.031636.961565.411563.051567.41MIN: 1566.77MIN: 1629.18MIN: 1559.12MIN: 1555.64MIN: 1561.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 0.50, N = 3SE +/- 0.63, N = 3SE +/- 0.69, N = 3SE +/- 0.60, N = 3SE +/- 0.46, N = 3937.61960.13935.96938.04936.49MIN: 932.91MIN: 955.07MIN: 930.6MIN: 933.04MIN: 932.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 1.32, N = 3SE +/- 1.46, N = 3SE +/- 0.16, N = 3SE +/- 0.70, N = 3SE +/- 0.15, N = 3939.92961.38938.22936.88935.38MIN: 933.27MIN: 955.17MIN: 930.52MIN: 931.54MIN: 931.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.228.618.598.638.651. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070116K32K48K64K80KSE +/- 5.53, N = 3SE +/- 1061.39, N = 3SE +/- 11.94, N = 3SE +/- 36.89, N = 3SE +/- 48.36, N = 374820.1976452.7849799.1248802.7648317.581. (CXX) g++ options: -O3 -march=native -fopenmp

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070180160240320400SE +/- 0.02, N = 3SE +/- 0.65, N = 3SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.68, N = 3377.78375.23372.58360.41374.641. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070112002400360048006000SE +/- 16.26, N = 3SE +/- 24.67, N = 3SE +/- 8.67, N = 3SE +/- 53.69, N = 3SE +/- 23.13, N = 3580157295744576957631. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011428425670SE +/- 0.17, N = 3SE +/- 0.19, N = 3SE +/- 0.36, N = 3SE +/- 0.23, N = 3SE +/- 0.56, N = 364.2464.2364.4264.2064.611. (CC) gcc options: -O3 -march=native

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701901802703604504244244274324291. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070130060090012001500SE +/- 2.91, N = 3SE +/- 4.93, N = 3SE +/- 8.67, N = 3SE +/- 3.51, N = 3SE +/- 7.21, N = 3144416171585157116071. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070190180270360450SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 33903874034034031. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070170140210280350SE +/- 0.33, N = 31972653173193191. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 37927527619249031. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 1.15, N = 3SE +/- 1.33, N = 3SE +/- 0.88, N = 39038648649168841. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Chimera 1080p 10-bitGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070150100150200250SE +/- 0.39, N = 3SE +/- 0.34, N = 3SE +/- 0.50, N = 3SE +/- 1.18, N = 3SE +/- 0.48, N = 3215.16219.69222.27223.06221.94-lm - MIN: 151.62 / MAX: 411.26-lm - MIN: 156.35 / MAX: 406.23MIN: 157.09 / MAX: 436.51MIN: 157.45 / MAX: 397.96MIN: 157.38 / MAX: 404.981. (CC) gcc options: -O3 -march=native -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 357.6157.0357.3057.4156.721. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107016001200180024003000SE +/- 3.24, N = 3SE +/- 14.28, N = 3SE +/- 9.76, N = 3SE +/- 7.69, N = 9SE +/- 16.86, N = 32676.82802.72701.92642.22773.91. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011428425670SE +/- 0.19, N = 3SE +/- 0.50, N = 3SE +/- 0.56, N = 3SE +/- 0.50, N = 9SE +/- 0.47, N = 360.360.260.161.060.81. (CC) gcc options: -O3 -march=native -pthread -lz

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 311.9312.0212.0612.0911.971. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

C-Blosc

Compressor: blosclz

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0Compressor: blosclzGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013K6K9K12K15KSE +/- 11.42, N = 3SE +/- 18.15, N = 3SE +/- 21.15, N = 3SE +/- 69.56, N = 3SE +/- 37.20, N = 311800.711802.611713.411926.811889.41. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: regnety_400mGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070148121620SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 313.7713.8713.6313.9313.48MIN: 12.93 / MAX: 14.62MIN: 13.18 / MAX: 15.01MIN: 13.01 / MAX: 14.57MIN: 13.17 / MAX: 14.48MIN: 13.11 / MAX: 14.041. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: squeezenet_ssdGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070148121620SE +/- 0.36, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 315.6315.0815.2615.2315.28MIN: 15.02 / MAX: 16.85MIN: 14.88 / MAX: 21.62MIN: 14.97 / MAX: 18.92MIN: 14.88 / MAX: 17.07MIN: 14.89 / MAX: 16.11. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: yolov4-tinyGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.25, N = 3SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 1.86, N = 320.8421.3521.2621.3722.83MIN: 19.92 / MAX: 24.91MIN: 20.42 / MAX: 33.9MIN: 20 / MAX: 24.4MIN: 20.44 / MAX: 22.72MIN: 20.18 / MAX: 937.41. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet50GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.30, N = 3SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 3SE +/- 0.24, N = 318.3317.5917.8717.7717.74MIN: 17.58 / MAX: 24.57MIN: 17.07 / MAX: 18.69MIN: 17.16 / MAX: 18.62MIN: 17.07 / MAX: 28.68MIN: 17.09 / MAX: 18.961. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: alexnetGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 39.059.159.189.088.99MIN: 8.96 / MAX: 19.48MIN: 9.08 / MAX: 9.59MIN: 9.11 / MAX: 9.74MIN: 9 / MAX: 11.81MIN: 8.73 / MAX: 9.391. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet18GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 310.5410.5811.1610.9710.93MIN: 10.19 / MAX: 17.98MIN: 10.2 / MAX: 11.57MIN: 11.03 / MAX: 11.45MIN: 10.84 / MAX: 20.49MIN: 10.84 / MAX: 11.271. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: vgg16GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701816243240SE +/- 0.37, N = 3SE +/- 0.49, N = 3SE +/- 0.47, N = 3SE +/- 0.52, N = 3SE +/- 0.53, N = 336.2036.0136.3536.5636.70MIN: 35.36 / MAX: 47.25MIN: 35.37 / MAX: 37.68MIN: 35.3 / MAX: 37.7MIN: 35.42 / MAX: 58.41MIN: 35.5 / MAX: 41.991. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: googlenetGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.31, N = 312.7312.4413.0612.8412.21MIN: 12.1 / MAX: 19.86MIN: 12 / MAX: 13.22MIN: 12.84 / MAX: 14.29MIN: 12.68 / MAX: 16.73MIN: 11.77 / MAX: 13.021. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: blazefaceGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.58731.17461.76192.34922.9365SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 32.562.542.612.552.54MIN: 2.5 / MAX: 3.31MIN: 2.45 / MAX: 3.32MIN: 2.47 / MAX: 3.3MIN: 2.47 / MAX: 3.17MIN: 2.46 / MAX: 3.121. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: efficientnet-b0GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 36.696.576.626.566.27MIN: 6.33 / MAX: 10.85MIN: 6.28 / MAX: 14.64MIN: 6.24 / MAX: 24.41MIN: 6.27 / MAX: 11.76MIN: 6.05 / MAX: 14.371. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mnasnetGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.08452.1693.25354.3385.4225SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 34.824.734.804.724.58MIN: 4.55 / MAX: 12.4MIN: 4.44 / MAX: 11.54MIN: 4.42 / MAX: 16.22MIN: 4.46 / MAX: 10.72MIN: 4.39 / MAX: 10.871. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: shufflenet-v2GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.13632.27263.40894.54525.6815SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 35.054.975.025.054.98MIN: 4.83 / MAX: 14.14MIN: 4.8 / MAX: 8.9MIN: 4.83 / MAX: 15.94MIN: 4.88 / MAX: 9.37MIN: 4.88 / MAX: 8.61. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v3-v3 - Model: mobilenet-v3GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.0622.1243.1864.2485.31SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 34.724.574.644.664.54MIN: 4.49 / MAX: 7.39MIN: 4.37 / MAX: 9.05MIN: 4.36 / MAX: 10.01MIN: 4.46 / MAX: 12.92MIN: 4.37 / MAX: 11.471. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v2-v2 - Model: mobilenet-v2GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.2062.4123.6184.8246.03SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 35.365.105.125.064.89MIN: 4.96 / MAX: 8.4MIN: 4.73 / MAX: 10.4MIN: 4.76 / MAX: 8.89MIN: 4.72 / MAX: 10.01MIN: 4.72 / MAX: 10.071. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mobilenetGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070148121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 314.1613.9413.8113.7913.81MIN: 13.93 / MAX: 14.76MIN: 13.68 / MAX: 22.27MIN: 13.51 / MAX: 20.36MIN: 13.61 / MAX: 14.23MIN: 13.64 / MAX: 14.541. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011122334455SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.26, N = 3SE +/- 0.18, N = 348.1647.8847.9047.8749.321. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.8812.9113.0412.9612.941. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107019K18K27K36K45KSE +/- 34.41, N = 3SE +/- 10.60, N = 3SE +/- 120.35, N = 3SE +/- 43.71, N = 3SE +/- 22.03, N = 342354.9042795.0535458.6634558.2234223.301. (CXX) g++ options: -O3 -march=native -fopenmp

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070110002000300040005000SE +/- 0.61, N = 3SE +/- 5.70, N = 3SE +/- 13.08, N = 3SE +/- 0.64, N = 3SE +/- 2.82, N = 34522.734609.404538.664592.954580.191. (CC) gcc options: -O3 -march=native -mavx2

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107016001200180024003000SE +/- 10.70, N = 3SE +/- 4.29, N = 3SE +/- 2.75, N = 3SE +/- 14.84, N = 3SE +/- 2.25, N = 32819.23017.62876.92775.72782.71. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011020304050SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 343.343.543.944.343.91. (CC) gcc options: -O3 -march=native -pthread -lz

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070120K40K60K80K100KSE +/- 69.57, N = 3SE +/- 343.09, N = 3SE +/- 46.23, N = 3SE +/- 304.44, N = 3SE +/- 228.39, N = 399711982989742698149984931. (CXX) g++ options: -pipe -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701816243240SE +/- 0.19, N = 4SE +/- 0.23, N = 4SE +/- 0.09, N = 4SE +/- 0.16, N = 4SE +/- 0.18, N = 426.8428.1132.6735.0927.281. (CC) gcc options: -O3 -march=native -std=c99

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107017001400210028003500SE +/- 2.28, N = 3SE +/- 2.27, N = 5SE +/- 3.06, N = 3SE +/- 5.37, N = 3SE +/- 2.91, N = 33361.73436.83285.53351.93332.71. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070190180270360450SE +/- 4.99, N = 3SE +/- 4.23, N = 5SE +/- 4.87, N = 3SE +/- 5.64, N = 3SE +/- 5.56, N = 3425.7429.4424.0419.5432.71. (CC) gcc options: -O3 -march=native -pthread -lz

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701140K280K420K560K700KSE +/- 3624.80, N = 3SE +/- 1621.16, N = 3SE +/- 2003.80, N = 3SE +/- 1267.23, N = 3SE +/- 2406.19, N = 3618050.83650499.47630485.59597455.16601830.261. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701918273645SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 337.7036.8737.8736.4737.161. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 319.4019.3419.2819.6419.671. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107019001800270036004500SE +/- 1.28, N = 3SE +/- 0.39, N = 3SE +/- 0.87, N = 3SE +/- 0.79, N = 3SE +/- 4.17, N = 33991.203993.893998.483995.233993.171. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107019001800270036004500SE +/- 0.87, N = 3SE +/- 0.82, N = 3SE +/- 4.08, N = 3SE +/- 2.67, N = 3SE +/- 7.30, N = 33998.633987.083999.253985.273972.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107016001200180024003000SE +/- 19.67, N = 3SE +/- 35.29, N = 3SE +/- 18.68, N = 3SE +/- 33.86, N = 4SE +/- 0.85, N = 32586.32568.82529.82749.12773.61. (CXX) g++ options: -O3 -march=native -rdynamic

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 2.38, N = 3SE +/- 0.08, N = 3SE +/- 0.79, N = 3SE +/- 0.89, N = 3SE +/- 0.47, N = 3977.17945.45780.12774.65775.491. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107012004006008001000SE +/- 2.79, N = 3SE +/- 1.36, N = 3SE +/- 0.15, N = 3SE +/- 1.05, N = 3SE +/- 0.73, N = 3984.02951.15788.57779.38781.381. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 329.8230.0430.4329.9629.971. (CC) gcc options: -lm -lpthread -O3 -march=native

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701100200300400500SE +/- 0.04, N = 3SE +/- 0.21, N = 3SE +/- 0.30, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3482.78475.41474.80439.56442.411. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701110220330440550SE +/- 0.01, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3491.14486.32486.29442.67442.271. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070190180270360450SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.56, N = 3SE +/- 0.17, N = 3420.37411.57411.52373.56374.711. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070190180270360450SE +/- 0.33, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 1.13, N = 3SE +/- 0.30, N = 3416.18414.45404.16367.68366.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.50, N = 3SE +/- 0.28, N = 3152.43150.88151.25140.85140.481. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.30, N = 3152.24150.59151.18141.03140.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070120406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 397.6498.5696.8897.0096.611. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070120406080100SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.25, N = 399.72100.4198.20100.70100.611. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 321.3521.2021.0921.0121.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 321.5321.6421.8421.1621.081. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070150100150200250SE +/- 0.03, N = 3SE +/- 0.84, N = 3SE +/- 0.30, N = 3SE +/- 0.47, N = 3SE +/- 0.26, N = 3218.81220.62219.54218.62217.431. (CC) gcc options: -O3 -march=native -rdynamic

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107014080120160200SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3198.11199.10197.63194.76197.131. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701714212835SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 327.1827.2427.8628.0728.101. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4KGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107014080120160200SE +/- 0.89, N = 3SE +/- 1.65, N = 3SE +/- 2.16, N = 3SE +/- 1.13, N = 3SE +/- 1.97, N = 6197.46199.89195.15192.95194.06-lm - MIN: 150.4 / MAX: 226.05-lm - MIN: 143.79 / MAX: 228.44MIN: 149.2 / MAX: 222.59MIN: 132.83 / MAX: 217.9MIN: 131.48 / MAX: 225.931. (CC) gcc options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 0.32, N = 3SE +/- 0.07, N = 3SE +/- 0.70, N = 2SE +/- 0.20, N = 3SE +/- 0.09, N = 355.254.957.150.551.51. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 1.55, N = 2SE +/- 0.36, N = 3SE +/- 1.17, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 356.056.258.951.951.91. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 0.25, N = 3SE +/- 0.27, N = 3SE +/- 0.48, N = 3SE +/- 0.15, N = 3SE +/- 0.37, N = 354.754.156.449.849.81. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 0.75, N = 3SE +/- 0.13, N = 3SE +/- 0.63, N = 3SE +/- 0.15, N = 3SE +/- 0.44, N = 355.556.158.751.051.21. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070120406080100SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.38, N = 3SE +/- 0.09, N = 379.979.779.679.379.51. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011632486480SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.31, N = 369.571.871.671.771.41. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011428425670SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 9.10, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 363.663.754.763.763.71. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011326395265SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 357.157.457.357.457.21. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701918273645SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 338.138.138.338.338.21. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070120406080100SE +/- 0.42, N = 3SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.43, N = 3SE +/- 0.49, N = 377.078.077.477.677.11. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011632486480SE +/- 1.48, N = 3SE +/- 0.12, N = 3SE +/- 0.30, N = 3SE +/- 0.31, N = 3SE +/- 0.35, N = 368.871.270.370.770.11. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011122334455SE +/- 0.27, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 345.947.146.446.746.41. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070180160240320400SE +/- 0.29, N = 3SE +/- 0.34, N = 3SE +/- 0.31, N = 3SE +/- 0.19, N = 3SE +/- 0.17, N = 3321.39347.44311.60314.56318.06MIN: 319.29 / MAX: 341.28MIN: 345.68 / MAX: 356.59MIN: 309.73 / MAX: 322.67MIN: 312.66 / MAX: 328.44MIN: 316.44 / MAX: 326.161. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.3613.3813.3413.3513.331. (CXX) g++ options: -O3 -march=native -rdynamic

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.02091, N = 3SE +/- 0.00795, N = 3SE +/- 0.02137, N = 3SE +/- 0.01742, N = 3SE +/- 0.02038, N = 39.828589.791859.390629.413469.59452MIN: 9.53MIN: 9.62MIN: 9.29MIN: 9.27MIN: 9.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.10380.20760.31140.41520.519SE +/- 0.000631, N = 3SE +/- 0.001062, N = 3SE +/- 0.001585, N = 3SE +/- 0.000687, N = 3SE +/- 0.001427, N = 30.4614590.4594560.4604820.4599560.459660MIN: 0.45MIN: 0.45MIN: 0.45MIN: 0.45MIN: 0.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

VOSK Speech Recognition Toolkit

OpenBenchmarking.orgSeconds, Fewer Is BetterVOSK Speech Recognition Toolkit 0.3.21GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 320.7220.9920.7520.8920.57

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070160120180240300SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.63, N = 3SE +/- 0.19, N = 3289.96296.14286.22288.91287.49MIN: 288.43 / MAX: 291.61MIN: 294.66 / MAX: 298.56MIN: 285.02 / MAX: 287.82MIN: 286.05 / MAX: 294.45MIN: 285.88 / MAX: 299.621. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701200M400M600M800M1000MSE +/- 353836.12, N = 3SE +/- 539269.05, N = 3SE +/- 2904171.33, N = 3SE +/- 4781007.56, N = 3SE +/- 4623189.13, N = 39246300009302366679397166679511700009444333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 36 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 36 - Buffer Length: 256 - Filter Length: 57GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701200M400M600M800M1000MSE +/- 120554.28, N = 3SE +/- 588132.64, N = 3SE +/- 272213.15, N = 3SE +/- 1013283.99, N = 3SE +/- 1056729.76, N = 39167900009216700009406600009545366679375533331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070130K60K90K120K150KSE +/- 578.10, N = 3SE +/- 1635.40, N = 4SE +/- 1006.09, N = 3SE +/- 946.90, N = 3SE +/- 734.59, N = 31353021359031382221364441379661. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070148121620SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 317.2016.8217.2516.8016.891. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011.25892.51783.77675.03566.2945SE +/- 0.02371, N = 3SE +/- 0.02285, N = 3SE +/- 0.01959, N = 3SE +/- 0.02335, N = 3SE +/- 0.02089, N = 35.540625.595335.533345.545445.53672MIN: 5.4MIN: 5.45MIN: 5.38MIN: 5.4MIN: 5.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.11890.23780.35670.47560.5945SE +/- 0.003001, N = 3SE +/- 0.003361, N = 3SE +/- 0.003324, N = 3SE +/- 0.003036, N = 3SE +/- 0.003225, N = 30.5284160.5267980.5260140.5281330.527939MIN: 0.5MIN: 0.5MIN: 0.5MIN: 0.5MIN: 0.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701918273645SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 340.1340.1640.6440.3140.201. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070170140210280350SE +/- 2.72, N = 3SE +/- 0.09, N = 3SE +/- 0.38, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 3329.13336.11329.49327.33325.281. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.012, N = 5SE +/- 0.006, N = 5SE +/- 0.004, N = 5SE +/- 0.009, N = 5SE +/- 0.014, N = 58.4368.5008.3698.4118.3791. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.011, N = 5SE +/- 0.021, N = 5SE +/- 0.031, N = 5SE +/- 0.013, N = 5SE +/- 0.011, N = 58.2838.2418.2838.4568.1861. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.10520.21040.31560.42080.526SE +/- 0.003487, N = 3SE +/- 0.002588, N = 3SE +/- 0.004576, N = 4SE +/- 0.004665, N = 3SE +/- 0.000391, N = 30.4415000.4675940.4269020.4306760.425675MIN: 0.41MIN: 0.44MIN: 0.4MIN: 0.41MIN: 0.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.40310.80621.20931.61242.0155SE +/- 0.00719, N = 3SE +/- 0.00782, N = 3SE +/- 0.00494, N = 3SE +/- 0.00508, N = 3SE +/- 0.00613, N = 31.763081.791501.755201.762641.74471MIN: 1.7MIN: 1.74MIN: 1.69MIN: 1.69MIN: 1.661. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.02602, N = 3SE +/- 0.17046, N = 14SE +/- 0.03159, N = 3SE +/- 0.03513, N = 3SE +/- 0.03413, N = 37.890328.100387.901637.904777.91258MIN: 7.58MIN: 7.58MIN: 7.61MIN: 7.56MIN: 7.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.66351.3271.99052.6543.3175SE +/- 0.01360, N = 3SE +/- 0.01601, N = 3SE +/- 0.01261, N = 3SE +/- 0.01584, N = 3SE +/- 0.01797, N = 32.949032.860082.939782.934282.94670MIN: 2.85MIN: 2.77MIN: 2.83MIN: 2.83MIN: 2.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.2880.5760.8641.1521.44SE +/- 0.00683, N = 3SE +/- 0.00348, N = 3SE +/- 0.00554, N = 3SE +/- 0.00400, N = 3SE +/- 0.00255, N = 31.232391.279911.227031.230121.22496MIN: 1.18MIN: 1.23MIN: 1.18MIN: 1.19MIN: 1.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.002, N = 3SE +/- 0.011, N = 3SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.003, N = 38.7168.5258.7308.7328.5991. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070170140210280350SE +/- 2.56, N = 13SE +/- 4.16, N = 3SE +/- 1.76, N = 14SE +/- 2.83, N = 6SE +/- 3.01, N = 5306.08295.07299.97297.32293.711. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.043, N = 36.6426.5606.8666.7806.5481. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.18650.3730.55950.7460.9325SE +/- 0.003298, N = 3SE +/- 0.008442, N = 15SE +/- 0.005954, N = 8SE +/- 0.005568, N = 3SE +/- 0.008934, N = 30.6794820.8288660.6805220.6970800.678626MIN: 0.66MIN: 0.74MIN: 0.63MIN: 0.67MIN: 0.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.01311, N = 3SE +/- 0.01517, N = 3SE +/- 0.01276, N = 3SE +/- 0.01164, N = 3SE +/- 0.00808, N = 39.347669.554559.350959.355509.34206MIN: 9.29MIN: 9.5MIN: 9.29MIN: 9.29MIN: 9.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701246810SE +/- 0.007, N = 3SE +/- 0.011, N = 3SE +/- 0.033, N = 3SE +/- 0.004, N = 3SE +/- 0.017, N = 35.2866.0456.1306.2015.9911. (CXX) g++ options: -fopenmp -O3 -march=native

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107011632486480SE +/- 0.03, N = 3SE +/- 1.05, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 370.1273.9469.3069.8069.23MIN: 69.44 / MAX: 71.63MIN: 72.33 / MAX: 77.08MIN: 68.65 / MAX: 70.61MIN: 69.16 / MAX: 70.99MIN: 68.59 / MAX: 70.341. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107014080120160200SE +/- 0.58, N = 3SE +/- 0.18, N = 3SE +/- 0.44, N = 3SE +/- 0.31, N = 3SE +/- 0.32, N = 3190.03190.62189.88191.02191.331. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070130060090012001500SE +/- 1.51, N = 3SE +/- 0.45, N = 3SE +/- 0.37, N = 3SE +/- 0.47, N = 3SE +/- 1.30, N = 31445.241450.561484.231468.971419.371. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107013691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 311.0210.9610.9210.9210.93MIN: 10.79MIN: 10.74MIN: 10.74MIN: 10.74MIN: 10.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070150100150200250SE +/- 1.34, N = 3SE +/- 1.22, N = 3SE +/- 3.33, N = 3SE +/- 3.10, N = 3SE +/- 1.58, N = 3247.45243.48246.75244.05244.561. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070170140210280350SE +/- 0.21, N = 3SE +/- 1.42, N = 3SE +/- 1.79, N = 3SE +/- 2.35, N = 3SE +/- 0.68, N = 3310.79302.25306.89305.72303.541. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070180160240320400SE +/- 0.95, N = 3SE +/- 0.96, N = 3SE +/- 1.74, N = 3SE +/- 1.99, N = 3SE +/- 0.75, N = 3374.62376.89372.69375.65375.321. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.4