Intel 10980XE  GCC Compiler Benchmarks

Intel Core i9-10980XE GCC compiler benchmarking by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2107032-IB-10980XECO53&grs&sor.

Intel 10980XE  GCC Compiler BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads)ASRock X299 Steel Legend (P1.30 BIOS)Intel Sky Lake-E DMI3 Registers32GBSamsung SSD 970 PRO 512GBNVIDIA NV132 11GBRealtek ALC1220ASUS VP28UIntel I219-V + Intel I211Ubuntu 21.045.11.0-22-generic (x86_64)GNOME Shell 3.38.4X Server + Waylandnouveau4.3 Mesa 21.0.11.0.2GCC 8.5.0ext42560x1600GCC 9.4.0GCC 10.3.0GCC 11.1.0GCC 12.0.0 20210701OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Details- Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Python Details- Python 3.9.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Intel 10980XE  GCC Compiler Benchmarksgraphics-magick: Sharpenfinancebench: Bonds OpenMPcompress-zstd: 8, Long Mode - Compression Speedespeak: Text-To-Speech Synthesisbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptfinancebench: Repo OpenMPgraphics-magick: Swirlonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsmallpt: Global Illumination Renderer; 128 Samplesmnn: resnet-v2-50viennacl: CPU BLAS - dGEMM-NNbotan: Twofishviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TTbotan: Twofish - Decryptgraphics-magick: Resizingtnn: CPU - MobileNet v2graphics-magick: Rotatebotan: Blowfishmrbayes: Primate Phylogeny Analysisonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUbotan: Blowfish - Decryptquantlib: ncnn: CPU-v2-v2 - mobilenet-v2coremark: CoreMark Size 666 - Iterations Per Secondcompress-zstd: 19, Long Mode - Decompression Speedbotan: CAST-256 - Decryptbotan: CAST-256gcrypt: mnn: inception-v3ncnn: CPU - googlenettnn: CPU - SqueezeNet v2ncnn: CPU - efficientnet-b0compress-zstd: 19 - Decompression Speedgraphics-magick: HWB Color Spacencnn: CPU - resnet18ncnn: CPU - mnasnetvpxenc: Speed 5 - Bosphorus 4Kmnn: mobilenetV3webp: Quality 100, Highest Compressioncryptopp: Unkeyed Algorithmsonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUcompress-zstd: 8 - Decompression Speedetcpak: DXT1stockfish: Total Timeonednn: IP Shapes 3D - u8s8f32 - CPUmnn: MobileNetV2_224compress-zstd: 8, Long Mode - Decompression Speedsvt-vp9: VMAF Optimized - Bosphorus 1080pncnn: CPU - resnet50graphics-magick: Noise-Gaussianliquid-dsp: 36 - 256 - 57ncnn: CPU-v3-v3 - mobilenet-v3mnn: SqueezeNetV1.0cryptopp: Keyed Algorithmswebp: Quality 100, Lossless, Highest Compressiondav1d: Chimera 1080p 10-bitncnn: CPU - squeezenet_ssdx265: Bosphorus 4Kdav1d: Summer Nature 4Kvpxenc: Speed 0 - Bosphorus 4Kviennacl: CPU BLAS - sAXPYtnn: CPU - SqueezeNet v1.1aom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kncnn: CPU - regnety_400metcpak: ETC1 + Ditheringviennacl: CPU BLAS - dGEMV-Nencode-opus: WAV To Opus Encodecompress-zstd: 8 - Compression Speedonednn: IP Shapes 3D - bf16bf16bf16 - CPUtachyon: Total Timeliquid-dsp: 32 - 256 - 57svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUncnn: CPU - blazefacencnn: CPU - mobilenetonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUwebp: Quality 100, Losslessviennacl: CPU BLAS - sCOPYonednn: Recurrent Neural Network Inference - u8s8f32 - CPUhmmer: Pfam Database Searchbotan: KASUMIngspice: C7552encode-mp3: WAV To MP3compress-7zip: Compress Speed Testcompress-zstd: 19, Long Mode - Compression Speedonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUetcpak: ETC2pjsip: OPTIONS, Statelessncnn: CPU - alexnetpjsip: INVITEc-ray: Total Time - 4K, 16 Rays Per Pixelvosk: aom-av1: Speed 8 Realtime - Bosphorus 4Kbotan: KASUMI - Decrypthimeno: Poisson Pressure Solverncnn: CPU - vgg16securemark: SecureMark-TLSgraphics-magick: Enhancedblosc: blosclzmnn: mobilenet-v1-1.0aom-av1: Speed 6 Realtime - Bosphorus 4Kcryptopp: Integer + Elliptic Curve Public Key Algorithmssvt-vp9: Visual Quality Optimized - Bosphorus 1080pkvazaar: Bosphorus 4K - Very Fastncnn: CPU - shufflenet-v2sqlite-speedtest: Timed Time - Size 1,000encode-flac: WAV To FLACcompress-zstd: 19 - Compression Speedtjbench: Decompression Throughputsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Kviennacl: CPU BLAS - sDOTkvazaar: Bosphorus 4K - Ultra Fastpjsip: OPTIONS, Statefulsvt-hevc: 1 - Bosphorus 1080pngspice: C2670svt-hevc: 10 - Bosphorus 1080ponednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUsvt-hevc: 7 - Bosphorus 1080pviennacl: CPU BLAS - dGEMV-Tbotan: AES-256gnupg: 2.7GB Sample File Encryptiontnn: CPU - DenseNetviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUencode-wavpack: WAV To WavPackbotan: AES-256 - Decryptlibgav1: Chimera 1080p 10-bitlibgav1: Summer Nature 4Kncnn: CPU - yolov4-tinymnn: squeezenetv1.1viennacl: CPU BLAS - dDOTonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070119774820.187500335.526.836984.019977.16542354.8984377920.6794825.28627.29755.5416.18456.054.755.2420.3661444321.387809491.138146.0960.441500482.7802586.35.36618050.8300092819.2152.429152.235194.12130.15412.7370.1246.692676.890310.544.828.222.4036.642377.7809021573.181573.039.828583361.71445.235511400131.232393.7043547.4306.0818.333909167900004.725.594714.48766937.695215.1615.6321.53197.464.7568.8289.9634.4627.1813.77329.12869.58.283425.72.9490348.1611924630000310.79939.9232.5614.161.7630817.20245.9937.607126.20499.716126.2008.7169971143.39.34766198.1091353029.05330729.81820.72119.4097.6444522.73117036.2025986942411800.72.4337.515519.144212247.4521.355.0557.6068.43660.3218.8135811.34011.92577.040.13580112.88134.346374.625.5406211.0180190.0379.93998.62764.2393505.93157.138.10.5284160.46145913.3553991.20120.844.56463.67.8903226576452.778646375.628.107951.152945.44942795.0533857520.8288666.04531.45256.1414.44556.254.154.9411.5731617347.436776486.315157.3320.467594475.4082568.85.10650499.4748883017.6150.879150.585193.97332.41212.4473.9386.572802.786410.584.738.612.2976.560375.2314921639.781636.969.791853436.81450.560506225521.279913.5943628.4295.0717.593879216700004.575.540719.76248436.869219.6915.0821.64199.894.8771.2296.1424.4427.2413.87336.10671.88.241429.42.8600847.8849930236667302.25961.3832.5413.941.7915016.82347.1960.128126.506100.411127.6888.5259829843.59.55455199.0971359039.15325230.03520.99319.3498.5574609.39856336.0126345742411802.62.4557.435538.547039243.4821.204.9757.0348.50060.2220.6177871.35512.02178.040.16572912.91134.222376.895.5953310.9596190.6279.73987.08064.2283527.68257.438.10.5267980.45945613.3753993.89021.354.42063.78.1003831749799.115885370.132.672788.573780.12335458.6627617610.6805226.13028.25558.7404.15758.956.457.1411.5221585311.604765486.288154.6290.426902474.8042529.85.12630485.5885102876.9151.248151.184193.41130.80413.0669.2976.622701.986411.164.808.592.3416.866372.5830411566.751565.419.390623285.51484.230499422761.227033.6693479.0299.9717.874039406600004.645.612717.16555937.869222.2715.2621.84195.154.8470.3286.2154.4827.8613.63329.48671.68.283424.02.9397847.9045939716667306.89938.2202.6113.811.7552017.25146.4935.962126.68798.200126.8778.7309742643.99.35095197.6291382229.18328130.43020.75319.2896.8794538.66196136.3526347242711713.42.4677.385503.078606246.7521.095.0257.3038.36960.1219.5427991.35912.06277.440.64574413.04133.912372.695.5333410.9199189.8879.63999.24864.4243508.29057.338.30.5260140.46048213.3403998.47921.264.27154.77.9016331948802.755208469.435.086779.376774.64534558.2239589240.6970806.20128.44551.0367.67751.949.850.5373.5621571314.561852442.670142.9340.430676439.5552749.15.06597455.1608122775.7140.850141.031208.34331.27512.8469.8026.562642.291610.974.728.632.4126.780360.4141921566.251563.059.413463351.91468.968522069631.230123.7543553.2297.3217.774039545366674.665.723692.98639936.471223.0615.2321.16192.954.8770.7288.9104.3328.0713.93327.32971.78.456419.52.9342847.8659951170000305.72936.8802.5513.791.7626416.80346.7938.043126.616100.697129.3968.7329814944.39.35550194.7591364449.08324029.96020.89419.6497.0014592.94740136.5625956543211926.82.4777.515593.130439244.0521.015.0557.4118.41161.0218.6181621.34712.09077.640.31576912.96134.711375.655.5454410.9199191.0279.33985.26664.2043508.39057.438.30.5281330.45995613.3533995.23221.374.60963.77.9047731948317.579427385.527.281781.383775.48834223.3033869030.6786265.99128.27251.2366.64851.949.851.5374.7071607318.056794442.265145.1740.425675442.4102773.64.89601830.2632922782.7140.480140.309196.22230.79712.2169.2316.272773.988410.934.588.652.3726.548374.6407951564.711567.419.594523332.71419.365507345711.224963.7293531.4293.7117.744039375533334.545.506714.20526937.161221.9415.2821.08194.064.9270.1287.4904.4028.1013.48325.27871.48.186432.72.9467049.3237944433333303.54935.3782.5413.811.7447116.89246.4936.491129.454100.609128.3208.5999849343.99.34206197.1281379668.99330429.97320.57419.6796.6104580.18807336.7026451442911889.42.4777.495532.467940244.5621.154.9856.7158.37960.8217.4258161.34811.96677.140.20576312.94135.428375.325.5367210.9330191.3379.53972.02064.6133524.74657.238.20.5279390.45966013.3313993.17321.3228.2422.834.28363.77.91258OpenBenchmarking.org

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 9.4GCC 8.570140210280350SE +/- 0.33, N = 33193193172651971. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5GCC 9.416K32K48K64K80KSE +/- 48.36, N = 3SE +/- 36.89, N = 3SE +/- 11.94, N = 3SE +/- 5.53, N = 3SE +/- 1061.39, N = 348317.5848802.7649799.1274820.1976452.781. (CXX) g++ options: -O3 -march=native -fopenmp

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedGCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 10.3GCC 8.5100200300400500SE +/- 3.45, N = 3SE +/- 3.88, N = 15SE +/- 2.56, N = 13SE +/- 3.28, N = 15SE +/- 3.63, N = 3469.4385.5375.6370.1335.51. (CC) gcc options: -O3 -march=native -pthread -lz

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisGCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 10.3GCC 11.1816243240SE +/- 0.19, N = 4SE +/- 0.18, N = 4SE +/- 0.23, N = 4SE +/- 0.09, N = 4SE +/- 0.16, N = 426.8427.2828.1132.6735.091. (CC) gcc options: -O3 -march=native -std=c99

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305GCC 8.5GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 11.12004006008001000SE +/- 2.79, N = 3SE +/- 1.36, N = 3SE +/- 0.15, N = 3SE +/- 0.73, N = 3SE +/- 1.05, N = 3984.02951.15788.57781.38779.381. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 11.12004006008001000SE +/- 2.38, N = 3SE +/- 0.08, N = 3SE +/- 0.79, N = 3SE +/- 0.47, N = 3SE +/- 0.89, N = 3977.17945.45780.12775.49774.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5GCC 9.49K18K27K36K45KSE +/- 22.03, N = 3SE +/- 43.71, N = 3SE +/- 120.35, N = 3SE +/- 34.41, N = 3SE +/- 10.60, N = 334223.3034558.2235458.6642354.9042795.051. (CXX) g++ options: -O3 -march=native -fopenmp

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 11.1GCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 9.42004006008001000SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 39249037927617521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUGCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 11.1GCC 9.40.18650.3730.55950.7460.9325SE +/- 0.008934, N = 3SE +/- 0.003298, N = 3SE +/- 0.005954, N = 8SE +/- 0.005568, N = 3SE +/- 0.008442, N = 150.6786260.6794820.6805220.6970800.828866MIN: 0.65MIN: 0.66MIN: 0.63MIN: 0.67MIN: 0.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 10.3GCC 11.1246810SE +/- 0.007, N = 3SE +/- 0.017, N = 3SE +/- 0.011, N = 3SE +/- 0.033, N = 3SE +/- 0.004, N = 35.2865.9916.0456.1306.2011. (CXX) g++ options: -fopenmp -O3 -march=native

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: resnet-v2-50GCC 8.5GCC 10.3GCC 12.0.0 20210701GCC 11.1GCC 9.4714212835SE +/- 0.10, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 3SE +/- 0.47, N = 1527.3028.2628.2728.4531.45MIN: 26.86 / MAX: 27.93MIN: 27.6 / MAX: 28.76MIN: 27.7 / MAX: 28.91MIN: 27.77 / MAX: 28.83MIN: 24.41 / MAX: 36.241. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNGCC 10.3GCC 9.4GCC 8.5GCC 12.0.0 20210701GCC 11.11326395265SE +/- 0.63, N = 3SE +/- 0.13, N = 3SE +/- 0.75, N = 3SE +/- 0.44, N = 3SE +/- 0.15, N = 358.756.155.551.251.01. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 2021070190180270360450SE +/- 0.33, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 1.13, N = 3SE +/- 0.30, N = 3416.18414.45404.16367.68366.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNGCC 10.3GCC 9.4GCC 8.5GCC 12.0.0 20210701GCC 11.11326395265SE +/- 1.17, N = 3SE +/- 0.36, N = 3SE +/- 1.55, N = 2SE +/- 0.10, N = 3SE +/- 0.10, N = 358.956.256.051.951.91. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTGCC 10.3GCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 11.11326395265SE +/- 0.48, N = 3SE +/- 0.25, N = 3SE +/- 0.27, N = 3SE +/- 0.37, N = 3SE +/- 0.15, N = 356.454.754.149.849.81. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTGCC 10.3GCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 11.11326395265SE +/- 0.70, N = 2SE +/- 0.32, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 357.155.254.951.550.51. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 11.190180270360450SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.17, N = 3SE +/- 0.56, N = 3420.37411.57411.52374.71373.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 9.4GCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.530060090012001500SE +/- 4.93, N = 3SE +/- 7.21, N = 3SE +/- 8.67, N = 3SE +/- 3.51, N = 3SE +/- 2.91, N = 3161716071585157114441. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2GCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 8.5GCC 9.480160240320400SE +/- 0.31, N = 3SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 3SE +/- 0.34, N = 3311.60314.56318.06321.39347.44MIN: 309.73 / MAX: 322.67MIN: 312.66 / MAX: 328.44MIN: 316.44 / MAX: 326.16MIN: 319.29 / MAX: 341.28MIN: 345.68 / MAX: 356.591. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 11.1GCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 10.32004006008001000SE +/- 2.52, N = 3SE +/- 7.07, N = 15SE +/- 7.25, N = 15SE +/- 3.51, N = 3SE +/- 5.24, N = 38528097947767651. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishGCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 20210701110220330440550SE +/- 0.01, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3491.14486.32486.29442.67442.271. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 11.1GCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 9.4306090120150SE +/- 0.32, N = 3SE +/- 1.02, N = 3SE +/- 0.45, N = 3SE +/- 0.58, N = 3SE +/- 1.65, N = 12142.93145.17146.10154.63157.331. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -march=native -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUGCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.5GCC 9.40.10520.21040.31560.42080.526SE +/- 0.000391, N = 3SE +/- 0.004576, N = 4SE +/- 0.004665, N = 3SE +/- 0.003487, N = 3SE +/- 0.002588, N = 30.4256750.4269020.4306760.4415000.467594MIN: 0.41MIN: 0.4MIN: 0.41MIN: 0.41MIN: 0.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptGCC 8.5GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 11.1100200300400500SE +/- 0.04, N = 3SE +/- 0.21, N = 3SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3482.78475.41474.80442.41439.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21GCC 12.0.0 20210701GCC 11.1GCC 8.5GCC 9.4GCC 10.36001200180024003000SE +/- 0.85, N = 3SE +/- 33.86, N = 4SE +/- 19.67, N = 3SE +/- 35.29, N = 3SE +/- 18.68, N = 32773.62749.12586.32568.82529.81. (CXX) g++ options: -O3 -march=native -rdynamic

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v2-v2 - Model: mobilenet-v2GCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 10.3GCC 8.51.2062.4123.6184.8246.03SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 34.895.065.105.125.36MIN: 4.72 / MAX: 10.07MIN: 4.72 / MAX: 10.01MIN: 4.73 / MAX: 10.4MIN: 4.76 / MAX: 8.89MIN: 4.96 / MAX: 8.41. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 9.4GCC 10.3GCC 8.5GCC 12.0.0 20210701GCC 11.1140K280K420K560K700KSE +/- 1621.16, N = 3SE +/- 2003.80, N = 3SE +/- 3624.80, N = 3SE +/- 2406.19, N = 3SE +/- 1267.23, N = 3650499.47630485.59618050.83601830.26597455.161. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedGCC 9.4GCC 10.3GCC 8.5GCC 12.0.0 20210701GCC 11.16001200180024003000SE +/- 4.29, N = 3SE +/- 2.75, N = 3SE +/- 10.70, N = 3SE +/- 2.25, N = 3SE +/- 14.84, N = 33017.62876.92819.22782.72775.71. (CC) gcc options: -O3 -march=native -pthread -lz

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptGCC 8.5GCC 10.3GCC 9.4GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.50, N = 3SE +/- 0.28, N = 3152.43151.25150.88140.85140.481. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256GCC 8.5GCC 10.3GCC 9.4GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.30, N = 3152.24151.18150.59141.03140.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9GCC 10.3GCC 9.4GCC 8.5GCC 12.0.0 20210701GCC 11.150100150200250SE +/- 0.44, N = 3SE +/- 0.21, N = 3SE +/- 0.27, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3193.41193.97194.12196.22208.341. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: inception-v3GCC 8.5GCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 9.4816243240SE +/- 0.10, N = 3SE +/- 0.43, N = 3SE +/- 0.42, N = 3SE +/- 0.42, N = 3SE +/- 0.28, N = 1530.1530.8030.8031.2832.41MIN: 29.77 / MAX: 30.53MIN: 30.14 / MAX: 31.87MIN: 30.12 / MAX: 31.88MIN: 30.28 / MAX: 31.97MIN: 29.17 / MAX: 33.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: googlenetGCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 11.1GCC 10.33691215SE +/- 0.31, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 312.2112.4412.7312.8413.06MIN: 11.77 / MAX: 13.02MIN: 12 / MAX: 13.22MIN: 12.1 / MAX: 19.86MIN: 12.68 / MAX: 16.73MIN: 12.84 / MAX: 14.291. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2GCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.5GCC 9.41632486480SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 1.05, N = 369.2369.3069.8070.1273.94MIN: 68.59 / MAX: 70.34MIN: 68.65 / MAX: 70.61MIN: 69.16 / MAX: 70.99MIN: 69.44 / MAX: 71.63MIN: 72.33 / MAX: 77.081. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: efficientnet-b0GCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 10.3GCC 8.5246810SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 36.276.566.576.626.69MIN: 6.05 / MAX: 14.37MIN: 6.27 / MAX: 11.76MIN: 6.28 / MAX: 14.64MIN: 6.24 / MAX: 24.41MIN: 6.33 / MAX: 10.851. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedGCC 9.4GCC 12.0.0 20210701GCC 10.3GCC 8.5GCC 11.16001200180024003000SE +/- 14.28, N = 3SE +/- 16.86, N = 3SE +/- 9.76, N = 3SE +/- 3.24, N = 3SE +/- 7.69, N = 92802.72773.92701.92676.82642.21. (CC) gcc options: -O3 -march=native -pthread -lz

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 11.1GCC 8.5GCC 12.0.0 20210701GCC 10.3GCC 9.42004006008001000SE +/- 1.15, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 39169038848648641. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet18GCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 10.33691215SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 310.5410.5810.9310.9711.16MIN: 10.19 / MAX: 17.98MIN: 10.2 / MAX: 11.57MIN: 10.84 / MAX: 11.27MIN: 10.84 / MAX: 20.49MIN: 11.03 / MAX: 11.451. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mnasnetGCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 10.3GCC 8.51.08452.1693.25354.3385.4225SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 34.584.724.734.804.82MIN: 4.39 / MAX: 10.87MIN: 4.46 / MAX: 10.72MIN: 4.44 / MAX: 11.54MIN: 4.42 / MAX: 16.22MIN: 4.55 / MAX: 12.41. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KGCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 10.3GCC 8.5246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 38.658.638.618.598.221. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenetV3GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 8.5GCC 11.10.54271.08541.62812.17082.7135SE +/- 0.015, N = 15SE +/- 0.011, N = 3SE +/- 0.011, N = 3SE +/- 0.012, N = 3SE +/- 0.032, N = 32.2972.3412.3722.4032.412MIN: 1.96 / MAX: 2.53MIN: 2.16 / MAX: 2.53MIN: 2.25 / MAX: 2.5MIN: 2.28 / MAX: 2.54MIN: 2.23 / MAX: 2.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionGCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 11.1GCC 10.3246810SE +/- 0.043, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 36.5486.5606.6426.7806.8661. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 10.3GCC 11.180160240320400SE +/- 0.02, N = 3SE +/- 0.65, N = 3SE +/- 0.68, N = 3SE +/- 0.17, N = 3SE +/- 0.04, N = 3377.78375.23374.64372.58360.411. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5GCC 9.4400800120016002000SE +/- 1.54, N = 3SE +/- 3.72, N = 3SE +/- 1.06, N = 3SE +/- 0.81, N = 3SE +/- 1.53, N = 31564.711566.251566.751573.181639.78MIN: 1557.51MIN: 1553.36MIN: 1558.77MIN: 1566.72MIN: 1630.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUGCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.5GCC 9.4400800120016002000SE +/- 2.29, N = 3SE +/- 0.93, N = 3SE +/- 0.75, N = 3SE +/- 1.09, N = 3SE +/- 2.09, N = 31563.051565.411567.411573.031636.96MIN: 1555.64MIN: 1559.12MIN: 1561.17MIN: 1566.77MIN: 1629.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUGCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.53691215SE +/- 0.02137, N = 3SE +/- 0.01742, N = 3SE +/- 0.02038, N = 3SE +/- 0.00795, N = 3SE +/- 0.02091, N = 39.390629.413469.594529.791859.82858MIN: 9.29MIN: 9.27MIN: 9.4MIN: 9.62MIN: 9.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression SpeedGCC 9.4GCC 8.5GCC 11.1GCC 12.0.0 20210701GCC 10.37001400210028003500SE +/- 2.27, N = 5SE +/- 2.28, N = 3SE +/- 5.37, N = 3SE +/- 2.91, N = 3SE +/- 3.06, N = 33436.83361.73351.93332.73285.51. (CC) gcc options: -O3 -march=native -pthread -lz

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1GCC 10.3GCC 11.1GCC 9.4GCC 8.5GCC 12.0.0 2021070130060090012001500SE +/- 0.37, N = 3SE +/- 0.47, N = 3SE +/- 0.45, N = 3SE +/- 1.51, N = 3SE +/- 1.30, N = 31484.231468.971450.561445.241419.371. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeGCC 11.1GCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 10.311M22M33M44M55MSE +/- 211973.29, N = 3SE +/- 562135.06, N = 15SE +/- 432778.20, N = 8SE +/- 508293.85, N = 15SE +/- 623930.99, N = 352206963511400135073457150622552499422761. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUGCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.5GCC 9.40.2880.5760.8641.1521.44SE +/- 0.00255, N = 3SE +/- 0.00554, N = 3SE +/- 0.00400, N = 3SE +/- 0.00683, N = 3SE +/- 0.00348, N = 31.224961.227031.230121.232391.27991MIN: 1.18MIN: 1.18MIN: 1.19MIN: 1.18MIN: 1.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: MobileNetV2_224GCC 9.4GCC 10.3GCC 8.5GCC 12.0.0 20210701GCC 11.10.84471.68942.53413.37884.2235SE +/- 0.040, N = 15SE +/- 0.090, N = 3SE +/- 0.069, N = 3SE +/- 0.058, N = 3SE +/- 0.018, N = 33.5943.6693.7043.7293.754MIN: 3.07 / MAX: 4.08MIN: 3.42 / MAX: 4.19MIN: 3.31 / MAX: 3.95MIN: 3.41 / MAX: 3.94MIN: 3.47 / MAX: 3.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression SpeedGCC 9.4GCC 11.1GCC 8.5GCC 12.0.0 20210701GCC 10.38001600240032004000SE +/- 2.52, N = 13SE +/- 3.35, N = 3SE +/- 5.55, N = 3SE +/- 3.04, N = 15SE +/- 3.49, N = 153628.43553.23547.43531.43479.01. (CC) gcc options: -O3 -march=native -pthread -lz

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pGCC 8.5GCC 10.3GCC 11.1GCC 9.4GCC 12.0.0 2021070170140210280350SE +/- 2.56, N = 13SE +/- 1.76, N = 14SE +/- 2.83, N = 6SE +/- 4.16, N = 3SE +/- 3.01, N = 5306.08299.97297.32295.07293.711. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet50GCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5510152025SE +/- 0.29, N = 3SE +/- 0.24, N = 3SE +/- 0.24, N = 3SE +/- 0.29, N = 3SE +/- 0.30, N = 317.5917.7417.7717.8718.33MIN: 17.07 / MAX: 18.69MIN: 17.09 / MAX: 18.96MIN: 17.07 / MAX: 28.68MIN: 17.16 / MAX: 18.62MIN: 17.58 / MAX: 24.571. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5GCC 9.490180270360450SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 34034034033903871. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

Liquid-DSP

Threads: 36 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 36 - Buffer Length: 256 - Filter Length: 57GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 9.4GCC 8.5200M400M600M800M1000MSE +/- 1013283.99, N = 3SE +/- 272213.15, N = 3SE +/- 1056729.76, N = 3SE +/- 588132.64, N = 3SE +/- 120554.28, N = 39545366679406600009375533339216700009167900001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v3-v3 - Model: mobilenet-v3GCC 12.0.0 20210701GCC 9.4GCC 10.3GCC 11.1GCC 8.51.0622.1243.1864.2485.31SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 34.544.574.644.664.72MIN: 4.37 / MAX: 11.47MIN: 4.37 / MAX: 9.05MIN: 4.36 / MAX: 10.01MIN: 4.46 / MAX: 12.92MIN: 4.49 / MAX: 7.391. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: SqueezeNetV1.0GCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 10.3GCC 11.11.28772.57543.86315.15086.4385SE +/- 0.079, N = 3SE +/- 0.039, N = 15SE +/- 0.056, N = 3SE +/- 0.083, N = 3SE +/- 0.020, N = 35.5065.5405.5945.6125.723MIN: 5.22 / MAX: 5.91MIN: 5.06 / MAX: 6.72MIN: 5.4 / MAX: 5.85MIN: 5.24 / MAX: 6.01MIN: 5.47 / MAX: 6.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsGCC 9.4GCC 10.3GCC 8.5GCC 12.0.0 20210701GCC 11.1160320480640800SE +/- 0.35, N = 3SE +/- 0.48, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 3SE +/- 0.20, N = 3719.76717.17714.49714.21692.991. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionGCC 11.1GCC 9.4GCC 12.0.0 20210701GCC 8.5GCC 10.3918273645SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.01, N = 336.4736.8737.1637.7037.871. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Chimera 1080p 10-bitGCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 9.4GCC 8.550100150200250SE +/- 1.18, N = 3SE +/- 0.50, N = 3SE +/- 0.48, N = 3SE +/- 0.34, N = 3SE +/- 0.39, N = 3223.06222.27221.94219.69215.16MIN: 157.45 / MAX: 397.96MIN: 157.09 / MAX: 436.51MIN: 157.38 / MAX: 404.98-lm - MIN: 156.35 / MAX: 406.23-lm - MIN: 151.62 / MAX: 411.261. (CC) gcc options: -O3 -march=native -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: squeezenet_ssdGCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.548121620SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.36, N = 315.0815.2315.2615.2815.63MIN: 14.88 / MAX: 21.62MIN: 14.88 / MAX: 17.07MIN: 14.97 / MAX: 18.92MIN: 14.89 / MAX: 16.1MIN: 15.02 / MAX: 16.851. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 10.3GCC 9.4GCC 8.5GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 321.8421.6421.5321.1621.081. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4KGCC 9.4GCC 8.5GCC 10.3GCC 12.0.0 20210701GCC 11.14080120160200SE +/- 1.65, N = 3SE +/- 0.89, N = 3SE +/- 2.16, N = 3SE +/- 1.97, N = 6SE +/- 1.13, N = 3199.89197.46195.15194.06192.95-lm - MIN: 143.79 / MAX: 228.44-lm - MIN: 150.4 / MAX: 226.05MIN: 149.2 / MAX: 222.59MIN: 131.48 / MAX: 225.93MIN: 132.83 / MAX: 217.91. (CC) gcc options: -O3 -march=native -pthread

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KGCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 10.3GCC 8.51.1072.2143.3214.4285.535SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 34.924.874.874.844.751. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYGCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.51632486480SE +/- 0.12, N = 3SE +/- 0.31, N = 3SE +/- 0.30, N = 3SE +/- 0.35, N = 3SE +/- 1.48, N = 371.270.770.370.168.81. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1GCC 10.3GCC 12.0.0 20210701GCC 11.1GCC 8.5GCC 9.460120180240300SE +/- 0.10, N = 3SE +/- 0.19, N = 3SE +/- 0.63, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3286.22287.49288.91289.96296.14MIN: 285.02 / MAX: 287.82MIN: 285.88 / MAX: 299.62MIN: 286.05 / MAX: 294.45MIN: 288.43 / MAX: 291.61MIN: 294.66 / MAX: 298.561. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KGCC 10.3GCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 11.11.0082.0163.0244.0325.04SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 10SE +/- 0.04, N = 34.484.464.444.404.331. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 9.4GCC 8.5714212835SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 328.1028.0727.8627.2427.181. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: regnety_400mGCC 12.0.0 20210701GCC 10.3GCC 8.5GCC 9.4GCC 11.148121620SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.11, N = 313.4813.6313.7713.8713.93MIN: 13.11 / MAX: 14.04MIN: 13.01 / MAX: 14.57MIN: 12.93 / MAX: 14.62MIN: 13.18 / MAX: 15.01MIN: 13.17 / MAX: 14.481. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringGCC 9.4GCC 10.3GCC 8.5GCC 11.1GCC 12.0.0 2021070170140210280350SE +/- 0.09, N = 3SE +/- 0.38, N = 3SE +/- 2.72, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 3336.11329.49329.13327.33325.281. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NGCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.51632486480SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.31, N = 3SE +/- 0.12, N = 371.871.771.671.469.51. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 10.3GCC 11.1246810SE +/- 0.011, N = 5SE +/- 0.021, N = 5SE +/- 0.011, N = 5SE +/- 0.031, N = 5SE +/- 0.013, N = 58.1868.2418.2838.2838.4561. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedGCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 10.3GCC 11.190180270360450SE +/- 5.56, N = 3SE +/- 4.23, N = 5SE +/- 4.99, N = 3SE +/- 4.87, N = 3SE +/- 5.64, N = 3432.7429.4425.7424.0419.51. (CC) gcc options: -O3 -march=native -pthread -lz

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUGCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.50.66351.3271.99052.6543.3175SE +/- 0.01601, N = 3SE +/- 0.01584, N = 3SE +/- 0.01261, N = 3SE +/- 0.01797, N = 3SE +/- 0.01360, N = 32.860082.934282.939782.946702.94903MIN: 2.77MIN: 2.83MIN: 2.83MIN: 2.84MIN: 2.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeGCC 11.1GCC 9.4GCC 10.3GCC 8.5GCC 12.0.0 202107011122334455SE +/- 0.26, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.18, N = 347.8747.8847.9048.1649.321. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57GCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 9.4GCC 8.5200M400M600M800M1000MSE +/- 4781007.56, N = 3SE +/- 4623189.13, N = 3SE +/- 2904171.33, N = 3SE +/- 539269.05, N = 3SE +/- 353836.12, N = 39511700009444333339397166679302366679246300001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 8.5GCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.470140210280350SE +/- 0.21, N = 3SE +/- 1.79, N = 3SE +/- 2.35, N = 3SE +/- 0.68, N = 3SE +/- 1.42, N = 3310.79306.89305.72303.54302.251. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUGCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.5GCC 9.42004006008001000SE +/- 0.15, N = 3SE +/- 0.70, N = 3SE +/- 0.16, N = 3SE +/- 1.32, N = 3SE +/- 1.46, N = 3935.38936.88938.22939.92961.38MIN: 931.25MIN: 931.54MIN: 930.52MIN: 933.27MIN: 955.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: blazefaceGCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 8.5GCC 10.30.58731.17461.76192.34922.9365SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 32.542.542.552.562.61MIN: 2.45 / MAX: 3.32MIN: 2.46 / MAX: 3.12MIN: 2.47 / MAX: 3.17MIN: 2.5 / MAX: 3.31MIN: 2.47 / MAX: 3.31. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mobilenetGCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 9.4GCC 8.548121620SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 313.7913.8113.8113.9414.16MIN: 13.61 / MAX: 14.23MIN: 13.51 / MAX: 20.36MIN: 13.64 / MAX: 14.54MIN: 13.68 / MAX: 22.27MIN: 13.93 / MAX: 14.761. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUGCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.5GCC 9.40.40310.80621.20931.61242.0155SE +/- 0.00613, N = 3SE +/- 0.00494, N = 3SE +/- 0.00508, N = 3SE +/- 0.00719, N = 3SE +/- 0.00782, N = 31.744711.755201.762641.763081.79150MIN: 1.66MIN: 1.69MIN: 1.69MIN: 1.7MIN: 1.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 11.1GCC 9.4GCC 12.0.0 20210701GCC 8.5GCC 10.348121620SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 316.8016.8216.8917.2017.251. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYGCC 9.4GCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 8.51122334455SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 347.146.746.446.445.91. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUGCC 10.3GCC 12.0.0 20210701GCC 8.5GCC 11.1GCC 9.42004006008001000SE +/- 0.69, N = 3SE +/- 0.46, N = 3SE +/- 0.50, N = 3SE +/- 0.60, N = 3SE +/- 0.63, N = 3935.96936.49937.61938.04960.13MIN: 930.6MIN: 932.18MIN: 932.91MIN: 933.04MIN: 955.071. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database SearchGCC 8.5GCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701306090120150SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.32, N = 3SE +/- 0.10, N = 3126.20126.51126.62126.69129.451. (CC) gcc options: -O3 -march=native -pthread -lhmmer -leasel -lm -lmpi

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIGCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 10.320406080100SE +/- 0.07, N = 3SE +/- 0.25, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3100.70100.61100.4199.7298.201. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552GCC 8.5GCC 10.3GCC 9.4GCC 12.0.0 20210701GCC 11.1306090120150SE +/- 1.35, N = 3SE +/- 1.51, N = 3SE +/- 1.54, N = 3SE +/- 1.05, N = 3SE +/- 0.26, N = 3126.20126.88127.69128.32129.401. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.4GCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 11.1246810SE +/- 0.011, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.005, N = 38.5258.5998.7168.7308.7321. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 11.1GCC 10.320K40K60K80K100KSE +/- 69.57, N = 3SE +/- 228.39, N = 3SE +/- 343.09, N = 3SE +/- 304.44, N = 3SE +/- 46.23, N = 399711984939829898149974261. (CXX) g++ options: -pipe -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedGCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 9.4GCC 8.51020304050SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 344.343.943.943.543.31. (CC) gcc options: -O3 -march=native -pthread -lz

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUGCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 11.1GCC 9.43691215SE +/- 0.00808, N = 3SE +/- 0.01311, N = 3SE +/- 0.01276, N = 3SE +/- 0.01164, N = 3SE +/- 0.01517, N = 39.342069.347669.350959.355509.55455MIN: 9.28MIN: 9.29MIN: 9.29MIN: 9.29MIN: 9.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2GCC 9.4GCC 8.5GCC 10.3GCC 12.0.0 20210701GCC 11.14080120160200SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3199.10198.11197.63197.13194.761. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessGCC 10.3GCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 8.530K60K90K120K150KSE +/- 1006.09, N = 3SE +/- 734.59, N = 3SE +/- 946.90, N = 3SE +/- 1635.40, N = 4SE +/- 578.10, N = 31382221379661364441359031353021. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: alexnetGCC 12.0.0 20210701GCC 8.5GCC 11.1GCC 9.4GCC 10.33691215SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 38.999.059.089.159.18MIN: 8.73 / MAX: 9.39MIN: 8.96 / MAX: 19.48MIN: 9 / MAX: 11.81MIN: 9.08 / MAX: 9.59MIN: 9.11 / MAX: 9.741. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEGCC 8.5GCC 12.0.0 20210701GCC 10.3GCC 9.4GCC 11.17001400210028003500SE +/- 25.16, N = 15SE +/- 25.40, N = 15SE +/- 6.36, N = 3SE +/- 27.02, N = 3SE +/- 7.22, N = 3330733043281325232401. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 8.5GCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 10.3714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 329.8229.9629.9730.0430.431. (CC) gcc options: -lm -lpthread -O3 -march=native

VOSK Speech Recognition Toolkit

OpenBenchmarking.orgSeconds, Fewer Is BetterVOSK Speech Recognition Toolkit 0.3.21GCC 12.0.0 20210701GCC 8.5GCC 10.3GCC 11.1GCC 9.4510152025SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.13, N = 320.5720.7220.7520.8920.99

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KGCC 12.0.0 20210701GCC 11.1GCC 8.5GCC 9.4GCC 10.3510152025SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 319.6719.6419.4019.3419.281. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptGCC 9.4GCC 8.5GCC 11.1GCC 10.3GCC 12.0.0 2021070120406080100SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 398.5697.6497.0096.8896.611. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 9.4GCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 8.510002000300040005000SE +/- 5.70, N = 3SE +/- 0.64, N = 3SE +/- 2.82, N = 3SE +/- 13.08, N = 3SE +/- 0.61, N = 34609.404592.954580.194538.664522.731. (CC) gcc options: -O3 -march=native -mavx2

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: vgg16GCC 9.4GCC 8.5GCC 10.3GCC 11.1GCC 12.0.0 20210701816243240SE +/- 0.49, N = 3SE +/- 0.37, N = 3SE +/- 0.47, N = 3SE +/- 0.52, N = 3SE +/- 0.53, N = 336.0136.2036.3536.5636.70MIN: 35.37 / MAX: 37.68MIN: 35.36 / MAX: 47.25MIN: 35.3 / MAX: 37.7MIN: 35.42 / MAX: 58.41MIN: 35.5 / MAX: 41.991. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 12.0.0 20210701GCC 10.3GCC 9.4GCC 8.5GCC 11.160K120K180K240K300KSE +/- 101.43, N = 3SE +/- 95.66, N = 3SE +/- 247.29, N = 3SE +/- 106.27, N = 3SE +/- 68.80, N = 32645142634722634572598692595651. (CC) gcc options: -pedantic -O3

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 9.4GCC 8.5901802703604504324294274244241. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread

C-Blosc

Compressor: blosclz

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0Compressor: blosclzGCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 10.33K6K9K12K15KSE +/- 69.56, N = 3SE +/- 37.20, N = 3SE +/- 18.15, N = 3SE +/- 11.42, N = 3SE +/- 21.15, N = 311926.811889.411802.611800.711713.41. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenet-v1-1.0GCC 8.5GCC 9.4GCC 10.3GCC 11.1GCC 12.0.0 202107010.55731.11461.67192.22922.7865SE +/- 0.025, N = 3SE +/- 0.012, N = 15SE +/- 0.025, N = 3SE +/- 0.024, N = 3SE +/- 0.029, N = 32.4332.4552.4672.4772.477MIN: 2.32 / MAX: 2.62MIN: 2.23 / MAX: 3.16MIN: 2.32 / MAX: 2.68MIN: 2.3 / MAX: 2.65MIN: 2.3 / MAX: 2.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KGCC 11.1GCC 8.5GCC 12.0.0 20210701GCC 9.4GCC 10.3246810SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 12SE +/- 0.08, N = 15SE +/- 0.04, N = 37.517.517.497.437.381. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsGCC 11.1GCC 9.4GCC 12.0.0 20210701GCC 8.5GCC 10.312002400360048006000SE +/- 1.99, N = 3SE +/- 6.16, N = 3SE +/- 5.19, N = 3SE +/- 1.33, N = 3SE +/- 1.85, N = 35593.135538.555532.475519.145503.081. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pGCC 8.5GCC 10.3GCC 12.0.0 20210701GCC 11.1GCC 9.450100150200250SE +/- 1.34, N = 3SE +/- 3.33, N = 3SE +/- 1.58, N = 3SE +/- 3.10, N = 3SE +/- 1.22, N = 3247.45246.75244.56244.05243.481. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastGCC 8.5GCC 9.4GCC 12.0.0 20210701GCC 10.3GCC 11.1510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 321.3521.2021.1521.0921.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: shufflenet-v2GCC 9.4GCC 12.0.0 20210701GCC 10.3GCC 8.5GCC 11.11.13632.27263.40894.54525.6815SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 34.974.985.025.055.05MIN: 4.8 / MAX: 8.9MIN: 4.88 / MAX: 8.6MIN: 4.83 / MAX: 15.94MIN: 4.83 / MAX: 14.14MIN: 4.88 / MAX: 9.371. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 12.0.0 20210701GCC 9.4GCC 10.3GCC 11.1GCC 8.51326395265SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 356.7257.0357.3057.4157.611. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.3GCC 12.0.0 20210701GCC 11.1GCC 8.5GCC 9.4246810SE +/- 0.004, N = 5SE +/- 0.014, N = 5SE +/- 0.009, N = 5SE +/- 0.012, N = 5SE +/- 0.006, N = 58.3698.3798.4118.4368.5001. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedGCC 11.1GCC 12.0.0 20210701GCC 8.5GCC 9.4GCC 10.31428425670SE +/- 0.50, N = 9SE +/- 0.47, N = 3SE +/- 0.19, N = 3SE +/- 0.50, N = 3SE +/- 0.56, N = 361.060.860.360.260.11. (CC) gcc options: -O3 -march=native -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 9.4GCC 10.3GCC 8.5GCC 11.1GCC 12.0.0 2021070150100150200250SE +/- 0.84, N = 3SE +/- 0.30, N = 3SE +/- 0.03, N = 3SE +/- 0.47, N = 3SE +/- 0.26, N = 3220.62219.54218.81218.62217.431. (CC) gcc options: -O3 -march=native -rdynamic

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KGCC 10.3GCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 8.50.30580.61160.91741.22321.529SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 31.3591.3551.3481.3471.3401. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KGCC 11.1GCC 10.3GCC 9.4GCC 12.0.0 20210701GCC 8.53691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 312.0912.0612.0211.9711.931. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTGCC 9.4GCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 8.520406080100SE +/- 0.26, N = 3SE +/- 0.43, N = 3SE +/- 0.33, N = 3SE +/- 0.49, N = 3SE +/- 0.42, N = 378.077.677.477.177.01. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.5918273645SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 340.6440.3140.2040.1640.131. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulGCC 8.5GCC 11.1GCC 12.0.0 20210701GCC 10.3GCC 9.412002400360048006000SE +/- 16.26, N = 3SE +/- 53.69, N = 3SE +/- 23.13, N = 3SE +/- 8.67, N = 3SE +/- 24.67, N = 3580157695763574457291. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pGCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.53691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 313.0412.9612.9412.9112.881. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670GCC 10.3GCC 9.4GCC 8.5GCC 11.1GCC 12.0.0 20210701306090120150SE +/- 1.13, N = 3SE +/- 0.71, N = 3SE +/- 0.81, N = 3SE +/- 0.92, N = 3SE +/- 0.83, N = 3133.91134.22134.35134.71135.431. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pGCC 9.4GCC 11.1GCC 12.0.0 20210701GCC 8.5GCC 10.380160240320400SE +/- 0.96, N = 3SE +/- 1.99, N = 3SE +/- 0.75, N = 3SE +/- 0.95, N = 3SE +/- 1.74, N = 3376.89375.65375.32374.62372.691. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUGCC 10.3GCC 12.0.0 20210701GCC 8.5GCC 11.1GCC 9.41.25892.51783.77675.03566.2945SE +/- 0.01959, N = 3SE +/- 0.02089, N = 3SE +/- 0.02371, N = 3SE +/- 0.02335, N = 3SE +/- 0.02285, N = 35.533345.536725.540625.545445.59533MIN: 5.38MIN: 5.38MIN: 5.4MIN: 5.4MIN: 5.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUGCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.4GCC 8.53691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 310.9210.9210.9310.9611.02MIN: 10.74MIN: 10.74MIN: 10.77MIN: 10.74MIN: 10.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pGCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 8.5GCC 10.34080120160200SE +/- 0.32, N = 3SE +/- 0.31, N = 3SE +/- 0.18, N = 3SE +/- 0.58, N = 3SE +/- 0.44, N = 3191.33191.02190.62190.03189.881. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TGCC 8.5GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 11.120406080100SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.38, N = 379.979.779.679.579.31. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256GCC 10.3GCC 8.5GCC 9.4GCC 11.1GCC 12.0.0 202107019001800270036004500SE +/- 4.08, N = 3SE +/- 0.87, N = 3SE +/- 0.82, N = 3SE +/- 2.67, N = 3SE +/- 7.30, N = 33999.253998.633987.083985.273972.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionGCC 11.1GCC 9.4GCC 8.5GCC 10.3GCC 12.0.0 202107011428425670SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.36, N = 3SE +/- 0.56, N = 364.2064.2364.2464.4264.611. (CC) gcc options: -O3 -march=native

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetGCC 8.5GCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.48001600240032004000SE +/- 0.16, N = 3SE +/- 0.59, N = 3SE +/- 0.77, N = 3SE +/- 0.21, N = 3SE +/- 2.55, N = 33505.933508.293508.393524.753527.68MIN: 3487.54 / MAX: 3535.34MIN: 3489.27 / MAX: 3603.98MIN: 3486.98 / MAX: 3606.8MIN: 3509.67 / MAX: 3548.51MIN: 3508.67 / MAX: 3981.671. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYGCC 11.1GCC 9.4GCC 10.3GCC 12.0.0 20210701GCC 8.51326395265SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 357.457.457.357.257.11. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYGCC 11.1GCC 10.3GCC 12.0.0 20210701GCC 9.4GCC 8.5918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.00, N = 338.338.338.238.138.11. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUGCC 10.3GCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 8.50.11890.23780.35670.47560.5945SE +/- 0.003324, N = 3SE +/- 0.003361, N = 3SE +/- 0.003225, N = 3SE +/- 0.003036, N = 3SE +/- 0.003001, N = 30.5260140.5267980.5279390.5281330.528416MIN: 0.5MIN: 0.5MIN: 0.5MIN: 0.5MIN: 0.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUGCC 9.4GCC 12.0.0 20210701GCC 11.1GCC 10.3GCC 8.50.10380.20760.31140.41520.519SE +/- 0.001062, N = 3SE +/- 0.001427, N = 3SE +/- 0.000687, N = 3SE +/- 0.001585, N = 3SE +/- 0.000631, N = 30.4594560.4596600.4599560.4604820.461459MIN: 0.45MIN: 0.45MIN: 0.45MIN: 0.45MIN: 0.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackGCC 12.0.0 20210701GCC 10.3GCC 11.1GCC 8.5GCC 9.43691215SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.3313.3413.3513.3613.381. (CXX) g++ options: -O3 -march=native -rdynamic

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptGCC 10.3GCC 11.1GCC 9.4GCC 12.0.0 20210701GCC 8.59001800270036004500SE +/- 0.87, N = 3SE +/- 0.79, N = 3SE +/- 0.39, N = 3SE +/- 4.17, N = 3SE +/- 1.28, N = 33998.483995.233993.893993.173991.201. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Chimera 1080p 10-bitGCC 12.0.0 20210701510152025SE +/- 0.01, N = 321.321. (CXX) g++ options: -O3 -march=native -lpthread -lrt

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Summer Nature 4KGCC 12.0.0 20210701714212835SE +/- 0.01, N = 328.241. (CXX) g++ options: -O3 -march=native -lpthread -lrt

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: yolov4-tinyGCC 8.5GCC 10.3GCC 9.4GCC 11.1GCC 12.0.0 20210701510152025SE +/- 0.25, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 1.86, N = 320.8421.2621.3521.3722.83MIN: 19.92 / MAX: 24.91MIN: 20 / MAX: 24.4MIN: 20.42 / MAX: 33.9MIN: 20.44 / MAX: 22.72MIN: 20.18 / MAX: 937.41. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: squeezenetv1.1GCC 10.3GCC 12.0.0 20210701GCC 9.4GCC 8.5GCC 11.11.0372.0743.1114.1485.185SE +/- 0.162, N = 3SE +/- 0.149, N = 3SE +/- 0.061, N = 15SE +/- 0.036, N = 3SE +/- 0.007, N = 34.2714.2834.4204.5644.609MIN: 3.97 / MAX: 4.72MIN: 3.97 / MAX: 4.71MIN: 3.98 / MAX: 4.76MIN: 4.42 / MAX: 4.75MIN: 4.51 / MAX: 4.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTGCC 12.0.0 20210701GCC 11.1GCC 9.4GCC 8.5GCC 10.31428425670SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 9.10, N = 363.763.763.763.654.71. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUGCC 8.5GCC 10.3GCC 11.1GCC 12.0.0 20210701GCC 9.4246810SE +/- 0.02602, N = 3SE +/- 0.03159, N = 3SE +/- 0.03513, N = 3SE +/- 0.03413, N = 3SE +/- 0.17046, N = 147.890327.901637.904777.912588.10038MIN: 7.58MIN: 7.61MIN: 7.56MIN: 7.63MIN: 7.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl


Phoronix Test Suite v10.8.4