Xeon Gold 6226R December

Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012200-HA-XEONGOLD615.

Xeon Gold 6226R DecemberProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads)Supermicro X11SPL-F v1.02 (3.1 BIOS)Intel Sky Lake-E DMI3 Registers188GB3841GB Micron_9300_MTFDHAL3T8TDPllvmpipeVE2282 x Intel I210Ubuntu 20.045.9.0-050900rc6daily20200921-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.8 (LLVM 10.0.0 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x5002f01Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Xeon Gold 6226R Decemberclomp: Static OMP Speeduphmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondbuild-ffmpeg: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compileencode-ape: WAV To APEnode-web-tooling: sqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mencode-wavpack: WAV To WavPackbrl-cad: VGR Performance Metric12326.8174.22710.6370.560.390.570.582.353583.148160.4962451.244915.613462.596104.343362.754973.211264.166560.5266920.8670401643.22922.9921643.599.4217411.291212.5274922.8740.9746241645.61921.6740.4864402.05419537952.81083743.49795.58085.52717.52810.7865.36317.776.075.215.945.567.702.9314.5629.6510.588.0720.2724.7216.7327.1916.73116435426.2174.16910.5900.560.390.570.582.350383.137070.4914591.247815.607042.588064.340822.762103.208254.162370.5272520.8606631645.08923.5361645.649.4230111.289112.5418923.3690.9738741643.67923.4380.4779512.05772535255.62588843.23695.68685.55117.53510.6965.85216.965.965.145.975.377.262.8812.9928.049.416.7318.9324.4016.4527.1316.75316371326.5174.44710.5490.560.390.570.582.349903.158350.4957331.243805.604772.613504.350802.756193.212784.174450.5264370.8628561647.12922.2701645.919.4227711.316212.5335923.3360.9770731644.83920.8630.4792162.05675534883.15508543.45495.52885.78417.54310.6865.59417.735.895.076.015.347.202.9013.0929.249.496.9920.3825.2216.9427.4116.779OpenBenchmarking.org

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup123612182430SE +/- 0.32, N = 3SE +/- 0.42, N = 3SE +/- 0.12, N = 326.826.226.51. (CC) gcc options: -fopenmp -O3 -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1234080120160200SE +/- 0.36, N = 3SE +/- 0.41, N = 3SE +/- 0.20, N = 3174.23174.17174.451. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 310.6410.5910.551. (CC) gcc options: -std=c99 -O3 -lm -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1260.2520.3780.5040.63SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.561. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.570.571. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.52961.05921.58882.11842.648SE +/- 0.00656, N = 3SE +/- 0.00411, N = 3SE +/- 0.00268, N = 32.353582.350382.34990MIN: 2.26MIN: 2.22MIN: 2.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1230.71061.42122.13182.84243.553SE +/- 0.01117, N = 3SE +/- 0.00567, N = 3SE +/- 0.00910, N = 33.148163.137073.15835MIN: 3.1MIN: 3.09MIN: 3.111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.11170.22340.33510.44680.5585SE +/- 0.000595, N = 3SE +/- 0.000604, N = 3SE +/- 0.000320, N = 30.4962450.4914590.495733MIN: 0.47MIN: 0.47MIN: 0.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.28080.56160.84241.12321.404SE +/- 0.00268, N = 3SE +/- 0.00242, N = 3SE +/- 0.00126, N = 31.244911.247811.24380MIN: 1.2MIN: 1.2MIN: 1.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU1231.2632.5263.7895.0526.315SE +/- 0.01299, N = 3SE +/- 0.00948, N = 3SE +/- 0.00888, N = 35.613465.607045.60477MIN: 5.5MIN: 5.44MIN: 5.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU1230.5881.1761.7642.3522.94SE +/- 0.00587, N = 3SE +/- 0.00862, N = 3SE +/- 0.00522, N = 32.596102.588062.61350MIN: 2.51MIN: 2.53MIN: 2.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1230.97891.95782.93673.91564.8945SE +/- 0.01713, N = 3SE +/- 0.01378, N = 3SE +/- 0.01473, N = 34.343364.340824.35080MIN: 4.27MIN: 4.27MIN: 4.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.62151.2431.86452.4863.1075SE +/- 0.00248, N = 3SE +/- 0.00638, N = 3SE +/- 0.00159, N = 32.754972.762102.75619MIN: 2.69MIN: 2.69MIN: 2.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.72291.44582.16872.89163.6145SE +/- 0.00363, N = 3SE +/- 0.00574, N = 3SE +/- 0.01283, N = 33.211263.208253.21278MIN: 3.17MIN: 3.16MIN: 3.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1230.93931.87862.81793.75724.6965SE +/- 0.01549, N = 3SE +/- 0.01546, N = 3SE +/- 0.01564, N = 34.166564.162374.17445MIN: 4.09MIN: 4.1MIN: 4.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.11860.23720.35580.47440.593SE +/- 0.000759, N = 3SE +/- 0.000712, N = 3SE +/- 0.000172, N = 30.5266920.5272520.526437MIN: 0.51MIN: 0.51MIN: 0.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.19510.39020.58530.78040.9755SE +/- 0.007447, N = 3SE +/- 0.011319, N = 5SE +/- 0.008326, N = 30.8670400.8606630.862856MIN: 0.83MIN: 0.82MIN: 0.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU123400800120016002000SE +/- 0.66, N = 3SE +/- 0.79, N = 3SE +/- 4.18, N = 31643.221645.081647.12MIN: 1637.46MIN: 1639.15MIN: 1640.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1232004006008001000SE +/- 0.36, N = 3SE +/- 1.28, N = 3SE +/- 1.52, N = 3922.99923.54922.27MIN: 920.22MIN: 919.76MIN: 917.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU123400800120016002000SE +/- 2.60, N = 3SE +/- 4.86, N = 3SE +/- 1.22, N = 31643.591645.641645.91MIN: 1637.17MIN: 1635.99MIN: 1639.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.01674, N = 3SE +/- 0.00522, N = 3SE +/- 0.00825, N = 39.421749.423019.42277MIN: 9.07MIN: 9.08MIN: 9.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.2911.2911.32MIN: 11.16MIN: 11.17MIN: 11.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5312.5412.53MIN: 12.41MIN: 12.41MIN: 12.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1232004006008001000SE +/- 0.33, N = 3SE +/- 0.50, N = 3SE +/- 0.27, N = 3922.87923.37923.34MIN: 919.98MIN: 919.79MIN: 920.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.21980.43960.65940.87921.099SE +/- 0.001208, N = 3SE +/- 0.001190, N = 3SE +/- 0.001996, N = 30.9746240.9738740.977073MIN: 0.94MIN: 0.94MIN: 0.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 0.69, N = 3SE +/- 1.59, N = 3SE +/- 0.87, N = 31645.611643.671644.83MIN: 1641.21MIN: 1637.61MIN: 1640.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232004006008001000SE +/- 0.31, N = 3SE +/- 1.61, N = 3SE +/- 0.78, N = 3921.67923.44920.86MIN: 918.58MIN: 918.55MIN: 916.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.10940.21880.32820.43760.547SE +/- 0.001972, N = 3SE +/- 0.005102, N = 3SE +/- 0.004438, N = 30.4864400.4779510.479216MIN: 0.47MIN: 0.46MIN: 0.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU1230.4630.9261.3891.8522.315SE +/- 0.00388, N = 3SE +/- 0.00591, N = 3SE +/- 0.00357, N = 32.054192.057722.05675MIN: 1.97MIN: 1.97MIN: 1.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123120K240K360K480K600KSE +/- 746.08, N = 3SE +/- 1073.23, N = 3SE +/- 2681.32, N = 3537952.81535255.63534883.161. (CC) gcc options: -O2 -lrt" -lrt

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1231020304050SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 343.5043.2443.45

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12320406080100SE +/- 0.24, N = 3SE +/- 0.25, N = 3SE +/- 0.15, N = 395.5895.6995.53

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.23, N = 385.5385.5585.78

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12348121620SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 517.5317.5417.541. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 310.7810.6910.681. Nodejs v10.19.0

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 365.3665.8565.591. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet12348121620SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 317.7716.9617.73MIN: 17.45 / MAX: 18.07MIN: 16.84 / MAX: 18.68MIN: 16.96 / MAX: 19.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2123246810SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.075.965.89MIN: 5.65 / MAX: 23.59MIN: 5.64 / MAX: 9.74MIN: 5.64 / MAX: 9.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31231.17232.34463.51694.68925.8615SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 35.215.145.07MIN: 4.94 / MAX: 9.16MIN: 4.94 / MAX: 8.76MIN: 4.93 / MAX: 8.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 2SE +/- 0.03, N = 35.945.976.01MIN: 5.83 / MAX: 9.49MIN: 5.86 / MAX: 9.64MIN: 5.83 / MAX: 9.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1231.2512.5023.7535.0046.255SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 35.565.375.34MIN: 5.21 / MAX: 9.44MIN: 5.18 / MAX: 9.36MIN: 5.16 / MAX: 9.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123246810SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.707.267.20MIN: 7.35 / MAX: 8.6MIN: 7.01 / MAX: 8.01MIN: 7 / MAX: 91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.65931.31861.97792.63723.2965SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.932.882.90MIN: 2.85 / MAX: 3.18MIN: 2.82 / MAX: 3.69MIN: 2.83 / MAX: 4.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet12348121620SE +/- 0.46, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 314.5612.9913.09MIN: 13.58 / MAX: 15.21MIN: 12.91 / MAX: 13.36MIN: 12.88 / MAX: 15.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123714212835SE +/- 0.50, N = 3SE +/- 0.03, N = 3SE +/- 0.36, N = 329.6528.0429.24MIN: 28.54 / MAX: 34.02MIN: 27.76 / MAX: 46.41MIN: 27.78 / MAX: 48.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181233691215SE +/- 0.42, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 310.589.419.49MIN: 9.67 / MAX: 12.86MIN: 9.33 / MAX: 10.24MIN: 9.34 / MAX: 11.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 38.076.736.99MIN: 8.01 / MAX: 9.86MIN: 6.67 / MAX: 8.33MIN: 6.66 / MAX: 23.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50123510152025SE +/- 0.51, N = 3SE +/- 0.08, N = 3SE +/- 0.52, N = 320.2718.9320.38MIN: 19.1 / MAX: 21.77MIN: 18.61 / MAX: 38.46MIN: 19.05 / MAX: 23.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123612182430SE +/- 0.51, N = 3SE +/- 0.38, N = 3SE +/- 0.35, N = 324.7224.4025.22MIN: 23.36 / MAX: 28.82MIN: 23.06 / MAX: 27.46MIN: 23.44 / MAX: 28.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd12348121620SE +/- 0.11, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 316.7316.4516.94MIN: 16.43 / MAX: 17.04MIN: 16.33 / MAX: 19.11MIN: 16.44 / MAX: 191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m123612182430SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 327.1927.1327.41MIN: 26.8 / MAX: 29.86MIN: 26.7 / MAX: 28.12MIN: 26.75 / MAX: 31.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 516.7316.7516.781. (CXX) g++ options: -rdynamic

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1240K80K120K160K200K1643541637131. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm


Phoronix Test Suite v10.8.4