5220R 2P Ubuntu EO 2020

2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012214-HA-5220R2PUB13&grs.

5220R 2P Ubuntu EO 2020ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1232 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads)TYAN S7106 (V2.01.B40 BIOS)Intel Sky Lake-E DMI3 Registers94GB500GB Samsung SSD 860llvmpipeVE2282 x Intel I210 + 2 x QLogic cLOM8214 1/10GbEUbuntu 20.045.9.0-050900rc6-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

5220R 2P Ubuntu EO 2020apache-siege: 500ncnn: CPU - resnet50ncnn: CPU - efficientnet-b0ncnn: CPU - shufflenet-v2apache-siege: 100ncnn: CPU - regnety_400mapache-siege: 250ncnn: CPU - mnasnetncnn: CPU - resnet18ncnn: CPU - googlenetncnn: CPU - vgg16sqlite-speedtest: Timed Time - Size 1,000ncnn: CPU-v3-v3 - mobilenet-v3onednn: IP Shapes 3D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUsimdjson: PartialTweetsncnn: CPU-v2-v2 - mobilenet-v2clomp: Static OMP Speeduponednn: IP Shapes 1D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUnode-web-tooling: mafft: Multiple Sequence Alignment - LSU RNAncnn: CPU - mobilenetonednn: Recurrent Neural Network Inference - f32 - CPUncnn: CPU - blazefaceonednn: Recurrent Neural Network Inference - u8s8f32 - CPUapache-siege: 50onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUencode-ape: WAV To APEonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondbuild-clash: Time To Compilehmmer: Pfam Database Searchonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUbuild-ffmpeg: Time To Compilebuild-eigen: Time To Compileapache-siege: 10onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUbuild2: Time To Compileencode-opus: WAV To Opus Encodeonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUencode-ogg: WAV To Oggencode-wavpack: WAV To WavPackonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUbrl-cad: VGR Performance Metricsimdjson: DistinctUserIDsimdjson: LargeRandsimdjson: Kostyaapache-siege: 200ncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - alexnetonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPU12351574.6526.9612.149.2035458.4366.1845610.349.3013.8620.5840.2866.0839.261.255370.3481593.924791457.16818.4080.5610.12321.345037.0591810.2511.84322.91819.4574.80818.43033348.710.55285113.0821.466451096383.890921482.027224.3252.284879.5081930.58985.76021723.416.4113879.00310.1830.56251923.21616.7857.658601.813625.692332.735947.470370.6948782531040.580.390.5651905.5923.4332.2610.041941.011443.5914.3586948209.4628.4012.568.8236786.9868.4746010.139.4613.6720.4839.9765.2979.081.272840.3414763.853281430.86817.9200.5710.1931.81.362787.0627810.2811.72823.15828.0634.85826.78533392.840.55381313.0091.458541088921.697688485.102223.7612.281419.5390130.59786.01621708.106.4163678.85810.1780.56224823.20216.7797.666161.810945.687732.739447.470630.6952540.580.390.5644343.8424.2534.069.801431.411951.913.6986446008.3826.7712.689.0835464.3567.5344492.569.6114.1221.1040.8164.7929.261.248180.3471513.892961446.43832.7590.5610.2931.51.343856.9760110.3711.85923.16822.7424.83821.64333656.660.55724213.1121.455961093129.508145484.600225.0862.275249.5350830.50585.77721771.536.4298978.80210.2010.56140823.17116.8087.653641.813335.685002.739417.476170.6947240.580.390.5641854.1823.9432.4310.831458.061434.193.66837OpenBenchmarking.org

Apache Siege

Concurrent Users: 500

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 50012311K22K33K44K55KSE +/- 846.89, N = 12SE +/- 667.57, N = 4SE +/- 317.59, N = 351574.6548209.4646008.381. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50123714212835SE +/- 0.16, N = 5SE +/- 0.59, N = 4SE +/- 0.30, N = 326.9628.4026.77MIN: 25.86 / MAX: 79.69MIN: 26.69 / MAX: 109.07MIN: 24.83 / MAX: 63.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.27, N = 5SE +/- 0.11, N = 4SE +/- 0.23, N = 312.1412.5612.68MIN: 11.27 / MAX: 34.46MIN: 11.5 / MAX: 42.38MIN: 11.81 / MAX: 54.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.20, N = 5SE +/- 0.12, N = 4SE +/- 0.04, N = 39.208.829.08MIN: 8.56 / MAX: 10.92MIN: 8.54 / MAX: 10.49MIN: 8.94 / MAX: 10.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 1001238K16K24K32K40KSE +/- 423.42, N = 3SE +/- 279.56, N = 3SE +/- 243.60, N = 335458.4336786.9835464.351. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1231530456075SE +/- 1.00, N = 5SE +/- 0.15, N = 4SE +/- 1.92, N = 366.1868.4767.53MIN: 62.2 / MAX: 92.33MIN: 66.73 / MAX: 90.49MIN: 63.08 / MAX: 89.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 25012310K20K30K40K50KSE +/- 582.76, N = 3SE +/- 456.56, N = 3SE +/- 270.33, N = 345610.3446010.1344492.561. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1233691215SE +/- 0.12, N = 5SE +/- 0.17, N = 4SE +/- 0.10, N = 39.309.469.61MIN: 8.78 / MAX: 10.96MIN: 8.8 / MAX: 33.29MIN: 9.15 / MAX: 55.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1812348121620SE +/- 0.31, N = 5SE +/- 0.08, N = 4SE +/- 0.20, N = 313.8613.6714.12MIN: 13.25 / MAX: 25.23MIN: 13.41 / MAX: 15.51MIN: 13.65 / MAX: 14.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123510152025SE +/- 0.45, N = 5SE +/- 0.18, N = 4SE +/- 0.20, N = 320.5820.4821.10MIN: 19.51 / MAX: 58.18MIN: 19.94 / MAX: 21.74MIN: 20.64 / MAX: 39.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123918273645SE +/- 0.64, N = 5SE +/- 0.52, N = 4SE +/- 0.82, N = 340.2839.9740.81MIN: 38.75 / MAX: 86.53MIN: 38.61 / MAX: 110.54MIN: 38.92 / MAX: 82.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 366.0865.3064.791. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31233691215SE +/- 0.21, N = 5SE +/- 0.18, N = 4SE +/- 0.07, N = 39.269.089.26MIN: 8.45 / MAX: 39.85MIN: 8.43 / MAX: 40.21MIN: 8.89 / MAX: 12.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.28640.57280.85921.14561.432SE +/- 0.01721, N = 3SE +/- 0.00361, N = 3SE +/- 0.01312, N = 31.255371.272841.24818MIN: 1.13MIN: 1.14MIN: 1.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.07830.15660.23490.31320.3915SE +/- 0.004840, N = 4SE +/- 0.002266, N = 3SE +/- 0.003928, N = 30.3481590.3414760.347151MIN: 0.31MIN: 0.31MIN: 0.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1230.88311.76622.64933.53244.4155SE +/- 0.01612, N = 3SE +/- 0.00730, N = 3SE +/- 0.00503, N = 33.924793.853283.89296MIN: 3.84MIN: 3.79MIN: 3.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12330060090012001500SE +/- 13.34, N = 3SE +/- 5.02, N = 3SE +/- 8.18, N = 31457.161430.861446.43MIN: 1424.17MIN: 1416.66MIN: 1408.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232004006008001000SE +/- 0.70, N = 3SE +/- 1.69, N = 3SE +/- 8.52, N = 3818.41817.92832.76MIN: 811.75MIN: 807.14MIN: 800.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.570.561. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.21, N = 5SE +/- 0.19, N = 4SE +/- 0.15, N = 310.1210.1910.29MIN: 9.37 / MAX: 25.56MIN: 9.48 / MAX: 31.67MIN: 9.67 / MAX: 32.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup123714212835SE +/- 0.09, N = 3SE +/- 0.09, N = 332.031.831.51. (CC) gcc options: -fopenmp -O3 -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.30660.61320.91981.22641.533SE +/- 0.01667, N = 5SE +/- 0.01651, N = 6SE +/- 0.01635, N = 51.345031.362781.34385MIN: 1.21MIN: 1.23MIN: 1.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01330, N = 3SE +/- 0.01028, N = 3SE +/- 0.09282, N = 37.059187.062786.97601MIN: 6.97MIN: 6.97MIN: 2.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 310.2510.2810.371. Nodejs v10.19.0

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 311.8411.7311.861. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123612182430SE +/- 0.29, N = 5SE +/- 0.30, N = 4SE +/- 0.31, N = 322.9123.1523.16MIN: 22.04 / MAX: 24.82MIN: 22.12 / MAX: 25.31MIN: 22.45 / MAX: 26.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1232004006008001000SE +/- 4.25, N = 3SE +/- 8.38, N = 8SE +/- 5.98, N = 3819.46828.06822.74MIN: 809.53MIN: 800.64MIN: 803.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1231.09132.18263.27394.36525.4565SE +/- 0.06, N = 5SE +/- 0.01, N = 4SE +/- 0.06, N = 34.804.854.83MIN: 4.56 / MAX: 6.42MIN: 4.76 / MAX: 5.32MIN: 4.65 / MAX: 5.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1232004006008001000SE +/- 1.82, N = 3SE +/- 9.62, N = 3SE +/- 3.85, N = 3818.43826.79821.64MIN: 805.26MIN: 799.56MIN: 810.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 50

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 501237K14K21K28K35KSE +/- 96.11, N = 3SE +/- 48.81, N = 3SE +/- 171.37, N = 333348.7133392.8433656.661. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.12540.25080.37620.50160.627SE +/- 0.001999, N = 3SE +/- 0.001779, N = 3SE +/- 0.001173, N = 30.5528510.5538130.557242MIN: 0.53MIN: 0.53MIN: 0.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1233691215SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.04, N = 513.0813.0113.111. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU1230.330.660.991.321.65SE +/- 0.00352, N = 3SE +/- 0.00103, N = 3SE +/- 0.00357, N = 31.466451.458541.45596MIN: 1.41MIN: 1.42MIN: 1.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123200K400K600K800K1000KSE +/- 8504.83, N = 3SE +/- 5763.94, N = 3SE +/- 6644.33, N = 31096383.891088921.701093129.511. (CC) gcc options: -O2 -lrt" -lrt

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile123100200300400500SE +/- 0.24, N = 3SE +/- 2.07, N = 3SE +/- 0.20, N = 3482.03485.10484.60

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12350100150200250SE +/- 0.76, N = 3SE +/- 0.25, N = 3SE +/- 0.59, N = 3224.33223.76225.091. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.51411.02821.54232.05642.5705SE +/- 0.00407, N = 3SE +/- 0.00102, N = 3SE +/- 0.00603, N = 32.284872.281412.27524MIN: 2.21MIN: 2.21MIN: 2.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.00202, N = 3SE +/- 0.02942, N = 3SE +/- 0.02255, N = 39.508199.539019.53508MIN: 9.41MIN: 9.4MIN: 9.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123714212835SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 330.5930.6030.51

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 385.7686.0285.78

Apache Siege

Concurrent Users: 10

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 101235K10K15K20K25KSE +/- 15.72, N = 3SE +/- 68.54, N = 3SE +/- 96.00, N = 321723.4121708.1021771.531. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU123246810SE +/- 0.01793, N = 3SE +/- 0.00690, N = 3SE +/- 0.01957, N = 36.411386.416366.42989MIN: 6.3MIN: 6.31MIN: 6.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12320406080100SE +/- 0.52, N = 3SE +/- 0.49, N = 3SE +/- 0.25, N = 379.0078.8678.80

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 510.1810.1810.201. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.12660.25320.37980.50640.633SE +/- 0.001025, N = 3SE +/- 0.000242, N = 3SE +/- 0.000912, N = 30.5625190.5622480.561408MIN: 0.54MIN: 0.54MIN: 0.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123612182430SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 323.2223.2023.171. (CC) gcc options: -O2 -ffast-math -fsigned-char

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 516.7916.7816.811. (CXX) g++ options: -rdynamic

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU123246810SE +/- 0.00487, N = 3SE +/- 0.01236, N = 3SE +/- 0.00837, N = 37.658607.666167.65364MIN: 7.54MIN: 7.54MIN: 7.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.40810.81621.22431.63242.0405SE +/- 0.00667, N = 3SE +/- 0.00137, N = 3SE +/- 0.00350, N = 31.813621.810941.81333MIN: 1.72MIN: 1.7MIN: 1.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU1231.28082.56163.84245.12326.404SE +/- 0.00373, N = 3SE +/- 0.00725, N = 3SE +/- 0.01035, N = 35.692335.687735.68500MIN: 5.51MIN: 5.53MIN: 5.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.61641.23281.84922.46563.082SE +/- 0.00436, N = 3SE +/- 0.00953, N = 3SE +/- 0.00193, N = 32.735942.739442.73941MIN: 2.69MIN: 2.69MIN: 2.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123246810SE +/- 0.01532, N = 3SE +/- 0.01810, N = 3SE +/- 0.01439, N = 37.470377.470637.47617MIN: 7.37MIN: 7.38MIN: 3.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.15640.31280.46920.62560.782SE +/- 0.001664, N = 3SE +/- 0.000340, N = 3SE +/- 0.001534, N = 30.6948780.6952540.694724MIN: 0.68MIN: 0.68MIN: 0.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric150K100K150K200K250K2531041. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1260.2520.3780.5040.63SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.561. (CXX) g++ options: -O3 -pthread

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 20012311K22K33K44K55KSE +/- 1707.01, N = 12SE +/- 560.13, N = 3SE +/- 391.74, N = 351905.5944343.8441854.181. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123612182430SE +/- 0.31, N = 5SE +/- 0.83, N = 4SE +/- 0.30, N = 323.4324.2523.94MIN: 22.42 / MAX: 104.2MIN: 22.74 / MAX: 27.15MIN: 23.29 / MAX: 59.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.89, N = 5SE +/- 1.23, N = 4SE +/- 0.98, N = 332.2634.0632.43MIN: 29.84 / MAX: 67.93MIN: 30.11 / MAX: 67.85MIN: 30.29 / MAX: 76.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1233691215SE +/- 0.34, N = 5SE +/- 0.21, N = 4SE +/- 0.06, N = 310.049.8010.83MIN: 9.32 / MAX: 35.37MIN: 8.83 / MAX: 49.27MIN: 10.67 / MAX: 13.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 387.00, N = 15SE +/- 11.16, N = 3SE +/- 21.58, N = 31941.011431.411458.06MIN: 1403.8MIN: 1405.72MIN: 14121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU123400800120016002000SE +/- 10.07, N = 3SE +/- 463.22, N = 15SE +/- 2.21, N = 31443.591951.911434.19MIN: 1421.98MIN: 1412.46MIN: 1424.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU12348121620SE +/- 5.73224, N = 12SE +/- 0.08058, N = 15SE +/- 0.08350, N = 1514.358693.698643.66837MIN: 2.72MIN: 2.71MIN: 2.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.5