5220R 2P Ubuntu EO 2020

2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012214-HA-5220R2PUB13&grr&sor.

5220R 2P Ubuntu EO 2020ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1232 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads)TYAN S7106 (V2.01.B40 BIOS)Intel Sky Lake-E DMI3 Registers94GB500GB Samsung SSD 860llvmpipeVE2282 x Intel I210 + 2 x QLogic cLOM8214 1/10GbEUbuntu 20.045.9.0-050900rc6-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

5220R 2P Ubuntu EO 2020build-clash: Time To Compilebrl-cad: VGR Performance Metrichmmer: Pfam Database Searchonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUapache-siege: 500onednn: Recurrent Neural Network Inference - f32 - CPUncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetbuild-eigen: Time To Compileapache-siege: 200node-web-tooling: build2: Time To Compileonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUsimdjson: Kostyasqlite-speedtest: Timed Time - Size 1,000simdjson: LargeRandapache-siege: 250simdjson: PartialTweetssimdjson: DistinctUserIDonednn: IP Shapes 3D - bf16bf16bf16 - CPUbuild-ffmpeg: Time To Compilecoremark: CoreMark Size 666 - Iterations Per Secondclomp: Static OMP Speedupencode-wavpack: WAV To WavPackapache-siege: 100onednn: IP Shapes 1D - u8s8f32 - CPUencode-ogg: WAV To Oggencode-ape: WAV To APEonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-opus: WAV To Opus Encodeonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - f32 - CPUapache-siege: 50onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUmafft: Multiple Sequence Alignment - LSU RNAonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUapache-siege: 10onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU123482.027253104224.3251941.011443.5951574.65819.45766.1823.4332.2626.9610.0413.8640.2820.584.8012.149.309.209.2610.1222.9185.76051905.5910.2579.0031457.16818.408818.4300.5666.0830.3945610.340.560.5814.3586930.5891096383.8909213216.78535458.431.3450323.21613.0827.658600.5625192.2848710.1835.692331.8136233348.710.3481590.5528511.4664511.8431.255373.924796.411387.059187.4703721723.419.508190.6948782.73594485.102223.7611431.411951.9148209.46828.06368.4724.2534.0628.409.8013.6739.9720.484.8512.569.468.829.0810.1923.1586.01644343.8410.2878.8581430.86817.920826.7850.5665.2970.3946010.130.570.583.6986430.5971088921.69768831.816.77936786.981.3627823.20213.0097.666160.5622482.2814110.1785.687731.8109433392.840.3414760.5538131.4585411.7281.272843.853286.416367.062787.4706321708.109.539010.6952542.73944484.600225.0861458.061434.1946008.38822.74267.5323.9432.4326.7710.8314.1240.8121.104.8312.689.619.089.2610.2923.1685.77741854.1810.3778.8021446.43832.759821.6430.5664.7920.3944492.560.560.583.6683730.5051093129.50814531.516.80835464.351.3438523.17113.1127.653640.5614082.2752410.2015.685001.8133333656.660.3471510.5572421.4559611.8591.248183.892966.429896.976017.4761721771.539.535080.6947242.73941OpenBenchmarking.org

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile132100200300400500SE +/- 0.24, N = 3SE +/- 0.20, N = 3SE +/- 2.07, N = 3482.03484.60485.10

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric150K100K150K200K250K2531041. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search21350100150200250SE +/- 0.25, N = 3SE +/- 0.76, N = 3SE +/- 0.59, N = 3223.76224.33225.091. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU231400800120016002000SE +/- 11.16, N = 3SE +/- 21.58, N = 3SE +/- 387.00, N = 151431.411458.061941.01MIN: 1405.72MIN: 1412MIN: 1403.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU312400800120016002000SE +/- 2.21, N = 3SE +/- 10.07, N = 3SE +/- 463.22, N = 151434.191443.591951.91MIN: 1424.29MIN: 1421.98MIN: 1412.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 500

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 50012311K22K33K44K55KSE +/- 846.89, N = 12SE +/- 667.57, N = 4SE +/- 317.59, N = 351574.6548209.4646008.381. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1322004006008001000SE +/- 4.25, N = 3SE +/- 5.98, N = 3SE +/- 8.38, N = 8819.46822.74828.06MIN: 809.53MIN: 803.83MIN: 800.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1321530456075SE +/- 1.00, N = 5SE +/- 1.92, N = 3SE +/- 0.15, N = 466.1867.5368.47MIN: 62.2 / MAX: 92.33MIN: 63.08 / MAX: 89.94MIN: 66.73 / MAX: 90.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd132612182430SE +/- 0.31, N = 5SE +/- 0.30, N = 3SE +/- 0.83, N = 423.4323.9424.25MIN: 22.42 / MAX: 104.2MIN: 23.29 / MAX: 59.23MIN: 22.74 / MAX: 27.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny132816243240SE +/- 0.89, N = 5SE +/- 0.98, N = 3SE +/- 1.23, N = 432.2632.4334.06MIN: 29.84 / MAX: 67.93MIN: 30.29 / MAX: 76.66MIN: 30.11 / MAX: 67.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50312714212835SE +/- 0.30, N = 3SE +/- 0.16, N = 5SE +/- 0.59, N = 426.7726.9628.40MIN: 24.83 / MAX: 63.26MIN: 25.86 / MAX: 79.69MIN: 26.69 / MAX: 109.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet2133691215SE +/- 0.21, N = 4SE +/- 0.34, N = 5SE +/- 0.06, N = 39.8010.0410.83MIN: 8.83 / MAX: 49.27MIN: 9.32 / MAX: 35.37MIN: 10.67 / MAX: 13.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1821348121620SE +/- 0.08, N = 4SE +/- 0.31, N = 5SE +/- 0.20, N = 313.6713.8614.12MIN: 13.41 / MAX: 15.51MIN: 13.25 / MAX: 25.23MIN: 13.65 / MAX: 14.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16213918273645SE +/- 0.52, N = 4SE +/- 0.64, N = 5SE +/- 0.82, N = 339.9740.2840.81MIN: 38.61 / MAX: 110.54MIN: 38.75 / MAX: 86.53MIN: 38.92 / MAX: 82.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet213510152025SE +/- 0.18, N = 4SE +/- 0.45, N = 5SE +/- 0.20, N = 320.4820.5821.10MIN: 19.94 / MAX: 21.74MIN: 19.51 / MAX: 58.18MIN: 20.64 / MAX: 39.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1321.09132.18263.27394.36525.4565SE +/- 0.06, N = 5SE +/- 0.06, N = 3SE +/- 0.01, N = 44.804.834.85MIN: 4.56 / MAX: 6.42MIN: 4.65 / MAX: 5.89MIN: 4.76 / MAX: 5.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.27, N = 5SE +/- 0.11, N = 4SE +/- 0.23, N = 312.1412.5612.68MIN: 11.27 / MAX: 34.46MIN: 11.5 / MAX: 42.38MIN: 11.81 / MAX: 54.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1233691215SE +/- 0.12, N = 5SE +/- 0.17, N = 4SE +/- 0.10, N = 39.309.469.61MIN: 8.78 / MAX: 10.96MIN: 8.8 / MAX: 33.29MIN: 9.15 / MAX: 55.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v22313691215SE +/- 0.12, N = 4SE +/- 0.04, N = 3SE +/- 0.20, N = 58.829.089.20MIN: 8.54 / MAX: 10.49MIN: 8.94 / MAX: 10.72MIN: 8.56 / MAX: 10.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v32133691215SE +/- 0.18, N = 4SE +/- 0.21, N = 5SE +/- 0.07, N = 39.089.269.26MIN: 8.43 / MAX: 40.21MIN: 8.45 / MAX: 39.85MIN: 8.89 / MAX: 12.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.21, N = 5SE +/- 0.19, N = 4SE +/- 0.15, N = 310.1210.1910.29MIN: 9.37 / MAX: 25.56MIN: 9.48 / MAX: 31.67MIN: 9.67 / MAX: 32.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123612182430SE +/- 0.29, N = 5SE +/- 0.30, N = 4SE +/- 0.31, N = 322.9123.1523.16MIN: 22.04 / MAX: 24.82MIN: 22.12 / MAX: 25.31MIN: 22.45 / MAX: 26.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile13220406080100SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 385.7685.7886.02

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 20012311K22K33K44K55KSE +/- 1707.01, N = 12SE +/- 560.13, N = 3SE +/- 391.74, N = 351905.5944343.8441854.181. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 310.3710.2810.251. Nodejs v10.19.0

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32120406080100SE +/- 0.25, N = 3SE +/- 0.49, N = 3SE +/- 0.52, N = 378.8078.8679.00

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU23130060090012001500SE +/- 5.02, N = 3SE +/- 8.18, N = 3SE +/- 13.34, N = 31430.861446.431457.16MIN: 1416.66MIN: 1408.27MIN: 1424.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU2132004006008001000SE +/- 1.69, N = 3SE +/- 0.70, N = 3SE +/- 8.52, N = 3817.92818.41832.76MIN: 807.14MIN: 811.75MIN: 800.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1322004006008001000SE +/- 1.82, N = 3SE +/- 3.85, N = 3SE +/- 9.62, N = 3818.43821.64826.79MIN: 805.26MIN: 810.14MIN: 799.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1260.2520.3780.5040.63SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.561. (CXX) g++ options: -O3 -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0003211530456075SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.09, N = 364.7965.3066.081. (CC) gcc options: -O2 -ldl -lz -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 25021310K20K30K40K50KSE +/- 456.56, N = 3SE +/- 582.76, N = 3SE +/- 270.33, N = 346010.1345610.3444492.561. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets2310.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.560.561. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU32148121620SE +/- 0.08350, N = 15SE +/- 0.08058, N = 15SE +/- 5.73224, N = 123.668373.6986414.35869MIN: 2.7MIN: 2.71MIN: 2.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile312714212835SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 330.5130.5930.60

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second132200K400K600K800K1000KSE +/- 8504.83, N = 3SE +/- 6644.33, N = 3SE +/- 5763.94, N = 31096383.891093129.511088921.701. (CC) gcc options: -O2 -lrt" -lrt

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup123714212835SE +/- 0.09, N = 3SE +/- 0.09, N = 332.031.831.51. (CC) gcc options: -fopenmp -O3 -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack21348121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 516.7816.7916.811. (CXX) g++ options: -rdynamic

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 1002318K16K24K32K40KSE +/- 279.56, N = 3SE +/- 243.60, N = 3SE +/- 423.42, N = 336786.9835464.3535458.431. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3120.30660.61320.91981.22641.533SE +/- 0.01635, N = 5SE +/- 0.01667, N = 5SE +/- 0.01651, N = 61.343851.345031.36278MIN: 1.21MIN: 1.21MIN: 1.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg321612182430SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 323.1723.2023.221. (CC) gcc options: -O2 -ffast-math -fsigned-char

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE2133691215SE +/- 0.02, N = 5SE +/- 0.04, N = 5SE +/- 0.04, N = 513.0113.0813.111. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU312246810SE +/- 0.00837, N = 3SE +/- 0.00487, N = 3SE +/- 0.01236, N = 37.653647.658607.66616MIN: 7.53MIN: 7.54MIN: 7.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3210.12660.25320.37980.50640.633SE +/- 0.000912, N = 3SE +/- 0.000242, N = 3SE +/- 0.001025, N = 30.5614080.5622480.562519MIN: 0.53MIN: 0.54MIN: 0.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3210.51411.02821.54232.05642.5705SE +/- 0.00603, N = 3SE +/- 0.00102, N = 3SE +/- 0.00407, N = 32.275242.281412.28487MIN: 2.21MIN: 2.21MIN: 2.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode2133691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 510.1810.1810.201. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU3211.28082.56163.84245.12326.404SE +/- 0.01035, N = 3SE +/- 0.00725, N = 3SE +/- 0.00373, N = 35.685005.687735.69233MIN: 5.52MIN: 5.53MIN: 5.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2310.40810.81621.22431.63242.0405SE +/- 0.00137, N = 3SE +/- 0.00350, N = 3SE +/- 0.00667, N = 31.810941.813331.81362MIN: 1.7MIN: 1.72MIN: 1.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 50

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 503217K14K21K28K35KSE +/- 171.37, N = 3SE +/- 48.81, N = 3SE +/- 96.11, N = 333656.6633392.8433348.711. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.07830.15660.23490.31320.3915SE +/- 0.002266, N = 3SE +/- 0.003928, N = 3SE +/- 0.004840, N = 40.3414760.3471510.348159MIN: 0.31MIN: 0.31MIN: 0.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.12540.25080.37620.50160.627SE +/- 0.001999, N = 3SE +/- 0.001779, N = 3SE +/- 0.001173, N = 30.5528510.5538130.557242MIN: 0.53MIN: 0.53MIN: 0.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU3210.330.660.991.321.65SE +/- 0.00357, N = 3SE +/- 0.00103, N = 3SE +/- 0.00352, N = 31.455961.458541.46645MIN: 1.42MIN: 1.42MIN: 1.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA2133691215SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 311.7311.8411.861. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3120.28640.57280.85921.14561.432SE +/- 0.01312, N = 3SE +/- 0.01721, N = 3SE +/- 0.00361, N = 31.248181.255371.27284MIN: 1.15MIN: 1.13MIN: 1.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU2310.88311.76622.64933.53244.4155SE +/- 0.00730, N = 3SE +/- 0.00503, N = 3SE +/- 0.01612, N = 33.853283.892963.92479MIN: 3.79MIN: 3.82MIN: 3.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU123246810SE +/- 0.01793, N = 3SE +/- 0.00690, N = 3SE +/- 0.01957, N = 36.411386.416366.42989MIN: 6.3MIN: 6.31MIN: 6.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU312246810SE +/- 0.09282, N = 3SE +/- 0.01330, N = 3SE +/- 0.01028, N = 36.976017.059187.06278MIN: 2.5MIN: 6.97MIN: 6.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123246810SE +/- 0.01532, N = 3SE +/- 0.01810, N = 3SE +/- 0.01439, N = 37.470377.470637.47617MIN: 7.37MIN: 7.38MIN: 3.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 10

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 103125K10K15K20K25KSE +/- 96.00, N = 3SE +/- 15.72, N = 3SE +/- 68.54, N = 321771.5321723.4121708.101. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU1323691215SE +/- 0.00202, N = 3SE +/- 0.02255, N = 3SE +/- 0.02942, N = 39.508199.535089.53901MIN: 9.41MIN: 9.41MIN: 9.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3120.15640.31280.46920.62560.782SE +/- 0.001534, N = 3SE +/- 0.001664, N = 3SE +/- 0.000340, N = 30.6947240.6948780.695254MIN: 0.68MIN: 0.68MIN: 0.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1320.61641.23281.84922.46563.082SE +/- 0.00436, N = 3SE +/- 0.00193, N = 3SE +/- 0.00953, N = 32.735942.739412.73944MIN: 2.69MIN: 2.69MIN: 2.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.5