5220R 2P Ubuntu EO 2020

2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012214-HA-5220R2PUB13&grr.

5220R 2P Ubuntu EO 2020ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1232 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads)TYAN S7106 (V2.01.B40 BIOS)Intel Sky Lake-E DMI3 Registers94GB500GB Samsung SSD 860llvmpipeVE2282 x Intel I210 + 2 x QLogic cLOM8214 1/10GbEUbuntu 20.045.9.0-050900rc6-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

5220R 2P Ubuntu EO 2020build-clash: Time To Compilebrl-cad: VGR Performance Metrichmmer: Pfam Database Searchonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUapache-siege: 500onednn: Recurrent Neural Network Inference - f32 - CPUncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetbuild-eigen: Time To Compileapache-siege: 200node-web-tooling: build2: Time To Compileonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUsimdjson: Kostyasqlite-speedtest: Timed Time - Size 1,000simdjson: LargeRandapache-siege: 250simdjson: PartialTweetssimdjson: DistinctUserIDonednn: IP Shapes 3D - bf16bf16bf16 - CPUbuild-ffmpeg: Time To Compilecoremark: CoreMark Size 666 - Iterations Per Secondclomp: Static OMP Speedupencode-wavpack: WAV To WavPackapache-siege: 100onednn: IP Shapes 1D - u8s8f32 - CPUencode-ogg: WAV To Oggencode-ape: WAV To APEonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-opus: WAV To Opus Encodeonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - f32 - CPUapache-siege: 50onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUmafft: Multiple Sequence Alignment - LSU RNAonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUapache-siege: 10onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU123482.027253104224.3251941.011443.5951574.65819.45766.1823.4332.2626.9610.0413.8640.2820.584.8012.149.309.209.2610.1222.9185.76051905.5910.2579.0031457.16818.408818.4300.5666.0830.3945610.340.560.5814.3586930.5891096383.8909213216.78535458.431.3450323.21613.0827.658600.5625192.2848710.1835.692331.8136233348.710.3481590.5528511.4664511.8431.255373.924796.411387.059187.4703721723.419.508190.6948782.73594485.102223.7611431.411951.9148209.46828.06368.4724.2534.0628.409.8013.6739.9720.484.8512.569.468.829.0810.1923.1586.01644343.8410.2878.8581430.86817.920826.7850.5665.2970.3946010.130.570.583.6986430.5971088921.69768831.816.77936786.981.3627823.20213.0097.666160.5622482.2814110.1785.687731.8109433392.840.3414760.5538131.4585411.7281.272843.853286.416367.062787.4706321708.109.539010.6952542.73944484.600225.0861458.061434.1946008.38822.74267.5323.9432.4326.7710.8314.1240.8121.104.8312.689.619.089.2610.2923.1685.77741854.1810.3778.8021446.43832.759821.6430.5664.7920.3944492.560.560.583.6683730.5051093129.50814531.516.80835464.351.3438523.17113.1127.653640.5614082.2752410.2015.685001.8133333656.660.3471510.5572421.4559611.8591.248183.892966.429896.976017.4761721771.539.535080.6947242.73941OpenBenchmarking.org

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile123100200300400500SE +/- 0.24, N = 3SE +/- 2.07, N = 3SE +/- 0.20, N = 3482.03485.10484.60

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric150K100K150K200K250K2531041. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12350100150200250SE +/- 0.76, N = 3SE +/- 0.25, N = 3SE +/- 0.59, N = 3224.33223.76225.091. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 387.00, N = 15SE +/- 11.16, N = 3SE +/- 21.58, N = 31941.011431.411458.06MIN: 1403.8MIN: 1405.72MIN: 14121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU123400800120016002000SE +/- 10.07, N = 3SE +/- 463.22, N = 15SE +/- 2.21, N = 31443.591951.911434.19MIN: 1421.98MIN: 1412.46MIN: 1424.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 500

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 50012311K22K33K44K55KSE +/- 846.89, N = 12SE +/- 667.57, N = 4SE +/- 317.59, N = 351574.6548209.4646008.381. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1232004006008001000SE +/- 4.25, N = 3SE +/- 8.38, N = 8SE +/- 5.98, N = 3819.46828.06822.74MIN: 809.53MIN: 800.64MIN: 803.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1231530456075SE +/- 1.00, N = 5SE +/- 0.15, N = 4SE +/- 1.92, N = 366.1868.4767.53MIN: 62.2 / MAX: 92.33MIN: 66.73 / MAX: 90.49MIN: 63.08 / MAX: 89.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123612182430SE +/- 0.31, N = 5SE +/- 0.83, N = 4SE +/- 0.30, N = 323.4324.2523.94MIN: 22.42 / MAX: 104.2MIN: 22.74 / MAX: 27.15MIN: 23.29 / MAX: 59.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.89, N = 5SE +/- 1.23, N = 4SE +/- 0.98, N = 332.2634.0632.43MIN: 29.84 / MAX: 67.93MIN: 30.11 / MAX: 67.85MIN: 30.29 / MAX: 76.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50123714212835SE +/- 0.16, N = 5SE +/- 0.59, N = 4SE +/- 0.30, N = 326.9628.4026.77MIN: 25.86 / MAX: 79.69MIN: 26.69 / MAX: 109.07MIN: 24.83 / MAX: 63.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1233691215SE +/- 0.34, N = 5SE +/- 0.21, N = 4SE +/- 0.06, N = 310.049.8010.83MIN: 9.32 / MAX: 35.37MIN: 8.83 / MAX: 49.27MIN: 10.67 / MAX: 13.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1812348121620SE +/- 0.31, N = 5SE +/- 0.08, N = 4SE +/- 0.20, N = 313.8613.6714.12MIN: 13.25 / MAX: 25.23MIN: 13.41 / MAX: 15.51MIN: 13.65 / MAX: 14.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123918273645SE +/- 0.64, N = 5SE +/- 0.52, N = 4SE +/- 0.82, N = 340.2839.9740.81MIN: 38.75 / MAX: 86.53MIN: 38.61 / MAX: 110.54MIN: 38.92 / MAX: 82.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123510152025SE +/- 0.45, N = 5SE +/- 0.18, N = 4SE +/- 0.20, N = 320.5820.4821.10MIN: 19.51 / MAX: 58.18MIN: 19.94 / MAX: 21.74MIN: 20.64 / MAX: 39.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1231.09132.18263.27394.36525.4565SE +/- 0.06, N = 5SE +/- 0.01, N = 4SE +/- 0.06, N = 34.804.854.83MIN: 4.56 / MAX: 6.42MIN: 4.76 / MAX: 5.32MIN: 4.65 / MAX: 5.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.27, N = 5SE +/- 0.11, N = 4SE +/- 0.23, N = 312.1412.5612.68MIN: 11.27 / MAX: 34.46MIN: 11.5 / MAX: 42.38MIN: 11.81 / MAX: 54.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1233691215SE +/- 0.12, N = 5SE +/- 0.17, N = 4SE +/- 0.10, N = 39.309.469.61MIN: 8.78 / MAX: 10.96MIN: 8.8 / MAX: 33.29MIN: 9.15 / MAX: 55.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.20, N = 5SE +/- 0.12, N = 4SE +/- 0.04, N = 39.208.829.08MIN: 8.56 / MAX: 10.92MIN: 8.54 / MAX: 10.49MIN: 8.94 / MAX: 10.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31233691215SE +/- 0.21, N = 5SE +/- 0.18, N = 4SE +/- 0.07, N = 39.269.089.26MIN: 8.45 / MAX: 39.85MIN: 8.43 / MAX: 40.21MIN: 8.89 / MAX: 12.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.21, N = 5SE +/- 0.19, N = 4SE +/- 0.15, N = 310.1210.1910.29MIN: 9.37 / MAX: 25.56MIN: 9.48 / MAX: 31.67MIN: 9.67 / MAX: 32.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123612182430SE +/- 0.29, N = 5SE +/- 0.30, N = 4SE +/- 0.31, N = 322.9123.1523.16MIN: 22.04 / MAX: 24.82MIN: 22.12 / MAX: 25.31MIN: 22.45 / MAX: 26.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 385.7686.0285.78

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 20012311K22K33K44K55KSE +/- 1707.01, N = 12SE +/- 560.13, N = 3SE +/- 391.74, N = 351905.5944343.8441854.181. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 310.2510.2810.371. Nodejs v10.19.0

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12320406080100SE +/- 0.52, N = 3SE +/- 0.49, N = 3SE +/- 0.25, N = 379.0078.8678.80

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12330060090012001500SE +/- 13.34, N = 3SE +/- 5.02, N = 3SE +/- 8.18, N = 31457.161430.861446.43MIN: 1424.17MIN: 1416.66MIN: 1408.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232004006008001000SE +/- 0.70, N = 3SE +/- 1.69, N = 3SE +/- 8.52, N = 3818.41817.92832.76MIN: 811.75MIN: 807.14MIN: 800.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1232004006008001000SE +/- 1.82, N = 3SE +/- 9.62, N = 3SE +/- 3.85, N = 3818.43826.79821.64MIN: 805.26MIN: 799.56MIN: 810.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1260.2520.3780.5040.63SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.561. (CXX) g++ options: -O3 -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 366.0865.3064.791. (CC) gcc options: -O2 -ldl -lz -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 25012310K20K30K40K50KSE +/- 582.76, N = 3SE +/- 456.56, N = 3SE +/- 270.33, N = 345610.3446010.1344492.561. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.570.561. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU12348121620SE +/- 5.73224, N = 12SE +/- 0.08058, N = 15SE +/- 0.08350, N = 1514.358693.698643.66837MIN: 2.72MIN: 2.71MIN: 2.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123714212835SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 330.5930.6030.51

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123200K400K600K800K1000KSE +/- 8504.83, N = 3SE +/- 5763.94, N = 3SE +/- 6644.33, N = 31096383.891088921.701093129.511. (CC) gcc options: -O2 -lrt" -lrt

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup123714212835SE +/- 0.09, N = 3SE +/- 0.09, N = 332.031.831.51. (CC) gcc options: -fopenmp -O3 -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 516.7916.7816.811. (CXX) g++ options: -rdynamic

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 1001238K16K24K32K40KSE +/- 423.42, N = 3SE +/- 279.56, N = 3SE +/- 243.60, N = 335458.4336786.9835464.351. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.30660.61320.91981.22641.533SE +/- 0.01667, N = 5SE +/- 0.01651, N = 6SE +/- 0.01635, N = 51.345031.362781.34385MIN: 1.21MIN: 1.23MIN: 1.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123612182430SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 323.2223.2023.171. (CC) gcc options: -O2 -ffast-math -fsigned-char

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1233691215SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.04, N = 513.0813.0113.111. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU123246810SE +/- 0.00487, N = 3SE +/- 0.01236, N = 3SE +/- 0.00837, N = 37.658607.666167.65364MIN: 7.54MIN: 7.54MIN: 7.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.12660.25320.37980.50640.633SE +/- 0.001025, N = 3SE +/- 0.000242, N = 3SE +/- 0.000912, N = 30.5625190.5622480.561408MIN: 0.54MIN: 0.54MIN: 0.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.51411.02821.54232.05642.5705SE +/- 0.00407, N = 3SE +/- 0.00102, N = 3SE +/- 0.00603, N = 32.284872.281412.27524MIN: 2.21MIN: 2.21MIN: 2.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 510.1810.1810.201. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU1231.28082.56163.84245.12326.404SE +/- 0.00373, N = 3SE +/- 0.00725, N = 3SE +/- 0.01035, N = 35.692335.687735.68500MIN: 5.51MIN: 5.53MIN: 5.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.40810.81621.22431.63242.0405SE +/- 0.00667, N = 3SE +/- 0.00137, N = 3SE +/- 0.00350, N = 31.813621.810941.81333MIN: 1.72MIN: 1.7MIN: 1.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 50

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 501237K14K21K28K35KSE +/- 96.11, N = 3SE +/- 48.81, N = 3SE +/- 171.37, N = 333348.7133392.8433656.661. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.07830.15660.23490.31320.3915SE +/- 0.004840, N = 4SE +/- 0.002266, N = 3SE +/- 0.003928, N = 30.3481590.3414760.347151MIN: 0.31MIN: 0.31MIN: 0.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.12540.25080.37620.50160.627SE +/- 0.001999, N = 3SE +/- 0.001779, N = 3SE +/- 0.001173, N = 30.5528510.5538130.557242MIN: 0.53MIN: 0.53MIN: 0.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU1230.330.660.991.321.65SE +/- 0.00352, N = 3SE +/- 0.00103, N = 3SE +/- 0.00357, N = 31.466451.458541.45596MIN: 1.41MIN: 1.42MIN: 1.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 311.8411.7311.861. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.28640.57280.85921.14561.432SE +/- 0.01721, N = 3SE +/- 0.00361, N = 3SE +/- 0.01312, N = 31.255371.272841.24818MIN: 1.13MIN: 1.14MIN: 1.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1230.88311.76622.64933.53244.4155SE +/- 0.01612, N = 3SE +/- 0.00730, N = 3SE +/- 0.00503, N = 33.924793.853283.89296MIN: 3.84MIN: 3.79MIN: 3.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU123246810SE +/- 0.01793, N = 3SE +/- 0.00690, N = 3SE +/- 0.01957, N = 36.411386.416366.42989MIN: 6.3MIN: 6.31MIN: 6.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01330, N = 3SE +/- 0.01028, N = 3SE +/- 0.09282, N = 37.059187.062786.97601MIN: 6.97MIN: 6.97MIN: 2.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123246810SE +/- 0.01532, N = 3SE +/- 0.01810, N = 3SE +/- 0.01439, N = 37.470377.470637.47617MIN: 7.37MIN: 7.38MIN: 3.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Apache Siege

Concurrent Users: 10

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 101235K10K15K20K25KSE +/- 15.72, N = 3SE +/- 68.54, N = 3SE +/- 96.00, N = 321723.4121708.1021771.531. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.00202, N = 3SE +/- 0.02942, N = 3SE +/- 0.02255, N = 39.508199.539019.53508MIN: 9.41MIN: 9.4MIN: 9.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.15640.31280.46920.62560.782SE +/- 0.001664, N = 3SE +/- 0.000340, N = 3SE +/- 0.001534, N = 30.6948780.6952540.694724MIN: 0.68MIN: 0.68MIN: 0.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.61641.23281.84922.46563.082SE +/- 0.00436, N = 3SE +/- 0.00953, N = 3SE +/- 0.00193, N = 32.735942.739442.73941MIN: 2.69MIN: 2.69MIN: 2.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.5