Xeon Gold 6226R December

Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012200-HA-XEONGOLD615&grr&sro&rro.

Xeon Gold 6226R DecemberProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads)Supermicro X11SPL-F v1.02 (3.1 BIOS)Intel Sky Lake-E DMI3 Registers188GB3841GB Micron_9300_MTFDHAL3T8TDPllvmpipeVE2282 x Intel I210Ubuntu 20.045.9.0-050900rc6daily20200921-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.8 (LLVM 10.0.0 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x5002f01Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Xeon Gold 6226R Decemberbrl-cad: VGR Performance Metrichmmer: Pfam Database Searchbuild2: Time To Compilebuild-eigen: Time To Compilenode-web-tooling: onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUsimdjson: Kostyasqlite-speedtest: Timed Time - Size 1,000simdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetbuild-ffmpeg: Time To Compileencode-ape: WAV To APEencode-wavpack: WAV To WavPackcoremark: CoreMark Size 666 - Iterations Per Secondclomp: Static OMP Speeduponednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUmafft: Multiple Sequence Alignment - LSU RNAonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU123164354174.22795.58085.52710.781645.611643.591643.22922.874921.674922.9920.5665.3630.390.570.5827.1916.7324.7220.278.0710.5829.6514.562.937.705.565.945.216.0717.7743.49717.52816.731537952.81083726.811.29120.5266922.754975.613462.353580.4962450.9746242.054190.48644010.6372.596101.244913.148169.421744.166564.343360.86704012.52743.21126163713174.16995.68685.55110.691643.671645.641645.08923.369923.438923.5360.5665.8520.390.570.5827.1316.4524.4018.936.739.4128.0412.992.887.265.375.975.145.9616.9643.23617.53516.753535255.62588826.211.28910.5272522.762105.607042.350380.4914590.9738742.057720.47795110.5902.588061.247813.137079.423014.162374.340820.86066312.54183.20825174.44795.52885.78410.681644.831645.911647.12923.336920.863922.2700.5665.5940.390.570.5827.4116.9425.2220.386.999.4929.2413.092.907.205.346.015.075.8917.7343.45417.54316.779534883.15508526.511.31620.5264372.756195.604772.349900.4957330.9770732.056750.47921610.5492.613501.243803.158359.422774.174454.350800.86285612.53353.21278OpenBenchmarking.org

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric2140K80K120K160K200K1637131643541. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search3214080120160200SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.36, N = 3174.45174.17174.231. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32120406080100SE +/- 0.15, N = 3SE +/- 0.25, N = 3SE +/- 0.24, N = 395.5395.6995.58

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile32120406080100SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 385.7885.5585.53

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 310.6810.6910.781. Nodejs v10.19.0

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU321400800120016002000SE +/- 0.87, N = 3SE +/- 1.59, N = 3SE +/- 0.69, N = 31644.831643.671645.61MIN: 1640.5MIN: 1637.61MIN: 1641.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU321400800120016002000SE +/- 1.22, N = 3SE +/- 4.86, N = 3SE +/- 2.60, N = 31645.911645.641643.59MIN: 1639.71MIN: 1635.99MIN: 1637.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU321400800120016002000SE +/- 4.18, N = 3SE +/- 0.79, N = 3SE +/- 0.66, N = 31647.121645.081643.22MIN: 1640.3MIN: 1639.15MIN: 1637.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU3212004006008001000SE +/- 0.27, N = 3SE +/- 0.50, N = 3SE +/- 0.33, N = 3923.34923.37922.87MIN: 920.26MIN: 919.79MIN: 919.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3212004006008001000SE +/- 0.78, N = 3SE +/- 1.61, N = 3SE +/- 0.31, N = 3920.86923.44921.67MIN: 916.71MIN: 918.55MIN: 918.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU3212004006008001000SE +/- 1.52, N = 3SE +/- 1.28, N = 3SE +/- 0.36, N = 3922.27923.54922.99MIN: 917.55MIN: 919.76MIN: 920.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1260.2520.3780.5040.63SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.561. (CXX) g++ options: -O3 -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0003211530456075SE +/- 0.20, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 365.5965.8565.361. (CC) gcc options: -O2 -ldl -lz -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.570.571. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m321612182430SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 327.4127.1327.19MIN: 26.75 / MAX: 31.13MIN: 26.7 / MAX: 28.12MIN: 26.8 / MAX: 29.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd32148121620SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 0.11, N = 316.9416.4516.73MIN: 16.44 / MAX: 19MIN: 16.33 / MAX: 19.11MIN: 16.43 / MAX: 17.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny321612182430SE +/- 0.35, N = 3SE +/- 0.38, N = 3SE +/- 0.51, N = 325.2224.4024.72MIN: 23.44 / MAX: 28.59MIN: 23.06 / MAX: 27.46MIN: 23.36 / MAX: 28.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50321510152025SE +/- 0.52, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 320.3818.9320.27MIN: 19.05 / MAX: 23.49MIN: 18.61 / MAX: 38.46MIN: 19.1 / MAX: 21.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet321246810SE +/- 0.22, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.996.738.07MIN: 6.66 / MAX: 23.91MIN: 6.67 / MAX: 8.33MIN: 8.01 / MAX: 9.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet183213691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.42, N = 39.499.4110.58MIN: 9.34 / MAX: 11.41MIN: 9.33 / MAX: 10.24MIN: 9.67 / MAX: 12.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16321714212835SE +/- 0.36, N = 3SE +/- 0.03, N = 3SE +/- 0.50, N = 329.2428.0429.65MIN: 27.78 / MAX: 48.69MIN: 27.76 / MAX: 46.41MIN: 28.54 / MAX: 34.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet32148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.46, N = 313.0912.9914.56MIN: 12.88 / MAX: 15.13MIN: 12.91 / MAX: 13.36MIN: 13.58 / MAX: 15.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3210.65931.31861.97792.63723.2965SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 32.902.882.93MIN: 2.83 / MAX: 4.6MIN: 2.82 / MAX: 3.69MIN: 2.85 / MAX: 3.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0321246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 37.207.267.70MIN: 7 / MAX: 9MIN: 7.01 / MAX: 8.01MIN: 7.35 / MAX: 8.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet3211.2512.5023.7535.0046.255SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 35.345.375.56MIN: 5.16 / MAX: 9.17MIN: 5.18 / MAX: 9.36MIN: 5.21 / MAX: 9.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2321246810SE +/- 0.03, N = 3SE +/- 0.04, N = 2SE +/- 0.01, N = 36.015.975.94MIN: 5.83 / MAX: 9.91MIN: 5.86 / MAX: 9.64MIN: 5.83 / MAX: 9.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v33211.17232.34463.51694.68925.8615SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 35.075.145.21MIN: 4.93 / MAX: 8.45MIN: 4.94 / MAX: 8.76MIN: 4.94 / MAX: 9.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2321246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 35.895.966.07MIN: 5.64 / MAX: 9.39MIN: 5.64 / MAX: 9.74MIN: 5.65 / MAX: 23.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet32148121620SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 317.7316.9617.77MIN: 16.96 / MAX: 19.65MIN: 16.84 / MAX: 18.68MIN: 17.45 / MAX: 18.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile3211020304050SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 343.4543.2443.50

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE32148121620SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 517.5417.5417.531. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 516.7816.7516.731. (CXX) g++ options: -rdynamic

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second321120K240K360K480K600KSE +/- 2681.32, N = 3SE +/- 1073.23, N = 3SE +/- 746.08, N = 3534883.16535255.63537952.811. (CC) gcc options: -O2 -lrt" -lrt

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup321612182430SE +/- 0.12, N = 3SE +/- 0.42, N = 3SE +/- 0.32, N = 326.526.226.81. (CC) gcc options: -fopenmp -O3 -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU3213691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 311.3211.2911.29MIN: 11.17MIN: 11.17MIN: 11.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3210.11860.23720.35580.47440.593SE +/- 0.000172, N = 3SE +/- 0.000712, N = 3SE +/- 0.000759, N = 30.5264370.5272520.526692MIN: 0.51MIN: 0.51MIN: 0.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3210.62151.2431.86452.4863.1075SE +/- 0.00159, N = 3SE +/- 0.00638, N = 3SE +/- 0.00248, N = 32.756192.762102.75497MIN: 2.68MIN: 2.69MIN: 2.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU3211.2632.5263.7895.0526.315SE +/- 0.00888, N = 3SE +/- 0.00948, N = 3SE +/- 0.01299, N = 35.604775.607045.61346MIN: 5.48MIN: 5.44MIN: 5.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3210.52961.05921.58882.11842.648SE +/- 0.00268, N = 3SE +/- 0.00411, N = 3SE +/- 0.00656, N = 32.349902.350382.35358MIN: 2.25MIN: 2.22MIN: 2.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.11170.22340.33510.44680.5585SE +/- 0.000320, N = 3SE +/- 0.000604, N = 3SE +/- 0.000595, N = 30.4957330.4914590.496245MIN: 0.47MIN: 0.47MIN: 0.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3210.21980.43960.65940.87921.099SE +/- 0.001996, N = 3SE +/- 0.001190, N = 3SE +/- 0.001208, N = 30.9770730.9738740.974624MIN: 0.94MIN: 0.94MIN: 0.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU3210.4630.9261.3891.8522.315SE +/- 0.00357, N = 3SE +/- 0.00591, N = 3SE +/- 0.00388, N = 32.056752.057722.05419MIN: 1.97MIN: 1.97MIN: 1.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU3210.10940.21880.32820.43760.547SE +/- 0.004438, N = 3SE +/- 0.005102, N = 3SE +/- 0.001972, N = 30.4792160.4779510.486440MIN: 0.46MIN: 0.46MIN: 0.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 310.5510.5910.641. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU3210.5881.1761.7642.3522.94SE +/- 0.00522, N = 3SE +/- 0.00862, N = 3SE +/- 0.00587, N = 32.613502.588062.59610MIN: 2.56MIN: 2.53MIN: 2.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.28080.56160.84241.12321.404SE +/- 0.00126, N = 3SE +/- 0.00242, N = 3SE +/- 0.00268, N = 31.243801.247811.24491MIN: 1.2MIN: 1.2MIN: 1.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU3210.71061.42122.13182.84243.553SE +/- 0.00910, N = 3SE +/- 0.00567, N = 3SE +/- 0.01117, N = 33.158353.137073.14816MIN: 3.11MIN: 3.09MIN: 3.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU3213691215SE +/- 0.00825, N = 3SE +/- 0.00522, N = 3SE +/- 0.01674, N = 39.422779.423019.42174MIN: 9.06MIN: 9.08MIN: 9.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU3210.93931.87862.81793.75724.6965SE +/- 0.01564, N = 3SE +/- 0.01546, N = 3SE +/- 0.01549, N = 34.174454.162374.16656MIN: 4.1MIN: 4.1MIN: 4.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU3210.97891.95782.93673.91564.8945SE +/- 0.01473, N = 3SE +/- 0.01378, N = 3SE +/- 0.01713, N = 34.350804.340824.34336MIN: 4.28MIN: 4.27MIN: 4.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3210.19510.39020.58530.78040.9755SE +/- 0.008326, N = 3SE +/- 0.011319, N = 5SE +/- 0.007447, N = 30.8628560.8606630.867040MIN: 0.83MIN: 0.82MIN: 0.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU3213691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5312.5412.53MIN: 12.41MIN: 12.41MIN: 12.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU3210.72291.44582.16872.89163.6145SE +/- 0.01283, N = 3SE +/- 0.00574, N = 3SE +/- 0.00363, N = 33.212783.208253.21126MIN: 3.16MIN: 3.16MIN: 3.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.5