ss4

Intel Core i9-7980XE testing with a ASRock X299E-ITX/ac (P1.60 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2107061-IB-SS491180652.

ss4ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolutionsysbench-i9Ph10graphics-magick-i9Ph10ipc-benchmark-i9Ph10amg-i9Ph10ramspeed-i9Ph10ramspeed-i9Ph10-2npb-i9Ph10onednn-i9Ph10scimark-i9Ph10cachebench-19Ph10apache-i9Ph10ctx-clock-i9Ph10hackbench-i9Ph10stress-ng-19Ph10perf-bench-i9Ph10openssl-i9Ph10mbw-i9Ph10schbench-i9Ph10Intel Core i9-7980XE @ 4.40GHz (18 Cores / 36 Threads)ASRock X299E-ITX/ac (P1.60 BIOS)Intel Sky Lake-E DMI3 Registers32GB512GB Western Digital CL SN520 SDAPNUW-512G-1022llvmpipeRealtek ALC1220Intel I219-V + Intel I211 + 2 x Intel 10-Gigabit X540-AT2 + Intel 8265 / 8275Ubuntu 20.045.8.0-55-generic (x86_64)GNOME Shell 3.36.7X Server 1.20.9NVIDIA4.5 Mesa 20.2.6 (LLVM 11.0.0 256 bits)GCC 9.3.0ext43840x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x2006b06Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

ss4sysbench: RAM / Memorysysbench: CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceipc-benchmark: TCP Socket - 128ipc-benchmark: TCP Socket - 1024ipc-benchmark: Unnamed Pipe - 128ipc-benchmark: Unnamed Pipe - 1024ipc-benchmark: FIFO Named Pipe - 128ipc-benchmark: FIFO Named Pipe - 1024ipc-benchmark: Unnamed Unix Domain Socket - 128ipc-benchmark: Unnamed Unix Domain Socket - 1024amg: ramspeed: Add - Integerramspeed: Scale - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Scale - Floating Pointramspeed: Average - Floating Pointnpb: EP.Cnpb: EP.Donednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationcachebench: Readcachebench: Writecachebench: Read / Modify / Writeapache: Static Web Page Servingctx-clock: Context Switch Timehackbench: 16 - Threadhackbench: 16 - Processstress-ng: MMAPstress-ng: NUMAstress-ng: MEMFDstress-ng: Atomicstress-ng: Cryptostress-ng: Mallocstress-ng: RdRandstress-ng: Forkingstress-ng: SENDFILEstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Semaphoresstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Socket Activitystress-ng: Context Switchingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingstress-ng: System V Message Passingperf-bench: Epoll Waitperf-bench: Futex Hashperf-bench: Memcpy 1MBperf-bench: Memset 1MBperf-bench: Sched Pipeperf-bench: Futex Lock-Piperf-bench: Syscall Basicopenssl: RSA 4096-bit Performancembw: Memory Copy - 1024 MiBmbw: Memory Copy, Fixed Block Size - 1024 MiBschbench: 8 - 16sysbench-i9Ph10graphics-magick-i9Ph10ipc-benchmark-i9Ph10amg-i9Ph10ramspeed-i9Ph10ramspeed-i9Ph10-2npb-i9Ph10onednn-i9Ph10scimark-i9Ph10cachebench-19Ph10apache-i9Ph10ctx-clock-i9Ph10hackbench-i9Ph10stress-ng-19Ph10perf-bench-i9Ph10openssl-i9Ph10mbw-i9Ph10schbench-i9Ph1012623.7412520.7024475877125581155711161872012314281736829152904618001081547106110114298201342166376729880.8428949.5129409.0829791.0728949.6028998.3529206.7328492.5928905.771506.402072.0429.894360.18063.1982229.512415.335722.07399.3850410.26986.224218.692303.327603.756184016.912528.574016.0722.183231.225526.74282530.552.542614015.842529.351.858195.43817667.44145.14306.37759.82938.771187.123549.69723129727.11584648323.24402923584.1579687.01380.303284.60159.591021.16203239.381271.7679499483.77184685.5648739.6983392.8239.243365.152677344.8232081.3545165.785958.334775.322176721.58769203.7394.712904903.0312689134433619.55094063.51883212931120932250201780.49180.9398732.134133888OpenBenchmarking.org

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memorysysbench-i9Ph103K6K9K12K15KSE +/- 43.24, N = 312623.741. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUsysbench-i9Ph103K6K9K12K15KSE +/- 3.16, N = 312520.701. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlgraphics-magick-i9Ph10501001502002502441. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotategraphics-magick-i9Ph10160320480640800SE +/- 8.25, N = 57581. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpengraphics-magick-i9Ph1020406080100SE +/- 0.33, N = 3771. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedgraphics-magick-i9Ph103060901201501251. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizinggraphics-magick-i9Ph10130260390520650SE +/- 0.58, N = 35811. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussiangraphics-magick-i9Ph103060901201501551. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacegraphics-magick-i9Ph10150300450600750SE +/- 1.20, N = 37111. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

IPC_benchmark

Type: TCP Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 128ipc-benchmark-i9Ph10300K600K900K1200K1500KSE +/- 1126.58, N = 31618720

IPC_benchmark

Type: TCP Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 1024ipc-benchmark-i9Ph10300K600K900K1200K1500KSE +/- 7215.07, N = 31231428

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 128ipc-benchmark-i9Ph10400K800K1200K1600K2000KSE +/- 29706.06, N = 151736829

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 1024ipc-benchmark-i9Ph10300K600K900K1200K1500KSE +/- 21356.75, N = 31529046

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 128ipc-benchmark-i9Ph10400K800K1200K1600K2000KSE +/- 25575.15, N = 31800108

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 1024ipc-benchmark-i9Ph10300K600K900K1200K1500KSE +/- 5801.12, N = 31547106

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 128ipc-benchmark-i9Ph10200K400K600K800K1000KSE +/- 1288.51, N = 31101142

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 1024ipc-benchmark-i9Ph10200K400K600K800K1000KSE +/- 660.09, N = 3982013

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2amg-i9Ph1090M180M270M360M450MSE +/- 226558.75, N = 34216637671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerramspeed-i9Ph10ramspeed-i9Ph10-26K12K18K24K30KSE +/- 10.07, N = 3SE +/- 33.36, N = 329880.8429791.071. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerramspeed-i9Ph10ramspeed-i9Ph10-26K12K18K24K30KSE +/- 19.97, N = 3SE +/- 11.65, N = 328949.5128949.601. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerramspeed-i9Ph10ramspeed-i9Ph10-26K12K18K24K30KSE +/- 3.06, N = 3SE +/- 14.58, N = 329409.0828998.351. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointramspeed-i9Ph10-26K12K18K24K30KSE +/- 156.20, N = 329206.731. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointramspeed-i9Ph10-26K12K18K24K30KSE +/- 42.13, N = 328492.591. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointramspeed-i9Ph10-26K12K18K24K30KSE +/- 20.30, N = 328905.771. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cnpb-i9Ph1030060090012001500SE +/- 14.95, N = 151506.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dnpb-i9Ph10400800120016002000SE +/- 11.23, N = 32072.041. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUonednn-i9Ph10714212835SE +/- 0.31, N = 329.89MIN: 17.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUonednn-i9Ph101326395265SE +/- 0.57, N = 360.18MIN: 27.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph100.71961.43922.15882.87843.598SE +/- 0.00253, N = 33.19822MIN: 2.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph10714212835SE +/- 0.24, N = 1529.51MIN: 4.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph1048121620SE +/- 0.01, N = 315.34MIN: 14.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph10510152025SE +/- 0.20, N = 1522.07MIN: 6.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUonednn-i9Ph103691215SE +/- 0.00136, N = 39.38504MIN: 9.241. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUonednn-i9Ph103691215SE +/- 0.04, N = 310.27MIN: 7.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUonednn-i9Ph10246810SE +/- 0.00670, N = 36.22421MIN: 6.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph10246810SE +/- 0.00069, N = 38.69230MIN: 8.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph100.74871.49742.24612.99483.7435SE +/- 0.00103, N = 33.32760MIN: 2.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph100.84511.69022.53533.38044.2255SE +/- 0.02427, N = 33.75618MIN: 3.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUonednn-i9Ph109001800270036004500SE +/- 0.94, N = 34016.91MIN: 3981.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUonednn-i9Ph105001000150020002500SE +/- 1.65, N = 32528.57MIN: 2493.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph109001800270036004500SE +/- 1.46, N = 34016.07MIN: 3984.541. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph10510152025SE +/- 0.01, N = 322.18MIN: 21.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph10714212835SE +/- 0.01, N = 331.23MIN: 30.351. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph10612182430SE +/- 0.03, N = 326.74MIN: 26.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph105001000150020002500SE +/- 0.81, N = 32530.55MIN: 2497.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUonednn-i9Ph100.57211.14421.71632.28842.8605SE +/- 0.00332, N = 32.54261MIN: 2.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph109001800270036004500SE +/- 1.16, N = 34015.84MIN: 3980.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph105001000150020002500SE +/- 1.04, N = 32529.35MIN: 2494.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUonednn-i9Ph100.41810.83621.25431.67242.0905SE +/- 0.00144, N = 31.85819MIN: 1.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUonednn-i9Ph101.22362.44723.67084.89446.118SE +/- 0.00321, N = 35.43817MIN: 4.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositescimark-i9Ph10140280420560700SE +/- 0.38, N = 3667.441. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carloscimark-i9Ph10306090120150SE +/- 0.02, N = 3145.141. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformscimark-i9Ph1070140210280350SE +/- 0.93, N = 3306.371. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyscimark-i9Ph10160320480640800SE +/- 0.38, N = 3759.821. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationscimark-i9Ph102004006008001000SE +/- 0.62, N = 3938.771. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationscimark-i9Ph1030060090012001500SE +/- 0.14, N = 31187.121. (CC) gcc options: -lm

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readcachebench-19Ph108001600240032004000SE +/- 0.10, N = 33549.70MIN: 3539.96 / MAX: 3554.411. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writecachebench-19Ph106K12K18K24K30KSE +/- 4.33, N = 329727.12MIN: 26749.71 / MAX: 31785.281. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writecachebench-19Ph1010K20K30K40K50KSE +/- 5.12, N = 348323.24MIN: 43906.38 / MAX: 50885.741. (CC) gcc options: -lrt

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Servingapache-i9Ph105K10K15K20K25KSE +/- 18.01, N = 323584.151. (CC) gcc options: -shared -fPIC -O2 -pthread

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Timectx-clock-i9Ph102004006008001000796

Hackbench

Count: 16 - Type: Thread

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Threadhackbench-i9Ph1020406080100SE +/- 1.31, N = 1587.011. (CC) gcc options: -lpthread

Hackbench

Count: 16 - Type: Process

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Processhackbench-i9Ph1020406080100SE +/- 1.34, N = 1580.301. (CC) gcc options: -lpthread

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MMAPstress-ng-19Ph1060120180240300SE +/- 0.12, N = 3284.601. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: NUMAstress-ng-19Ph104080120160200SE +/- 0.46, N = 3159.591. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MEMFDstress-ng-19Ph102004006008001000SE +/- 1.49, N = 31021.161. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Atomicstress-ng-19Ph1040K80K120K160K200KSE +/- 800.47, N = 3203239.381. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Cryptostress-ng-19Ph1030060090012001500SE +/- 0.79, N = 31271.761. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Mallocstress-ng-19Ph1020M40M60M80M100MSE +/- 240190.46, N = 379499483.771. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: RdRand

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: RdRandstress-ng-19Ph1040K80K120K160K200KSE +/- 92.19, N = 3184685.561. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Forkingstress-ng-19Ph1010K20K30K40K50KSE +/- 147.65, N = 348739.691. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: SENDFILEstress-ng-19Ph1020K40K60K80K100KSE +/- 14.13, N = 383392.821. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Cachestress-ng-19Ph10918273645SE +/- 0.50, N = 1539.241. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Stressstress-ng-19Ph107001400210028003500SE +/- 10.45, N = 33365.151. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Semaphoresstress-ng-19Ph10600K1200K1800K2400K3000KSE +/- 582.79, N = 32677344.821. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Matrix Mathstress-ng-19Ph107K14K21K28K35KSE +/- 27.61, N = 332081.351. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Vector Mathstress-ng-19Ph1010K20K30K40K50KSE +/- 6.52, N = 345165.781. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Memory Copyingstress-ng-19Ph1013002600390052006500SE +/- 1.53, N = 35958.331. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Socket Activitystress-ng-19Ph1010002000300040005000SE +/- 146.28, N = 154775.321. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Context Switchingstress-ng-19Ph10500K1000K1500K2000K2500KSE +/- 21578.82, N = 62176721.581. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc C String Functionsstress-ng-19Ph10160K320K480K640K800KSE +/- 189.69, N = 3769203.731. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc Qsort Data Sortingstress-ng-19Ph1020406080100SE +/- 0.19, N = 394.711. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: System V Message Passingstress-ng-19Ph10600K1200K1800K2400K3000KSE +/- 23026.16, N = 152904903.031. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

perf-bench

Benchmark: Epoll Wait

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll Waitperf-bench-i9Ph103K6K9K12K15KSE +/- 82.90, N = 3126891. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Hash

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Hashperf-bench-i9Ph10300K600K900K1200K1500KSE +/- 603.79, N = 313443361. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memcpy 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBperf-bench-i9Ph10510152025SE +/- 0.26, N = 319.551. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memset 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBperf-bench-i9Ph101428425670SE +/- 0.59, N = 663.521. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipeperf-bench-i9Ph1030K60K90K120K150KSE +/- 856.00, N = 31293111. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Lock-Pi

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-Piperf-bench-i9Ph1050100150200250SE +/- 0.67, N = 32091. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Syscall Basic

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall Basicperf-bench-i9Ph10700K1400K2100K2800K3500KSE +/- 1302.59, N = 332250201. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performanceopenssl-i9Ph10400800120016002000SE +/- 3.15, N = 31780.41. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

MBW

Test: Memory Copy - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiBmbw-i9Ph102K4K6K8K10KSE +/- 96.24, N = 59180.941. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiBmbw-i9Ph102K4K6K8K10KSE +/- 7.28, N = 38732.131. (CC) gcc options: -O3 -march=native

Schbench

Message Threads: 8 - Workers Per Message Thread: 16

OpenBenchmarking.orgusec, 99.9th Latency Percentile, Fewer Is BetterSchbenchMessage Threads: 8 - Workers Per Message Thread: 16schbench-i9Ph1030K60K90K120K150KSE +/- 1182.41, N = 31338881. (CC) gcc options: -O2 -lpthread


Phoronix Test Suite v10.8.4