ss2

AMD EPYC 3255 8-Core Temp testing with a congatec conga-B7E3 (5.13 BIOS) and MSI NVIDIA GeForce GTX 1050 2GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2107011-IB-SS205239152.

ss2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay DriverOpenGLCompilerFile-SystemScreen Resolutionsysbench2004Ph10graphics-magick2004Ph10ipc_benchmark2004Ph10amg2004Ph10ramspeed2004Ph10npb2004Ph10scimark2004Ph10cachebench2004Ph10onednn2004Ph10onednn2-2004Ph10apache2004Ph10ctx_clock2004Ph10hackbench2004Ph10mbw2004Ph10openssl2004Ph10perf-bench2004Ph10stress-ng2004Ph10AMD EPYC 3255 8-Core Temp @ 2.50GHz (8 Cores / 16 Threads)congatec conga-B7E3 (5.13 BIOS)AMD 17h32GB2000GB Samsung SSD 970 EVO 2TB + 2000GB Portable SSD T5MSI NVIDIA GeForce GTX 1050 2GBNVIDIA GP107GL HD AudioIntel I211 + Intel I210 + 2 x AMD Device 1458 + 2 x AMD Device 1459Ubuntu 20.045.4.0-77-generic (x86_64)nouveau4.5 Mesa 20.2.6 (LLVM 11.0.0 256 bits)GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8001250Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

ss2sysbench: RAM / Memorysysbench: CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceipc-benchmark: TCP Socket - 128ipc-benchmark: TCP Socket - 1024ipc-benchmark: Unnamed Pipe - 128ipc-benchmark: Unnamed Pipe - 1024ipc-benchmark: FIFO Named Pipe - 128ipc-benchmark: FIFO Named Pipe - 1024ipc-benchmark: Unnamed Unix Domain Socket - 128ipc-benchmark: Unnamed Unix Domain Socket - 1024amg: ramspeed: Add - Integerramspeed: Scale - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Scale - Floating Pointramspeed: Average - Floating Pointnpb: EP.Cnpb: EP.Dnpb: FT.Cscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationcachebench: Readcachebench: Writecachebench: Read / Modify / Writeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUapache: Static Web Page Servingctx-clock: Context Switch Timehackbench: 16 - Threadhackbench: 16 - Processmbw: Memory Copy - 1024 MiBmbw: Memory Copy, Fixed Block Size - 1024 MiBopenssl: RSA 4096-bit Performanceperf-bench: Epoll Waitperf-bench: Futex Hashperf-bench: Memcpy 1MBperf-bench: Memset 1MBperf-bench: Sched Pipeperf-bench: Futex Lock-Piperf-bench: Syscall Basicstress-ng: MMAPstress-ng: NUMAstress-ng: MEMFDstress-ng: Atomicstress-ng: Cryptostress-ng: Mallocstress-ng: Forkingstress-ng: SENDFILEstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Semaphoresstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Socket Activitystress-ng: Context Switchingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingstress-ng: System V Message Passingsysbench2004Ph10graphics-magick2004Ph10ipc_benchmark2004Ph10amg2004Ph10ramspeed2004Ph10npb2004Ph10scimark2004Ph10cachebench2004Ph10onednn2004Ph10onednn2-2004Ph10apache2004Ph10ctx_clock2004Ph10hackbench2004Ph10mbw2004Ph10openssl2004Ph10perf-bench2004Ph10stress-ng2004Ph106742.4111968.4530346786119588154679183483312402401905581153520618714871434448120383988780910025696711233.269839.1610455.0411223.549816.4810508.78381.58379.355785.55388.3897.06127.68435.10399.27846.842080.92043116912.98900629344.24930017.830318.45469.878694.8616938.285614.503021.501744.113310.234513.795212202.28534.4312244.98530.4810.745112245.117.820518.41709.784784.8566738.300414.555721.546944.076510.238913.803112229.28528.4412238.18527.3710.732312229.38532.566.9103520070.5718280.39374.5768116.1864799.6131170.959159359683413.09095629.136207352528081120222791.28103.65312.28228226.921338.1028599372.9840159.7094315.6914.651882.751358793.0128206.4748824.90604.263878.662343297.63646433.6193.2110328168.09OpenBenchmarking.org

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memorysysbench2004Ph1014002800420056007000SE +/- 6.11, N = 36742.411. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUsysbench2004Ph103K6K9K12K15KSE +/- 43.11, N = 311968.451. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlgraphics-magick2004Ph1070140210280350SE +/- 1.00, N = 33031. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotategraphics-magick2004Ph10100200300400500SE +/- 0.67, N = 34671. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpengraphics-magick2004Ph1020406080100861. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedgraphics-magick2004Ph10306090120150SE +/- 0.33, N = 31191. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizinggraphics-magick2004Ph10130260390520650SE +/- 0.88, N = 35881. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussiangraphics-magick2004Ph103060901201501541. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacegraphics-magick2004Ph10150300450600750SE +/- 1.33, N = 36791. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

IPC_benchmark

Type: TCP Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 128ipc_benchmark2004Ph10400K800K1200K1600K2000KSE +/- 6651.88, N = 31834833

IPC_benchmark

Type: TCP Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 1024ipc_benchmark2004Ph10300K600K900K1200K1500KSE +/- 1928.65, N = 31240240

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 128ipc_benchmark2004Ph10400K800K1200K1600K2000KSE +/- 13255.52, N = 31905581

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 1024ipc_benchmark2004Ph10300K600K900K1200K1500KSE +/- 15158.00, N = 31535206

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 128ipc_benchmark2004Ph10400K800K1200K1600K2000KSE +/- 22048.60, N = 31871487

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 1024ipc_benchmark2004Ph10300K600K900K1200K1500KSE +/- 10432.31, N = 31434448

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 128ipc_benchmark2004Ph10300K600K900K1200K1500KSE +/- 13076.65, N = 31203839

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 1024ipc_benchmark2004Ph10200K400K600K800K1000KSE +/- 5635.32, N = 3887809

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2amg2004Ph1020M40M60M80M100MSE +/- 28448.34, N = 31002569671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerramspeed2004Ph102K4K6K8K10KSE +/- 10.55, N = 311233.261. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerramspeed2004Ph102K4K6K8K10KSE +/- 8.14, N = 39839.161. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerramspeed2004Ph102K4K6K8K10KSE +/- 14.90, N = 310455.041. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointramspeed2004Ph102K4K6K8K10KSE +/- 8.00, N = 311223.541. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointramspeed2004Ph102K4K6K8K10KSE +/- 11.87, N = 39816.481. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointramspeed2004Ph102K4K6K8K10KSE +/- 6.82, N = 310508.781. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cnpb2004Ph1080160240320400SE +/- 0.85, N = 3381.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dnpb2004Ph1080160240320400SE +/- 4.09, N = 4379.351. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cnpb2004Ph1012002400360048006000SE +/- 11.02, N = 35785.551. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositescimark2004Ph1080160240320400SE +/- 4.40, N = 15388.381. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carloscimark2004Ph1020406080100SE +/- 0.17, N = 397.061. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformscimark2004Ph10306090120150SE +/- 0.41, N = 3127.681. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyscimark2004Ph1090180270360450SE +/- 0.30, N = 3435.101. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationscimark2004Ph1090180270360450SE +/- 45.04, N = 3399.271. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationscimark2004Ph102004006008001000SE +/- 0.16, N = 3846.841. (CC) gcc options: -lm

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readcachebench2004Ph10400800120016002000SE +/- 3.24, N = 32080.92MIN: 2073.92 / MAX: 2084.441. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writecachebench2004Ph104K8K12K16K20KSE +/- 178.90, N = 516912.99MIN: 12147.97 / MAX: 19456.51. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writecachebench2004Ph106K12K18K24K30KSE +/- 109.87, N = 329344.25MIN: 25528.91 / MAX: 32707.411. (CC) gcc options: -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph1048121620SE +/- 0.08, N = 3SE +/- 0.05, N = 317.8317.82MIN: 16.86MIN: 16.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph10510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 318.4518.42MIN: 18.14MIN: 18.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph103691215SE +/- 0.03098, N = 3SE +/- 0.01313, N = 39.878699.78478MIN: 9.45MIN: 9.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph101.09392.18783.28174.37565.4695SE +/- 0.00363, N = 3SE +/- 0.00513, N = 34.861694.85667MIN: 4.59MIN: 4.61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph10918273645SE +/- 0.01, N = 3SE +/- 0.00, N = 338.2938.30MIN: 37.55MIN: 37.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph1048121620SE +/- 0.01, N = 3SE +/- 0.03, N = 314.5014.56MIN: 13.16MIN: 13.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph10510152025SE +/- 0.14, N = 3SE +/- 0.06, N = 321.5021.55MIN: 19.89MIN: 19.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph101020304050SE +/- 0.02, N = 3SE +/- 0.01, N = 344.1144.08MIN: 43.63MIN: 43.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph103691215SE +/- 0.00, N = 3SE +/- 0.01, N = 310.2310.24MIN: 9.59MIN: 9.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph1048121620SE +/- 0.01, N = 3SE +/- 0.01, N = 313.8013.80MIN: 13.39MIN: 13.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph103K6K9K12K15KSE +/- 17.24, N = 3SE +/- 34.23, N = 312202.212229.2MIN: 12171MIN: 121571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph102K4K6K8K10KSE +/- 10.03, N = 3SE +/- 15.42, N = 38534.438528.44MIN: 8497.26MIN: 8487.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph103K6K9K12K15KSE +/- 16.36, N = 3SE +/- 3.61, N = 312244.912238.1MIN: 12194.5MIN: 12215.61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph102K4K6K8K10KSE +/- 8.98, N = 3SE +/- 6.76, N = 38530.488527.37MIN: 8496.86MIN: 8492.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUonednn2004Ph10onednn2-2004Ph103691215SE +/- 0.00, N = 3SE +/- 0.01, N = 310.7510.73MIN: 10.37MIN: 10.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUonednn2004Ph10onednn2-2004Ph103K6K9K12K15KSE +/- 5.40, N = 3SE +/- 19.75, N = 312245.112229.3MIN: 12224.4MIN: 121791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUonednn2-2004Ph102K4K6K8K10KSE +/- 4.17, N = 38532.56MIN: 8501.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUonednn2-2004Ph10246810SE +/- 0.00437, N = 36.91035MIN: 6.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Servingapache2004Ph104K8K12K16K20KSE +/- 34.80, N = 320070.571. (CC) gcc options: -shared -fPIC -O2 -pthread

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Timectx_clock2004Ph104080120160200SE +/- 2.95, N = 15182

Hackbench

Count: 16 - Type: Thread

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Threadhackbench2004Ph1020406080100SE +/- 1.07, N = 380.391. (CC) gcc options: -lpthread

Hackbench

Count: 16 - Type: Process

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Processhackbench2004Ph1020406080100SE +/- 0.70, N = 374.581. (CC) gcc options: -lpthread

MBW

Test: Memory Copy - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiBmbw2004Ph102K4K6K8K10KSE +/- 6.56, N = 38116.191. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiBmbw2004Ph1010002000300040005000SE +/- 2.54, N = 34799.611. (CC) gcc options: -O3 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performanceopenssl2004Ph1030060090012001500SE +/- 2.40, N = 31170.91. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

perf-bench

Benchmark: Epoll Wait

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll Waitperf-bench2004Ph1013K26K39K52K65KSE +/- 160.94, N = 3591591. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Hash

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Hashperf-bench2004Ph10800K1600K2400K3200K4000KSE +/- 10145.28, N = 335968341. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memcpy 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBperf-bench2004Ph103691215SE +/- 0.04, N = 313.091. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memset 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBperf-bench2004Ph10714212835SE +/- 0.11, N = 329.141. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipeperf-bench2004Ph108K16K24K32K40KSE +/- 374.38, N = 5352521. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Lock-Pi

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-Piperf-bench2004Ph102004006008001000SE +/- 1.73, N = 38081. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Syscall Basic

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall Basicperf-bench2004Ph102M4M6M8M10MSE +/- 107215.32, N = 3112022271. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MMAPstress-ng2004Ph1020406080100SE +/- 2.24, N = 1291.281. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: NUMAstress-ng2004Ph1020406080100SE +/- 0.97, N = 15103.651. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MEMFDstress-ng2004Ph1070140210280350SE +/- 0.78, N = 3312.281. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Atomicstress-ng2004Ph1050K100K150K200K250KSE +/- 206.55, N = 3228226.921. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Cryptostress-ng2004Ph1030060090012001500SE +/- 3.44, N = 31338.101. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Mallocstress-ng2004Ph106M12M18M24M30MSE +/- 26135.85, N = 328599372.981. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Forkingstress-ng2004Ph109K18K27K36K45KSE +/- 221.43, N = 340159.701. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: SENDFILEstress-ng2004Ph1020K40K60K80K100KSE +/- 91.24, N = 394315.691. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Cachestress-ng2004Ph1048121620SE +/- 0.13, N = 1514.651. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Stressstress-ng2004Ph10400800120016002000SE +/- 2.97, N = 31882.751. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Semaphoresstress-ng2004Ph10300K600K900K1200K1500KSE +/- 831.38, N = 31358793.011. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Matrix Mathstress-ng2004Ph106K12K18K24K30KSE +/- 29.01, N = 328206.471. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Vector Mathstress-ng2004Ph1010K20K30K40K50KSE +/- 41.81, N = 348824.901. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Memory Copyingstress-ng2004Ph10130260390520650SE +/- 0.38, N = 3604.261. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Socket Activitystress-ng2004Ph108001600240032004000SE +/- 11.92, N = 33878.661. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Context Switchingstress-ng2004Ph10500K1000K1500K2000K2500KSE +/- 28452.37, N = 32343297.631. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc C String Functionsstress-ng2004Ph10140K280K420K560K700KSE +/- 4652.30, N = 3646433.611. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc Qsort Data Sortingstress-ng2004Ph1020406080100SE +/- 0.23, N = 393.211. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: System V Message Passingstress-ng2004Ph102M4M6M8M10MSE +/- 158615.33, N = 1210328168.091. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc


Phoronix Test Suite v10.8.4