ss6

AMD EPYC 3255 8-Core Temp testing with a congatec conga-B7E3 (5.13 BIOS) and NVIDIA Quadro P1000 4GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2108043-IB-SS635889082.

ss6ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolutionsysbench2004Ph10graphics-magick2004Ph10ipc-benchmark2004Ph10amg2004Ph10ramspeed2004PH1-npb2004Ph10scimark2004Ph10cachebench2004Ph10onednn2004Ph10apache2004Ph10ctx-clock2004Ph10hackbench1004Ph10mbw2004Ph10openssl2004Ph10perf-bench2004Ph10stress-ng2004Ph10schbench2004Ph10t-test1-2004Ph10AMD EPYC 3255 8-Core Temp @ 2.50GHz (8 Cores / 16 Threads)congatec conga-B7E3 (5.13 BIOS)AMD 17h32GB1920GB ATP NVMe M.2 2280 SED SSD + 2000GB Portable SSD T5NVIDIA Quadro P1000 4GBNVIDIA GP107GL HD AudioHP Z24n G2Intel I210 + Intel I211 + 2 x AMD Device 1458 + 2 x AMD Device 1459Ubuntu 20.045.4.0-65-generic (x86_64)nouveau4.5 Mesa 21.0.3 (LLVM 12.0.0 256 bits)1.0.2GCC 9.3.0ext41920x1200OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800126cSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

ss6sysbench: RAM / Memorysysbench: CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceipc-benchmark: TCP Socket - 128ipc-benchmark: TCP Socket - 1024ipc-benchmark: Unnamed Pipe - 128ipc-benchmark: Unnamed Pipe - 1024ipc-benchmark: FIFO Named Pipe - 128ipc-benchmark: FIFO Named Pipe - 1024ipc-benchmark: Unnamed Unix Domain Socket - 128ipc-benchmark: Unnamed Unix Domain Socket - 1024amg: ramspeed: Add - Integerramspeed: Scale - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Scale - Floating Pointramspeed: Average - Floating Pointnpb: EP.Cnpb: EP.Dscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationcachebench: Readcachebench: Writecachebench: Read / Modify / Writeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUapache: Static Web Page Servingctx-clock: Context Switch Timehackbench: 16 - Threadhackbench: 16 - Processmbw: Memory Copy - 1024 MiBmbw: Memory Copy, Fixed Block Size - 1024 MiBopenssl: RSA 4096-bit Performanceperf-bench: Epoll Waitperf-bench: Futex Hashperf-bench: Memcpy 1MBperf-bench: Memset 1MBperf-bench: Sched Pipeperf-bench: Futex Lock-Piperf-bench: Syscall Basicstress-ng: MMAPstress-ng: NUMAstress-ng: MEMFDstress-ng: Atomicstress-ng: Cryptostress-ng: Mallocstress-ng: Forkingstress-ng: SENDFILEstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Semaphoresstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Socket Activitystress-ng: Context Switchingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingstress-ng: System V Message Passingschbench: 8 - 16t-test1: 1t-test1: 2sysbench2004Ph10graphics-magick2004Ph10ipc-benchmark2004Ph10amg2004Ph10ramspeed2004PH1-npb2004Ph10scimark2004Ph10cachebench2004Ph10onednn2004Ph10apache2004Ph10ctx-clock2004Ph10hackbench1004Ph10mbw2004Ph10openssl2004Ph10perf-bench2004Ph10stress-ng2004Ph10schbench2004Ph10t-test1-2004Ph107131.3712224.3130851088122622158715188723913054001966470158977818832951476384124955292213817864986720720.6017114.7118296.3921039.8516118.9718556.23384.06382.85417.27100.33135.04451.72539.42859.832145.00517917416.29521130368.70522211.157214.01039.730303.5859423.780014.333321.082027.215210.070213.883410707.55916.1310785.15899.046.3481110780.45929.136.5918420782.2117576.57869.54711376.9905994.7511191.561141377510713.47579230.002033337658411300716440.45129.56459.93234591.401367.2134500114.1945987.9796321.9718.352312.351396806.2028675.9949739.061037.124594.772715180.95644776.3194.758365788.2510474751.67214.527OpenBenchmarking.org

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memorysysbench2004Ph1015003000450060007500SE +/- 45.49, N = 37131.371. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUsysbench2004Ph103K6K9K12K15KSE +/- 42.84, N = 312224.311. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlgraphics-magick2004Ph1070140210280350SE +/- 0.88, N = 33081. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotategraphics-magick2004Ph10110220330440550SE +/- 0.88, N = 35101. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpengraphics-magick2004Ph1020406080100881. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedgraphics-magick2004Ph103060901201501221. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizinggraphics-magick2004Ph10130260390520650SE +/- 0.33, N = 36221. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussiangraphics-magick2004Ph10306090120150SE +/- 0.33, N = 31581. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacegraphics-magick2004Ph10150300450600750SE +/- 0.88, N = 37151. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

IPC_benchmark

Type: TCP Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 128ipc-benchmark2004Ph10400K800K1200K1600K2000KSE +/- 865.47, N = 31887239

IPC_benchmark

Type: TCP Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: TCP Socket - Message Bytes: 1024ipc-benchmark2004Ph10300K600K900K1200K1500KSE +/- 1589.09, N = 31305400

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 128ipc-benchmark2004Ph10400K800K1200K1600K2000KSE +/- 10510.23, N = 31966470

IPC_benchmark

Type: Unnamed Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Pipe - Message Bytes: 1024ipc-benchmark2004Ph10300K600K900K1200K1500KSE +/- 7171.13, N = 31589778

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 128ipc-benchmark2004Ph10400K800K1200K1600K2000KSE +/- 20185.32, N = 41883295

IPC_benchmark

Type: FIFO Named Pipe - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: FIFO Named Pipe - Message Bytes: 1024ipc-benchmark2004Ph10300K600K900K1200K1500KSE +/- 2419.86, N = 31476384

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 128

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 128ipc-benchmark2004Ph10300K600K900K1200K1500KSE +/- 10605.40, N = 31249552

IPC_benchmark

Type: Unnamed Unix Domain Socket - Message Bytes: 1024

OpenBenchmarking.orgMessages Per Second, More Is BetterIPC_benchmarkType: Unnamed Unix Domain Socket - Message Bytes: 1024ipc-benchmark2004Ph10200K400K600K800K1000KSE +/- 10653.07, N = 3922138

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2amg2004Ph1040M80M120M160M200MSE +/- 69024.21, N = 31786498671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerramspeed2004PH1-4K8K12K16K20KSE +/- 9.28, N = 320720.601. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerramspeed2004PH1-4K8K12K16K20KSE +/- 132.01, N = 317114.711. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerramspeed2004PH1-4K8K12K16K20KSE +/- 47.75, N = 318296.391. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointramspeed2004PH1-5K10K15K20K25KSE +/- 28.29, N = 321039.851. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointramspeed2004PH1-3K6K9K12K15KSE +/- 49.73, N = 316118.971. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointramspeed2004PH1-4K8K12K16K20KSE +/- 26.66, N = 318556.231. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cnpb2004Ph1080160240320400SE +/- 1.59, N = 3384.061. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dnpb2004Ph1080160240320400SE +/- 4.54, N = 3382.851. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositescimark2004Ph1090180270360450SE +/- 2.85, N = 3417.271. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carloscimark2004Ph1020406080100SE +/- 0.14, N = 3100.331. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformscimark2004Ph10306090120150SE +/- 0.89, N = 3135.041. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyscimark2004Ph10100200300400500SE +/- 1.12, N = 3451.721. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationscimark2004Ph10120240360480600SE +/- 28.52, N = 3539.421. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationscimark2004Ph102004006008001000SE +/- 15.70, N = 3859.831. (CC) gcc options: -lm

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readcachebench2004Ph105001000150020002500SE +/- 0.01, N = 32145.01MIN: 2143.44 / MAX: 2145.171. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writecachebench2004Ph104K8K12K16K20KSE +/- 28.41, N = 317416.30MIN: 14119.7 / MAX: 18996.351. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writecachebench2004Ph107K14K21K28K35KSE +/- 201.26, N = 330368.71MIN: 26009.51 / MAX: 36088.451. (CC) gcc options: -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUonednn2004Ph103691215SE +/- 0.00, N = 311.16MIN: 10.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUonednn2004Ph1048121620SE +/- 0.01, N = 314.01MIN: 13.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUonednn2004Ph103691215SE +/- 0.00567, N = 39.73030MIN: 8.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUonednn2004Ph100.80681.61362.42043.22724.034SE +/- 0.00038, N = 33.58594MIN: 3.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUonednn2004Ph10612182430SE +/- 0.02, N = 323.78MIN: 23.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUonednn2004Ph1048121620SE +/- 0.05, N = 314.33MIN: 13.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUonednn2004Ph10510152025SE +/- 0.29, N = 321.08MIN: 19.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10612182430SE +/- 0.08, N = 327.22MIN: 25.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUonednn2004Ph103691215SE +/- 0.01, N = 310.07MIN: 9.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUonednn2004Ph1048121620SE +/- 0.09, N = 1413.88MIN: 13.291. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUonednn2004Ph102K4K6K8K10KSE +/- 33.23, N = 310707.5MIN: 10641.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUonednn2004Ph1013002600390052006500SE +/- 7.60, N = 35916.13MIN: 5896.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUonednn2004Ph102K4K6K8K10KSE +/- 12.33, N = 310785.1MIN: 107631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUonednn2004Ph1013002600390052006500SE +/- 10.29, N = 35899.04MIN: 5875.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUonednn2004Ph10246810SE +/- 0.00111, N = 36.34811MIN: 6.241. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUonednn2004Ph102K4K6K8K10KSE +/- 22.39, N = 310780.4MIN: 10727.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUonednn2004Ph1013002600390052006500SE +/- 8.66, N = 35929.13MIN: 5906.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUonednn2004Ph10246810SE +/- 0.00334, N = 36.59184MIN: 6.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Servingapache2004Ph104K8K12K16K20KSE +/- 28.82, N = 320782.211. (CC) gcc options: -shared -fPIC -O2 -pthread

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Timectx-clock2004Ph104080120160200175

Hackbench

Count: 16 - Type: Thread

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Threadhackbench1004Ph1020406080100SE +/- 0.57, N = 376.581. (CC) gcc options: -lpthread

Hackbench

Count: 16 - Type: Process

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 16 - Type: Processhackbench1004Ph101530456075SE +/- 1.01, N = 1569.551. (CC) gcc options: -lpthread

MBW

Test: Memory Copy - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiBmbw2004Ph102K4K6K8K10KSE +/- 41.51, N = 311376.991. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiBmbw2004Ph1013002600390052006500SE +/- 10.45, N = 35994.751. (CC) gcc options: -O3 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performanceopenssl2004Ph1030060090012001500SE +/- 2.38, N = 31191.51. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

perf-bench

Benchmark: Epoll Wait

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll Waitperf-bench2004Ph1013K26K39K52K65KSE +/- 109.01, N = 3611411. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Hash

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Hashperf-bench2004Ph10800K1600K2400K3200K4000KSE +/- 1489.17, N = 337751071. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memcpy 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBperf-bench2004Ph103691215SE +/- 0.05, N = 313.481. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Memset 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBperf-bench2004Ph10714212835SE +/- 0.31, N = 330.001. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipeperf-bench2004Ph107K14K21K28K35KSE +/- 285.95, N = 12337651. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Futex Lock-Pi

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-Piperf-bench2004Ph102004006008001000SE +/- 2.65, N = 38411. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

perf-bench

Benchmark: Syscall Basic

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall Basicperf-bench2004Ph103M6M9M12M15MSE +/- 117757.00, N = 7130071641. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lcrypto -lz -lnuma

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MMAPstress-ng2004Ph10918273645SE +/- 1.00, N = 1540.451. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: NUMAstress-ng2004Ph10306090120150SE +/- 0.92, N = 3129.561. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MEMFDstress-ng2004Ph10100200300400500SE +/- 2.39, N = 3459.931. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Atomicstress-ng2004Ph1050K100K150K200K250KSE +/- 100.00, N = 3234591.401. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Cryptostress-ng2004Ph1030060090012001500SE +/- 1.13, N = 31367.211. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Mallocstress-ng2004Ph107M14M21M28M35MSE +/- 35888.73, N = 334500114.191. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Forkingstress-ng2004Ph1010K20K30K40K50KSE +/- 256.35, N = 345987.971. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: SENDFILEstress-ng2004Ph1020K40K60K80K100KSE +/- 49.20, N = 396321.971. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Cachestress-ng2004Ph10510152025SE +/- 0.19, N = 318.351. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU Stressstress-ng2004Ph105001000150020002500SE +/- 0.45, N = 32312.351. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Semaphoresstress-ng2004Ph10300K600K900K1200K1500KSE +/- 647.20, N = 31396806.201. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Matrix Mathstress-ng2004Ph106K12K18K24K30KSE +/- 52.11, N = 328675.991. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Vector Mathstress-ng2004Ph1011K22K33K44K55KSE +/- 32.31, N = 349739.061. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Memory Copyingstress-ng2004Ph102004006008001000SE +/- 0.64, N = 31037.121. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Socket Activitystress-ng2004Ph1010002000300040005000SE +/- 19.14, N = 34594.771. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Context Switchingstress-ng2004Ph10600K1200K1800K2400K3000KSE +/- 156049.06, N = 152715180.951. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc C String Functionsstress-ng2004Ph10140K280K420K560K700KSE +/- 6688.33, N = 3644776.311. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc Qsort Data Sortingstress-ng2004Ph1020406080100SE +/- 0.48, N = 394.751. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: System V Message Passingstress-ng2004Ph102M4M6M8M10MSE +/- 92479.51, N = 38365788.251. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Schbench

Message Threads: 8 - Workers Per Message Thread: 16

OpenBenchmarking.orgusec, 99.9th Latency Percentile, Fewer Is BetterSchbenchMessage Threads: 8 - Workers Per Message Thread: 16schbench2004Ph1020K40K60K80K100KSE +/- 682.67, N = 31047471. (CC) gcc options: -O2 -lpthread

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1t-test1-2004Ph101224364860SE +/- 0.22, N = 351.671. (CC) gcc options: -pthread

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2t-test1-2004Ph1048121620SE +/- 0.09, N = 314.531. (CC) gcc options: -pthread


Phoronix Test Suite v10.8.0