AMD EPYC 3255 8-Core Temp testing with a congatec conga-B7E3 (5.13 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2107059-IB-SS324963552 ss3 - Phoronix Test Suite ss3 AMD EPYC 3255 8-Core Temp testing with a congatec conga-B7E3 (5.13 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2107059-IB-SS324963552 .
ss3 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel OpenGL Compiler File-System Screen Resolution sysbench-BIOS12Ph10 graphics-magick-BIOS12Ph10 ipc-benchmarkBIOS12Ph10 ipc-benchmarkBIOS12Ph10-2 amgBIOS12Ph10 ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 npbBIOSPh10 onednnBIOS12Ph10 scimarkBIOS12Ph10 AMD EPYC 3255 8-Core Temp @ 2.50GHz (8 Cores / 16 Threads) congatec conga-B7E3 (5.13 BIOS) AMD 17h 32GB 2000GB Samsung SSD 970 EVO 2TB + 2000GB Portable SSD T5 llvmpipe NVIDIA GP107GL HD Audio Intel I210 + Intel I211 + 2 x AMD Device 1458 + 2 x AMD Device 1459 Ubuntu 20.04 5.4.0-77-generic (x86_64) 4.5 Mesa 20.2.6 (LLVM 11.0.0 256 bits) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800126c Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
ss3 sysbench: RAM / Memory sysbench: CPU graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space ipc-benchmark: TCP Socket - 128 ipc-benchmark: TCP Socket - 1024 ipc-benchmark: Unnamed Pipe - 128 ipc-benchmark: Unnamed Pipe - 1024 ipc-benchmark: FIFO Named Pipe - 128 ipc-benchmark: FIFO Named Pipe - 1024 ipc-benchmark: Unnamed Unix Domain Socket - 128 ipc-benchmark: Unnamed Unix Domain Socket - 1024 amg: ramspeed: Add - Integer ramspeed: Scale - Integer ramspeed: Average - Integer ramspeed: Add - Floating Point ramspeed: Scale - Floating Point ramspeed: Average - Floating Point npb: EP.C npb: EP.D onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU scimark2: Composite scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation sysbench-BIOS12Ph10 graphics-magick-BIOS12Ph10 ipc-benchmarkBIOS12Ph10 ipc-benchmarkBIOS12Ph10-2 amgBIOS12Ph10 ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 npbBIOSPh10 onednnBIOS12Ph10 scimarkBIOS12Ph10 7141.63 12261.9 308 504 88 122 615 160 743 1891570 1297815 1882887 1307426 1975783 1584521 1935827 1485293 1244161 943475 111657933 12551.97 11076.42 11732.05 12589.03 11033.62 12548.83 11085.72 11690.21 12569.01 11031.65 11762.06 388.82 376.71 15.7907 15.7193 9.55680 4.17270 34.2821 14.1336 21.0568 39.2196 9.98448 13.7120 11458.5 7582.44 11504.0 7568.70 9.52426 11507.7 7638.63 6.65550 393.94 99.85 130.34 451.62 423.88 873.39 OpenBenchmarking.org
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory sysbench-BIOS12Ph10 1500 3000 4500 6000 7500 SE +/- 3.02, N = 3 7141.63 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU sysbench-BIOS12Ph10 3K 6K 9K 12K 15K SE +/- 47.92, N = 3 12261.9 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl graphics-magick-BIOS12Ph10 70 140 210 280 350 SE +/- 1.45, N = 3 308 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate graphics-magick-BIOS12Ph10 110 220 330 440 550 SE +/- 0.88, N = 3 504 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen graphics-magick-BIOS12Ph10 20 40 60 80 100 88 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced graphics-magick-BIOS12Ph10 30 60 90 120 150 122 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing graphics-magick-BIOS12Ph10 130 260 390 520 650 615 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian graphics-magick-BIOS12Ph10 40 80 120 160 200 160 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space graphics-magick-BIOS12Ph10 160 320 480 640 800 SE +/- 6.24, N = 3 743 1. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread
IPC_benchmark Type: TCP Socket - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: TCP Socket - Message Bytes: 128 ipc-benchmarkBIOS12Ph10 ipc-benchmarkBIOS12Ph10-2 400K 800K 1200K 1600K 2000K SE +/- 9664.73, N = 3 SE +/- 9029.51, N = 3 1891570 1882887
IPC_benchmark Type: TCP Socket - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: TCP Socket - Message Bytes: 1024 ipc-benchmarkBIOS12Ph10 ipc-benchmarkBIOS12Ph10-2 300K 600K 900K 1200K 1500K SE +/- 13127.08, N = 3 SE +/- 3088.73, N = 3 1297815 1307426
IPC_benchmark Type: Unnamed Pipe - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Pipe - Message Bytes: 128 ipc-benchmarkBIOS12Ph10-2 400K 800K 1200K 1600K 2000K SE +/- 14775.40, N = 3 1975783
IPC_benchmark Type: Unnamed Pipe - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Pipe - Message Bytes: 1024 ipc-benchmarkBIOS12Ph10-2 300K 600K 900K 1200K 1500K SE +/- 2363.90, N = 3 1584521
IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 128 ipc-benchmarkBIOS12Ph10-2 400K 800K 1200K 1600K 2000K SE +/- 13209.68, N = 3 1935827
IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 1024 ipc-benchmarkBIOS12Ph10-2 300K 600K 900K 1200K 1500K SE +/- 17308.67, N = 3 1485293
IPC_benchmark Type: Unnamed Unix Domain Socket - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Unix Domain Socket - Message Bytes: 128 ipc-benchmarkBIOS12Ph10-2 300K 600K 900K 1200K 1500K SE +/- 1579.62, N = 3 1244161
IPC_benchmark Type: Unnamed Unix Domain Socket - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Unix Domain Socket - Message Bytes: 1024 ipc-benchmarkBIOS12Ph10-2 200K 400K 600K 800K 1000K SE +/- 4573.65, N = 3 943475
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 amgBIOS12Ph10 20M 40M 60M 80M 100M SE +/- 16862.61, N = 3 111657933 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 3K 6K 9K 12K 15K SE +/- 10.48, N = 3 SE +/- 1.73, N = 3 12551.97 12548.83 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 2K 4K 6K 8K 10K SE +/- 4.59, N = 3 SE +/- 3.66, N = 3 11076.42 11085.72 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 3K 6K 9K 12K 15K SE +/- 10.39, N = 3 SE +/- 14.62, N = 3 11732.05 11690.21 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Add - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 3K 6K 9K 12K 15K SE +/- 5.91, N = 3 SE +/- 11.04, N = 3 12589.03 12569.01 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point ramspeedBIOS12Ph10 ramspeedBIOS12Ph10-2 2K 4K 6K 8K 10K SE +/- 9.63, N = 3 SE +/- 21.83, N = 3 11033.62 11031.65 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point ramspeedBIOS12Ph10-2 3K 6K 9K 12K 15K SE +/- 4.68, N = 3 11762.06 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C npbBIOSPh10 80 160 240 320 400 SE +/- 2.05, N = 3 388.82 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D npbBIOSPh10 80 160 240 320 400 SE +/- 2.04, N = 3 376.71 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 4 8 12 16 20 SE +/- 0.07, N = 3 15.79 MIN: 15.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 4 8 12 16 20 SE +/- 0.00, N = 3 15.72 MIN: 15.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 3 6 9 12 15 SE +/- 0.04084, N = 3 9.55680 MIN: 9.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 0.9389 1.8778 2.8167 3.7556 4.6945 SE +/- 0.00235, N = 3 4.17270 MIN: 4.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 8 16 24 32 40 SE +/- 0.03, N = 3 34.28 MIN: 33.71 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 4 8 12 16 20 SE +/- 0.01, N = 3 14.13 MIN: 12.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 5 10 15 20 25 SE +/- 0.09, N = 3 21.06 MIN: 19.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 9 18 27 36 45 SE +/- 0.04, N = 3 39.22 MIN: 39.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 3 6 9 12 15 SE +/- 0.00489, N = 3 9.98448 MIN: 9.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 4 8 12 16 20 SE +/- 0.12, N = 3 13.71 MIN: 13.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 2K 4K 6K 8K 10K SE +/- 30.96, N = 3 11458.5 MIN: 11416.6 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 1600 3200 4800 6400 8000 SE +/- 12.55, N = 3 7582.44 MIN: 7545.45 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 2K 4K 6K 8K 10K SE +/- 17.96, N = 3 11504.0 MIN: 11466.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 1600 3200 4800 6400 8000 SE +/- 7.26, N = 3 7568.70 MIN: 7541.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU onednnBIOS12Ph10 3 6 9 12 15 SE +/- 0.00726, N = 3 9.52426 MIN: 9.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU onednnBIOS12Ph10 2K 4K 6K 8K 10K SE +/- 11.38, N = 3 11507.7 MIN: 11479.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU onednnBIOS12Ph10 1600 3200 4800 6400 8000 SE +/- 13.12, N = 3 7638.63 MIN: 7598.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU onednnBIOS12Ph10 2 4 6 8 10 SE +/- 0.00869, N = 3 6.65550 MIN: 6.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite scimarkBIOS12Ph10 90 180 270 360 450 SE +/- 4.74, N = 15 393.94 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo scimarkBIOS12Ph10 20 40 60 80 100 SE +/- 0.26, N = 3 99.85 1. (CC) gcc options: -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform scimarkBIOS12Ph10 30 60 90 120 150 SE +/- 2.33, N = 3 130.34 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply scimarkBIOS12Ph10 100 200 300 400 500 SE +/- 0.72, N = 3 451.62 1. (CC) gcc options: -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization scimarkBIOS12Ph10 90 180 270 360 450 SE +/- 88.49, N = 3 423.88 1. (CC) gcc options: -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation scimarkBIOS12Ph10 200 400 600 800 1000 SE +/- 3.06, N = 3 873.39 1. (CC) gcc options: -lm
Phoronix Test Suite v10.8.4