sa AMD EPYC 7262 8-Core testing with a GIGABYTE MZ32-AR0-00 v01000100 (R21 BIOS) and Gigabyte NVIDIA GeForce RTX 3080 Lite Hash Rate 10GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2401224-NE-SA824249507&grw .
sa Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX AMD EPYC 7262 8-Core @ 3.20GHz (8 Cores / 16 Threads) GIGABYTE MZ32-AR0-00 v01000100 (R21 BIOS) AMD Starship/Matisse 128GB 1000GB Samsung SSD 980 PRO 1TB Gigabyte NVIDIA GeForce RTX 3080 Lite Hash Rate 10GB NVIDIA GA102 HD Audio 2 x Intel I350 Ubuntu 22.04 6.5.0-14-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 NVIDIA OpenCL 3.0 CUDA 12.2.148 1.3.242 GCC 11.4.0 + CUDA 11.8 ext4 800x600 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x830107a - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
sa viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 342 460 350 543 621 579 181 356 487 490 489 486 OpenBenchmarking.org
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 70 140 210 280 350 SE +/- 0.33, N = 3 342 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 100 200 300 400 500 SE +/- 0.58, N = 3 460 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 80 160 240 320 400 SE +/- 0.33, N = 3 350 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 120 240 360 480 600 SE +/- 0.33, N = 3 543 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 130 260 390 520 650 SE +/- 0.00, N = 3 621 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 130 260 390 520 650 SE +/- 0.33, N = 3 579 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 40 80 120 160 200 SE +/- 0.00, N = 3 181 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 80 160 240 320 400 SE +/- 0.58, N = 3 356 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 110 220 330 440 550 SE +/- 1.45, N = 3 487 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 110 220 330 440 550 SE +/- 1.20, N = 3 490 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 110 220 330 440 550 SE +/- 1.20, N = 3 489 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT AMD EPYC 7262 8-Core - Gigabyte NVIDIA GeForce RTX 110 220 330 440 550 486 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.5