2022-08-09_viennacl AMD Ryzen 9 5900X 12-Core testing with a ASUS Pro WS X570-ACE (4201 BIOS) and MSI NVIDIA GeForce RTX 3080 10GB on Fedora Linux 35 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2208098-NE-20220809V28 .
2022-08-09_viennacl Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution 2022-08-09_viennacl AMD Ryzen 9 5900X 12-Core @ 5.16GHz (12 Cores / 24 Threads) ASUS Pro WS X570-ACE (4201 BIOS) AMD Starship/Matisse 128GB 3 x 1000GB Samsung SSD 980 PRO 1TB + 1000GB Samsung SSD 970 EVO 1TB + 32GB OCZ VERTEX + 256GB OCZ VERTEX4 MSI NVIDIA GeForce RTX 3080 10GB NVIDIA GA102 HD Audio 2 x BenQ PD2720U Mellanox MT27520 + 7 x Mellanox MT27500/MT27520 + Intel I211 Fedora Linux 35 5.18.16-100.fc35.x86_64 (x86_64) GNOME Shell 41.8.1 X Server 1.20.11 NVIDIA 515.57 4.6.0 OpenCL 3.0 CUDA 11.7.99 GCC 11.3.1 20220421 + Clang 13.0.0 + LLVM 13.0.0 ext4 7680x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - DEBUGINFOD_URLS=https://debuginfod.fedoraproject.org/ - --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa201016 - GPU Compute Cores: 8704 - SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2022-08-09_viennacl viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT 2022-08-09_viennacl 339 453 353 526 598 574 228 356 464 463 457 465 OpenBenchmarking.org
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 2022-08-09_viennacl 70 140 210 280 350 SE +/- 1.76, N = 3 339 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 2022-08-09_viennacl 100 200 300 400 500 SE +/- 0.67, N = 3 453 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 2022-08-09_viennacl 80 160 240 320 400 SE +/- 0.88, N = 3 353 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 2022-08-09_viennacl 110 220 330 440 550 SE +/- 3.71, N = 3 526 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 2022-08-09_viennacl 130 260 390 520 650 SE +/- 3.84, N = 3 598 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 2022-08-09_viennacl 120 240 360 480 600 SE +/- 0.67, N = 3 574 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 2022-08-09_viennacl 50 100 150 200 250 SE +/- 2.40, N = 3 228 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 2022-08-09_viennacl 80 160 240 320 400 SE +/- 1.67, N = 3 356 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 2022-08-09_viennacl 100 200 300 400 500 SE +/- 3.21, N = 3 464 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 2022-08-09_viennacl 100 200 300 400 500 SE +/- 1.20, N = 3 463 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 2022-08-09_viennacl 100 200 300 400 500 SE +/- 1.50, N = 2 457 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 2022-08-09_viennacl 100 200 300 400 500 SE +/- 2.00, N = 2 465 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4