OpenCL MiniTest Intel Core i9-9900K testing with a ASRock Z390 Pro4 (P4.70 BIOS) and Sapphire AMD Radeon R9 290/390 on ManjaroLinux 21.3.6 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2208035-EIRI-220803618&grw .
OpenCL MiniTest Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenCL Vulkan Compiler File-System Screen Resolution OpenGL Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads) ASRock Z390 Pro4 (P4.70 BIOS) Intel Cannon Lake PCH 32GB 1024GB ADATA SX8100NP Sapphire AMD Radeon R9 290/390 (1200MHz) Realtek ALC892 32S305 Intel I219-V ManjaroLinux 21.3.6 5.15.57-2-MANJARO (x86_64) GNOME Shell 42.3.1 X Server OpenCL 2.1 AMD-APP (3188.4) 1.3.211 GCC 12.1.0 + Clang 14.0.6 + LLVM 14.0.6 ext4 1920x1080 4.6 Mesa 22.1.4 (LLVM 14.0.6 DRM 3.42) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0xf0 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Vulnerable: eIBRS with unprivileged eBPF + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled
OpenCL MiniTest lczero: OpenCL neatbench: GPU cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Kernel Latency clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT smallpt-gpu: GPU - Complex smallpt-gpu: GPU - Cornell smallpt-gpu: GPU - Caustic3 xsbench-cl: Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 5162 200 249.7 251.6 192.2 22.12 1679.41 4961.56 632.17 269.59 16.64 40.12 121 157 116 175 208 187 43.8 121 457 378 390 451 85317597 5689 200 254.8 251.8 196.8 21.34 1677.72 5250.22 664.14 276.80 16.61 40.77 126 161 119 179 214 188 45.5 125 479 417 401 452 1659539532 1659539657 1659539791 87831037 OpenBenchmarking.org
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: OpenCL Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 1200 2400 3600 4800 6000 SE +/- 70.64, N = 3 SE +/- 68.09, N = 3 5162 5689 1. (CXX) g++ options: -flto -pthread
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 200 200
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 60 120 180 240 300 SE +/- 0.46, N = 3 SE +/- 0.70, N = 3 249.7 254.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 60 120 180 240 300 SE +/- 2.57, N = 3 SE +/- 1.79, N = 3 251.6 251.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 40 80 120 160 200 SE +/- 0.59, N = 3 SE +/- 0.42, N = 3 192.2 196.8 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 22.12 21.34 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 400 800 1200 1600 2000 SE +/- 0.02, N = 3 SE +/- 0.58, N = 3 1679.41 1677.72 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 1100 2200 3300 4400 5500 SE +/- 0.10, N = 3 SE +/- 0.24, N = 3 4961.56 5250.22 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 140 280 420 560 700 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 632.17 664.14 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 60 120 180 240 300 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 269.59 276.80 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 16.64 16.61 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 40.12 40.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 30 60 90 120 150 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 121 126 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 157 161 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 30 60 90 120 150 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 116 119 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 1.15, N = 3 175 179 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 50 100 150 200 250 SE +/- 0.58, N = 3 SE +/- 0.58, N = 3 208 214 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 40 80 120 160 200 SE +/- 1.86, N = 3 SE +/- 0.58, N = 3 187 188 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 43.8 45.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 30 60 90 120 150 SE +/- 0.88, N = 3 SE +/- 2.08, N = 3 121 125 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 100 200 300 400 500 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 457 479 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 90 180 270 360 450 SE +/- 1.00, N = 3 SE +/- 0.00, N = 3 378 417 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 390 401 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 100 200 300 400 500 SE +/- 1.20, N = 3 SE +/- 0.33, N = 3 451 452 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
SmallPT GPU OpenCL Device: GPU - Scene: Complex OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Complex Sapphire Radeon R9 290X 4GB w/lib 400M 800M 1200M 1600M 2000M SE +/- 20.21, N = 3 1659539532 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Cornell Sapphire Radeon R9 290X 4GB w/lib 400M 800M 1200M 1600M 2000M SE +/- 23.96, N = 3 1659539657 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Caustic3 Sapphire Radeon R9 290X 4GB w/lib 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1659539791 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Xsbench OpenCL OpenBenchmarking.org Lookups/s, More Is Better Xsbench OpenCL 2017-07-06 Sapphire Radeon R9 290X 4GB Sapphire Radeon R9 290X 4GB w/lib 20M 40M 60M 80M 100M SE +/- 30592.17, N = 3 SE +/- 18062.59, N = 3 85317597 87831037 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm -lOpenCL
Phoronix Test Suite v10.8.4