OpenCL MiniTest

Intel Core i9-9900K testing with a ASRock Z390 Pro4 (P4.70 BIOS) and Sapphire AMD Radeon R9 290/390 on ManjaroLinux 21.3.6 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2208035-EIRI-220803618&rdt&gru.

OpenCL MiniTestProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenCLVulkanCompilerFile-SystemScreen ResolutionOpenGLSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/libIntel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads)ASRock Z390 Pro4 (P4.70 BIOS)Intel Cannon Lake PCH32GB1024GB ADATA SX8100NPSapphire AMD Radeon R9 290/390 (1200MHz)Realtek ALC89232S305Intel I219-VManjaroLinux 21.3.65.15.57-2-MANJARO (x86_64)GNOME Shell 42.3.1X ServerOpenCL 2.1 AMD-APP (3188.4)1.3.211GCC 12.1.0 + Clang 14.0.6 + LLVM 14.0.6ext41920x10804.6 Mesa 22.1.4 (LLVM 14.0.6 DRM 3.42)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0xf0Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Vulnerable: eIBRS with unprivileged eBPF + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled

OpenCL MiniTestneatbench: GPUcl-mem: Copycl-mem: Readcl-mem: Writeviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Single-Precision Floatclpeak: Double-Precision Doubleviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTclpeak: Integer Compute INTxsbench-cl: lczero: OpenCLsmallpt-gpu: GPU - Complexsmallpt-gpu: GPU - Cornellsmallpt-gpu: GPU - Caustic3clpeak: Kernel LatencySapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib200249.7251.6192.212115711617520818743.8121269.5916.6440.124961.56632.174573783904511679.4185317597516222.12200254.8251.8196.812616111917921418845.5125276.8016.6140.775250.22664.144794174014521677.7287831037568916595395321659539657165953979121.34OpenBenchmarking.org

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3200200

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopySapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib60120180240300SE +/- 0.46, N = 3SE +/- 0.70, N = 3249.7254.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib60120180240300SE +/- 2.57, N = 3SE +/- 1.79, N = 3251.6251.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib4080120160200SE +/- 0.59, N = 3SE +/- 0.42, N = 3192.2196.81. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib306090120150SE +/- 0.67, N = 3SE +/- 0.33, N = 31211261. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib4080120160200SE +/- 0.33, N = 3SE +/- 0.88, N = 31571611. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib306090120150SE +/- 0.58, N = 3SE +/- 0.88, N = 31161191. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib4080120160200SE +/- 0.67, N = 3SE +/- 1.15, N = 31751791. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib50100150200250SE +/- 0.58, N = 3SE +/- 0.58, N = 32082141. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib4080120160200SE +/- 1.86, N = 3SE +/- 0.58, N = 31871881. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib1020304050SE +/- 0.15, N = 3SE +/- 0.06, N = 343.845.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib306090120150SE +/- 0.88, N = 3SE +/- 2.08, N = 31211251. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib60120180240300SE +/- 0.09, N = 3SE +/- 0.01, N = 3269.59276.801. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib48121620SE +/- 0.11, N = 3SE +/- 0.10, N = 316.6416.611. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib918273645SE +/- 0.09, N = 3SE +/- 0.21, N = 340.1240.771. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib11002200330044005500SE +/- 0.10, N = 3SE +/- 0.24, N = 34961.565250.221. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib140280420560700SE +/- 0.08, N = 3SE +/- 0.10, N = 3632.17664.141. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib100200300400500SE +/- 0.00, N = 3SE +/- 0.58, N = 34574791. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib90180270360450SE +/- 1.00, N = 3SE +/- 0.00, N = 33784171. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib90180270360450SE +/- 0.33, N = 3SE +/- 0.67, N = 33904011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib100200300400500SE +/- 1.20, N = 3SE +/- 0.33, N = 34514521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib400800120016002000SE +/- 0.02, N = 3SE +/- 0.58, N = 31679.411677.721. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Xsbench OpenCL

OpenBenchmarking.orgLookups/s, More Is BetterXsbench OpenCL 2017-07-06Sapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib20M40M60M80M100MSE +/- 30592.17, N = 3SE +/- 18062.59, N = 385317597878310371. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm -lOpenCL

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLSapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib12002400360048006000SE +/- 70.64, N = 3SE +/- 68.09, N = 3516256891. (CXX) g++ options: -flto -pthread

SmallPT GPU

OpenCL Device: GPU - Scene: Complex

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: ComplexSapphire Radeon R9 290X 4GB w/lib400M800M1200M1600M2000MSE +/- 20.21, N = 316595395321. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellSapphire Radeon R9 290X 4GB w/lib400M800M1200M1600M2000MSE +/- 23.96, N = 316595396571. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Sapphire Radeon R9 290X 4GB w/lib400M800M1200M1600M2000MSE +/- 24.83, N = 316595397911. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencySapphire Radeon R9 290X 4GBSapphire Radeon R9 290X 4GB w/lib510152025SE +/- 0.09, N = 3SE +/- 0.09, N = 322.1221.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.4