open-cl-suite-fedora-34 AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111091-AS-OPENCLSUI63&grw .
open-cl-suite-fedora-34 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 09.11.21 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS) AMD Starship/Matisse 32768MB 3 x 1000GB Samsung SSD 980 PRO 1TB AMD Radeon VII 16GB (1801/1000MHz) AMD Vega 20 HDMI Audio ASUS MG278 + S242HL + GT-191 Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Fedora 34 5.14.16-201.fc34.x86_64 (x86_64) KDE Plasma 5.22.5 X Server 1.20.11 amdgpu 21.0.0 4.6 Mesa 22.0.0-devel (LLVM 12.0.1 DRM 3.42 5.14.16-201.fc34.x86_64) OpenCL 2.2 AMD-APP (3361.0) 1.2.197 Clang 12.0.1 ext4 4480x2160 OpenBenchmarking.org - kvm_amd.sev=1 amdgpu.ppfeaturemask=0xffffffff amdgpu.exp_hw_support=1 amdgpu.gpu_recovery=1 amdgpu.deep_color=1 amdgpu.async_gfx_ring=1 amdgpu.mes=1 amdgpu.debug_largebar=1 amdgpu.tmz=1 - Scaling Governor: acpi-cpufreq ondemand - GLAMOR - Python 3.9.7 - SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
open-cl-suite-fedora-34 darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth parboil: OpenCL BFS parboil: OpenCL LBM parboil: OpenCL TPACF rodinia: OpenCL Myocyte rodinia: OpenCL Heartwall rodinia: OpenCL Leukocyte blender: BMW27 - OpenCL blender: Barbershop - OpenCL cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Kernel Latency clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer luxmark: GPU - Hotel luxmark: GPU - Microphone luxmark: GPU - Luxball HDR smallpt-gpu: GPU - 4480 x 2160 - Caustic smallpt-gpu: GPU - 4480 x 2160 - Cornell smallpt-gpu: GPU - 4480 x 2160 - Caustic3 09.11.21 1.52 2.38 0.20 0.63 12.58 2375.38 16.60 8751698 14.34 14.48 451.23 1.38 6.07 1.19 104.09 2.35 4.76 59.13 286.83 308.90 818.03 698.80 10.97 4474.05 13681.51 3431.62 801.36 16.53 25.35 3411 29494 51880 1636489887 1636490023 1636490160 OpenBenchmarking.org
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.1 Test: Boat - Acceleration: OpenCL 09.11.21 0.342 0.684 1.026 1.368 1.71 SE +/- 0.02, N = 3 1.52
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.1 Test: Masskrug - Acceleration: OpenCL 09.11.21 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.03, N = 12 2.38
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.1 Test: Server Rack - Acceleration: OpenCL 09.11.21 0.045 0.09 0.135 0.18 0.225 SE +/- 0.00, N = 3 0.20
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.1 Test: Server Room - Acceleration: OpenCL 09.11.21 0.1418 0.2836 0.4254 0.5672 0.709 SE +/- 0.00, N = 3 0.63
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad 09.11.21 3 6 9 12 15 SE +/- 0.12, N = 15 12.58 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP 09.11.21 500 1000 1500 2000 2500 SE +/- 0.70, N = 3 2375.38 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash 09.11.21 4 8 12 16 20 SE +/- 0.00, N = 11 16.60 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops 09.11.21 2M 4M 6M 8M 10M SE +/- 768261.22, N = 12 8751698 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download 09.11.21 4 8 12 16 20 SE +/- 0.00, N = 3 14.34 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback 09.11.21 4 8 12 16 20 SE +/- 0.00, N = 3 14.48 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth 09.11.21 100 200 300 400 500 SE +/- 0.33, N = 3 451.23 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
Parboil Test: OpenCL BFS OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL BFS 09.11.21 0.3105 0.621 0.9315 1.242 1.5525 SE +/- 0.01, N = 3 1.38 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenCL LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL LBM 09.11.21 2 4 6 8 10 SE +/- 0.02, N = 3 6.07 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenCL TPACF OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL TPACF 09.11.21 0.2678 0.5356 0.8034 1.0712 1.339 SE +/- 0.02, N = 3 1.19 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte 09.11.21 20 40 60 80 100 SE +/- 0.95, N = 3 104.09 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Heartwall 09.11.21 0.5288 1.0576 1.5864 2.1152 2.644 SE +/- 0.01, N = 3 2.35 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Leukocyte 09.11.21 1.071 2.142 3.213 4.284 5.355 SE +/- 0.03, N = 3 4.76 1. (CXX) g++ options: -O2 -lOpenCL
Blender Blend File: BMW27 - Compute: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: OpenCL 09.11.21 13 26 39 52 65 SE +/- 1.24, N = 15 59.13
Blender Blend File: Barbershop - Compute: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: OpenCL 09.11.21 60 120 180 240 300 SE +/- 4.04, N = 9 286.83
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 09.11.21 70 140 210 280 350 SE +/- 2.41, N = 3 308.90 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 09.11.21 200 400 600 800 1000 SE +/- 1.07, N = 3 818.03 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write 09.11.21 150 300 450 600 750 SE +/- 5.00, N = 3 698.80 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency 09.11.21 3 6 9 12 15 SE +/- 0.09, N = 3 10.97 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT 09.11.21 1000 2000 3000 4000 5000 SE +/- 1.49, N = 3 4474.05 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float 09.11.21 3K 6K 9K 12K 15K SE +/- 1.46, N = 3 13681.51 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double 09.11.21 700 1400 2100 2800 3500 SE +/- 0.97, N = 3 3431.62 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth 09.11.21 200 400 600 800 1000 SE +/- 0.10, N = 3 801.36 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer 09.11.21 4 8 12 16 20 SE +/- 0.06, N = 3 16.53 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer 09.11.21 6 12 18 24 30 SE +/- 0.04, N = 3 25.35 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel 09.11.21 700 1400 2100 2800 3500 SE +/- 2.85, N = 3 3411
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone 09.11.21 6K 12K 18K 24K 30K SE +/- 104.67, N = 3 29494
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR 09.11.21 11K 22K 33K 44K 55K SE +/- 90.83, N = 3 51880
SmallPT GPU OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic 09.11.21 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 1636489887 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Cornell 09.11.21 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1636490023 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic3 09.11.21 400M 800M 1200M 1600M 2000M SE +/- 25.40, N = 3 1636490160 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.4