open-cl-suite-fedora-34

AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2111091-AS-OPENCLSUI63&grw.

open-cl-suite-fedora-34ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution09.11.21AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS)AMD Starship/Matisse32768MB3 x 1000GB Samsung SSD 980 PRO 1TBAMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioASUS MG278 + S242HL + GT-191Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Fedora 345.14.16-201.fc34.x86_64 (x86_64)KDE Plasma 5.22.5X Server 1.20.11amdgpu 21.0.04.6 Mesa 22.0.0-devel (LLVM 12.0.1 DRM 3.42 5.14.16-201.fc34.x86_64)OpenCL 2.2 AMD-APP (3361.0)1.2.197Clang 12.0.1ext44480x2160OpenBenchmarking.org- kvm_amd.sev=1 amdgpu.ppfeaturemask=0xffffffff amdgpu.exp_hw_support=1 amdgpu.gpu_recovery=1 amdgpu.deep_color=1 amdgpu.async_gfx_ring=1 amdgpu.mes=1 amdgpu.debug_largebar=1 amdgpu.tmz=1 - Scaling Governor: acpi-cpufreq ondemand- GLAMOR- Python 3.9.7- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

open-cl-suite-fedora-34darktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthparboil: OpenCL BFSparboil: OpenCL LBMparboil: OpenCL TPACFrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyteblender: BMW27 - OpenCLblender: Barbershop - OpenCLcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRsmallpt-gpu: GPU - 4480 x 2160 - Causticsmallpt-gpu: GPU - 4480 x 2160 - Cornellsmallpt-gpu: GPU - 4480 x 2160 - Caustic309.11.211.522.380.200.6312.582375.3816.60875169814.3414.48451.231.386.071.19104.092.354.7659.13286.83308.90818.03698.8010.974474.0513681.513431.62801.3616.5325.3534112949451880163648988716364900231636490160OpenBenchmarking.org

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Boat - Acceleration: OpenCL09.11.210.3420.6841.0261.3681.71SE +/- 0.02, N = 31.52

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Masskrug - Acceleration: OpenCL09.11.210.53551.0711.60652.1422.6775SE +/- 0.03, N = 122.38

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Server Rack - Acceleration: OpenCL09.11.210.0450.090.1350.180.225SE +/- 0.00, N = 30.20

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Server Room - Acceleration: OpenCL09.11.210.14180.28360.42540.56720.709SE +/- 0.00, N = 30.63

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad09.11.213691215SE +/- 0.12, N = 1512.581. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP09.11.215001000150020002500SE +/- 0.70, N = 32375.381. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash09.11.2148121620SE +/- 0.00, N = 1116.601. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops09.11.212M4M6M8M10MSE +/- 768261.22, N = 1287516981. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download09.11.2148121620SE +/- 0.00, N = 314.341. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback09.11.2148121620SE +/- 0.00, N = 314.481. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth09.11.21100200300400500SE +/- 0.33, N = 3451.231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Parboil

Test: OpenCL BFS

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFS09.11.210.31050.6210.93151.2421.5525SE +/- 0.01, N = 31.381. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBM09.11.21246810SE +/- 0.02, N = 36.071. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL TPACF

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACF09.11.210.26780.53560.80341.07121.339SE +/- 0.02, N = 31.191. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte09.11.2120406080100SE +/- 0.95, N = 3104.091. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Heartwall09.11.210.52881.05761.58642.11522.644SE +/- 0.01, N = 32.351. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Leukocyte09.11.211.0712.1423.2134.2845.355SE +/- 0.03, N = 34.761. (CXX) g++ options: -O2 -lOpenCL

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: OpenCL09.11.211326395265SE +/- 1.24, N = 1559.13

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: OpenCL09.11.2160120180240300SE +/- 4.04, N = 9286.83

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy09.11.2170140210280350SE +/- 2.41, N = 3308.901. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Read09.11.212004006008001000SE +/- 1.07, N = 3818.031. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Write09.11.21150300450600750SE +/- 5.00, N = 3698.801. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latency09.11.213691215SE +/- 0.09, N = 310.971. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INT09.11.2110002000300040005000SE +/- 1.49, N = 34474.051. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Float09.11.213K6K9K12K15KSE +/- 1.46, N = 313681.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Double09.11.217001400210028003500SE +/- 0.97, N = 33431.621. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidth09.11.212004006008001000SE +/- 0.10, N = 3801.361. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBuffer09.11.2148121620SE +/- 0.06, N = 316.531. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBuffer09.11.21612182430SE +/- 0.04, N = 325.351. (CXX) g++ options: -O3 -rdynamic -lOpenCL

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Hotel09.11.217001400210028003500SE +/- 2.85, N = 33411

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone09.11.216K12K18K24K30KSE +/- 104.67, N = 329494

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDR09.11.2111K22K33K44K55KSE +/- 90.83, N = 351880

SmallPT GPU

OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic09.11.21400M800M1200M1600M2000MSE +/- 25.12, N = 316364898871. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Cornell09.11.21400M800M1200M1600M2000MSE +/- 24.83, N = 316364900231. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic309.11.21400M800M1200M1600M2000MSE +/- 25.40, N = 316364901601. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4