open-cl-suite-fedora-34

AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2111091-AS-OPENCLSUI63
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
09.11.21
November 09 2021
  1 Hour, 57 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


open-cl-suite-fedora-34OpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)ASUS ROG STRIX TRX40-XE GAMING (1502 BIOS)AMD Starship/Matisse32768MB3 x 1000GB Samsung SSD 980 PRO 1TBAMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioASUS MG278 + S242HL + GT-191Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Fedora 345.14.16-201.fc34.x86_64 (x86_64)KDE Plasma 5.22.5X Server 1.20.11amdgpu 21.0.04.6 Mesa 22.0.0-devel (LLVM 12.0.1 DRM 3.42 5.14.16-201.fc34.x86_64)OpenCL 2.2 AMD-APP (3361.0)1.2.197Clang 12.0.1ext44480x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionOpen-cl-suite-fedora-34 BenchmarksSystem Logs- kvm_amd.sev=1 amdgpu.ppfeaturemask=0xffffffff amdgpu.exp_hw_support=1 amdgpu.gpu_recovery=1 amdgpu.deep_color=1 amdgpu.async_gfx_ring=1 amdgpu.mes=1 amdgpu.debug_largebar=1 amdgpu.tmz=1 - Scaling Governor: acpi-cpufreq ondemand- GLAMOR- Python 3.9.7- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

open-cl-suite-fedora-34shoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writeparboil: OpenCL BFSparboil: OpenCL LBMparboil: OpenCL TPACFrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocytedarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLblender: BMW27 - OpenCLblender: Barbershop - OpenCLsmallpt-gpu: GPU - 4480 x 2160 - Causticsmallpt-gpu: GPU - 4480 x 2160 - Cornellsmallpt-gpu: GPU - 4480 x 2160 - Caustic3luxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBuffer09.11.2112.582375.3816.60875169814.3414.48451.23308.90818.03698.801.386.071.19104.092.354.761.522.380.200.6359.13286.831636489887163649002316364901603411294945188010.974474.0513681.513431.62801.3616.5325.35OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad09.11.213691215SE +/- 0.12, N = 1512.581. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP09.11.215001000150020002500SE +/- 0.70, N = 32375.381. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash09.11.2148121620SE +/- 0.00, N = 1116.601. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops09.11.212M4M6M8M10MSE +/- 768261.22, N = 1287516981. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download09.11.2148121620SE +/- 0.00, N = 314.341. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback09.11.2148121620SE +/- 0.00, N = 314.481. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth09.11.21100200300400500SE +/- 0.33, N = 3451.231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy09.11.2170140210280350SE +/- 2.41, N = 3308.901. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Read09.11.212004006008001000SE +/- 1.07, N = 3818.031. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Write09.11.21150300450600750SE +/- 5.00, N = 3698.801. (CC) gcc options: -O2 -flto -lOpenCL

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFS09.11.210.31050.6210.93151.2421.5525SE +/- 0.01, N = 31.381. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBM09.11.21246810SE +/- 0.02, N = 36.071. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACF09.11.210.26780.53560.80341.07121.339SE +/- 0.02, N = 31.191. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte09.11.2120406080100SE +/- 0.95, N = 3104.091. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Heartwall09.11.210.52881.05761.58642.11522.644SE +/- 0.01, N = 32.351. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Leukocyte09.11.211.0712.1423.2134.2845.355SE +/- 0.03, N = 34.761. (CXX) g++ options: -O2 -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Boat - Acceleration: OpenCL09.11.210.3420.6841.0261.3681.71SE +/- 0.02, N = 31.52

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Masskrug - Acceleration: OpenCL09.11.210.53551.0711.60652.1422.6775SE +/- 0.03, N = 122.38

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Server Rack - Acceleration: OpenCL09.11.210.0450.090.1350.180.225SE +/- 0.00, N = 30.20

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.6.1Test: Server Room - Acceleration: OpenCL09.11.210.14180.28360.42540.56720.709SE +/- 0.00, N = 30.63

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: OpenCL09.11.211326395265SE +/- 1.24, N = 1559.13

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: OpenCL09.11.2160120180240300SE +/- 4.04, N = 9286.83

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic09.11.21400M800M1200M1600M2000MSE +/- 25.12, N = 316364898871. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Cornell09.11.21400M800M1200M1600M2000MSE +/- 24.83, N = 316364900231. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 4480 x 2160 - Scene: Caustic309.11.21400M800M1200M1600M2000MSE +/- 25.40, N = 316364901601. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Hotel09.11.217001400210028003500SE +/- 2.85, N = 33411

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone09.11.216K12K18K24K30KSE +/- 104.67, N = 329494

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDR09.11.2111K22K33K44K55KSE +/- 90.83, N = 351880

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latency09.11.213691215SE +/- 0.09, N = 310.971. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INT09.11.2110002000300040005000SE +/- 1.49, N = 34474.051. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Float09.11.213K6K9K12K15KSE +/- 1.46, N = 313681.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Double09.11.217001400210028003500SE +/- 0.97, N = 33431.621. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidth09.11.212004006008001000SE +/- 0.10, N = 3801.361. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBuffer09.11.2148121620SE +/- 0.06, N = 316.531. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBuffer09.11.21612182430SE +/- 0.04, N = 325.351. (CXX) g++ options: -O3 -rdynamic -lOpenCL