generalopencl

AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.71 BIOS) and AMD Radeon RX 6900 XT 16GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2204062-TJ-GENERALOP05
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
opencltest1
April 06 2022
  1 Hour, 15 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


generalopenclOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)MSI Creator TRX40 (MS-7C59) v1.0 (1.71 BIOS)AMD Starship/Matisse252GB2 x 1000GB Sabrent Rocket 4.0 Plus + 2 x 1000GB Samsung SSD 970 EVO Plus 1TB + 4001GB ArmorATDAMD Radeon RX 6900 XT 16GB (2660/1000MHz)AMD Device ab28LG Ultra HDAquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-107-generic (x86_64)MATE 1.24.0X Server 1.20.134.6 Mesa 21.3.0-devel (LLVM 12.0.1)OpenCL 2.2 AMD-APP (3361.0)GCC 9.4.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionGeneralopencl BenchmarksSystem Logs- amdgpu.ppfeaturemask=0xffffffff - Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301039- GLAMOR - BAR1 / Visible vRAM Size: 256 MB- Python 2.7.18 + Python 3.8.10- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of LFENCE IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

generalopenclshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writerodinia: OpenCL Myocytedarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLblender: BMW27 - OpenCLblender: Barbershop - OpenCLsmallpt-gpu: GPU - 3840 x 2160 - Cornellsmallpt-gpu: GPU - 3840 x 2160 - Caustic3luxmark: GPU - Hotelclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferopencltest114.45091689.9535.3842288190728.777212.7011853.408367.8465.8414.3120.8394.9572.9920.0961.96150.36556.2916492228471649222942894018.775673.2728186.801788.88451.4219.0225.34OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadopencltest148121620SE +/- 0.03, N = 314.451. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPopencltest1400800120016002000SE +/- 1.91, N = 31689.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashopencltest1816243240SE +/- 0.02, N = 335.381. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsopencltest1600K1200K1800K2400K3000KSE +/- 5385.44, N = 328819071. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadopencltest1714212835SE +/- 0.00, N = 328.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackopencltest13691215SE +/- 0.02, N = 312.701. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthopencltest12004006008001000SE +/- 4.04, N = 3853.411. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyopencltest180160240320400SE +/- 0.07, N = 3367.81. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readopencltest1100200300400500SE +/- 0.09, N = 3465.81. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writeopencltest190180270360450SE +/- 1.33, N = 3414.31. (CC) gcc options: -O2 -flto -lOpenCL

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL BFS

opencltest1: The test quit with a non-zero exit status.

Test: OpenCL LBM

opencltest1: The test quit with a non-zero exit status.

Test: OpenCL Histo

opencltest1: The test quit with a non-zero exit status.

Test: OpenCL TPACF

opencltest1: The test quit with a non-zero exit status.

Test: OpenCL MRI Gridding

opencltest1: The test quit with a non-zero exit status.

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL LavaMD

opencltest1: Error: Failed to compile source (from CL or HIP source to LLVM IR).

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyteopencltest1306090120150SE +/- 8.61, N = 10120.841. (CXX) g++ options: -O2 -lOpenCL

Test: OpenCL Heartwall

opencltest1: Test failed to run.

Test: OpenCL Leukocyte

opencltest1: The test quit with a non-zero exit status.

Test: OpenCL Particle Filter

opencltest1: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Boat - Acceleration: OpenCLopencltest11.11532.23063.34594.46125.5765SE +/- 0.059, N = 34.957

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Masskrug - Acceleration: OpenCLopencltest10.67321.34642.01962.69283.366SE +/- 0.003, N = 32.992

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Rack - Acceleration: OpenCLopencltest10.02160.04320.06480.08640.108SE +/- 0.005, N = 150.096

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Room - Acceleration: OpenCLopencltest10.44120.88241.32361.76482.206SE +/- 0.012, N = 31.961

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: OpenCLopencltest11122334455SE +/- 0.03, N = 350.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: OpenCLopencltest1120240360480600SE +/- 0.24, N = 3556.29

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencltest1: The test quit with a non-zero exit status.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencltest1: The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencltest1: The test quit with a non-zero exit status.

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Caustic

opencltest1: The test quit with a non-zero exit status.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornellopencltest1400M800M1200M1600M2000MSE +/- 25.12, N = 316492228471. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3opencltest1400M800M1200M1600M2000M16492229421. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Hotelopencltest12K4K6K8K10KSE +/- 14.01, N = 38940

OpenCL Device: GPU - Scene: Microphone

opencltest1: The test quit with a non-zero exit status.

OpenCL Device: GPU - Scene: Luxball HDR

opencltest1: The test quit with a non-zero exit status.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latencyopencltest1510152025SE +/- 0.65, N = 1218.771. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTopencltest112002400360048006000SE +/- 1.51, N = 35673.271. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatopencltest16K12K18K24K30KSE +/- 39.41, N = 328186.801. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doubleopencltest1400800120016002000SE +/- 0.24, N = 31788.881. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthopencltest1100200300400500SE +/- 2.14, N = 3451.421. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferopencltest1510152025SE +/- 0.08, N = 319.021. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferopencltest1612182430SE +/- 0.01, N = 325.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL