20231222um79064thunderbolt.txt

AMD Ryzen 9 7940HS testing with a Shenzhen Meigao Electronic Equipment F7BSC (1.07 BIOS) and AMD Radeon PRO W6800 8GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312228-NE-20231222U30
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
AMD Radeon PRO W6800
December 22 2023
  2 Hours, 36 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


20231222um79064thunderbolt.txtOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7940HS @ 4.00GHz (8 Cores / 16 Threads)Shenzhen Meigao Electronic Equipment F7BSC (1.07 BIOS)AMD Device 14e856GB4097GB HP SSD FX900 Pro 4TB + 1024GB KINGSTON OM8PGP41024Q-A0AMD Radeon PRO W6800 8GB (2555/1000MHz)AMD Navi 21 HDMI AudioDELL ST2210Realtek RTL8125 2.5GbE + Intel I210 + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.046.2.0-39-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.3 + Wayland4.6 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 DRM 3.56)OpenCL 2.1 AMD-APP (3602.0)1.3.238GCC 11.4.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution20231222um79064thunderbolt.txt BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa704103- BAR1 / Visible vRAM Size: 8192 MB - vBIOS Version: 113-D4300100-103- Python 3.10.12- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

20231222um79064thunderbolt.txtcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Slulesh-cl: luxmark: GPU - Hotelluxmark: CPU+GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRrodinia: OpenCL Myocyterodinia: OpenCL Leukocyteshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthsmallpt-gpu: GPU - 1920 x 1080 - Causticsmallpt-gpu: GPU - 1920 x 1080 - Cornellsmallpt-gpu: GPU - 1920 x 1080 - Caustic3viennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTxsbench-cl: AMD Radeon PRO W6800326.9413.5369.719.263654.2415708.24373.031172.3717191.604.9921.562.8293.6260.4471.0993406516653761952.293619388193171125631431101113111422149.6793.77595.95001.90841469.8824.2202589.7184847.52280757781.99721.8595913.65017032735621703273699170327383943.164.745.737.756.642.141.548.347.452.850.65117374973263593461404699721010941991OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyAMD Radeon PRO W680070140210280350SE +/- 0.50, N = 3326.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadAMD Radeon PRO W680090180270360450SE +/- 1.72, N = 3413.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteAMD Radeon PRO W680080160240320400SE +/- 1.59, N = 3369.71. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyAMD Radeon PRO W6800510152025SE +/- 0.20, N = 1519.261. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeAMD Radeon PRO W68008001600240032004000SE +/- 2.74, N = 33654.241. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeAMD Radeon PRO W68003K6K9K12K15KSE +/- 115.06, N = 315708.241. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthAMD Radeon PRO W680080160240320400SE +/- 0.33, N = 3373.031. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeAMD Radeon PRO W680030060090012001500SE +/- 0.55, N = 31172.371. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeAMD Radeon PRO W68004K8K12K16K20KSE +/- 82.20, N = 317191.601. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferAMD Radeon PRO W68001.12282.24563.36844.49125.614SE +/- 0.03, N = 34.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferAMD Radeon PRO W6800510152025SE +/- 0.18, N = 1521.561. (CXX) g++ options: -O3

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLAMD Radeon PRO W68000.63651.2731.90952.5463.1825SE +/- 0.005, N = 32.829

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLAMD Radeon PRO W68000.81591.63182.44773.26364.0795SE +/- 0.033, N = 73.626

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLAMD Radeon PRO W68000.10060.20120.30180.40240.503SE +/- 0.005, N = 150.447

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLAMD Radeon PRO W68000.24730.49460.74190.98921.2365SE +/- 0.016, N = 31.099

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32AMD Radeon PRO W68007001400210028003500SE +/- 7.69, N = 33406

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CAMD Radeon PRO W680011002200330044005500SE +/- 28.21, N = 35166

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SAMD Radeon PRO W680012002400360048006000SE +/- 66.77, N = 35376

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06AMD Radeon PRO W6800400800120016002000SE +/- 21.61, N = 51952.291. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Hotel

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelAMD Radeon PRO W68004K8K12K16K20KSE +/- 187.40, N = 619388

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelAMD Radeon PRO W68004K8K12K16K20KSE +/- 65.58, N = 319317

OpenCL Device: CPU - Scene: Microphone

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneAMD Radeon PRO W680020K40K60K80K100KSE +/- 1027.00, N = 7112563

OpenCL Device: CPU - Scene: Luxball HDR

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRAMD Radeon PRO W680030K60K90K120K150KSE +/- 505.54, N = 3143110

OpenCL Device: Hybrid GPU - Scene: Hotel

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneAMD Radeon PRO W680020K40K60K80K100KSE +/- 156.19, N = 3111311

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRAMD Radeon PRO W680030K60K90K120K150KSE +/- 359.67, N = 3142214

OpenCL Device: Hybrid GPU - Scene: Microphone

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Luxball HDR

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

Test: LU Decomposition

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Compressed Sparse Row

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Cyclic Redundancy Check

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL BFS

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL LBM

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL Histo

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL TPACF

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL MRI Gridding

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL LavaMD

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Failed to compile source (from CL or HIP source to LLVM IR).

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteAMD Radeon PRO W68003691215SE +/- 0.110, N = 159.6791. (CXX) g++ options: -O2 -lOpenCL

Test: OpenCL Heartwall

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteAMD Radeon PRO W68000.84941.69882.54823.39764.247SE +/- 0.033, N = 153.7751. (CXX) g++ options: -O2 -lOpenCL

Test: OpenCL Particle Filter

AMD Radeon PRO W6800: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DAMD Radeon PRO W680020406080100SE +/- 0.71, N = 395.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadAMD Radeon PRO W68000.42940.85881.28821.71762.147SE +/- 0.0030, N = 31.90841. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPAMD Radeon PRO W680030060090012001500SE +/- 4.15, N = 31469.881. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashAMD Radeon PRO W6800612182430SE +/- 0.02, N = 324.221. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionAMD Radeon PRO W6800130260390520650SE +/- 0.29, N = 3589.721. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NAMD Radeon PRO W680010002000300040005000SE +/- 50.10, N = 34847.521. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsAMD Radeon PRO W68006M12M18M24M30MSE +/- 320864.78, N = 9280757781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadAMD Radeon PRO W68000.44940.89881.34821.79762.247SE +/- 0.0000, N = 31.99721. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackAMD Radeon PRO W68000.41840.83681.25521.67362.092SE +/- 0.0000, N = 31.85951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthAMD Radeon PRO W68002004006008001000SE +/- 0.93, N = 3913.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1080 - Scene: CausticAMD Radeon PRO W6800400M800M1200M1600M2000MSE +/- 25.12, N = 317032735621. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1080 - Scene: CornellAMD Radeon PRO W6800400M800M1200M1600M2000MSE +/- 25.12, N = 317032736991. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1080 - Scene: Caustic3AMD Radeon PRO W6800400M800M1200M1600M2000MSE +/- 25.40, N = 317032738391. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYAMD Radeon PRO W68001020304050SE +/- 0.09, N = 343.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYAMD Radeon PRO W68001428425670SE +/- 0.18, N = 364.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTAMD Radeon PRO W68001020304050SE +/- 0.17, N = 345.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYAMD Radeon PRO W6800918273645SE +/- 0.00, N = 337.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYAMD Radeon PRO W68001326395265SE +/- 0.09, N = 356.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTAMD Radeon PRO W68001020304050SE +/- 0.03, N = 342.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NAMD Radeon PRO W6800918273645SE +/- 0.00, N = 341.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNAMD Radeon PRO W68001122334455SE +/- 0.03, N = 348.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTAMD Radeon PRO W68001122334455SE +/- 0.00, N = 347.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNAMD Radeon PRO W68001224364860SE +/- 0.00, N = 352.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTAMD Radeon PRO W68001122334455SE +/- 0.19, N = 350.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYAMD Radeon PRO W6800110220330440550SE +/- 0.58, N = 35111. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYAMD Radeon PRO W6800160320480640800SE +/- 0.33, N = 37371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTAMD Radeon PRO W6800110220330440550SE +/- 1.86, N = 34971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYAMD Radeon PRO W680070140210280350SE +/- 0.67, N = 33261. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYAMD Radeon PRO W680080160240320400SE +/- 0.88, N = 33591. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTAMD Radeon PRO W680080160240320400SE +/- 0.67, N = 33461. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NAMD Radeon PRO W6800306090120150SE +/- 1.15, N = 31401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TAMD Radeon PRO W6800100200300400500SE +/- 0.33, N = 34691. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNAMD Radeon PRO W68002004006008001000SE +/- 0.67, N = 39721. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTAMD Radeon PRO W68002004006008001000SE +/- 0.00, N = 310101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNAMD Radeon PRO W68002004006008001000SE +/- 1.15, N = 39411. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTAMD Radeon PRO W68002004006008001000SE +/- 0.88, N = 39911. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

AMD Radeon PRO W6800: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

63 Results Shown

cl-mem:
  Copy
  Read
  Write
clpeak:
  Kernel Latency
  Integer Compute
  Integer 24-bit Compute
  Global Memory Bandwidth
  Double-Precision Compute
  Single-Precision Compute
  Transfer Bandwidth enqueueReadBuffer
  Transfer Bandwidth enqueueWriteBuffer
Darktable:
  Boat - OpenCL
  Masskrug - OpenCL
  Server Rack - OpenCL
  Server Room - OpenCL
FluidX3D:
  FP32-FP32
  FP32-FP16C
  FP32-FP16S
Lulesh OpenCL
LuxMark:
  GPU - Hotel
  CPU+GPU - Hotel
  GPU - Microphone
  GPU - Luxball HDR
  CPU+GPU - Microphone
  CPU+GPU - Luxball HDR
Rodinia:
  OpenCL Myocyte
  OpenCL Leukocyte
SHOC Scalable HeterOgeneous Computing:
  OpenCL - S3D
  OpenCL - Triad
  OpenCL - FFT SP
  OpenCL - MD5 Hash
  OpenCL - Reduction
  OpenCL - GEMM SGEMM_N
  OpenCL - Max SP Flops
  OpenCL - Bus Speed Download
  OpenCL - Bus Speed Readback
  OpenCL - Texture Read Bandwidth
SmallPT GPU:
  GPU - 1920 x 1080 - Caustic
  GPU - 1920 x 1080 - Cornell
  GPU - 1920 x 1080 - Caustic3
ViennaCL:
  CPU BLAS - sCOPY
  CPU BLAS - sAXPY
  CPU BLAS - sDOT
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-TT
  OpenCL BLAS - sCOPY
  OpenCL BLAS - sAXPY
  OpenCL BLAS - sDOT
  OpenCL BLAS - dCOPY
  OpenCL BLAS - dAXPY
  OpenCL BLAS - dDOT
  OpenCL BLAS - dGEMV-N
  OpenCL BLAS - dGEMV-T
  OpenCL BLAS - dGEMM-NN
  OpenCL BLAS - dGEMM-NT
  OpenCL BLAS - dGEMM-TN
  OpenCL BLAS - dGEMM-TT