20231227.txt AMD Ryzen 9 7940HS testing with a Shenzhen Meigao Electronic Equipment F7BSC (1.07 BIOS) and AMD Radeon PRO W6800 30GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2312270-NE-20231227T93&grs .
20231227.txt Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution AMD Radeon PRO W6800 AMD Ryzen 9 7940HS @ 4.00GHz (8 Cores / 16 Threads) Shenzhen Meigao Electronic Equipment F7BSC (1.07 BIOS) AMD Device 14e8 56GB 4097GB HP SSD FX900 Pro 4TB + 1024GB KINGSTON OM8PGP41024Q-A0 AMD Radeon PRO W6800 30GB AMD Navi 21 HDMI Audio DELL U2412M Realtek RTL8125 2.5GbE + Intel I210 + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 22.04 6.2.0-39-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.3 + Wayland 4.6 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 DRM 3.56) OpenCL 2.1 AMD-APP (3602.0) 1.3.238 GCC 12.1.0 ext4 1920x1200 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-conda-linux-gnu --disable-bootstrap --disable-libmudflap --disable-libssp --disable-multilib --disable-nls --enable-__cxa_atexit --enable-default-pie --enable-gold --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libgomp --enable-libquadmath --enable-libquadmath-support --enable-libsanitizer --enable-long-long --enable-lto --enable-plugin --enable-target-optspace --enable-threads=posix --host=x86_64-conda-linux-gnu --mandir=/home/conda/feedstock_root/build_artifacts/gcc_compilers_1665882792052/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho/man --target=x86_64-conda-linux-gnu --with-build-sysroot=/home/conda/feedstock_root/build_artifacts/gcc_compilers_1665882792052/_build_env/x86_64-conda-linux-gnu/sysroot --with-slibdir=/home/conda/feedstock_root/build_artifacts/gcc_compilers_1665882792052/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho/lib - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa704103 - BAR1 / Visible vRAM Size: 30704 MB - vBIOS Version: 113-D4300100-103 - Python 3.11.5 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
20231227.txt lulesh-cl: luxmark: CPU+GPU - Luxball HDR luxmark: CPU+GPU - Microphone luxmark: GPU - Luxball HDR luxmark: GPU - Microphone luxmark: CPU+GPU - Hotel luxmark: GPU - Hotel darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL viennacl: OpenCL BLAS - dGEMM-TT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sCOPY viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sCOPY clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Single-Precision Compute clpeak: Double-Precision Compute clpeak: Global Memory Bandwidth clpeak: Integer 24-bit Compute clpeak: Integer Compute fluidx3d: FP32-FP16S fluidx3d: FP32-FP16C fluidx3d: FP32-FP32 shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Reduction shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: OpenCL - Triad shoc: OpenCL - S3D clpeak: Kernel Latency shoc: OpenCL - Max SP Flops AMD Radeon PRO W6800 2016.3927 142184 111531 142986 112279 19205 19365 1.105 0.452 3.573 2.829 1010 989 1020 998 465 142 350 355 325 490 725 495 50.4 52.6 47.1 48.2 42.3 41.5 42.1 56.5 37.7 45.4 65.2 43.5 21.49 5.10 16882.56 1172.89 368.32 14797.68 3655.35 5346 5196 3428 917.980 1.8570 1.9976 6122.47 575.989 24.2961 1472.83 1.9015 83.3888 21.84 10629233 OpenBenchmarking.org
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 AMD Radeon PRO W6800 400 800 1200 1600 2000 SE +/- 5.43, N = 3 2016.39 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
LuxMark OpenCL Device: CPU+GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Luxball HDR AMD Radeon PRO W6800 30K 60K 90K 120K 150K SE +/- 336.85, N = 3 142184
LuxMark OpenCL Device: CPU+GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone AMD Radeon PRO W6800 20K 40K 60K 80K 100K SE +/- 45.06, N = 3 111531
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR AMD Radeon PRO W6800 30K 60K 90K 120K 150K SE +/- 318.13, N = 3 142986
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone AMD Radeon PRO W6800 20K 40K 60K 80K 100K SE +/- 969.07, N = 8 112279
LuxMark OpenCL Device: CPU+GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Hotel AMD Radeon PRO W6800 4K 8K 12K 16K 20K SE +/- 1.20, N = 3 19205
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel AMD Radeon PRO W6800 4K 8K 12K 16K 20K SE +/- 157.99, N = 9 19365
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Room - Acceleration: OpenCL AMD Radeon PRO W6800 0.2486 0.4972 0.7458 0.9944 1.243 SE +/- 0.002, N = 3 1.105
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Rack - Acceleration: OpenCL AMD Radeon PRO W6800 0.1017 0.2034 0.3051 0.4068 0.5085 SE +/- 0.005, N = 15 0.452
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Masskrug - Acceleration: OpenCL AMD Radeon PRO W6800 0.8039 1.6078 2.4117 3.2156 4.0195 SE +/- 0.015, N = 3 3.573
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Boat - Acceleration: OpenCL AMD Radeon PRO W6800 0.6365 1.273 1.9095 2.546 3.1825 SE +/- 0.031, N = 5 2.829
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT AMD Radeon PRO W6800 200 400 600 800 1000 SE +/- 0.00, N = 2 1010 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN AMD Radeon PRO W6800 200 400 600 800 1000 SE +/- 1.45, N = 3 989 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT AMD Radeon PRO W6800 200 400 600 800 1000 SE +/- 0.00, N = 3 1020 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN AMD Radeon PRO W6800 200 400 600 800 1000 SE +/- 0.88, N = 3 998 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T AMD Radeon PRO W6800 100 200 300 400 500 SE +/- 1.53, N = 3 465 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N AMD Radeon PRO W6800 30 60 90 120 150 SE +/- 1.00, N = 3 142 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT AMD Radeon PRO W6800 80 160 240 320 400 SE +/- 2.73, N = 3 350 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY AMD Radeon PRO W6800 80 160 240 320 400 SE +/- 2.91, N = 3 355 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY AMD Radeon PRO W6800 70 140 210 280 350 SE +/- 1.33, N = 3 325 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT AMD Radeon PRO W6800 110 220 330 440 550 SE +/- 1.76, N = 3 490 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY AMD Radeon PRO W6800 160 320 480 640 800 SE +/- 1.20, N = 3 725 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY AMD Radeon PRO W6800 110 220 330 440 550 SE +/- 2.60, N = 3 495 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT AMD Radeon PRO W6800 11 22 33 44 55 SE +/- 0.09, N = 3 50.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN AMD Radeon PRO W6800 12 24 36 48 60 SE +/- 0.03, N = 3 52.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT AMD Radeon PRO W6800 11 22 33 44 55 SE +/- 0.12, N = 3 47.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN AMD Radeon PRO W6800 11 22 33 44 55 SE +/- 0.10, N = 3 48.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T AMD Radeon PRO W6800 10 20 30 40 50 42.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N AMD Radeon PRO W6800 9 18 27 36 45 SE +/- 0.03, N = 3 41.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT AMD Radeon PRO W6800 10 20 30 40 50 SE +/- 0.00, N = 3 42.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY AMD Radeon PRO W6800 13 26 39 52 65 SE +/- 0.03, N = 3 56.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY AMD Radeon PRO W6800 9 18 27 36 45 SE +/- 0.00, N = 3 37.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT AMD Radeon PRO W6800 10 20 30 40 50 SE +/- 0.13, N = 3 45.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY AMD Radeon PRO W6800 15 30 45 60 75 SE +/- 0.06, N = 3 65.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY AMD Radeon PRO W6800 10 20 30 40 50 SE +/- 0.09, N = 3 43.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer AMD Radeon PRO W6800 5 10 15 20 25 SE +/- 0.19, N = 15 21.49 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer AMD Radeon PRO W6800 1.1475 2.295 3.4425 4.59 5.7375 SE +/- 0.01, N = 3 5.10 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute AMD Radeon PRO W6800 4K 8K 12K 16K 20K SE +/- 74.46, N = 3 16882.56 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute AMD Radeon PRO W6800 300 600 900 1200 1500 SE +/- 0.11, N = 3 1172.89 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth AMD Radeon PRO W6800 80 160 240 320 400 SE +/- 0.39, N = 3 368.32 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute AMD Radeon PRO W6800 3K 6K 9K 12K 15K SE +/- 133.95, N = 3 14797.68 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute AMD Radeon PRO W6800 800 1600 2400 3200 4000 SE +/- 1.28, N = 3 3655.35 1. (CXX) g++ options: -O3
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16S AMD Radeon PRO W6800 1100 2200 3300 4400 5500 SE +/- 30.89, N = 3 5346
FluidX3D Test: FP32-FP16C OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16C AMD Radeon PRO W6800 1100 2200 3300 4400 5500 SE +/- 23.62, N = 3 5196
FluidX3D Test: FP32-FP32 OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP32 AMD Radeon PRO W6800 700 1400 2100 2800 3500 SE +/- 12.45, N = 3 3428
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth AMD Radeon PRO W6800 200 400 600 800 1000 SE +/- 3.81, N = 3 917.98 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback AMD Radeon PRO W6800 0.4178 0.8356 1.2534 1.6712 2.089 SE +/- 0.0002, N = 3 1.8570 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download AMD Radeon PRO W6800 0.4495 0.899 1.3485 1.798 2.2475 SE +/- 0.0001, N = 3 1.9976 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N AMD Radeon PRO W6800 1300 2600 3900 5200 6500 SE +/- 8.35, N = 3 6122.47 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction AMD Radeon PRO W6800 120 240 360 480 600 SE +/- 0.63, N = 3 575.99 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash AMD Radeon PRO W6800 6 12 18 24 30 SE +/- 0.01, N = 3 24.30 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP AMD Radeon PRO W6800 300 600 900 1200 1500 SE +/- 4.42, N = 3 1472.83 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad AMD Radeon PRO W6800 0.4278 0.8556 1.2834 1.7112 2.139 SE +/- 0.0023, N = 3 1.9015 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D AMD Radeon PRO W6800 20 40 60 80 100 SE +/- 0.84, N = 3 83.39 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency AMD Radeon PRO W6800 5 10 15 20 25 SE +/- 0.34, N = 15 21.84 1. (CXX) g++ options: -O3
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops AMD Radeon PRO W6800 2M 4M 6M 8M 10M SE +/- 5725855.20, N = 12 10629233 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Phoronix Test Suite v10.8.5