1 AMD Ryzen 5 7600X 6-Core testing with a ASRock B650 PG Lightning (1.30.AS05 BIOS) and MSI AMD Radeon RX 6900 XT 16GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2403143-NE-18190240171&grs .
1 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Compiler File-System Screen Resolution 6900XT First Run AMD Ryzen 5 7600X 6-Core @ 5.45GHz (6 Cores / 12 Threads) ASRock B650 PG Lightning (1.30.AS05 BIOS) AMD Device 14d8 32GB 2 x 1000GB PCIe SSD MSI AMD Radeon RX 6900 XT 16GB (2200/2800MHz) AMD Navi 21/23 VG27A Realtek RTL8125 2.5GbE EndeavourOS rolling 6.7.9-arch1-1 (x86_64) KDE Plasma 6.0.2 X Server 1.21.1.11 4.6 Mesa 24.0.2-arch1.2 (LLVM 17.0.6 DRM 3.57) OpenCL 2.1 AMD-APP.dbg (3602.0) GCC 13.2.1 20230801 ext4 2560x1440 OpenBenchmarking.org - Transparent Huge Pages: always - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203 - GLAMOR - BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 102-RAPHAEL-008 - Python 3.11.8 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
1 lulesh-cl: luxmark: CPU+GPU - Luxball HDR luxmark: CPU+GPU - Microphone luxmark: GPU - Luxball HDR luxmark: GPU - Microphone luxmark: CPU+GPU - Hotel luxmark: GPU - Hotel smallpt-gpu: GPU - 2560 x 1440 - Caustic3 smallpt-gpu: GPU - 2560 x 1440 - Cornell smallpt-gpu: GPU - 2560 x 1440 - Caustic darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL viennacl: OpenCL BLAS - dGEMM-TT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sCOPY viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sCOPY rodinia: OpenCL Leukocyte rodinia: OpenCL Myocyte clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Single-Precision Compute clpeak: Double-Precision Compute clpeak: Global Memory Bandwidth clpeak: Integer 24-bit Compute clpeak: Integer Compute clpeak: Kernel Latency fluidx3d: FP32-FP16S fluidx3d: FP32-FP16C fluidx3d: FP32-FP32 cl-mem: Write cl-mem: Read cl-mem: Copy shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Reduction shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: OpenCL - Triad shoc: OpenCL - S3D shoc: OpenCL - Max SP Flops 6900XT First Run 4447.1642 90302 55844 90896 56358 9536 9648 1710392573 1710392437 1710392302 0.610 0.056 1.756 1.423 1340 1290 1350 1320 516 163 571 640 436 578 821 572 38.9 40.5 35.8 37.7 70.8 72.1 66.7 70.2 46.6 83.7 89.4 59.6 2.306 5.424 21.99 5.30 24650.53 1580.95 422.73 23607.58 4856.17 9.11 4231 4214 2008 416.5 471.4 367.3 1135.80 28.4150 28.7631 8720.47 645.292 32.1352 1697.07 25.0029 308.197 67300017 OpenBenchmarking.org
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 6900XT First Run 1000 2000 3000 4000 5000 SE +/- 19.10, N = 3 4447.16 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
LuxMark OpenCL Device: CPU+GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Luxball HDR 6900XT First Run 20K 40K 60K 80K 100K SE +/- 134.99, N = 3 90302
LuxMark OpenCL Device: CPU+GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone 6900XT First Run 12K 24K 36K 48K 60K SE +/- 69.67, N = 3 55844
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR 6900XT First Run 20K 40K 60K 80K 100K SE +/- 489.00, N = 3 90896
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone 6900XT First Run 12K 24K 36K 48K 60K SE +/- 538.50, N = 6 56358
LuxMark OpenCL Device: CPU+GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Hotel 6900XT First Run 2K 4K 6K 8K 10K SE +/- 6.51, N = 3 9536
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel 6900XT First Run 2K 4K 6K 8K 10K SE +/- 116.29, N = 4 9648
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3 6900XT First Run 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1710392573 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Cornell 6900XT First Run 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1710392437 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic 6900XT First Run 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1710392302 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.6.1 Test: Server Room - Acceleration: OpenCL 6900XT First Run 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.002, N = 3 0.610
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.6.1 Test: Server Rack - Acceleration: OpenCL 6900XT First Run 0.0126 0.0252 0.0378 0.0504 0.063 SE +/- 0.000, N = 3 0.056
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.6.1 Test: Masskrug - Acceleration: OpenCL 6900XT First Run 0.3951 0.7902 1.1853 1.5804 1.9755 SE +/- 0.007, N = 3 1.756
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.6.1 Test: Boat - Acceleration: OpenCL 6900XT First Run 0.3202 0.6404 0.9606 1.2808 1.601 SE +/- 0.001, N = 3 1.423
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 6900XT First Run 300 600 900 1200 1500 SE +/- 0.00, N = 3 1340 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 6900XT First Run 300 600 900 1200 1500 SE +/- 0.00, N = 3 1290 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 6900XT First Run 300 600 900 1200 1500 SE +/- 0.00, N = 3 1350 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 6900XT First Run 300 600 900 1200 1500 SE +/- 0.00, N = 3 1320 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 6900XT First Run 110 220 330 440 550 SE +/- 0.00, N = 3 516 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 6900XT First Run 40 80 120 160 200 SE +/- 0.33, N = 3 163 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 6900XT First Run 120 240 360 480 600 SE +/- 1.45, N = 3 571 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 6900XT First Run 140 280 420 560 700 SE +/- 0.88, N = 3 640 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 6900XT First Run 90 180 270 360 450 SE +/- 0.33, N = 3 436 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 6900XT First Run 120 240 360 480 600 SE +/- 1.00, N = 3 578 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 6900XT First Run 200 400 600 800 1000 SE +/- 0.88, N = 3 821 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 6900XT First Run 120 240 360 480 600 SE +/- 0.33, N = 3 572 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 6900XT First Run 9 18 27 36 45 SE +/- 0.09, N = 3 38.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 6900XT First Run 9 18 27 36 45 SE +/- 0.03, N = 3 40.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 6900XT First Run 8 16 24 32 40 SE +/- 0.06, N = 3 35.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 6900XT First Run 9 18 27 36 45 SE +/- 0.00, N = 3 37.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 6900XT First Run 16 32 48 64 80 SE +/- 0.23, N = 3 70.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 6900XT First Run 16 32 48 64 80 SE +/- 0.37, N = 3 72.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 6900XT First Run 15 30 45 60 75 SE +/- 0.52, N = 3 66.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 6900XT First Run 16 32 48 64 80 SE +/- 0.15, N = 3 70.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 6900XT First Run 11 22 33 44 55 SE +/- 0.21, N = 3 46.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 6900XT First Run 20 40 60 80 100 SE +/- 0.09, N = 3 83.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 6900XT First Run 20 40 60 80 100 SE +/- 0.49, N = 3 89.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 6900XT First Run 13 26 39 52 65 SE +/- 0.03, N = 3 59.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Leukocyte 6900XT First Run 0.5189 1.0378 1.5567 2.0756 2.5945 SE +/- 0.004, N = 3 2.306 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte 6900XT First Run 1.2204 2.4408 3.6612 4.8816 6.102 SE +/- 0.068, N = 3 5.424 1. (CXX) g++ options: -O2 -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer 6900XT First Run 5 10 15 20 25 SE +/- 0.09, N = 3 21.99 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer 6900XT First Run 1.1925 2.385 3.5775 4.77 5.9625 SE +/- 0.03, N = 3 5.30 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute 6900XT First Run 5K 10K 15K 20K 25K SE +/- 104.45, N = 3 24650.53 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute 6900XT First Run 300 600 900 1200 1500 SE +/- 0.40, N = 3 1580.95 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth 6900XT First Run 90 180 270 360 450 SE +/- 1.63, N = 3 422.73 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute 6900XT First Run 5K 10K 15K 20K 25K SE +/- 64.44, N = 3 23607.58 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute 6900XT First Run 1000 2000 3000 4000 5000 SE +/- 8.60, N = 3 4856.17 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency 6900XT First Run 3 6 9 12 15 SE +/- 0.08, N = 7 9.11 1. (CXX) g++ options: -O3
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16S 6900XT First Run 900 1800 2700 3600 4500 SE +/- 7.22, N = 3 4231
FluidX3D Test: FP32-FP16C OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16C 6900XT First Run 900 1800 2700 3600 4500 SE +/- 6.43, N = 3 4214
FluidX3D Test: FP32-FP32 OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP32 6900XT First Run 400 800 1200 1600 2000 SE +/- 2.60, N = 3 2008
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write 6900XT First Run 90 180 270 360 450 SE +/- 0.53, N = 3 416.5 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 6900XT First Run 100 200 300 400 500 SE +/- 0.09, N = 3 471.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 6900XT First Run 80 160 240 320 400 SE +/- 0.10, N = 3 367.3 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth 6900XT First Run 200 400 600 800 1000 SE +/- 6.38, N = 3 1135.80 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback 6900XT First Run 7 14 21 28 35 SE +/- 0.00, N = 3 28.42 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download 6900XT First Run 7 14 21 28 35 SE +/- 0.00, N = 3 28.76 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N 6900XT First Run 2K 4K 6K 8K 10K SE +/- 102.63, N = 4 8720.47 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction 6900XT First Run 140 280 420 560 700 SE +/- 0.64, N = 3 645.29 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash 6900XT First Run 7 14 21 28 35 SE +/- 0.04, N = 3 32.14 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP 6900XT First Run 400 800 1200 1600 2000 SE +/- 5.03, N = 3 1697.07 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad 6900XT First Run 6 12 18 24 30 SE +/- 0.09, N = 3 25.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D 6900XT First Run 70 140 210 280 350 SE +/- 0.39, N = 3 308.20 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops 6900XT First Run 14M 28M 42M 56M 70M SE +/- 4918035.27, N = 6 67300017 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi
Phoronix Test Suite v10.8.5