Ubuntu 22.04.02 LTS 7900X 7900XTX opencl

AMD Ryzen 9 7900X 12-Core testing with a ASUS ROG STRIX B650E-F GAMING WIFI (1410 BIOS) and ASUS NVIDIA GeForce RTX 4080 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2305210-NE-2305163NE65
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 3 Tests
HPC - High Performance Computing 3 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 16 Tests
OpenMPI Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Ubuntu 22.04.02 LTS 7900X 7900XTX opencl
May 16 2023
  14 Minutes
Ubuntu 22.04.02 LTS 7900X 4080 opencl
May 21 2023
  2 Hours, 32 Minutes
Invert Hiding All Results Option
  1 Hour, 23 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ubuntu 22.04.02 LTS 7900X 7900XTX opencl - Phoronix Test Suite

Ubuntu 22.04.02 LTS 7900X 7900XTX opencl

AMD Ryzen 9 7900X 12-Core testing with a ASUS ROG STRIX B650E-F GAMING WIFI (1410 BIOS) and ASUS NVIDIA GeForce RTX 4080 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2305210-NE-2305163NE65.

Ubuntu 22.04.02 LTS 7900X 7900XTX openclProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 openclAMD Ryzen 9 7900X 12-Core @ 4.70GHz (12 Cores / 24 Threads)ASUS ROG STRIX B650E-F GAMING WIFI (1410 BIOS)AMD Device 14d864GB2000GB SHPP41-2000GM + 120GB TOSHIBA RC100 + 1000GB Western Digital WD_BLACK SN750 SE NVMe 1TB + 32GB Flash DriveAMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Device ab30LG HDR 4K + LG Ultra HDIntel I225-V + MEDIATEK Device 0608Ubuntu 22.045.19.0-41-generic (x86_64)Budgie 10.6.1X Server 1.21.1.44.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.48)OpenCL 2.1 AMD-APP (3513.0)GCC 11.3.0ext47680x216032GB2000GB SHPP41-2000GM + 120GB TOSHIBA RC100 + 1000GB Western Digital WD_BLACK SN750 SE NVMe 1TBASUS NVIDIA GeForce RTX 4080 16GBNVIDIA Device 22bbNVIDIA 530.41.034.6.0OpenCL 3.0 CUDA 12.1.98OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- Ubuntu 22.04.02 LTS 7900X 7900XTX opencl: GLAMOR - BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-TIC106615-100- Ubuntu 22.04.02 LTS 7900X 4080 opencl: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 95.03.2b.00.8cPython Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details- Ubuntu 22.04.02 LTS 7900X 4080 opencl: GPU Compute Cores: 9728

Ubuntu 22.04.02 LTS 7900X 7900XTX openclfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Srodinia: OpenCL Myocyterodinia: OpenCL Leukocyterodinia: OpenCL Particle Filterviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLluxmark: GPU - Hotelluxmark: CPU+GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRlulesh-cl: shoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTsmallpt-gpu: GPU - 7680 x 2160 - Causticsmallpt-gpu: GPU - 7680 x 2160 - Cornellsmallpt-gpu: GPU - 7680 x 2160 - Caustic3Ubuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl17928928763.595.499.211513264.161.669.966.22.7583.0270.1212.50437727705778619.9162.3932.92910316417836.054.459.973.378.864.461.970.366.21.1871.8640.1110.6472231022294750299938575300997929189.9160422.94625.97441812.9057.0867946.67916861.653452.726.887626.39752970.92374.9620.7520.53.8923782.6723875.52577.23817.2046337.0211.4112.63367483412524605596219427743766801817168464922516846493631684649501OpenBenchmarking.org

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32Ubuntu 22.04.02 LTS 7900X 4080 opencl8001600240032004000SE +/- 4.37, N = 33772

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CUbuntu 22.04.02 LTS 7900X 4080 opencl17003400510068008500SE +/- 1.00, N = 37705

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SUbuntu 22.04.02 LTS 7900X 4080 opencl2K4K6K8K10KSE +/- 0.67, N = 37786

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteUbuntu 22.04.02 LTS 7900X 4080 opencl510152025SE +/- 0.09, N = 319.921. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteUbuntu 22.04.02 LTS 7900X 4080 opencl0.53841.07681.61522.15362.692SE +/- 0.030, N = 122.3931. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterUbuntu 22.04.02 LTS 7900X 4080 opencl0.6591.3181.9772.6363.295SE +/- 0.028, N = 72.9291. (CXX) g++ options: -O2 -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl4080120160200SE +/- 0.58, N = 3SE +/- 1.00, N = 31791031. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl60120180240300SE +/- 0.33, N = 3SE +/- 1.20, N = 32891641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl60120180240300SE +/- 0.00, N = 2SE +/- 0.67, N = 32871781. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1428425670SE +/- 0.09, N = 3SE +/- 0.12, N = 363.536.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl20406080100SE +/- 0.12, N = 3SE +/- 0.03, N = 395.454.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl20406080100SE +/- 0.19, N = 3SE +/- 0.03, N = 399.259.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl306090120150SE +/- 0.67, N = 3SE +/- 0.07, N = 3115.073.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl306090120150SE +/- 0.33, N = 3SE +/- 0.12, N = 3132.078.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1428425670SE +/- 0.20, N = 3SE +/- 0.20, N = 364.164.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1428425670SE +/- 0.17, N = 3SE +/- 0.19, N = 361.661.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1632486480SE +/- 0.12, N = 3SE +/- 0.17, N = 369.970.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1530456075SE +/- 0.12, N = 3SE +/- 0.17, N = 366.266.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl0.62061.24121.86182.48243.103SE +/- 0.022, N = 3SE +/- 0.006, N = 32.7581.187

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl0.68111.36222.04332.72443.4055SE +/- 0.007, N = 3SE +/- 0.008, N = 33.0271.864

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl0.02720.05440.08160.10880.136SE +/- 0.002, N = 3SE +/- 0.000, N = 30.1210.111

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl0.56341.12681.69022.25362.817SE +/- 0.006, N = 3SE +/- 0.003, N = 32.5040.647

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 37.00, N = 322310

LuxMark

OpenCL Device: CPU+GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 11.26, N = 322294

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneUbuntu 22.04.02 LTS 7900X 4080 opencl16K32K48K64K80KSE +/- 252.33, N = 375029

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRUbuntu 22.04.02 LTS 7900X 4080 opencl20K40K60K80K100KSE +/- 399.21, N = 399385

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneUbuntu 22.04.02 LTS 7900X 4080 opencl16K32K48K64K80KSE +/- 12.53, N = 375300

LuxMark

OpenCL Device: CPU+GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRUbuntu 22.04.02 LTS 7900X 4080 opencl20K40K60K80K100KSE +/- 6.36, N = 399792

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06Ubuntu 22.04.02 LTS 7900X 4080 opencl2K4K6K8K10KSE +/- 33.49, N = 39189.921. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.21, N = 3422.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 325.971. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPUbuntu 22.04.02 LTS 7900X 4080 opencl400800120016002000SE +/- 2.20, N = 31812.901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashUbuntu 22.04.02 LTS 7900X 4080 opencl1326395265SE +/- 0.32, N = 357.091. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 8.26, N = 8946.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NUbuntu 22.04.02 LTS 7900X 4080 opencl4K8K12K16K20KSE +/- 152.29, N = 1516861.61. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsUbuntu 22.04.02 LTS 7900X 4080 opencl11K22K33K44K55KSE +/- 92.32, N = 353452.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 326.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 326.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthUbuntu 22.04.02 LTS 7900X 4080 opencl6001200180024003000SE +/- 2.71, N = 32970.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyUbuntu 22.04.02 LTS 7900X 4080 opencl80160240320400SE +/- 0.12, N = 3374.91. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.22, N = 3620.71. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteUbuntu 22.04.02 LTS 7900X 4080 opencl110220330440550SE +/- 1.33, N = 3520.51. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyUbuntu 22.04.02 LTS 7900X 4080 opencl0.87531.75062.62593.50124.3765SE +/- 0.05, N = 153.891. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 61.13, N = 323782.671. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 101.49, N = 323875.521. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthUbuntu 22.04.02 LTS 7900X 4080 opencl120240360480600SE +/- 3.45, N = 3577.231. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.38, N = 3817.201. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl10K20K30K40K50KSE +/- 82.14, N = 346337.021. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferUbuntu 22.04.02 LTS 7900X 4080 opencl3691215SE +/- 0.03, N = 311.411. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferUbuntu 22.04.02 LTS 7900X 4080 opencl3691215SE +/- 0.16, N = 312.631. (CXX) g++ options: -O3

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYUbuntu 22.04.02 LTS 7900X 4080 opencl80160240320400SE +/- 1.53, N = 33671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYUbuntu 22.04.02 LTS 7900X 4080 opencl100200300400500SE +/- 0.00, N = 34831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.33, N = 34121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYUbuntu 22.04.02 LTS 7900X 4080 opencl110220330440550SE +/- 0.00, N = 35241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.33, N = 36051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.33, N = 35961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NUbuntu 22.04.02 LTS 7900X 4080 opencl50100150200250SE +/- 0.00, N = 32191. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.33, N = 34271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNUbuntu 22.04.02 LTS 7900X 4080 opencl160320480640800SE +/- 1.53, N = 37431. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTUbuntu 22.04.02 LTS 7900X 4080 opencl170340510680850SE +/- 1.33, N = 37661. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.20, N = 38011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.53, N = 38171. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: CausticUbuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.69, N = 316846492251. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: CornellUbuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.12, N = 316846493631. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic3Ubuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.40, N = 316846495011. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4