Ubuntu 22.04.02 LTS 7900X 7900XTX opencl

AMD Ryzen 9 7900X 12-Core testing with a ASUS ROG STRIX B650E-F GAMING WIFI (1410 BIOS) and ASUS NVIDIA GeForce RTX 4080 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2305210-NE-2305163NE65&sor&grt.

Ubuntu 22.04.02 LTS 7900X 7900XTX openclProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 openclAMD Ryzen 9 7900X 12-Core @ 4.70GHz (12 Cores / 24 Threads)ASUS ROG STRIX B650E-F GAMING WIFI (1410 BIOS)AMD Device 14d864GB2000GB SHPP41-2000GM + 120GB TOSHIBA RC100 + 1000GB Western Digital WD_BLACK SN750 SE NVMe 1TB + 32GB Flash DriveAMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Device ab30LG HDR 4K + LG Ultra HDIntel I225-V + MEDIATEK Device 0608Ubuntu 22.045.19.0-41-generic (x86_64)Budgie 10.6.1X Server 1.21.1.44.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.48)OpenCL 2.1 AMD-APP (3513.0)GCC 11.3.0ext47680x216032GB2000GB SHPP41-2000GM + 120GB TOSHIBA RC100 + 1000GB Western Digital WD_BLACK SN750 SE NVMe 1TBASUS NVIDIA GeForce RTX 4080 16GBNVIDIA Device 22bbNVIDIA 530.41.034.6.0OpenCL 3.0 CUDA 12.1.98OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- Ubuntu 22.04.02 LTS 7900X 7900XTX opencl: GLAMOR - BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-TIC106615-100- Ubuntu 22.04.02 LTS 7900X 4080 opencl: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 95.03.2b.00.8cPython Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details- Ubuntu 22.04.02 LTS 7900X 4080 opencl: GPU Compute Cores: 9728

Ubuntu 22.04.02 LTS 7900X 7900XTX openclcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Slulesh-cl: luxmark: GPU - Hotelluxmark: CPU+GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRrodinia: OpenCL Myocyterodinia: OpenCL Leukocyterodinia: OpenCL Particle Filtershoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthsmallpt-gpu: GPU - 7680 x 2160 - Causticsmallpt-gpu: GPU - 7680 x 2160 - Cornellsmallpt-gpu: GPU - 7680 x 2160 - Caustic3viennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTxsbench-cl: Ubuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl2.7583.0270.1212.50417928928763.595.499.211513264.161.669.966.2374.9620.7520.53.8923782.6723875.52577.23817.2046337.0211.4112.631.1871.8640.1110.6473772770577869189.916022310222947502999385753009979219.9162.3932.929422.94625.97441812.9057.0867946.67916861.653452.726.887626.39752970.9216846492251684649363168464950110316417836.054.459.973.378.864.461.970.366.2367483412524605596219427743766801817OpenBenchmarking.org

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyUbuntu 22.04.02 LTS 7900X 4080 opencl80160240320400SE +/- 0.12, N = 3374.91. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.22, N = 3620.71. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteUbuntu 22.04.02 LTS 7900X 4080 opencl110220330440550SE +/- 1.33, N = 3520.51. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyUbuntu 22.04.02 LTS 7900X 4080 opencl0.87531.75062.62593.50124.3765SE +/- 0.05, N = 153.891. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 61.13, N = 323782.671. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 101.49, N = 323875.521. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthUbuntu 22.04.02 LTS 7900X 4080 opencl120240360480600SE +/- 3.45, N = 3577.231. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.38, N = 3817.201. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeUbuntu 22.04.02 LTS 7900X 4080 opencl10K20K30K40K50KSE +/- 82.14, N = 346337.021. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferUbuntu 22.04.02 LTS 7900X 4080 opencl3691215SE +/- 0.03, N = 311.411. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferUbuntu 22.04.02 LTS 7900X 4080 opencl3691215SE +/- 0.16, N = 312.631. (CXX) g++ options: -O3

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl0.62061.24121.86182.48243.103SE +/- 0.006, N = 3SE +/- 0.022, N = 31.1872.758

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl0.68111.36222.04332.72443.4055SE +/- 0.008, N = 3SE +/- 0.007, N = 31.8643.027

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl0.02720.05440.08160.10880.136SE +/- 0.000, N = 3SE +/- 0.002, N = 30.1110.121

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl0.56341.12681.69022.25362.817SE +/- 0.003, N = 3SE +/- 0.006, N = 30.6472.504

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32Ubuntu 22.04.02 LTS 7900X 4080 opencl8001600240032004000SE +/- 4.37, N = 33772

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CUbuntu 22.04.02 LTS 7900X 4080 opencl17003400510068008500SE +/- 1.00, N = 37705

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SUbuntu 22.04.02 LTS 7900X 4080 opencl2K4K6K8K10KSE +/- 0.67, N = 37786

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06Ubuntu 22.04.02 LTS 7900X 4080 opencl2K4K6K8K10KSE +/- 33.49, N = 39189.921. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 37.00, N = 322310

LuxMark

OpenCL Device: CPU+GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelUbuntu 22.04.02 LTS 7900X 4080 opencl5K10K15K20K25KSE +/- 11.26, N = 322294

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneUbuntu 22.04.02 LTS 7900X 4080 opencl16K32K48K64K80KSE +/- 252.33, N = 375029

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRUbuntu 22.04.02 LTS 7900X 4080 opencl20K40K60K80K100KSE +/- 399.21, N = 399385

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneUbuntu 22.04.02 LTS 7900X 4080 opencl16K32K48K64K80KSE +/- 12.53, N = 375300

LuxMark

OpenCL Device: CPU+GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRUbuntu 22.04.02 LTS 7900X 4080 opencl20K40K60K80K100KSE +/- 6.36, N = 399792

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteUbuntu 22.04.02 LTS 7900X 4080 opencl510152025SE +/- 0.09, N = 319.921. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteUbuntu 22.04.02 LTS 7900X 4080 opencl0.53841.07681.61522.15362.692SE +/- 0.030, N = 122.3931. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterUbuntu 22.04.02 LTS 7900X 4080 opencl0.6591.3181.9772.6363.295SE +/- 0.028, N = 72.9291. (CXX) g++ options: -O2 -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.21, N = 3422.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 325.971. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPUbuntu 22.04.02 LTS 7900X 4080 opencl400800120016002000SE +/- 2.20, N = 31812.901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashUbuntu 22.04.02 LTS 7900X 4080 opencl1326395265SE +/- 0.32, N = 357.091. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 8.26, N = 8946.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NUbuntu 22.04.02 LTS 7900X 4080 opencl4K8K12K16K20KSE +/- 152.29, N = 1516861.61. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsUbuntu 22.04.02 LTS 7900X 4080 opencl11K22K33K44K55KSE +/- 92.32, N = 353452.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 326.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackUbuntu 22.04.02 LTS 7900X 4080 opencl612182430SE +/- 0.00, N = 326.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthUbuntu 22.04.02 LTS 7900X 4080 opencl6001200180024003000SE +/- 2.71, N = 32970.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: CausticUbuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.69, N = 316846492251. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: CornellUbuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.12, N = 316846493631. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 7680 x 2160 - Scene: Caustic3Ubuntu 22.04.02 LTS 7900X 4080 opencl400M800M1200M1600M2000MSE +/- 25.40, N = 316846495011. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl4080120160200SE +/- 0.58, N = 3SE +/- 1.00, N = 31791031. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl60120180240300SE +/- 0.33, N = 3SE +/- 1.20, N = 32891641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl60120180240300SE +/- 0.00, N = 2SE +/- 0.67, N = 32871781. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl1428425670SE +/- 0.09, N = 3SE +/- 0.12, N = 363.536.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl20406080100SE +/- 0.12, N = 3SE +/- 0.03, N = 395.454.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl20406080100SE +/- 0.19, N = 3SE +/- 0.03, N = 399.259.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl306090120150SE +/- 0.67, N = 3SE +/- 0.07, N = 3115.073.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TUbuntu 22.04.02 LTS 7900X 7900XTX openclUbuntu 22.04.02 LTS 7900X 4080 opencl306090120150SE +/- 0.33, N = 3SE +/- 0.12, N = 3132.078.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl1428425670SE +/- 0.20, N = 3SE +/- 0.20, N = 364.464.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl1428425670SE +/- 0.19, N = 3SE +/- 0.17, N = 361.961.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl1632486480SE +/- 0.17, N = 3SE +/- 0.12, N = 370.369.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTUbuntu 22.04.02 LTS 7900X 4080 openclUbuntu 22.04.02 LTS 7900X 7900XTX opencl1530456075SE +/- 0.17, N = 3SE +/- 0.12, N = 366.266.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYUbuntu 22.04.02 LTS 7900X 4080 opencl80160240320400SE +/- 1.53, N = 33671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYUbuntu 22.04.02 LTS 7900X 4080 opencl100200300400500SE +/- 0.00, N = 34831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.33, N = 34121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYUbuntu 22.04.02 LTS 7900X 4080 opencl110220330440550SE +/- 0.00, N = 35241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.33, N = 36051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTUbuntu 22.04.02 LTS 7900X 4080 opencl130260390520650SE +/- 0.33, N = 35961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NUbuntu 22.04.02 LTS 7900X 4080 opencl50100150200250SE +/- 0.00, N = 32191. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TUbuntu 22.04.02 LTS 7900X 4080 opencl90180270360450SE +/- 0.33, N = 34271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNUbuntu 22.04.02 LTS 7900X 4080 opencl160320480640800SE +/- 1.53, N = 37431. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTUbuntu 22.04.02 LTS 7900X 4080 opencl170340510680850SE +/- 1.33, N = 37661. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.20, N = 38011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTUbuntu 22.04.02 LTS 7900X 4080 opencl2004006008001000SE +/- 1.53, N = 38171. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5