clpeak-4

AMD EPYC 7F52 16-Core testing with a ASRockRack ROMED8-2T v1.03 (P3.80 BIOS) and llvmpipe on Debian 12 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501215-NE-CLPEAK40616
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
gpu-4
January 21
  4 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


clpeak-4OpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7F52 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASRockRack ROMED8-2T v1.03 (P3.80 BIOS)AMD Starship/Matisse8 x 32GB DDR4-3200MT/s Samsung M393A4K40EB3-CWE1000GB Samsung SSD 980 PRO 1TBllvmpipeNVIDIA GA102 HD AudioSDM-X932 x Intel X550Debian 126.1.0-30-amd64 (x86_64)Cinnamon 5.6.8X Server 1.21.1.7NVIDIA 565.57.014.5 Mesa 22.3.6 (LLVM 15.0.6 256 bits)OpenCL 3.0 CUDA 12.7.33GCC 12.2.0 + CUDA 12.6ext41280x1024ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionClpeak-4 BenchmarksSystem Logs- Transparent Huge Pages: always- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-bTRWOB/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-bTRWOB/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107c - BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.59.00.05- GPU Compute Cores: 10496- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

clpeak-4clpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Kernel Latencygpu-4838.5412.2215.91631.5834456.1317790.5217647.386.36OpenBenchmarking.org

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory Bandwidthgpu-42004006008001000SE +/- 0.06, N = 3838.541. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBuffergpu-43691215SE +/- 0.02, N = 312.221. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBuffergpu-448121620SE +/- 0.01, N = 315.911. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Computegpu-4140280420560700SE +/- 1.81, N = 3631.581. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision Computegpu-47K14K21K28K35KSE +/- 66.06, N = 334456.131. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Computegpu-44K8K12K16K20KSE +/- 176.84, N = 317790.521. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit Computegpu-44K8K12K16K20KSE +/- 7.95, N = 317647.381. (CXX) g++ options: -O3

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel Latencygpu-4246810SE +/- 0.00, N = 36.361. (CXX) g++ options: -O3