clpeak bench

Intel Xeon E5-2696 v4 testing with a ASRock X99 WS (P3.50 BIOS) and Gigabyte NVIDIA GeForce RTX 3080 10GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2401319-NE-CLPEAKBEN96
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Gigabyte NVIDIA GeForce RTX 3080
January 31
  4 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


clpeak benchOpenBenchmarking.orgPhoronix Test SuiteIntel Xeon E5-2696 v4 @ 3.70GHz (22 Cores / 44 Threads)ASRock X99 WS (P3.50 BIOS)Intel Xeon E7 v4/Xeon8 x 16 GB DDR4-2400MT/s Samsung M393A2G40EB1-CRC500GB SHGP31-500GMGigabyte NVIDIA GeForce RTX 3080 10GBRealtek ALC1150DELL P2715QIntel I217-LM + Intel I210Ubuntu 22.046.5.0-15-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.4NVIDIA 535.154.054.6.0OpenCL 3.0 CUDA 12.2.1481.3.242GCC 11.4.0ext42160x3840ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionClpeak Bench BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0xb000040- BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.26.08.0d- GPU Compute Cores: 8704- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

clpeak benchclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferGigabyte NVIDIA GeForce RTX 30803.9614808.8914965.52663.38528.2828771.4612.739.18OpenBenchmarking.org

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyGigabyte NVIDIA GeForce RTX 30800.8911.7822.6733.5644.455SE +/- 0.02, N = 33.961. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeGigabyte NVIDIA GeForce RTX 30803K6K9K12K15KSE +/- 68.39, N = 314808.891. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeGigabyte NVIDIA GeForce RTX 30803K6K9K12K15KSE +/- 166.30, N = 414965.521. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthGigabyte NVIDIA GeForce RTX 3080140280420560700SE +/- 0.09, N = 3663.381. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeGigabyte NVIDIA GeForce RTX 3080110220330440550SE +/- 1.14, N = 3528.281. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeGigabyte NVIDIA GeForce RTX 30806K12K18K24K30KSE +/- 2.43, N = 328771.461. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferGigabyte NVIDIA GeForce RTX 30803691215SE +/- 0.01, N = 312.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferGigabyte NVIDIA GeForce RTX 30803691215SE +/- 0.11, N = 39.181. (CXX) g++ options: -O3