7601_projectpysx_opencl 2 x AMD EPYC 7601 32-Core testing with a Supermicro Super Server H11DSi v2.00 (2.1 BIOS) and NVIDIA GeForce GTX 1070 8GB on Ubuntu 24.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2412212-NE-7601PROJE69&grr .
7601_projectpysx_opencl Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution 7601_projet_opencl 2 x AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores / 128 Threads) Supermicro Super Server H11DSi v2.00 (2.1 BIOS) AMD 17h 2 x 16GB DDR4-2400MT/s 1024GB P20A E1TB NVIDIA GeForce GTX 1070 8GB NVIDIA GP104 HD Audio SL17-01 2 x Intel I350 Ubuntu 24.04 6.8.0-50-generic (x86_64) GNOME Shell 46.0 X Server 1.21.1.11 NVIDIA 565.77 4.6.0 OpenCL 3.0 LINUX + OpenCL 3.0 CUDA 12.7.33 GCC 13.3.0 + CUDA 12.4 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x800126f - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 86.04.50.00.80 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
7601_projectpysx_opencl opencl-benchmark: Memory Bandwidth Coalesced Write opencl-benchmark: Memory Bandwidth Coalesced Read opencl-benchmark: INT8 Compute opencl-benchmark: INT16 Compute opencl-benchmark: INT32 Compute opencl-benchmark: INT64 Compute opencl-benchmark: FP32 Compute opencl-benchmark: FP64 Compute 7601_projet_opencl 6.39 10.81 1.693 1.799 0.683 0.061 1.063 0.849 OpenBenchmarking.org
ProjectPhysX OpenCL-Benchmark Operation: Memory Bandwidth Coalesced Write OpenBenchmarking.org GB/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: Memory Bandwidth Coalesced Write 7601_projet_opencl 2 4 6 8 10 SE +/- 0.10, N = 3 6.39 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: Memory Bandwidth Coalesced Read OpenBenchmarking.org GB/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: Memory Bandwidth Coalesced Read 7601_projet_opencl 3 6 9 12 15 SE +/- 1.05, N = 3 10.81 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT8 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: INT8 Compute 7601_projet_opencl 0.3809 0.7618 1.1427 1.5236 1.9045 SE +/- 0.005, N = 3 1.693 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT16 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: INT16 Compute 7601_projet_opencl 0.4048 0.8096 1.2144 1.6192 2.024 SE +/- 0.010, N = 3 1.799 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT32 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: INT32 Compute 7601_projet_opencl 0.1537 0.3074 0.4611 0.6148 0.7685 SE +/- 0.000, N = 3 0.683 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT64 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: INT64 Compute 7601_projet_opencl 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.000, N = 3 0.061 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: FP32 Compute OpenBenchmarking.org TFLOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: FP32 Compute 7601_projet_opencl 0.2392 0.4784 0.7176 0.9568 1.196 SE +/- 0.000, N = 3 1.063 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: FP64 Compute OpenBenchmarking.org TFLOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.6 Operation: FP64 Compute 7601_projet_opencl 0.191 0.382 0.573 0.764 0.955 SE +/- 0.003, N = 3 0.849 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
Phoronix Test Suite v10.8.5