nvrun1 2 x AMD EPYC 7763 64-Core testing with a Supermicro H12DSG-O-CPU (2.4 BIOS) and NVIDIA GA102GL [RTX A6000] 48GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2411062-NE-NVRUN146498&grt .
nvrun1 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution NVIDIA GA102GL 2 x AMD EPYC 7763 64-Core @ 2.45GHz (128 Cores / 256 Threads) Supermicro H12DSG-O-CPU (2.4 BIOS) AMD Starship/Matisse 16 x GB DDR4-3200MT/s 18ASF2G72PDZ-3G2R1 2 x 960GB SAMSUNG MZQL2960HCJR-00A07 + 2 x 7682GB SAMSUNG MZQL27T6HBLA-00A07 + 2 x 3841GB SAMSUNG MZ7L33T8 NVIDIA GA102GL [RTX A6000] 48GB NVIDIA GA102 HD Audio Intel 10-Gigabit X540-AT2 + 2 x Intel I350 Ubuntu 22.04 5.15.0-124-generic (x86_64) X Server NVIDIA OpenCL 3.0 CUDA 12.7.33 1.3.289 GCC 11.4.0 + CUDA 12.6 btrfs 1024x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa0011d5 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.5c.00.02 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
nvrun1 mixbench: NVIDIA CUDA - Integer mixbench: NVIDIA CUDA - Half Precision mixbench: NVIDIA CUDA - Double Precision mixbench: NVIDIA CUDA - Single Precision NVIDIA GA102GL 17371.68 34767.31 524.19 33891.37 OpenBenchmarking.org
Mixbench Backend: NVIDIA CUDA - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer NVIDIA GA102GL 4K 8K 12K 16K 20K SE +/- 11.96, N = 3 17371.68 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Half Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision NVIDIA GA102GL 7K 14K 21K 28K 35K SE +/- 20.76, N = 3 34767.31 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision NVIDIA GA102GL 110 220 330 440 550 SE +/- 4.50, N = 15 524.19 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision NVIDIA GA102GL 7K 14K 21K 28K 35K SE +/- 68.19, N = 3 33891.37 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Phoronix Test Suite v10.8.5