peak AMD Ryzen 9 7900X 12-Core testing with a ASUS TUF GAMING X670E-PLUS (1223 BIOS) and MSI NVIDIA GeForce RTX 3060 Ti 8GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2312016-NE-PEAK7141641&grr .
peak Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution peak_1 AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads) ASUS TUF GAMING X670E-PLUS (1223 BIOS) AMD Device 14d8 32GB 1000GB Samsung SSD 990 PRO 1TB + 2000GB Samsung SSD 980 PRO 2TB MSI NVIDIA GeForce RTX 3060 Ti 8GB NVIDIA GA104 HD Audio U32R59x Realtek RTL8125 2.5GbE Ubuntu 23.10 6.5.0-13-generic (x86_64) GNOME Shell 45.1 X Server NVIDIA 525.147.05 4.6.0 OpenCL 3.0 CUDA 12.0.151 GCC 13.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203 - BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.6b.00.a0 - GPU Compute Cores: 4864 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
peak clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Double-Precision Compute clpeak: Global Memory Bandwidth clpeak: Single-Precision Compute clpeak: Integer 24-bit Compute clpeak: Integer Compute clpeak: Kernel Latency peak_1 19.47 17.29 291.11 391.98 15937.30 8180.20 8196.97 4.14 OpenBenchmarking.org
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer peak_1 5 10 15 20 25 SE +/- 0.02, N = 3 19.47 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer peak_1 4 8 12 16 20 SE +/- 0.13, N = 3 17.29 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute peak_1 60 120 180 240 300 SE +/- 0.03, N = 3 291.11 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth peak_1 90 180 270 360 450 SE +/- 0.03, N = 3 391.98 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute peak_1 3K 6K 9K 12K 15K SE +/- 38.98, N = 3 15937.30 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute peak_1 2K 4K 6K 8K 10K SE +/- 25.73, N = 3 8180.20 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute peak_1 2K 4K 6K 8K 10K SE +/- 80.77, N = 3 8196.97 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency peak_1 0.9315 1.863 2.7945 3.726 4.6575 SE +/- 0.02, N = 3 4.14 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5