3090-clpeak AMD Ryzen Threadripper 1950X 16-Core testing with a ASRock X399 Taichi (P3.90 BIOS) and NVIDIA GeForce RTX 3090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310094-NE-3090CLPEA36&grs .
3090-clpeak Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 3090-clpeak AMD Ryzen Threadripper 1950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASRock X399 Taichi (P3.90 BIOS) AMD 17h 64GB 1024GB ADATA SX8200PNP + 1000GB KINGSTON SA2000M81000G + Samsung SSD 970 EVO Plus 500GB + 256GB SAMSUNG MZVLW256HEHP-000L7 + 2000GB Samsung SSD 970 EVO Plus 2TB NVIDIA GeForce RTX 3090 24GB NVIDIA GA102 HD Audio C27JG5x 2 x Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 22.04 6.2.0-34-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 NVIDIA 535.113.01 4.6.0 OpenCL 3.0 CUDA 12.2.146 1.3.242 GCC 11.4.0 ext4 2560x1440 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8001137 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.59.00.42 - GPU Compute Cores: 10496 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
3090-clpeak clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Single-Precision Compute clpeak: Double-Precision Compute clpeak: Global Memory Bandwidth clpeak: Integer 24-bit Compute clpeak: Integer Compute clpeak: Kernel Latency 3090-clpeak 12.76 7.32 34505.80 635.74 812.86 17703.89 17411.94 5.32 OpenBenchmarking.org
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer 3090-clpeak 3 6 9 12 15 SE +/- 0.00, N = 3 12.76 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer 3090-clpeak 2 4 6 8 10 SE +/- 0.02, N = 3 7.32 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute 3090-clpeak 7K 14K 21K 28K 35K SE +/- 247.00, N = 3 34505.80 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute 3090-clpeak 140 280 420 560 700 SE +/- 1.98, N = 3 635.74 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth 3090-clpeak 200 400 600 800 1000 SE +/- 2.10, N = 3 812.86 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute 3090-clpeak 4K 8K 12K 16K 20K SE +/- 181.86, N = 3 17703.89 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute 3090-clpeak 4K 8K 12K 16K 20K SE +/- 50.71, N = 3 17411.94 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency 3090-clpeak 1.197 2.394 3.591 4.788 5.985 SE +/- 0.04, N = 15 5.32 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5