Benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2402096-NE-ZLUDA405505 ZLUDA Radeon Benchmarks - Phoronix Test Suite ZLUDA Radeon Benchmarks Benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2402096-NE-ZLUDA405505&export=pdf&gru&sro&rro .
ZLUDA Radeon Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution RTX 3080 RTX 4070 SUPER RTX 4080 RX 6800 RX 6800 XT RX 7700 XT RX 7900 XT RX 7900 XTX AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads) ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) AMD Device 14d8 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G 2000GB Samsung SSD 980 PRO 2TB NVIDIA GeForce RTX 3080 10GB NVIDIA GA102 HD Audio DELL U2723QE Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 22.04 6.2.0-26-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 NVIDIA 550.40.07 4.6.0 OpenCL 3.0 CUDA 12.4.74 1.3.271 GCC 11.4.0 ext4 3840x2160 NVIDIA GeForce RTX 4070 SUPER 12GB NVIDIA Device 22bc NVIDIA GeForce RTX 4080 16GB NVIDIA Device 22bb AMD Radeon RX 6800 16GB (2475/1000MHz) AMD Navi 21 HDMI Audio X Server 1.21.1.4 + Wayland 4.6 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 DRM 3.54) OpenCL 2.1 AMD-APP (3590.0) 1.3.238 AMD Radeon RX 6800 XT 16GB (2575/1000MHz) XFX AMD Radeon RX 7700 XT 12GB (2276/1124MHz) AMD Device ab30 AMD Radeon RX 7900 XT 20GB (2025/1249MHz) AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203 Graphics Details - RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07 - RTX 4070 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.01 - RTX 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - RX 6800: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101 - RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - RX 7700 XT: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-EXT90040-100 - RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00 - RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102 OpenCL Details - RTX 3080: GPU Compute Cores: 8704 - RTX 4070 SUPER: GPU Compute Cores: 7168 - RTX 4080: GPU Compute Cores: 9728 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ZLUDA Radeon Benchmarks namd-cuda: ATPase Simulation - 327,506 Atoms blender: BMW27 - NVIDIA CUDA blender: Classroom - NVIDIA CUDA blender: Fishy Cat - NVIDIA CUDA blender: Barbershop - NVIDIA CUDA blender: Pabellon Barcelona - NVIDIA CUDA blender: BMW27 - Radeon HIP blender: Classroom - Radeon HIP RTX 3080 RTX 4070 SUPER RTX 4080 RX 6800 RX 6800 XT RX 7700 XT RX 7900 XT RX 7900 XTX 0.08498 12.68 25.71 24.05 97.87 59.02 0.08435 10.17 19.38 20.12 78.93 43.29 0.08105 7.82 14.50 15.61 60.01 32.31 0.13652 25.81 36.88 38.95 147.09 82.67 23.45 39.82 19.09 31.82 32.68 122.46 68.42 19.01 33.47 20.99 33.05 30.32 0.09004 11.42 21.03 20.25 81.98 45.13 14.88 24.25 0.09031 10.05 18.44 18.04 39.00 12.91 20.89 OpenBenchmarking.org
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms RX 7900 XTX RX 7900 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 0.0307 0.0614 0.0921 0.1228 0.1535 SE +/- 0.00096, N = 5 SE +/- 0.00055, N = 3 SE +/- 0.00117, N = 3 SE +/- 0.00046, N = 3 SE +/- 0.00007, N = 3 SE +/- 0.00017, N = 3 0.09031 0.09004 0.13652 0.08105 0.08435 0.08498
Blender Blend File: BMW27 - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA CUDA RX 7900 XTX RX 7900 XT RX 7700 XT RX 6800 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 3.60, N = 15 SE +/- 0.02, N = 3 SE +/- 3.44, N = 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 10.05 11.42 20.99 19.09 25.81 7.82 10.17 12.68
Blender Blend File: Classroom - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: NVIDIA CUDA RX 7900 XTX RX 7900 XT RX 7700 XT RX 6800 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 18.44 21.03 33.05 31.82 36.88 14.50 19.38 25.71
Blender Blend File: Fishy Cat - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA CUDA RX 7900 XTX RX 7900 XT RX 7700 XT RX 6800 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 18.04 20.25 30.32 32.68 38.95 15.61 20.12 24.05
Blender Blend File: Barbershop - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Barbershop - Compute: NVIDIA CUDA RX 7900 XT RX 6800 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 30 60 90 120 150 SE +/- 0.01, N = 2 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 81.98 122.46 147.09 60.01 78.93 97.87
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA RX 7900 XTX RX 7900 XT RX 6800 XT RX 6800 RTX 4080 RTX 4070 SUPER RTX 3080 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 39.00 45.13 68.42 82.67 32.31 43.29 59.02
Blender Blend File: BMW27 - Compute: Radeon HIP OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: Radeon HIP RX 7900 XTX RX 7900 XT RX 6800 XT RX 6800 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.91 14.88 19.01 23.45
Blender Blend File: Classroom - Compute: Radeon HIP OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: Radeon HIP RX 7900 XTX RX 7900 XT RX 6800 XT RX 6800 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 20.89 24.25 33.47 39.82
Phoronix Test Suite v10.8.4