Benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2402096-NE-ZLUDA405505 ZLUDA Radeon Benchmarks - Phoronix Test Suite ZLUDA Radeon Benchmarks Benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2402096-NE-ZLUDA405505&rdt&export=pdf&gru .
ZLUDA Radeon Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Display Driver RX 6800 RX 7900 XT RX 6800 XT RX 7700 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads) ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) AMD Device 14d8 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G 2000GB Samsung SSD 980 PRO 2TB AMD Radeon RX 6800 16GB (2475/1000MHz) AMD Navi 21 HDMI Audio DELL U2723QE Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 22.04 6.2.0-26-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 + Wayland 4.6 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 DRM 3.54) OpenCL 2.1 AMD-APP (3590.0) 1.3.238 GCC 11.4.0 ext4 3840x2160 AMD Radeon RX 7900 XT 20GB (2025/1249MHz) AMD Device ab30 AMD Radeon RX 6800 XT 16GB (2575/1000MHz) AMD Navi 21 HDMI Audio XFX AMD Radeon RX 7700 XT 12GB (2276/1124MHz) AMD Device ab30 AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) NVIDIA GeForce RTX 3080 10GB NVIDIA GA102 HD Audio X Server 1.21.1.4 NVIDIA 550.40.07 4.6.0 OpenCL 3.0 CUDA 12.4.74 1.3.271 NVIDIA GeForce RTX 4080 16GB NVIDIA Device 22bb NVIDIA GeForce RTX 4070 SUPER 12GB NVIDIA Device 22bc OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203 Graphics Details - RX 6800: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101 - RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00 - RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - RX 7700 XT: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-EXT90040-100 - RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102 - RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07 - RTX 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - RTX 4070 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.01 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details - RTX 3080: GPU Compute Cores: 8704 - RTX 4080: GPU Compute Cores: 9728 - RTX 4070 SUPER: GPU Compute Cores: 7168
ZLUDA Radeon Benchmarks namd-cuda: ATPase Simulation - 327,506 Atoms blender: BMW27 - NVIDIA CUDA blender: Classroom - NVIDIA CUDA blender: Fishy Cat - NVIDIA CUDA blender: Barbershop - NVIDIA CUDA blender: Pabellon Barcelona - NVIDIA CUDA blender: BMW27 - Radeon HIP blender: Classroom - Radeon HIP RX 6800 RX 7900 XT RX 6800 XT RX 7700 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 0.13652 25.81 36.88 38.95 147.09 82.67 23.45 39.82 0.09004 11.42 21.03 20.25 81.98 45.13 14.88 24.25 19.09 31.82 32.68 122.46 68.42 19.01 33.47 20.99 33.05 30.32 0.09031 10.05 18.44 18.04 39.00 12.91 20.89 0.08498 12.68 25.71 24.05 97.87 59.02 0.08105 7.82 14.50 15.61 60.01 32.31 0.08435 10.17 19.38 20.12 78.93 43.29 OpenBenchmarking.org
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms RX 6800 RX 7900 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 0.0307 0.0614 0.0921 0.1228 0.1535 SE +/- 0.00117, N = 3 SE +/- 0.00055, N = 3 SE +/- 0.00096, N = 5 SE +/- 0.00017, N = 3 SE +/- 0.00046, N = 3 SE +/- 0.00007, N = 3 0.13652 0.09004 0.09031 0.08498 0.08105 0.08435
Blender Blend File: BMW27 - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA CUDA RX 6800 RX 7900 XT RX 6800 XT RX 7700 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 6 12 18 24 30 SE +/- 3.44, N = 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 3.60, N = 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 25.81 11.42 19.09 20.99 10.05 12.68 7.82 10.17
Blender Blend File: Classroom - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: NVIDIA CUDA RX 6800 RX 7900 XT RX 6800 XT RX 7700 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 36.88 21.03 31.82 33.05 18.44 25.71 14.50 19.38
Blender Blend File: Fishy Cat - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA CUDA RX 6800 RX 7900 XT RX 6800 XT RX 7700 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 38.95 20.25 32.68 30.32 18.04 24.05 15.61 20.12
Blender Blend File: Barbershop - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Barbershop - Compute: NVIDIA CUDA RX 6800 RX 7900 XT RX 6800 XT RTX 3080 RTX 4080 RTX 4070 SUPER 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.01, N = 2 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 147.09 81.98 122.46 97.87 60.01 78.93
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA RX 6800 RX 7900 XT RX 6800 XT RX 7900 XTX RTX 3080 RTX 4080 RTX 4070 SUPER 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 82.67 45.13 68.42 39.00 59.02 32.31 43.29
Blender Blend File: BMW27 - Compute: Radeon HIP OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: Radeon HIP RX 6800 RX 7900 XT RX 6800 XT RX 7900 XTX 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 23.45 14.88 19.01 12.91
Blender Blend File: Classroom - Compute: Radeon HIP OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: Radeon HIP RX 6800 RX 7900 XT RX 6800 XT RX 7900 XTX 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 39.82 24.25 33.47 20.89
Phoronix Test Suite v10.8.4