satty Tests for a future article. AMD Ryzen AI 9 365 testing with a ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 (UM5606WA.308 BIOS) and AMD Radeon 512MB on Ubuntu 24.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2411238-NE-SATTY938104&grs&sor .
satty Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b AMD Ryzen AI 9 365 @ 4.31GHz (10 Cores / 20 Threads) ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 (UM5606WA.308 BIOS) AMD Device 1507 4 x 6GB LPDDR5-7500MT/s Micron MT62F1536M32D4DS-026 1024GB MTFDKBA1T0QFM-1BD1AABGB AMD Radeon 512MB AMD Rembrandt Radeon HD Audio MEDIATEK Device 7925 Ubuntu 24.10 6.12.0-rc7-phx-eraps (x86_64) GNOME Shell 47.0 X Server + Wayland 4.6 Mesa 24.2.3-1ubuntu1 (LLVM 19.1.0 DRM 3.59) GCC 14.2.0 ext4 2880x1800 OpenBenchmarking.org Kernel Details - amdgpu.dcdebugmask=0x600 - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - Platform Profile: balanced - CPU Microcode: 0xb204011 - ACPI Profile: balanced Graphics Details - BAR1 / Visible vRAM Size: 512 MB Java Details - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10) Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; PBRSB-eIBRS: Not affected; BHI: Not affected; ERAPS hardware RSB flush + srbds: Not affected + tsx_async_abort: Not affected
satty renaissance: Savina Reactors.IO blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only vkpeak: int32-scalar vkpeak: fp32-scalar vkpeak: int16-vec4 renaissance: Gaussian Mixture Model renaissance: Genetic Algorithm Using Jenetics + Futures vkpeak: fp32-vec4 renaissance: Rand Forest renaissance: Finagle HTTP Requests primesieve: 1e12 blender: BMW27 - CPU-Only vkpeak: int16-scalar vkpeak: fp16-scalar renaissance: In-Memory Database Shootout renaissance: Apache Spark PageRank blender: Junkshop - CPU-Only vkpeak: int32-vec4 vkpeak: fp16-vec4 renaissance: Akka Unbalanced Cobwebbed Tree renaissance: Apache Spark Bayes renaissance: ALS Movie Lens primesieve: 1e13 vkpeak: fp64-scalar vkpeak: fp64-vec4 blender: Pabellon Barcelona - CPU-Only blender: Barbershop - CPU-Only renaissance: Scala Dotty a b 6475.4 446.29 224.53 841.54 3323.53 5832.89 3916.7 1046.0 2706.74 600.5 2565.3 17.542 156.25 2917.26 3124.48 4451.2 3001.1 204.83 784.28 5740.07 5269.7 470.4 10965.2 260.792 141.2 141.16 553.42 1682.01 490.7 8639.2 414.83 214.69 815.61 3222.08 5668.59 4018.9 1072.1 2641.07 586.0 2624.4 17.935 152.95 2862.04 3071.59 4523.0 2954.7 201.68 773.20 5662.97 5335.9 467.3 11035.4 262.069 141.13 141.13 538.0 OpenBenchmarking.org
Renaissance Test: Savina Reactors.IO OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Savina Reactors.IO a b 2K 4K 6K 8K 10K SE +/- 67.05, N = 3 6475.4 8639.2 MIN: 6475.39 / MAX: 9563.93 MIN: 7778.63 / MAX: 9255.52
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Classroom - Compute: CPU-Only b a 100 200 300 400 500 SE +/- 1.99, N = 3 414.83 446.29
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Fishy Cat - Compute: CPU-Only b a 50 100 150 200 250 SE +/- 2.63, N = 4 214.69 224.53
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20240505 int32-scalar a b 200 400 600 800 1000 SE +/- 17.67, N = 3 841.54 815.61
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp32-scalar a b 700 1400 2100 2800 3500 SE +/- 43.70, N = 3 3323.53 3222.08
vkpeak int16-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20240505 int16-vec4 a b 1300 2600 3900 5200 6500 SE +/- 107.55, N = 3 5832.89 5668.59
Renaissance Test: Gaussian Mixture Model OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Gaussian Mixture Model a b 900 1800 2700 3600 4500 SE +/- 43.50, N = 5 3916.7 4018.9 MIN: 3761.78 / MAX: 4230.11 MIN: 3769.46 / MAX: 4727.13
Renaissance Test: Genetic Algorithm Using Jenetics + Futures OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Genetic Algorithm Using Jenetics + Futures a b 200 400 600 800 1000 SE +/- 6.78, N = 3 1046.0 1072.1 MIN: 1002.13 / MAX: 1099.2 MIN: 1011.51 / MAX: 1107.8
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp32-vec4 a b 600 1200 1800 2400 3000 SE +/- 16.30, N = 3 2706.74 2641.07
Renaissance Test: Random Forest OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Random Forest b a 130 260 390 520 650 SE +/- 5.30, N = 15 586.0 600.5 MIN: 462.27 / MAX: 735.73 MIN: 485.96 / MAX: 687.48
Renaissance Test: Finagle HTTP Requests OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Finagle HTTP Requests a b 600 1200 1800 2400 3000 SE +/- 33.52, N = 3 2565.3 2624.4 MIN: 1909.64 / MAX: 2565.32 MIN: 1971.05 / MAX: 2660.42
Primesieve Length: 1e12 OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.6 Length: 1e12 a b 4 8 12 16 20 SE +/- 0.03, N = 3 17.54 17.94 1. (CXX) g++ options: -O3
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: BMW27 - Compute: CPU-Only b a 30 60 90 120 150 SE +/- 1.02, N = 3 152.95 156.25
vkpeak int16-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20240505 int16-scalar a b 600 1200 1800 2400 3000 SE +/- 31.89, N = 3 2917.26 2862.04
vkpeak fp16-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp16-scalar a b 700 1400 2100 2800 3500 SE +/- 14.05, N = 3 3124.48 3071.59
Renaissance Test: In-Memory Database Shootout OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: In-Memory Database Shootout a b 1000 2000 3000 4000 5000 SE +/- 52.90, N = 4 4451.2 4523.0 MIN: 3907.86 / MAX: 4865.95 MIN: 3907.18 / MAX: 5166.19
Renaissance Test: Apache Spark PageRank OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Apache Spark PageRank b a 600 1200 1800 2400 3000 SE +/- 26.53, N = 3 2954.7 3001.1 MIN: 2649.65 / MAX: 3020.91 MIN: 2757.14 / MAX: 3164.97
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Junkshop - Compute: CPU-Only b a 40 80 120 160 200 SE +/- 1.63, N = 3 201.68 204.83
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20240505 int32-vec4 a b 200 400 600 800 1000 SE +/- 3.49, N = 3 784.28 773.20
vkpeak fp16-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp16-vec4 a b 1200 2400 3600 4800 6000 SE +/- 13.38, N = 3 5740.07 5662.97
Renaissance Test: Akka Unbalanced Cobwebbed Tree OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Akka Unbalanced Cobwebbed Tree a b 1100 2200 3300 4400 5500 SE +/- 41.51, N = 3 5269.7 5335.9 MIN: 5269.67 / MAX: 6807.28 MIN: 5275.66 / MAX: 7161.99
Renaissance Test: Apache Spark Bayes OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Apache Spark Bayes b a 100 200 300 400 500 SE +/- 4.08, N = 3 467.3 470.4 MIN: 431.04 / MAX: 540.41 MIN: 431.53 / MAX: 527.73
Renaissance Test: ALS Movie Lens OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: ALS Movie Lens a b 2K 4K 6K 8K 10K SE +/- 13.27, N = 3 10965.2 11035.4 MIN: 10354.76 / MAX: 11196.1 MIN: 10460.76 / MAX: 11067.81
Primesieve Length: 1e13 OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.6 Length: 1e13 a b 60 120 180 240 300 SE +/- 2.97, N = 3 260.79 262.07 1. (CXX) g++ options: -O3
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp64-scalar a b 30 60 90 120 150 SE +/- 0.00, N = 3 141.20 141.13
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20240505 fp64-vec4 a b 30 60 90 120 150 SE +/- 0.03, N = 3 141.16 141.13
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Pabellon Barcelona - Compute: CPU-Only a 120 240 360 480 600 553.42
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Barbershop - Compute: CPU-Only a 400 800 1200 1600 2000 1682.01
Renaissance Test: Scala Dotty OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.16 Test: Scala Dotty a b 120 240 360 480 600 SE +/- 9.06, N = 15 490.7 538.0 MIN: 416.75 / MAX: 1036.31 MIN: 408.07 / MAX: 1153.92
Phoronix Test Suite v10.8.5