epyc 2 x AMD EPYC 9654 96-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 23.04 via the Phoronix Test Suite. a: Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe OS: Ubuntu 23.04, Kernel: 6.2.0-23-generic (x86_64), Desktop: GNOME Shell 44.0, Display Server: X Server 1.21.1.7, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 1920x1080 Laghos 3.1 Test: Triple Point Problem Major Kernels Total Rate > Higher Is Better a . 257.33 |=================================================================== Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh Major Kernels Total Rate > Higher Is Better a . 500.25 |=================================================================== libxsmm 2-1.17-3645 M N K: 32 GFLOPS/s > Higher Is Better a . 1711.6 |=================================================================== libxsmm 2-1.17-3645 M N K: 64 GFLOPS/s > Higher Is Better a . 3012.5 |=================================================================== libxsmm 2-1.17-3645 M N K: 128 GFLOPS/s > Higher Is Better a . 4822.3 |=================================================================== libxsmm 2-1.17-3645 M N K: 256 GFLOPS/s > Higher Is Better a . 6067.4 |=================================================================== nekRS 23.0 Input: Kershaw flops/rank > Higher Is Better a . 8976140000 |=============================================================== nekRS 23.0 Input: TurboPipe Periodic flops/rank > Higher Is Better a . 5439060000 |=============================================================== RELION 4.0.1 Test: Basic - Device: CPU Seconds < Lower Is Better a . 131.79 |=================================================================== High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 GFLOP/s > Higher Is Better a . 61.81 |==================================================================== High Performance Conjugate Gradient 3.1 X Y Z: 144 144 144 - RT: 60 GFLOP/s > Higher Is Better a . 44.53 |==================================================================== High Performance Conjugate Gradient 3.1 X Y Z: 160 160 160 - RT: 60 GFLOP/s > Higher Is Better a . 44.06 |==================================================================== High Performance Conjugate Gradient 3.1 X Y Z: 192 192 192 - RT: 60 GFLOP/s > Higher Is Better a . 43.98 |==================================================================== Kripke 1.2.6 Throughput FoM > Higher Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time Seconds < Lower Is Better a . 24.33 |==================================================================== OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time Seconds < Lower Is Better a . 19.49 |==================================================================== OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time Seconds < Lower Is Better a . 125.79 |=================================================================== OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time Seconds < Lower Is Better a . 103.48 |=================================================================== Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Gas HII40 Seconds < Lower Is Better a . 12.76 |==================================================================== Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Dust 2D tau100.0 Seconds < Lower Is Better a . 85.69 |==================================================================== QMCPACK 3.16 Input: Li2_STO_ae Total Execution Time - Seconds < Lower Is Better a . 82.21 |==================================================================== SPECFEM3D 4.0 Model: Mount St. Helens Seconds < Lower Is Better a . 3.85004073 |=============================================================== QMCPACK 3.16 Input: simple-H2O Total Execution Time - Seconds < Lower Is Better a . 25.44 |==================================================================== QMCPACK 3.16 Input: FeCO6_b3lyp_gms Total Execution Time - Seconds < Lower Is Better a . 109.81 |=================================================================== GPAW 23.6 Input: Carbon Nanotube Seconds < Lower Is Better a . 20.30 |==================================================================== CP2K Molecular Dynamics 2023.1 Input: H20-64 Seconds < Lower Is Better a . 20.81 |==================================================================== CP2K Molecular Dynamics 2023.1 Input: H2O-DFT-LS Seconds < Lower Is Better a . 2135 |===================================================================== CP2K Molecular Dynamics 2023.1 Input: Fayalite-FIST Seconds < Lower Is Better a . 85.35 |==================================================================== Z3 Theorem Prover 4.12.1 SMT File: 1.smt2 Seconds < Lower Is Better a . 26.35 |==================================================================== Z3 Theorem Prover 4.12.1 SMT File: 2.smt2 Seconds < Lower Is Better a . 70.10 |==================================================================== SPECFEM3D 4.0 Model: Homogeneous Halfspace Seconds < Lower Is Better a . 5.041686571 |============================================================== SPECFEM3D 4.0 Model: Water-layered Halfspace Seconds < Lower Is Better a . 8.473585088 |============================================================== SPECFEM3D 4.0 Model: Tomographic Model Seconds < Lower Is Better a . 4.258437194 |============================================================== SPECFEM3D 4.0 Model: Layered Halfspace Seconds < Lower Is Better a . 9.963704381 |============================================================== Remhos 1.0 Test: Sample Remap Example Seconds < Lower Is Better a . 6.077 |==================================================================== QMCPACK 3.16 Input: FeCO6_b3lyp_gms Total Execution Time - Seconds < Lower Is Better a . 180.89 |=================================================================== SVT-AV1 1.6 Encoder Mode: Preset 4 - Input: Bosphorus 4K Frames Per Second > Higher Is Better a . 6.09 |===================================================================== SVT-AV1 1.6 Encoder Mode: Preset 8 - Input: Bosphorus 4K Frames Per Second > Higher Is Better a . 94.11 |==================================================================== SVT-AV1 1.6 Encoder Mode: Preset 12 - Input: Bosphorus 4K Frames Per Second > Higher Is Better a . 218.40 |=================================================================== SVT-AV1 1.6 Encoder Mode: Preset 13 - Input: Bosphorus 4K Frames Per Second > Higher Is Better a . 219.87 |=================================================================== Blender 3.6 Blend File: BMW27 - Compute: CPU-Only Seconds < Lower Is Better a . 7.86 |===================================================================== Blender 3.6 Blend File: Classroom - Compute: CPU-Only Seconds < Lower Is Better a . 18.08 |==================================================================== Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only Seconds < Lower Is Better a . 10.24 |==================================================================== Blender 3.6 Blend File: Barbershop - Compute: CPU-Only Seconds < Lower Is Better a . 72.04 |==================================================================== Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only Seconds < Lower Is Better a . 22.6 |===================================================================== Embree 4.1 Binary: Pathtracer - Model: Crown Frames Per Second > Higher Is Better a . 203.04 |=================================================================== Embree 4.1 Binary: Pathtracer ISPC - Model: Crown Frames Per Second > Higher Is Better a . 209.59 |=================================================================== Embree 4.1 Binary: Pathtracer - Model: Asian Dragon Frames Per Second > Higher Is Better a . 220.28 |=================================================================== Embree 4.1 Binary: Pathtracer - Model: Asian Dragon Obj Frames Per Second > Higher Is Better a . 195.61 |=================================================================== Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Frames Per Second > Higher Is Better a . 241.20 |=================================================================== Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Frames Per Second > Higher Is Better a . 206.94 |=================================================================== OSPRay 2.12 Benchmark: particle_volume/ao/real_time Items Per Second > Higher Is Better a . 49.95 |==================================================================== OSPRay 2.12 Benchmark: particle_volume/scivis/real_time Items Per Second > Higher Is Better a . 49.75 |==================================================================== OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time Items Per Second > Higher Is Better a . 195.13 |=================================================================== OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time Items Per Second > Higher Is Better a . 51.49 |==================================================================== OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time Items Per Second > Higher Is Better a . 49.65 |==================================================================== OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time Items Per Second > Higher Is Better a . 31.81 |==================================================================== Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 1102500000 |=============================================================== Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 1323000000 |=============================================================== Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 2211100000 |=============================================================== Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 2544400000 |=============================================================== Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 4422000000 |=============================================================== Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 5010600000 |=============================================================== Liquid-DSP 1.6 Threads: 256 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 8571500000 |=============================================================== Liquid-DSP 1.6 Threads: 256 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 8855500000 |=============================================================== Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 423300000 |================================================================ Liquid-DSP 1.6 Threads: 384 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 11901000000 |============================================================== Liquid-DSP 1.6 Threads: 384 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 10888000000 |============================================================== Faiss 1.7.4 Test: demo_sift1M Seconds < Lower Is Better Faiss 1.7.4 Test: bench_polysemous_sift1m - PQ baseline ms per query < Lower Is Better a . 3.847 |==================================================================== Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 848760000 |================================================================ PETSc 3.19 Test: Streams MB/s > Higher Is Better a . 395951.00 |================================================================ Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 1698500000 |=============================================================== Liquid-DSP 1.6 Threads: 256 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 2720300000 |=============================================================== Liquid-DSP 1.6 Threads: 384 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 3043700000 |=============================================================== srsRAN Project 23.5 Test: Downlink Processor Benchmark Mbps > Higher Is Better a . 752.9 |==================================================================== srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total Mbps > Higher Is Better a . 39684.1 |================================================================== srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread Mbps > Higher Is Better a . 239.4 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 64 ms per query < Lower Is Better a . 6.083 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 62 ms per query < Lower Is Better a . 5.128 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 34 ms per query < Lower Is Better a . 0.725 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 30 ms per query < Lower Is Better a . 0.72 |===================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 42 ms per query < Lower Is Better a . 0.786 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 38 ms per query < Lower Is Better a . 0.739 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 46 ms per query < Lower Is Better a . 0.912 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 50 ms per query < Lower Is Better a . 1.236 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 54 ms per query < Lower Is Better a . 1.924 |==================================================================== Faiss 1.7.4 Test: bench_polysemous_sift1m - Polysemous 58 ms per query < Lower Is Better a . 3.194 |====================================================================