satty Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite. a: Processor: Intel Xeon Platinum 8490H @ 3.50GHz (60 Cores / 120 Threads), Motherboard: Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 3 x 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 4 x Intel E810-C for QSFP OS: Ubuntu 22.04, Kernel: 5.15.0-47-generic (x86_64), Desktop: GNOME Shell 42.4, Display Server: X Server 1.21.1.3, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1024x768 b: Processor: Intel Xeon Platinum 8490H @ 3.50GHz (60 Cores / 120 Threads), Motherboard: Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 3 x 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 4 x Intel E810-C for QSFP OS: Ubuntu 22.04, Kernel: 5.15.0-47-generic (x86_64), Desktop: GNOME Shell 42.4, Display Server: X Server 1.21.1.3, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1600x1200 libxsmm 2-1.17-3645 M N K: 128 GFLOPS/s > Higher Is Better a . 1741.8 |=================================================================== b . 1691.1 |================================================================= libxsmm 2-1.17-3645 M N K: 256 GFLOPS/s > Higher Is Better a . 883.5 |==================================================================== b . 793.5 |============================================================= libxsmm 2-1.17-3645 M N K: 32 GFLOPS/s > Higher Is Better a . 496.0 |==================================================================== b . 486.4 |=================================================================== libxsmm 2-1.17-3645 M N K: 64 GFLOPS/s > Higher Is Better a . 978.8 |==================================================================== b . 960.3 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 232.00 |=================================================================== b . 233.51 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 110.36 |=================================================================== b . 108.38 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 99.54 |==================================================================== b . 98.79 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 336.17 |=================================================================== b . 337.50 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 255.35 |=================================================================== b . 246.11 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 177.48 |=================================================================== b . 174.66 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 132.73 |=================================================================== b . 130.66 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 49.00 |==================================================================== b . 48.81 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 53.12 |==================================================================== b . 52.99 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 147.84 |=================================================================== b . 148.33 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 109.37 |=================================================================== b . 108.23 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 98.90 |==================================================================== b . 98.16 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 232.79 |=================================================================== b . 230.65 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 97.29 |==================================================================== b . 96.09 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 95.09 |==================================================================== b . 94.46 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 232.16 |=================================================================== b . 228.74 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 263.23 |=================================================================== b . 251.98 |================================================================ HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 183.40 |=================================================================== b . 180.79 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 93.93 |==================================================================== b . 92.04 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 49.29 |==================================================================== b . 49.32 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 52.95 |==================================================================== b . 52.71 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 164.54 |=================================================================== b . 161.21 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 105.98 |=================================================================== b . 106.15 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 101.54 |=================================================================== b . 100.72 |================================================================== Palabos 2.3 Grid Size: 100 Mega Site Updates Per Second > Higher Is Better a . 314.53 |=================================================================== b . 308.95 |================================================================== Palabos 2.3 Grid Size: 400 Mega Site Updates Per Second > Higher Is Better a . 333.86 |=================================================================== b . 334.05 |=================================================================== Palabos 2.3 Grid Size: 500 Mega Site Updates Per Second > Higher Is Better a . 346.12 |=================================================================== b . 346.20 |=================================================================== Palabos 2.3 Grid Size: 1000 Mega Site Updates Per Second > Higher Is Better a . 391.95 |=================================================================== b . 389.11 |=================================================================== Palabos 2.3 Grid Size: 4000 Mega Site Updates Per Second > Higher Is Better