new stuff Apple M2 testing with a Apple MacBook Air (13 h M2 2022) and llvmpipe on Arch rolling via the Phoronix Test Suite. a: Processor: Apple M2 @ 2.42GHz (4 Cores / 8 Threads), Motherboard: Apple MacBook Air (13 h M2 2022), Chipset: Apple Silicon, Memory: 8GB, Disk: 251GB APPLE SSD AP0256Z + 2 x 0GB APPLE SSD AP0256Z, Graphics: llvmpipe, Network: Broadcom Device 4433 + Broadcom BRCM4387 Bluetooth OS: Arch rolling, Kernel: 6.3.0-asahi-6-1-ARCH (aarch64), Desktop: KDE Plasma 5.27.5, Display Server: X Server 1.21.1.8, OpenGL: 4.5 Mesa 23.0.3 (LLVM 15.0.7 128 bits), Compiler: GCC 12.1.0 + Clang 15.0.7, File-System: ext4, Screen Resolution: 2560x1600 b: Processor: Apple M2 @ 2.42GHz (4 Cores / 8 Threads), Motherboard: Apple MacBook Air (13 h M2 2022), Chipset: Apple Silicon, Memory: 8GB, Disk: 251GB APPLE SSD AP0256Z + 2 x 0GB APPLE SSD AP0256Z, Graphics: llvmpipe, Network: Broadcom Device 4433 + Broadcom BRCM4387 Bluetooth OS: Arch rolling, Kernel: 6.3.0-asahi-6-1-ARCH (aarch64), Desktop: KDE Plasma 5.27.5, Display Server: X Server 1.21.1.8, OpenGL: 4.5 Mesa 23.0.3 (LLVM 15.0.7 128 bits), Compiler: GCC 12.1.0 + Clang 15.0.7, File-System: ext4, Screen Resolution: 2560x1600 c: Processor: Apple M2 @ 2.42GHz (4 Cores / 8 Threads), Motherboard: Apple MacBook Air (13 h M2 2022), Chipset: Apple Silicon, Memory: 8GB, Disk: 251GB APPLE SSD AP0256Z + 2 x 0GB APPLE SSD AP0256Z, Graphics: llvmpipe, Network: Broadcom Device 4433 + Broadcom BRCM4387 Bluetooth OS: Arch rolling, Kernel: 6.3.0-asahi-6-1-ARCH (aarch64), Desktop: KDE Plasma 5.27.5, Display Server: X Server 1.21.1.8, OpenGL: 4.5 Mesa 23.0.3 (LLVM 15.0.7 128 bits), Compiler: GCC 12.1.0 + Clang 15.0.7, File-System: ext4, Screen Resolution: 2560x1600 QMCPACK 3.16 Input: simple-H2O Total Execution Time - Seconds < Lower Is Better a . 31.65 |==================================================================== b . 24.61 |===================================================== c . 25.29 |====================================================== Stress-NG 0.15.10 Test: Function Call Bogo Ops/s > Higher Is Better a . 14573.26 |======================================================== a . 14944.20 |========================================================= b . 16963.18 |================================================================= Stress-NG 0.15.10 Test: SENDFILE Bogo Ops/s > Higher Is Better a . 561001.69 |======================================================= a . 577567.57 |========================================================= b . 651207.81 |================================================================ QMCPACK 3.16 Input: Li2_STO_ae Total Execution Time - Seconds < Lower Is Better a . 673.57 |=================================================================== b . 583.64 |========================================================== c . 602.89 |============================================================ Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 193560667 |======================================================== b . 222810000 |================================================================ Stress-NG 0.15.10 Test: Memory Copying Bogo Ops/s > Higher Is Better a . 2186.34 |========================================================== a . 2239.30 |=========================================================== b . 2489.80 |================================================================== Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 151962667 |========================================================= b . 170690000 |================================================================ c . 170520000 |================================================================ Stress-NG 0.15.10 Test: Wide Vector Math Bogo Ops/s > Higher Is Better a . 134118.46 |========================================================= a . 133805.81 |========================================================= b . 150029.77 |================================================================ Stress-NG 0.15.10 Test: Glibc Qsort Data Sorting Bogo Ops/s > Higher Is Better a . 123.24 |============================================================ a . 123.59 |============================================================ b . 137.66 |=================================================================== Stress-NG 0.15.10 Test: Fused Multiply-Add Bogo Ops/s > Higher Is Better a . 8204160.15 |======================================================== a . 8292871.43 |========================================================= b . 9154152.72 |=============================================================== Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Gas HII40 Seconds < Lower Is Better a . 28.09 |==================================================================== b . 25.30 |============================================================= c . 25.70 |============================================================== QMCPACK 3.16 Input: FeCO6_b3lyp_gms Total Execution Time - Seconds < Lower Is Better a . 224.55 |=================================================================== b . 202.68 |============================================================ c . 205.77 |============================================================= Stress-NG 0.15.10 Test: Context Switching Bogo Ops/s > Higher Is Better a . 1631840.55 |========================================================= a . 1641110.35 |========================================================= b . 1799353.76 |=============================================================== Stress-NG 0.15.10 Test: Vector Floating Point Bogo Ops/s > Higher Is Better a . 10658.50 |=========================================================== a . 10677.45 |=========================================================== b . 11720.45 |================================================================= Stress-NG 0.15.10 Test: Glibc C String Functions Bogo Ops/s > Higher Is Better a . 2186759.52 |========================================================== a . 2172245.93 |========================================================== b . 2365946.27 |=============================================================== Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Dust 2D tau100.0 Seconds < Lower Is Better a . 224.32 |=================================================================== b . 218.25 |================================================================= c . 206.80 |============================================================== Stress-NG 0.15.10 Test: Socket Activity Bogo Ops/s > Higher Is Better a . 4555.91 |================================================================ a . 4416.07 |============================================================== b . 4732.69 |================================================================== Stress-NG 0.15.10 Test: Vector Shuffle Bogo Ops/s > Higher Is Better a . 5424.22 |============================================================== a . 5503.75 |=============================================================== b . 5777.27 |================================================================== Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 123490000 |============================================================= b . 129430000 |================================================================ Stress-NG 0.15.10 Test: CPU Cache Bogo Ops/s > Higher Is Better a . 1505499.01 |============================================================== a . 1498847.29 |============================================================= b . 1539109.56 |=============================================================== Stress-NG 0.15.10 Test: Malloc Bogo Ops/s > Higher Is Better a . 7331849.00 |============================================================= a . 7500169.53 |=============================================================== b . 7519020.96 |=============================================================== Stress-NG 0.15.10 Test: Pipe Bogo Ops/s > Higher Is Better a . 3273224.86 |============================================================== a . 3352013.11 |=============================================================== b . 3315715.43 |============================================================== Stress-NG 0.15.10 Test: Matrix 3D Math Bogo Ops/s > Higher Is Better a . 1502.72 |================================================================ a . 1538.05 |================================================================== b . 1510.48 |================================================================= Stress-NG 0.15.10 Test: IO_uring Bogo Ops/s > Higher Is Better a . 202926.30 |================================================================ a . 203685.29 |================================================================ b . 199030.94 |=============================================================== Stress-NG 0.15.10 Test: Semaphores Bogo Ops/s > Higher Is Better a . 12263477.55 |============================================================= a . 12255874.78 |============================================================= b . 12519037.43 |============================================================== Stress-NG 0.15.10 Test: Atomic Bogo Ops/s > Higher Is Better a . 219.58 |=================================================================== a . 218.20 |=================================================================== b . 215.67 |================================================================== Stress-NG 0.15.10 Test: Mutex Bogo Ops/s > Higher Is Better a . 2348077.86 |=============================================================== a . 2336991.86 |=============================================================== b . 2320821.60 |============================================================== Stress-NG 0.15.10 Test: Floating Point Bogo Ops/s > Higher Is Better a . 1895.17 |================================================================= a . 1903.08 |================================================================== b . 1915.81 |================================================================== Stress-NG 0.15.10 Test: Pthread Bogo Ops/s > Higher Is Better a . 97030.30 |================================================================ a . 97673.45 |================================================================= b . 97985.05 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 30.39 |==================================================================== b . 30.30 |==================================================================== c . 30.11 |=================================================================== Stress-NG 0.15.10 Test: CPU Stress Bogo Ops/s > Higher Is Better a . 2316.08 |================================================================= a . 2330.80 |================================================================== b . 2334.18 |================================================================== Stress-NG 0.15.10 Test: Zlib Bogo Ops/s > Higher Is Better a . 370.42 |=================================================================== a . 367.86 |=================================================================== b . 369.64 |=================================================================== Stress-NG 0.15.10 Test: Cloning Bogo Ops/s > Higher Is Better a . 415.06 |=================================================================== a . 413.10 |=================================================================== b . 413.63 |=================================================================== Stress-NG 0.15.10 Test: Poll Bogo Ops/s > Higher Is Better a . 498126.68 |================================================================ a . 500333.58 |================================================================ b . 500127.10 |================================================================ HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 15.37 |==================================================================== b . 15.30 |==================================================================== c . 15.30 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 9.25296 |================================================================== b . 9.29368 |================================================================== c . 9.27947 |================================================================== Stress-NG 0.15.10 Test: MMAP Bogo Ops/s > Higher Is Better a . 181.26 |=================================================================== a . 180.50 |=================================================================== b . 181.04 |=================================================================== Stress-NG 0.15.10 Test: MEMFD Bogo Ops/s > Higher Is Better a . 337.10 |=================================================================== a . 336.69 |=================================================================== b . 335.73 |=================================================================== Stress-NG 0.15.10 Test: System V Message Passing Bogo Ops/s > Higher Is Better a . 8003395.90 |=============================================================== a . 7994886.68 |=============================================================== b . 7972859.60 |=============================================================== Stress-NG 0.15.10 Test: Matrix Math Bogo Ops/s > Higher Is Better a . 28788.80 |================================================================= a . 28785.99 |================================================================= b . 28680.50 |================================================================= Stress-NG 0.15.10 Test: AVL Tree Bogo Ops/s > Higher Is Better a . 58.74 |==================================================================== a . 58.79 |==================================================================== b . 58.57 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 16.56 |==================================================================== b . 16.54 |==================================================================== c . 16.49 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 12.01 |==================================================================== b . 12.06 |==================================================================== c . 12.01 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 6.51491 |================================================================== b . 6.52483 |================================================================== c . 6.53833 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 12.54 |==================================================================== b . 12.52 |==================================================================== c . 12.50 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 24.42 |==================================================================== b . 24.47 |==================================================================== c . 24.39 |==================================================================== Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 1816667 |================================================================== b . 1811700 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 7.41321 |================================================================== b . 7.39435 |================================================================== c . 7.40031 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 12.36 |==================================================================== b . 12.38 |==================================================================== c . 12.37 |==================================================================== Stress-NG 0.15.10 Test: Crypto Bogo Ops/s > Higher Is Better a . 15660.68 |================================================================= a . 15694.58 |================================================================= b . 15678.78 |================================================================= Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 46837333 |================================================================= b . 46766000 |================================================================= c . 46863000 |================================================================= Stress-NG 0.15.10 Test: Forking Bogo Ops/s > Higher Is Better a . 20947.11 |================================================================= a . 20977.45 |================================================================= b . 20984.93 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 12.89 |==================================================================== b . 12.88 |==================================================================== c . 12.90 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 6.81731 |================================================================== b . 6.80673 |================================================================== c . 6.81684 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 4.84819 |================================================================== b . 4.84075 |================================================================== c . 4.84551 |================================================================== High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 GFLOP/s > Higher Is Better a . 5.91198 |================================================================== b . 5.90448 |================================================================== c . 5.91284 |================================================================== Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 89599667 |================================================================= b . 89679000 |================================================================= c . 89584000 |================================================================= Stress-NG 0.15.10 Test: Hash Bogo Ops/s > Higher Is Better a . 1031331.61 |=============================================================== a . 1031467.06 |=============================================================== b . 1032298.82 |=============================================================== Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 3539900 |================================================================== b . 3543000 |================================================================== Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 samples/s > Higher Is Better a . 45898000 |================================================================= b . 45909000 |================================================================= c . 45872000 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 9.36400 |================================================================== b . 9.36231 |================================================================== c . 9.36845 |================================================================== Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 24009333 |================================================================= b . 24023000 |================================================================= c . 24021000 |================================================================= Stress-NG 0.15.10 Test: Vector Math Bogo Ops/s > Higher Is Better a . 34179.57 |================================================================= a . 34161.15 |================================================================= b . 34177.46 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 15.19 |==================================================================== b . 15.19 |==================================================================== c . 15.19 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 7.71602 |================================================================== b . 7.71682 |================================================================== c . 7.71863 |================================================================== Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 11523333 |================================================================= b . 11520000 |================================================================= QMCPACK 3.16 Input: FeCO6_b3lyp_gms Total Execution Time - Seconds < Lower Is Better a . 722.61 |=================================================================== b . 722.60 |=================================================================== c . 722.58 |=================================================================== Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better a . 90442667 |================================================================= b . 90443000 |================================================================= c . 90441000 |================================================================= Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 512 samples/s > Higher Is Better a . 6839467 |================================================================== b . 6839600 |================================================================== Stress-NG 0.15.10 Test: Futex Bogo Ops/s > Higher Is Better a . 854204.69 |=============================================================== a . 871823.11 |================================================================ b . 832545.15 |============================================================= Stress-NG 0.15.10 Test: NUMA Bogo Ops/s > Higher Is Better a . 1685.59 |================================================================== a . 1574.19 |============================================================== b . 1678.28 |==================================================================