AMD AOCC 3.1 Compiler Benchmarking AMD EPYC 7543 testing of AMD AOCC 3.1 compiler benchmarks by Michael Larabel. AOCC 3.0: Processor: AMD EPYC 7543 32-Core @ 2.80GHz (32 Cores / 64 Threads), Motherboard: TYAN S8036GM2NE-LE (V2.00.B21 BIOS), Chipset: AMD Starship/Matisse, Memory: 64GB, Disk: 1000GB Corsair Force MP600, Graphics: ASPEED, Monitor: VE228, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server, Compiler: Clang 12.0.0, File-System: ext4, Screen Resolution: 1920x1080 AOCC 3.1: Processor: AMD EPYC 7543 32-Core @ 2.80GHz (32 Cores / 64 Threads), Motherboard: TYAN S8036GM2NE-LE (V2.00.B21 BIOS), Chipset: AMD Starship/Matisse, Memory: 64GB, Disk: 1000GB Corsair Force MP600, Graphics: ASPEED, Monitor: VE228, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server, Compiler: Clang 12.0.0, File-System: ext4, Screen Resolution: 1920x1080 C-Blosc 2.0 Compressor: blosclz MB/s > Higher Is Better AOCC 3.0 . 25045.5 |=========================================================== AOCC 3.1 . 25069.6 |=========================================================== QuantLib 1.21 MFLOPS > Higher Is Better AOCC 3.0 . 2857.6 |============================================================ AOCC 3.1 . 2854.9 |============================================================ Etcpak 0.7 Configuration: DXT1 Mpx/s > Higher Is Better AOCC 3.0 . 2799.90 |========================================================= AOCC 3.1 . 2897.28 |=========================================================== Etcpak 0.7 Configuration: ETC2 Mpx/s > Higher Is Better AOCC 3.0 . 188.82 |======================================================= AOCC 3.1 . 207.55 |============================================================ LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed MB/s > Higher Is Better AOCC 3.0 . 10359.14 |======================================================= AOCC 3.1 . 10964.11 |========================================================== Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed MB/s > Higher Is Better AOCC 3.0 . 2752.3 |========================================================== AOCC 3.1 . 2852.4 |============================================================ Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better AOCC 3.0 . 42.4 |============================================================== AOCC 3.1 . 42.1 |============================================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better AOCC 3.0 . 3270.0 |============================================================ AOCC 3.1 . 3264.4 |============================================================ JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 MP/s > Higher Is Better AOCC 3.0 . 9.01 |============================================================== AOCC 3.1 . 9.04 |============================================================== JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 MP/s > Higher Is Better AOCC 3.0 . 64.23 |=========================================================== AOCC 3.1 . 65.89 |============================================================= JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 MP/s > Higher Is Better AOCC 3.0 . 27.44 |=========================================================== AOCC 3.1 . 28.30 |============================================================= Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better AOCC 3.0 . 4869.92 |=========================================================== AOCC 3.1 . 4859.66 |=========================================================== Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better AOCC 3.0 . 4866.53 |=========================================================== AOCC 3.1 . 4876.96 |=========================================================== Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better AOCC 3.0 . 332.45 |============================================================ AOCC 3.1 . 332.81 |============================================================ Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better AOCC 3.0 . 339.97 |============================================================ AOCC 3.1 . 339.97 |============================================================ Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better AOCC 3.0 . 417.29 |============================================================ AOCC 3.1 . 417.05 |============================================================ Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better AOCC 3.0 . 404.41 |============================================================ AOCC 3.1 . 405.31 |============================================================ Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better AOCC 3.0 . 134.38 |============================================================ AOCC 3.1 . 134.37 |============================================================ Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better AOCC 3.0 . 137.88 |============================================================ AOCC 3.1 . 137.91 |============================================================ LibRaw 0.20 Post-Processing Benchmark Mpix/sec > Higher Is Better AOCC 3.0 . 44.54 |============================================================ AOCC 3.1 . 45.11 |============================================================= John The Ripper 1.9.0-jumbo-1 Test: Blowfish Real C/S > Higher Is Better AOCC 3.0 . 61893 |============================================================= AOCC 3.1 . 61842 |============================================================= John The Ripper 1.9.0-jumbo-1 Test: MD5 Real C/S > Higher Is Better AOCC 3.0 . 2237667 |=========================================================== AOCC 3.1 . 2240000 |=========================================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better AOCC 3.0 . 792 |=============================================================== AOCC 3.1 . 792 |=============================================================== GraphicsMagick 1.3.33 Operation: Sharpen Iterations Per Minute > Higher Is Better AOCC 3.0 . 357 |=============================================================== AOCC 3.1 . 357 |=============================================================== GraphicsMagick 1.3.33 Operation: Enhanced Iterations Per Minute > Higher Is Better AOCC 3.0 . 553 |================================================== AOCC 3.1 . 696 |=============================================================== GraphicsMagick 1.3.33 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better AOCC 3.0 . 422 |=============================================================== AOCC 3.1 . 420 |=============================================================== GraphicsMagick 1.3.33 Operation: HWB Color Space Iterations Per Minute > Higher Is Better AOCC 3.0 . 667 |============================================================== AOCC 3.1 . 677 |=============================================================== SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K Frames Per Second > Higher Is Better AOCC 3.1 . 1.987 |============================================================= SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Frames Per Second > Higher Is Better AOCC 3.1 . 19.47 |============================================================= SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better AOCC 3.0 . 292.33 |========================================================== AOCC 3.1 . 303.75 |============================================================ SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better AOCC 3.0 . 399.61 |=========================================================== AOCC 3.1 . 408.51 |============================================================ SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better AOCC 3.0 . 234.66 |============================================================ AOCC 3.1 . 234.18 |============================================================ VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Frames Per Second > Higher Is Better AOCC 3.0 . 15.61 |============================================================= AOCC 3.1 . 15.62 |============================================================= Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better AOCC 3.0 . 3779.48 |========================================================== AOCC 3.1 . 3819.99 |=========================================================== Stockfish 13 Total Time Nodes Per Second > Higher Is Better AOCC 3.0 . 87952290 |======================================================== AOCC 3.1 . 90448030 |========================================================== libavif avifenc 0.9.0 Encoder Speed: 2 Seconds < Lower Is Better AOCC 3.0 . 24.56 |============================================================ AOCC 3.1 . 24.77 |============================================================= libavif avifenc 0.9.0 Encoder Speed: 6 Seconds < Lower Is Better AOCC 3.0 . 9.731 |============================================================= AOCC 3.1 . 9.672 |============================================================= libavif avifenc 0.9.0 Encoder Speed: 10 Seconds < Lower Is Better AOCC 3.0 . 3.542 |============================================================= AOCC 3.1 . 3.497 |============================================================ libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Seconds < Lower Is Better AOCC 3.0 . 26.73 |============================================================= AOCC 3.1 . 26.67 |============================================================= libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Seconds < Lower Is Better AOCC 3.0 . 5.842 |============================================================= AOCC 3.1 . 5.800 |============================================================= POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better AOCC 3.0 . 17.01 |============================================================= AOCC 3.1 . 14.78 |===================================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 1.39552 |=========================================================== AOCC 3.1 . 1.40164 |=========================================================== oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 4.97905 |========================================================== AOCC 3.1 . 5.07547 |=========================================================== oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 2.11715 |=========================================================== AOCC 3.1 . 1.40674 |======================================= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 1.79719 |=========================================================== AOCC 3.1 . 1.80096 |=========================================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 3.08589 |========================================================== AOCC 3.1 . 3.13186 |=========================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 6386.24 |=========================================================== AOCC 3.1 . 6432.49 |=========================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 665.97 |============================================================ AOCC 3.1 . 668.79 |============================================================ oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better AOCC 3.0 . 0.434393 |========================================================== AOCC 3.1 . 0.435999 |========================================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better AOCC 3.0 . 8.697 |============================================================= AOCC 3.1 . 8.700 |============================================================= LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better AOCC 3.0 . 7.877 |============================================================= AOCC 3.1 . 7.828 |============================================================= Ngspice 34 Circuit: C2670 Seconds < Lower Is Better AOCC 3.0 . 92.53 |============================================================= AOCC 3.1 . 91.52 |============================================================ Ngspice 34 Circuit: C7552 Seconds < Lower Is Better AOCC 3.0 . 83.12 |============================================================= AOCC 3.1 . 82.28 |============================================================ RNNoise 2020-06-28 Seconds < Lower Is Better AOCC 3.0 . 18.66 |============================================================= AOCC 3.1 . 18.73 |============================================================= Tachyon 0.99b6 Total Time Seconds < Lower Is Better AOCC 3.0 . 28.89 |============================================================= AOCC 3.1 . 28.66 |============================================================= Google SynthMark 20201109 Test: VoiceMark_100 Voices > Higher Is Better AOCC 3.0 . 622.23 |============================================================ AOCC 3.1 . 621.63 |============================================================ SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better AOCC 3.0 . 275488 |=========================================================== AOCC 3.1 . 279931 |============================================================ Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better AOCC 3.0 . 797476667 |======================================================== AOCC 3.1 . 806510000 |========================================================= Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better AOCC 3.0 . 1507633333 |======================================================= AOCC 3.1 . 1527933333 |======================================================== Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better AOCC 3.0 . 1891833333 |======================================================== AOCC 3.1 . 1894966667 |======================================================== FinanceBench 2016-07-25 Benchmark: Repo OpenMP ms < Lower Is Better AOCC 3.0 . 33706.02 |========================================================== AOCC 3.1 . 33763.58 |========================================================== FinanceBench 2016-07-25 Benchmark: Bonds OpenMP ms < Lower Is Better AOCC 3.0 . 51304.15 |========================================================== AOCC 3.1 . 51185.49 |========================================================== libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Megapixels/sec > Higher Is Better AOCC 3.0 . 222.01 |============================================================ AOCC 3.1 . 220.67 |============================================================ ASTC Encoder 3.0 Preset: Exhaustive Seconds < Lower Is Better AOCC 3.0 . 21.95 |============================================================= AOCC 3.1 . 21.89 |============================================================= SQLite Speedtest 3.30 Timed Time - Size 1,000 Seconds < Lower Is Better AOCC 3.1 . 55.24 |============================================================= Google Draco 1.4.1 Model: Lion ms < Lower Is Better AOCC 3.0 . 5303 |============================================================== AOCC 3.1 . 4960 |========================================================== Google Draco 1.4.1 Model: Church Facade ms < Lower Is Better AOCC 3.0 . 7044 |============================================================== AOCC 3.1 . 6488 |========================================================= NCNN 20210525 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better AOCC 3.0 . 5.79 |============================================================== AOCC 3.1 . 5.74 |============================================================= NCNN 20210525 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better AOCC 3.0 . 5.73 |============================================================== AOCC 3.1 . 5.60 |============================================================= NCNN 20210525 Target: CPU - Model: vgg16 ms < Lower Is Better AOCC 3.0 . 61.79 |============================================================= AOCC 3.1 . 51.60 |=================================================== NCNN 20210525 Target: CPU - Model: resnet18 ms < Lower Is Better AOCC 3.0 . 12.21 |============================================================= AOCC 3.1 . 11.88 |=========================================================== TNN 0.3 Target: CPU - Model: SqueezeNet v2 ms < Lower Is Better AOCC 3.0 . 59.51 |============================================================ AOCC 3.1 . 60.66 |============================================================= TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 ms < Lower Is Better AOCC 3.0 . 262.36 |=========================================================== AOCC 3.1 . 266.24 |============================================================ Facebook RocksDB 6.22.1 Test: Update Random Op/s > Higher Is Better AOCC 3.0 . 820188 |=========================================================== AOCC 3.1 . 832651 |============================================================ Facebook RocksDB 6.22.1 Test: Read While Writing Op/s > Higher Is Better AOCC 3.0 . 6048186 |========================================================== AOCC 3.1 . 6135560 |=========================================================== Facebook RocksDB 6.22.1 Test: Read Random Write Random Op/s > Higher Is Better AOCC 3.0 . 2970829 |========================================================= AOCC 3.1 . 3059068 |=========================================================== ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better AOCC 3.0 . 474 |============================================================= AOCC 3.1 . 487 |=============================================================== ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better AOCC 3.0 . 86 |============================================================= AOCC 3.1 . 90 |================================================================ ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better AOCC 3.0 . 4759 |============================================================== AOCC 3.1 . 4568 |============================================================ WavPack Audio Encoding 5.3 WAV To WavPack Seconds < Lower Is Better AOCC 3.0 . 13.16 |============================================================= AOCC 3.1 . 13.18 |============================================================= GnuPG 2.2.27 2.7GB Sample File Encryption Seconds < Lower Is Better AOCC 3.0 . 67.74 |============================================================= AOCC 3.1 . 67.61 |=============================================================