EPYC 7763 LLVM Clang Compiler Tests AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite. Clang 12.0: Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: Supermicro H12SSL-i v1.01 (2.0 BIOS), Chipset: AMD Starship/Matisse, Memory: 126GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe OS: Ubuntu 20.04, Kernel: 5.12.0-051200rc6daily20210408-generic (x86_64) 20210407, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Compiler: Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73, File-System: ext4, Screen Resolution: 1024x768 Clang 11.0: Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: Supermicro H12SSL-i v1.01 (2.0 BIOS), Chipset: AMD Starship/Matisse, Memory: 126GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe OS: Ubuntu 20.04, Kernel: 5.12.0-051200rc6daily20210408-generic (x86_64) 20210407, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Compiler: Clang 11.0.0-2~ubuntu20.04.1, File-System: ext4, Screen Resolution: 1024x768 Clang 12.0 LTO: Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: Supermicro H12SSL-i v1.01 (2.0 BIOS), Chipset: AMD Starship/Matisse, Memory: 126GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe OS: Ubuntu 20.04, Kernel: 5.12.0-051200rc6daily20210408-generic (x86_64) 20210407, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Compiler: Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73, File-System: ext4, Screen Resolution: 1024x768 QuantLib 1.21 MFLOPS > Higher Is Better Clang 12.0 ..... 2653.8 |====================================================== Clang 11.0 ..... 2640.2 |====================================================== Clang 12.0 LTO . 2657.8 |====================================================== Etcpak 0.7 Configuration: DXT1 Mpx/s > Higher Is Better Clang 12.0 ..... 2718.53 |===================================================== Clang 11.0 ..... 1872.76 |==================================== Clang 12.0 LTO . 2719.99 |===================================================== Etcpak 0.7 Configuration: ETC1 Mpx/s > Higher Is Better Clang 12.0 ..... 284.64 |====================================================== Clang 11.0 ..... 205.07 |======================================= Clang 12.0 LTO . 284.76 |====================================================== Etcpak 0.7 Configuration: ETC2 Mpx/s > Higher Is Better Clang 12.0 ..... 202.09 |====================================================== Clang 11.0 ..... 168.82 |============================================= Clang 12.0 LTO . 202.10 |====================================================== toyBrot Fractal Generator 2020-11-18 Implementation: TBB ms < Lower Is Better Clang 12.0 ..... 6780 |====================================================== Clang 11.0 ..... 6247 |================================================= Clang 12.0 LTO . 7085 |======================================================== toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP ms < Lower Is Better Clang 12.0 . 7507 |============================================================ Clang 11.0 . 7029 |======================================================== toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks ms < Lower Is Better Clang 12.0 ..... 7437 |======================================================== Clang 11.0 ..... 6836 |=================================================== Clang 12.0 LTO . 7367 |======================================================= toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads ms < Lower Is Better Clang 12.0 ..... 7220 |======================================================== Clang 11.0 ..... 6395 |================================================== Clang 12.0 LTO . 7143 |======================================================= FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Mflops > Higher Is Better Clang 12.0 . 13333 |=========================================================== Clang 11.0 . 13324 |=========================================================== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 Mflops > Higher Is Better Clang 12.0 . 10805 |=========================================================== Clang 11.0 . 10564 |========================================================== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 Mflops > Higher Is Better Clang 12.0 . 10467.0 |========================================================= Clang 11.0 . 10004.2 |====================================================== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Mflops > Higher Is Better Clang 12.0 . 9862.0 |========================================================== Clang 11.0 . 9438.6 |======================================================== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 Mflops > Higher Is Better Clang 12.0 . 9088.3 |========================================================== Clang 11.0 . 8809.6 |======================================================== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 Mflops > Higher Is Better Clang 12.0 . 7789.9 |========================================================= Clang 11.0 . 7878.5 |========================================================== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Mflops > Higher Is Better Clang 12.0 . 6744.1 |========================================================= Clang 11.0 . 6823.8 |========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Mflops > Higher Is Better Clang 12.0 . 15649 |=========================================================== Clang 11.0 . 14590 |======================================================= FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 Mflops > Higher Is Better Clang 12.0 . 50350 |=========================================================== Clang 11.0 . 50740 |=========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 Mflops > Higher Is Better Clang 12.0 . 51254 |=========================================================== Clang 11.0 . 50084 |========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Mflops > Higher Is Better Clang 12.0 . 45428 |========================================================= Clang 11.0 . 46676 |=========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 Mflops > Higher Is Better Clang 12.0 . 36239 |=========================================================== Clang 11.0 . 36181 |=========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 Mflops > Higher Is Better Clang 12.0 . 31935 |=========================================================== Clang 11.0 . 31741 |=========================================================== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Mflops > Higher Is Better Clang 12.0 . 22797 |=========================================================== Clang 11.0 . 22913 |=========================================================== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better Clang 12.0 ..... 89.12 |==================================================== Clang 11.0 ..... 88.62 |==================================================== Clang 12.0 LTO . 93.63 |======================================================= WebP Image Encode 1.1 Encode Settings: Default Encode Time - Seconds < Lower Is Better Clang 12.0 . 1.331 |=========================================================== Clang 11.0 . 1.336 |=========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100 Encode Time - Seconds < Lower Is Better Clang 12.0 . 2.199 |========================================================== Clang 11.0 . 2.240 |=========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better Clang 12.0 . 19.02 |=========================================================== Clang 11.0 . 18.57 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better Clang 12.0 . 6.309 |=========================================================== Clang 11.0 . 6.243 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better Clang 12.0 . 38.45 |=========================================================== Clang 11.0 . 37.73 |========================================================== simdjson 0.8.2 Throughput Test: Kostya GB/s > Higher Is Better Clang 12.0 . 2.75 |============================================================ Clang 11.0 . 2.68 |========================================================== simdjson 0.8.2 Throughput Test: LargeRandom GB/s > Higher Is Better Clang 12.0 . 0.84 |============================================================ Clang 11.0 . 0.81 |========================================================== simdjson 0.8.2 Throughput Test: PartialTweets GB/s > Higher Is Better Clang 12.0 . 4.60 |============================================================ Clang 11.0 . 4.41 |========================================================== simdjson 0.8.2 Throughput Test: DistinctUserID GB/s > Higher Is Better Clang 12.0 . 4.62 |============================================================ Clang 11.0 . 4.41 |========================================================= LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed MB/s > Higher Is Better Clang 12.0 ..... 52.07 |======================================================= Clang 11.0 ..... 52.35 |======================================================= Clang 12.0 LTO . 50.93 |====================================================== LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better Clang 12.0 ..... 13911.5 |===================================================== Clang 11.0 ..... 13840.3 |===================================================== Clang 12.0 LTO . 13715.0 |==================================================== LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed MB/s > Higher Is Better Clang 12.0 ..... 48.50 |====================================================== Clang 11.0 ..... 49.01 |======================================================= Clang 12.0 LTO . 48.47 |====================================================== LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed MB/s > Higher Is Better Clang 12.0 ..... 13926.5 |===================================================== Clang 11.0 ..... 13927.9 |===================================================== Clang 12.0 LTO . 13698.7 |==================================================== JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 MP/s > Higher Is Better Clang 12.0 . 74.27 |======================================================== Clang 11.0 . 78.41 |=========================================================== JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 MP/s > Higher Is Better Clang 12.0 . 12.15 |=========================================================== Clang 11.0 . 12.01 |========================================================== JPEG XL 0.3.3 Input: PNG - Encode Speed: 8 MP/s > Higher Is Better Clang 12.0 . 0.82 |============================================================ Clang 11.0 . 0.80 |=========================================================== JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 MP/s > Higher Is Better Clang 12.0 . 66.66 |=========================================================== Clang 11.0 . 65.58 |========================================================== JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 MP/s > Higher Is Better Clang 12.0 . 66.38 |=========================================================== Clang 11.0 . 65.43 |========================================================== JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 MP/s > Higher Is Better Clang 12.0 . 28.13 |=========================================================== Clang 11.0 . 27.24 |========================================================= SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better Clang 12.0 . 3190.62 |======================================================= Clang 11.0 . 3319.34 |========================================================= SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better Clang 12.0 . 675.13 |========================================================== Clang 11.0 . 674.86 |========================================================== SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better Clang 12.0 . 363.85 |===================================================== Clang 11.0 . 399.16 |========================================================== SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better Clang 12.0 . 4280.22 |===================================================== Clang 11.0 . 4590.37 |========================================================= SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better Clang 12.0 . 8848.40 |======================================================= Clang 11.0 . 9146.88 |========================================================= SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better Clang 12.0 . 1785.50 |========================================================= Clang 11.0 . 1785.42 |========================================================= Botan 2.17.3 Test: KASUMI MiB/s > Higher Is Better Clang 12.0 . 82.64 |=========================================================== Clang 11.0 . 79.15 |========================================================= Botan 2.17.3 Test: KASUMI - Decrypt MiB/s > Higher Is Better Clang 12.0 . 84.23 |=========================================================== Clang 11.0 . 80.22 |======================================================== Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better Clang 12.0 . 4659.34 |====================================================== Clang 11.0 . 4901.13 |========================================================= Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better Clang 12.0 . 4682.46 |======================================================= Clang 11.0 . 4895.56 |========================================================= Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better Clang 12.0 . 315.41 |========================================================== Clang 11.0 . 299.21 |======================================================= Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better Clang 12.0 . 321.19 |========================================================== Clang 11.0 . 302.41 |======================================================= Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better Clang 12.0 . 380.05 |========================================================== Clang 11.0 . 319.23 |================================================= Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better Clang 12.0 . 351.28 |========================================================== Clang 11.0 . 351.08 |========================================================== Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better Clang 12.0 . 132.82 |========================================================== Clang 11.0 . 128.59 |======================================================== Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better Clang 12.0 . 133.05 |========================================================== Clang 11.0 . 127.74 |======================================================== Botan 2.17.3 Test: ChaCha20Poly1305 MiB/s > Higher Is Better Clang 12.0 . 850.50 |========================================================== Clang 11.0 . 848.24 |========================================================== Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt MiB/s > Higher Is Better Clang 12.0 . 843.40 |========================================================== Clang 11.0 . 840.64 |========================================================== LibRaw 0.20 Post-Processing Benchmark Mpix/sec > Higher Is Better Clang 12.0 . 41.78 |=========================================================== Clang 11.0 . 38.71 |======================================================= TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better Clang 12.0 . 1570966 |======================================================= Clang 11.0 . 1638265 |========================================================= GraphicsMagick 1.3.33 Operation: Swirl Iterations Per Minute > Higher Is Better Clang 12.0 . 1993 |============================================================ Clang 11.0 . 1915 |========================================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better Clang 12.0 . 712 |============================================================= Clang 11.0 . 665 |========================================================= GraphicsMagick 1.3.33 Operation: Sharpen Iterations Per Minute > Higher Is Better Clang 12.0 . 614 |============================================================= Clang 11.0 . 613 |============================================================= GraphicsMagick 1.3.33 Operation: Enhanced Iterations Per Minute > Higher Is Better Clang 12.0 . 1076 |============================================================ Clang 11.0 . 1068 |============================================================ GraphicsMagick 1.3.33 Operation: Resizing Iterations Per Minute > Higher Is Better Clang 12.0 . 2136 |============================================================ Clang 11.0 . 2034 |========================================================= GraphicsMagick 1.3.33 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better Clang 12.0 . 457 |============================================================ Clang 11.0 . 463 |============================================================= GraphicsMagick 1.3.33 Operation: HWB Color Space Iterations Per Minute > Higher Is Better Clang 12.0 . 605 |============================================================ Clang 11.0 . 616 |============================================================= dav1d 0.8.2 Video Input: Chimera 1080p FPS > Higher Is Better Clang 12.0 . 1198.22 |========================================================= Clang 11.0 . 1190.41 |========================================================= dav1d 0.8.2 Video Input: Summer Nature 4K FPS > Higher Is Better Clang 12.0 . 541.56 |========================================================== Clang 11.0 . 543.43 |========================================================== dav1d 0.8.2 Video Input: Summer Nature 1080p FPS > Higher Is Better Clang 12.0 . 1244.11 |========================================================= Clang 11.0 . 1251.25 |========================================================= dav1d 0.8.2 Video Input: Chimera 1080p 10-bit FPS > Higher Is Better Clang 12.0 . 308.32 |========================================================== Clang 11.0 . 184.19 |=================================== AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 0.21 |============================================================ Clang 11.0 . 0.21 |============================================================ AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 4.87 |=========================================================== Clang 11.0 . 4.95 |============================================================ AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 17.22 |=========================================================== Clang 11.0 . 17.13 |=========================================================== AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 8.99 |=========================================================== Clang 11.0 . 9.14 |============================================================ AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 33.39 |=========================================================== Clang 11.0 . 33.14 |=========================================================== AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 38.11 |=========================================================== Clang 11.0 . 37.28 |========================================================== AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 0.53 |============================================================ Clang 11.0 . 0.53 |============================================================ AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 7.10 |=========================================================== Clang 11.0 . 7.20 |============================================================ AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 26.85 |=========================================================== Clang 11.0 . 26.61 |========================================================== AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 22.13 |=========================================================== Clang 11.0 . 22.00 |=========================================================== AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 88.78 |=========================================================== Clang 11.0 . 86.09 |========================================================= AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 103.17 |========================================================== Clang 11.0 . 100.55 |========================================================= SVT-AV1 0.8 Encoder Mode: Enc Mode 0 - Input: 1080p Frames Per Second > Higher Is Better Clang 12.0 . 0.183 |=========================================================== Clang 11.0 . 0.181 |========================================================== SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Frames Per Second > Higher Is Better Clang 12.0 . 11.47 |========================================================= Clang 11.0 . 11.82 |=========================================================== SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Frames Per Second > Higher Is Better Clang 12.0 . 118.07 |========================================================== Clang 11.0 . 117.39 |========================================================== SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 41.09 |=========================================================== Clang 11.0 . 41.01 |=========================================================== SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 345.30 |========================================================== Clang 11.0 . 346.89 |========================================================== SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 643.58 |========================================================= Clang 11.0 . 652.74 |========================================================== SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 487.43 |========================================================== Clang 11.0 . 481.05 |========================================================= SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 488.23 |========================================================== Clang 11.0 . 482.02 |========================================================= SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 372.49 |========================================================== Clang 11.0 . 373.99 |========================================================== x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better Clang 12.0 . 30.32 |=========================================================== Clang 11.0 . 29.94 |========================================================== x265 3.4 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better Clang 12.0 . 74.00 |=========================================================== Clang 11.0 . 73.36 |========================================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better Clang 12.0 . 1785466.28 |====================================================== Clang 11.0 . 1790837.01 |====================================================== libavif avifenc 0.9.0 Encoder Speed: 0 Seconds < Lower Is Better Clang 12.0 . 47.88 |=========================================================== Clang 11.0 . 47.89 |=========================================================== libavif avifenc 0.9.0 Encoder Speed: 2 Seconds < Lower Is Better Clang 12.0 . 25.18 |========================================================== Clang 11.0 . 25.47 |=========================================================== libavif avifenc 0.9.0 Encoder Speed: 6 Seconds < Lower Is Better Clang 12.0 . 9.510 |=========================================================== Clang 11.0 . 9.536 |=========================================================== libavif avifenc 0.9.0 Encoder Speed: 10 Seconds < Lower Is Better Clang 12.0 . 3.361 |========================================================== Clang 11.0 . 3.429 |=========================================================== libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Seconds < Lower Is Better Clang 12.0 . 25.22 |========================================================= Clang 11.0 . 26.03 |=========================================================== libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Seconds < Lower Is Better Clang 12.0 . 5.746 |========================================================== Clang 11.0 . 5.879 |=========================================================== C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better Clang 12.0 . 15.87 |=========================================================== Clang 11.0 . 15.60 |========================================================== POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better Clang 12.0 . 9.296 |========================================================== Clang 11.0 . 9.408 |=========================================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1.07701 |========================================================= Clang 11.0 . 1.08011 |========================================================= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 3.28507 |===================================================== Clang 11.0 . 3.52787 |========================================================= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1.07507 |========================================================= Clang 11.0 . 1.07577 |========================================================= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 0.710124 |======================================================== Clang 11.0 . 0.594729 |=============================================== oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1.221320 |======================================================== Clang 11.0 . 0.841169 |======================================= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1.44425 |======================================================== Clang 11.0 . 1.45757 |========================================================= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 2.36797 |========================================================= Clang 11.0 . 2.31859 |======================================================== oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 2.03606 |========================================================= Clang 11.0 . 1.60540 |============================================= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 0.491940 |======================================================== Clang 11.0 . 0.489278 |======================================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 0.779776 |======================================================== Clang 11.0 . 0.779101 |======================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1302.70 |========================================================= Clang 11.0 . 1276.04 |======================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 593.97 |========================================================== Clang 11.0 . 563.20 |======================================================= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1307.49 |========================================================= Clang 11.0 . 1277.62 |======================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 590.18 |========================================================== Clang 11.0 . 562.97 |======================================================= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 0.313689 |======================================================== Clang 11.0 . 0.315522 |======================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1305.10 |========================================================= Clang 11.0 . 1271.91 |======================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better Clang 12.0 . 597.48 |========================================================== Clang 11.0 . 563.25 |======================================================= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better Clang 12.0 . 1.17258 |========================================================= Clang 11.0 . 1.15140 |======================================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better Clang 12.0 . 7.854 |========================================================== Clang 11.0 . 7.979 |=========================================================== LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better Clang 12.0 . 8.256 |=========================================================== Clang 11.0 . 8.250 |=========================================================== Opus Codec Encoding 1.3.1 WAV To Opus Encode Seconds < Lower Is Better Clang 12.0 . 7.567 |=========================================================== Clang 11.0 . 7.392 |========================================================== Gcrypt Library 1.9 Seconds < Lower Is Better Clang 12.0 . 236.92 |========================================================= Clang 11.0 . 240.21 |========================================================== Ngspice 34 Circuit: C2670 Seconds < Lower Is Better Clang 12.0 . 118.87 |========================================================== Clang 11.0 . 103.83 |=================================================== Ngspice 34 Circuit: C7552 Seconds < Lower Is Better Clang 12.0 . 95.96 |=========================================================== Clang 11.0 . 90.53 |======================================================== Tachyon 0.99b6 Total Time Seconds < Lower Is Better Clang 12.0 . 16.05 |========================================================== Clang 11.0 . 16.41 |=========================================================== WebP2 Image Encode 20210126 Encode Settings: Default Seconds < Lower Is Better Clang 12.0 . 2.739 |=========================================================== Clang 11.0 . 2.743 |=========================================================== WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Seconds < Lower Is Better Clang 12.0 . 109.53 |========================================================== Clang 11.0 . 109.64 |========================================================== WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Seconds < Lower Is Better Clang 12.0 . 207.01 |========================================================== Clang 11.0 . 203.63 |========================================================= WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Seconds < Lower Is Better Clang 12.0 . 6.690 |====================================================== Clang 11.0 . 7.366 |=========================================================== WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Seconds < Lower Is Better Clang 12.0 . 374.04 |======================================================= Clang 11.0 . 392.85 |========================================================== Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Clang 12.0 . 55663000 |======================================================= Clang 11.0 . 56307000 |======================================================== Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Clang 12.0 . 1564833333 |====================================================== Clang 11.0 . 1578400000 |====================================================== Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Clang 12.0 . 3070633333 |====================================================== Clang 11.0 . 3051366667 |====================================================== Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Clang 12.0 . 3643766667 |====================================================== Clang 11.0 . 3596533333 |===================================================== FinanceBench 2016-07-25 Benchmark: Repo OpenMP ms < Lower Is Better Clang 12.0 . 33246.84 |======================================================== Clang 11.0 . 33178.50 |======================================================== FinanceBench 2016-07-25 Benchmark: Bonds OpenMP ms < Lower Is Better Clang 12.0 . 51596.87 |======================================================== Clang 11.0 . 51900.43 |======================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better Clang 12.0 . 471 |========================================================== Clang 11.0 . 495 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better Clang 12.0 . 357 |===================================================== Clang 11.0 . 412 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better Clang 12.0 . 434 |========================================================= Clang 11.0 . 462 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better Clang 12.0 . 604 |=================== Clang 11.0 . 1877 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better Clang 12.0 . 878 |=================================================== Clang 11.0 . 1043 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better Clang 12.0 . 819 |====================================================== Clang 11.0 . 933 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better Clang 12.0 . 69.1 |============================================================ Clang 11.0 . 51.2 |============================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better Clang 12.0 . 626 |======================================================== Clang 11.0 . 677 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better Clang 12.0 . 48.6 |=================================== Clang 11.0 . 83.6 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better Clang 12.0 . 65.7 |================================================== Clang 11.0 . 79.3 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better Clang 12.0 . 51.9 |=================================== Clang 11.0 . 88.3 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better Clang 12.0 . 73.0 |==================================================== Clang 11.0 . 84.0 |============================================================ ASTC Encoder 2.4 Preset: Medium Seconds < Lower Is Better Clang 12.0 . 4.0058 |========================================================== Clang 11.0 . 3.9837 |========================================================== ASTC Encoder 2.4 Preset: Thorough Seconds < Lower Is Better Clang 12.0 . 6.7647 |========================================================== Clang 11.0 . 6.7674 |========================================================== ASTC Encoder 2.4 Preset: Exhaustive Seconds < Lower Is Better Clang 12.0 . 18.99 |=========================================================== Clang 11.0 . 19.03 |=========================================================== ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better Clang 12.0 . 333 |=========================================================== Clang 11.0 . 346 |============================================================= ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better Clang 12.0 . 498 |============================================================= Clang 11.0 . 471 |========================================================== ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better Clang 12.0 . 112 |============================================================= Clang 11.0 . 108 |=========================================================== ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better Clang 12.0 . 9904 |============================================================ Clang 11.0 . 9797 |=========================================================== ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Inferences Per Minute > Higher Is Better Clang 12.0 . 4456 |=========================================================== Clang 11.0 . 4523 |============================================================ SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better Clang 12.0 . 265204 |========================================================== Clang 11.0 . 260119 |========================================================= PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 1 - Mode: Read Only TPS > Higher Is Better Clang 12.0 . 24310 |========================================================== Clang 11.0 . 24943 |=========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency ms < Lower Is Better Clang 12.0 . 0.041 |=========================================================== Clang 11.0 . 0.040 |========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 1 - Mode: Read Write TPS > Higher Is Better Clang 12.0 . 3281 |=========================================================== Clang 11.0 . 3312 |============================================================ PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency ms < Lower Is Better Clang 12.0 . 0.305 |=========================================================== Clang 11.0 . 0.302 |========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only TPS > Higher Is Better Clang 12.0 . 1069022 |========================================================= Clang 11.0 . 1069367 |========================================================= PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency ms < Lower Is Better Clang 12.0 . 0.094 |=========================================================== Clang 11.0 . 0.094 |=========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only TPS > Higher Is Better Clang 12.0 . 1071209 |========================================================= Clang 11.0 . 1065506 |========================================================= PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency ms < Lower Is Better Clang 12.0 . 0.234 |=========================================================== Clang 11.0 . 0.235 |=========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write TPS > Higher Is Better Clang 12.0 . 62319 |=========================================================== Clang 11.0 . 61616 |========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency ms < Lower Is Better Clang 12.0 . 1.607 |========================================================== Clang 11.0 . 1.626 |=========================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write TPS > Higher Is Better Clang 12.0 . 56684 |=========================================================== Clang 11.0 . 54488 |========================================================= PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency ms < Lower Is Better Clang 12.0 . 4.431 |========================================================= Clang 11.0 . 4.603 |===========================================================