AMD AOCC 4.0 Compiler Benchmarks

Initial AOCC 4.0 compiler benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2211114-PTS-AMDAOCC455
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 3 Tests
C++ Boost Tests 2 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 10 Tests
Creator Workloads 13 Tests
Cryptography 2 Tests
Encoding 7 Tests
Game Development 2 Tests
Imaging 3 Tests
Multi-Core 11 Tests
Raytracing 2 Tests
Renderers 2 Tests
Software Defined Radio 2 Tests
Server CPU Tests 8 Tests
Texture Compression 2 Tests
Video Encoding 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
LLVM Clang 14
November 11 2022
  1 Hour, 48 Minutes
AOCC 4.0
November 11 2022
  1 Hour, 47 Minutes
Invert Hiding All Results Option
  1 Hour, 48 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD AOCC 4.0 Compiler BenchmarksOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0703 BIOS)AMD Device 14d832GB1000GB Sabrent Rocket 4.0 PlusAMD Radeon RX 6800 16GB (2475/1000MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.106.1.0-060100rc3daily20221103-generic (x86_64)GNOME Shell 43.0X Server + Wayland4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49)1.3.224Clang 14.0.6-2Clang 14.0.6ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilersFile-SystemScreen ResolutionAMD AOCC 4.0 Compiler Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203 - Python 3.10.7- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - AOCC 4.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver4

LLVM Clang 14 vs. AOCC 4.0 ComparisonPhoronix Test SuiteBaseline+9.1%+9.1%+18.2%+18.2%+27.3%+27.3%+36.4%+36.4%36.5%7.2%7%6.2%6.1%5.4%5.1%4.3%4.2%3.8%3.7%3.6%3.5%3.3%3.2%3.2%2.9%2.1%2.1%Sharpen32 - 256 - 57Q.1.LEigenQ.1.L.H.CQ.1.H.C16 - 256 - 57Preset 12 - Bosphorus 4K2Medium6BLASPreset 10 - Bosphorus 4KBosphorus 4K6, LosslessThorough7 - Bosphorus 4KEnhancedBosphorus 4K - Very FastGraphicsMagickLiquid-DSPWebP Image EncodeLeelaChessZeroWebP Image EncodeWebP Image EncodeLiquid-DSPSVT-AV1libavif avifencASTC Encoderlibavif avifencLeelaChessZeroSVT-AV1x265libavif avifencASTC EncoderSVT-HEVCGraphicsMagickKvazaarLLVM Clang 14AOCC 4.0

AMD AOCC 4.0 Compiler Benchmarksquantlib: lczero: BLASlczero: Eigenxmrig: Monero - 1Mcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedwebp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionsrsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB SISO 64-QAMgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingdav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitquadray: 1 - 4Kquadray: 1 - 1080pkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastsvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 1 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Ksvt-vp9: VMAF Optimized - Bosphorus 4Ksvt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 4Kx265: Bosphorus 4Kavifenc: 2avifenc: 6avifenc: 6, Losslessc-ray: Total Time - 4K, 16 Rays Per Pixelopenssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57astcenc: Mediumastcenc: Thoroughastcenc: Exhaustivedraco: Church FacadeLLVM Clang 14AOCC 4.04535.61694184412543.478.94941.655.14966.32.015.890.82199.3209.93375812470393.37825.0026.23102.1248.0787.37139.029200.7876.43104.05169.39111.30120.22109.7336.9133.6353.0804.65027.097371079907705998.4391464.016661000001742933333156.196319.24431.969547044475.31755195912571.979.24981.155.45017.22.156.210.87201.3210.74605932474394.20821.2626.44101.5649.0688.76143.901209.4006.43107.09169.54112.09120.86110.2438.1232.2842.9714.50426.974372854928475971.9393216.417509333331868800000162.170719.86281.99794705OpenBenchmarking.org

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AOCC 4.0LLVM Clang 1410002000300040005000SE +/- 32.16, N = 12SE +/- 44.47, N = 124475.34535.61. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AOCC 4.0LLVM Clang 148001600240032004000Min: 4127.1 / Avg: 4475.28 / Max: 4531.2Min: 4053.4 / Avg: 4535.6 / Max: 4635.61. (CXX) g++ options: -O3 -march=native -rdynamic

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAOCC 4.0LLVM Clang 14400800120016002000SE +/- 19.19, N = 4SE +/- 7.42, N = 3175516941. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAOCC 4.0LLVM Clang 1430060090012001500Min: 1729 / Avg: 1754.5 / Max: 1811Min: 1679 / Avg: 1693.67 / Max: 17031. (CXX) g++ options: -flto -O3 -march=native -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAOCC 4.0LLVM Clang 14400800120016002000SE +/- 7.67, N = 3SE +/- 8.95, N = 3195918441. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAOCC 4.0LLVM Clang 1430060090012001500Min: 1951 / Avg: 1958.67 / Max: 1974Min: 1826 / Avg: 1843.67 / Max: 18551. (CXX) g++ options: -flto -O3 -march=native -pthread

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MAOCC 4.0LLVM Clang 143K6K9K12K15KSE +/- 26.06, N = 3SE +/- 30.70, N = 312571.912543.41. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -funroll-loops -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MAOCC 4.0LLVM Clang 142K4K6K8K10KMin: 12524.6 / Avg: 12571.93 / Max: 12614.5Min: 12484.7 / Avg: 12543.43 / Max: 12588.31. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -funroll-loops -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedAOCC 4.0LLVM Clang 1420406080100SE +/- 0.28, N = 3SE +/- 0.06, N = 379.278.91. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedAOCC 4.0LLVM Clang 141530456075Min: 78.9 / Avg: 79.23 / Max: 79.8Min: 78.8 / Avg: 78.9 / Max: 791. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedAOCC 4.0LLVM Clang 1411002200330044005500SE +/- 26.61, N = 3SE +/- 59.66, N = 34981.14941.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedAOCC 4.0LLVM Clang 149001800270036004500Min: 4929.6 / Avg: 4981.13 / Max: 5018.4Min: 4824.5 / Avg: 4941.57 / Max: 5020.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedAOCC 4.0LLVM Clang 141224364860SE +/- 0.06, N = 3SE +/- 0.17, N = 355.455.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedAOCC 4.0LLVM Clang 141122334455Min: 55.3 / Avg: 55.4 / Max: 55.5Min: 54.8 / Avg: 55.13 / Max: 55.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedAOCC 4.0LLVM Clang 1411002200330044005500SE +/- 14.78, N = 3SE +/- 4.60, N = 35017.24966.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedAOCC 4.0LLVM Clang 149001800270036004500Min: 4998.3 / Avg: 5017.17 / Max: 5046.3Min: 4961.1 / Avg: 4966.33 / Max: 4975.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessAOCC 4.0LLVM Clang 140.48380.96761.45141.93522.419SE +/- 0.01, N = 3SE +/- 0.00, N = 32.152.011. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessAOCC 4.0LLVM Clang 14246810Min: 2.12 / Avg: 2.15 / Max: 2.16Min: 2.01 / Avg: 2.01 / Max: 2.021. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionAOCC 4.0LLVM Clang 14246810SE +/- 0.01, N = 3SE +/- 0.00, N = 36.215.891. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionAOCC 4.0LLVM Clang 14246810Min: 6.2 / Avg: 6.21 / Max: 6.23Min: 5.89 / Avg: 5.89 / Max: 5.91. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionAOCC 4.0LLVM Clang 140.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.821. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionAOCC 4.0LLVM Clang 14246810Min: 0.87 / Avg: 0.87 / Max: 0.87Min: 0.82 / Avg: 0.82 / Max: 0.821. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

srsRAN

srsRAN is an open-source LTE/5G software radio suite created by Software Radio Systems (SRS). The srsRAN radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAMAOCC 4.0LLVM Clang 144080120160200SE +/- 0.84, N = 3SE +/- 1.03, N = 3201.3199.31. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm
OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAMAOCC 4.0LLVM Clang 144080120160200Min: 199.7 / Avg: 201.33 / Max: 202.5Min: 197.8 / Avg: 199.33 / Max: 201.31. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm

OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB SISO 64-QAMAOCC 4.0LLVM Clang 1450100150200250SE +/- 0.99, N = 3SE +/- 0.21, N = 3210.7209.91. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm
OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB SISO 64-QAMAOCC 4.0LLVM Clang 144080120160200Min: 208.9 / Avg: 210.7 / Max: 212.3Min: 209.5 / Avg: 209.9 / Max: 210.21. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenAOCC 4.0LLVM Clang 14100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 34603371. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenAOCC 4.0LLVM Clang 1480160240320400Min: 459 / Avg: 459.67 / Max: 460Min: 337 / Avg: 337.33 / Max: 3381. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedAOCC 4.0LLVM Clang 14130260390520650SE +/- 0.33, N = 3SE +/- 0.00, N = 35935811. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedAOCC 4.0LLVM Clang 14100200300400500Min: 593 / Avg: 593.33 / Max: 594Min: 581 / Avg: 581 / Max: 5811. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingAOCC 4.0LLVM Clang 145001000150020002500SE +/- 3.51, N = 3SE +/- 0.88, N = 3247424701. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingAOCC 4.0LLVM Clang 14400800120016002000Min: 2470 / Avg: 2474 / Max: 2481Min: 2468 / Avg: 2469.67 / Max: 24711. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 4KAOCC 4.0LLVM Clang 1490180270360450SE +/- 2.47, N = 3SE +/- 3.69, N = 3394.20393.371. (CC) gcc options: -O3 -march=native -pthread -lm
OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 4KAOCC 4.0LLVM Clang 1470140210280350Min: 390.79 / Avg: 394.2 / Max: 399Min: 386 / Avg: 393.37 / Max: 397.361. (CC) gcc options: -O3 -march=native -pthread -lm

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080p 10-bitAOCC 4.0LLVM Clang 142004006008001000SE +/- 1.11, N = 3SE +/- 1.48, N = 3821.26825.001. (CC) gcc options: -O3 -march=native -pthread -lm
OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080p 10-bitAOCC 4.0LLVM Clang 14140280420560700Min: 819.5 / Avg: 821.26 / Max: 823.32Min: 822.92 / Avg: 825 / Max: 827.851. (CC) gcc options: -O3 -march=native -pthread -lm

QuadRay

VectorChief's QuadRay is a real-time ray-tracing engine written to support SIMD across ARM, MIPS, PPC, and x86/x86_64 processors. QuadRay supports SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 usage on Intel/AMD CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 4KAOCC 4.0LLVM Clang 14612182430SE +/- 0.06, N = 3SE +/- 0.10, N = 326.4426.231. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread
OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 4KAOCC 4.0LLVM Clang 14612182430Min: 26.33 / Avg: 26.44 / Max: 26.52Min: 26.04 / Avg: 26.23 / Max: 26.391. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread

OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 1080pAOCC 4.0LLVM Clang 1420406080100SE +/- 0.40, N = 3SE +/- 0.20, N = 3101.56102.121. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread
OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 1080pAOCC 4.0LLVM Clang 1420406080100Min: 100.93 / Avg: 101.56 / Max: 102.31Min: 101.75 / Avg: 102.12 / Max: 102.431. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastAOCC 4.0LLVM Clang 141122334455SE +/- 0.36, N = 3SE +/- 0.32, N = 349.0648.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastAOCC 4.0LLVM Clang 141020304050Min: 48.37 / Avg: 49.06 / Max: 49.59Min: 47.43 / Avg: 48.07 / Max: 48.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra FastAOCC 4.0LLVM Clang 1420406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 388.7687.371. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra FastAOCC 4.0LLVM Clang 1420406080100Min: 88.71 / Avg: 88.76 / Max: 88.83Min: 87.29 / Avg: 87.37 / Max: 87.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 10 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14306090120150SE +/- 3.87, N = 15SE +/- 2.91, N = 15143.90139.03
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 10 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14306090120150Min: 104.2 / Avg: 143.9 / Max: 150.89Min: 107.18 / Avg: 139.03 / Max: 144.31

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 12 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1450100150200250SE +/- 1.07, N = 3SE +/- 1.63, N = 3209.40200.79
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 12 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 144080120160200Min: 208.2 / Avg: 209.4 / Max: 211.54Min: 197.57 / Avg: 200.79 / Max: 202.82

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14246810SE +/- 0.02, N = 3SE +/- 0.02, N = 36.436.431. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 143691215Min: 6.4 / Avg: 6.43 / Max: 6.47Min: 6.4 / Avg: 6.43 / Max: 6.471. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100SE +/- 0.23, N = 3SE +/- 0.06, N = 3107.09104.051. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100Min: 106.7 / Avg: 107.09 / Max: 107.49Min: 103.95 / Avg: 104.05 / Max: 104.171. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 144080120160200SE +/- 0.44, N = 3SE +/- 0.32, N = 3169.54169.391. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14306090120150Min: 168.78 / Avg: 169.54 / Max: 170.31Min: 168.82 / Avg: 169.39 / Max: 169.921. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14306090120150SE +/- 1.05, N = 15SE +/- 0.93, N = 15112.09111.301. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100Min: 102.08 / Avg: 112.09 / Max: 114.04Min: 101.67 / Avg: 111.3 / Max: 112.851. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 14306090120150SE +/- 0.23, N = 3SE +/- 0.09, N = 3120.86120.221. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100Min: 120.41 / Avg: 120.86 / Max: 121.13Min: 120.04 / Avg: 120.22 / Max: 120.331. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100SE +/- 0.19, N = 3SE +/- 0.08, N = 3110.24109.731. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4KAOCC 4.0LLVM Clang 1420406080100Min: 109.92 / Avg: 110.24 / Max: 110.58Min: 109.64 / Avg: 109.73 / Max: 109.891. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAOCC 4.0LLVM Clang 14918273645SE +/- 0.03, N = 3SE +/- 0.34, N = 338.1236.911. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAOCC 4.0LLVM Clang 14816243240Min: 38.09 / Avg: 38.12 / Max: 38.17Min: 36.29 / Avg: 36.91 / Max: 37.471. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2AOCC 4.0LLVM Clang 14816243240SE +/- 0.08, N = 3SE +/- 0.22, N = 1532.2833.641. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2AOCC 4.0LLVM Clang 14714212835Min: 32.17 / Avg: 32.28 / Max: 32.44Min: 33.13 / Avg: 33.63 / Max: 36.171. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6AOCC 4.0LLVM Clang 140.6931.3862.0792.7723.465SE +/- 0.004, N = 3SE +/- 0.027, N = 82.9713.0801. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6AOCC 4.0LLVM Clang 14246810Min: 2.96 / Avg: 2.97 / Max: 2.98Min: 3.03 / Avg: 3.08 / Max: 3.271. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessAOCC 4.0LLVM Clang 141.04632.09263.13894.18525.2315SE +/- 0.011, N = 3SE +/- 0.032, N = 34.5044.6501. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessAOCC 4.0LLVM Clang 14246810Min: 4.48 / Avg: 4.5 / Max: 4.52Min: 4.61 / Avg: 4.65 / Max: 4.711. (CXX) g++ options: -O3 -fPIC -march=native -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAOCC 4.0LLVM Clang 14612182430SE +/- 0.01, N = 3SE +/- 0.15, N = 326.9727.101. (CC) gcc options: -lm -lpthread -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAOCC 4.0LLVM Clang 14612182430Min: 26.95 / Avg: 26.97 / Max: 27Min: 26.9 / Avg: 27.1 / Max: 27.391. (CC) gcc options: -lm -lpthread -O3 -march=native

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256AOCC 4.0LLVM Clang 148000M16000M24000M32000M40000MSE +/- 19509012.02, N = 3SE +/- 5864900.49, N = 337285492847371079907701. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256AOCC 4.0LLVM Clang 146000M12000M18000M24000M30000MMin: 37246557080 / Avg: 37285492846.67 / Max: 37307153720Min: 37096325130 / Avg: 37107990770 / Max: 371148846301. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096AOCC 4.0LLVM Clang 1413002600390052006500SE +/- 27.80, N = 3SE +/- 5.11, N = 35971.95998.41. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096AOCC 4.0LLVM Clang 1410002000300040005000Min: 5916.8 / Avg: 5971.87 / Max: 6006.1Min: 5989 / Avg: 5998.37 / Max: 6006.61. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096AOCC 4.0LLVM Clang 1480K160K240K320K400KSE +/- 114.01, N = 3SE +/- 1647.21, N = 3393216.4391464.01. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096AOCC 4.0LLVM Clang 1470K140K210K280K350KMin: 393020.4 / Avg: 393216.43 / Max: 393415.3Min: 388169.7 / Avg: 391464 / Max: 393136.11. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57AOCC 4.0LLVM Clang 14400M800M1200M1600M2000MSE +/- 2171277.15, N = 3SE +/- 1386842.94, N = 3175093333316661000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57AOCC 4.0LLVM Clang 14300M600M900M1200M1500MMin: 1748100000 / Avg: 1750933333.33 / Max: 1755200000Min: 1664200000 / Avg: 1666100000 / Max: 16688000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57AOCC 4.0LLVM Clang 14400M800M1200M1600M2000MSE +/- 1442220.51, N = 3SE +/- 5397633.23, N = 3186880000017429333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57AOCC 4.0LLVM Clang 14300M600M900M1200M1500MMin: 1866000000 / Avg: 1868800000 / Max: 1870800000Min: 1732200000 / Avg: 1742933333.33 / Max: 17493000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumAOCC 4.0LLVM Clang 144080120160200SE +/- 0.83, N = 3SE +/- 0.85, N = 3162.17156.201. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumAOCC 4.0LLVM Clang 14306090120150Min: 160.51 / Avg: 162.17 / Max: 163.06Min: 154.5 / Avg: 156.2 / Max: 157.211. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughAOCC 4.0LLVM Clang 14510152025SE +/- 0.05, N = 3SE +/- 0.08, N = 319.8619.241. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughAOCC 4.0LLVM Clang 14510152025Min: 19.76 / Avg: 19.86 / Max: 19.92Min: 19.09 / Avg: 19.24 / Max: 19.331. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveAOCC 4.0LLVM Clang 140.44950.8991.34851.7982.2475SE +/- 0.0025, N = 3SE +/- 0.0036, N = 31.99791.96951. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveAOCC 4.0LLVM Clang 14246810Min: 2 / Avg: 2 / Max: 2Min: 1.96 / Avg: 1.97 / Max: 1.981. (CXX) g++ options: -O3 -march=native -flto -pthread

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeAOCC 4.0LLVM Clang 1410002000300040005000SE +/- 36.11, N = 3SE +/- 19.68, N = 3470547041. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeAOCC 4.0LLVM Clang 148001600240032004000Min: 4654 / Avg: 4705.33 / Max: 4775Min: 4665 / Avg: 4704.33 / Max: 47251. (CXX) g++ options: -O3 -march=native