AMD AOCC 4.0 Compiler Benchmarks

Initial AOCC 4.0 compiler benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2211114-PTS-AMDAOCC455
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 3 Tests
C++ Boost Tests 2 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 10 Tests
Creator Workloads 13 Tests
Cryptography 2 Tests
Encoding 7 Tests
Game Development 2 Tests
Imaging 3 Tests
Multi-Core 11 Tests
Raytracing 2 Tests
Renderers 2 Tests
Software Defined Radio 2 Tests
Server CPU Tests 8 Tests
Texture Compression 2 Tests
Video Encoding 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
LLVM Clang 14
November 11 2022
  1 Hour, 48 Minutes
AOCC 4.0
November 11 2022
  1 Hour, 47 Minutes
Invert Hiding All Results Option
  1 Hour, 48 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD AOCC 4.0 Compiler BenchmarksOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0703 BIOS)AMD Device 14d832GB1000GB Sabrent Rocket 4.0 PlusAMD Radeon RX 6800 16GB (2475/1000MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.106.1.0-060100rc3daily20221103-generic (x86_64)GNOME Shell 43.0X Server + Wayland4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49)1.3.224Clang 14.0.6-2Clang 14.0.6ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilersFile-SystemScreen ResolutionAMD AOCC 4.0 Compiler Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203 - Python 3.10.7- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - AOCC 4.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver4

LLVM Clang 14 vs. AOCC 4.0 ComparisonPhoronix Test SuiteBaseline+9.1%+9.1%+18.2%+18.2%+27.3%+27.3%+36.4%+36.4%36.5%7.2%7%6.2%6.1%5.4%5.1%4.3%4.2%3.8%3.7%3.6%3.5%3.3%3.2%3.2%2.9%2.1%2.1%Sharpen32 - 256 - 57Q.1.LEigenQ.1.L.H.CQ.1.H.C16 - 256 - 57Preset 12 - Bosphorus 4K2Medium6BLASPreset 10 - Bosphorus 4KBosphorus 4K6, LosslessThorough7 - Bosphorus 4KEnhancedBosphorus 4K - Very FastGraphicsMagickLiquid-DSPWebP Image EncodeLeelaChessZeroWebP Image EncodeWebP Image EncodeLiquid-DSPSVT-AV1libavif avifencASTC Encoderlibavif avifencLeelaChessZeroSVT-AV1x265libavif avifencASTC EncoderSVT-HEVCGraphicsMagickKvazaarLLVM Clang 14AOCC 4.0

AMD AOCC 4.0 Compiler Benchmarksquantlib: lczero: BLASlczero: Eigenxmrig: Monero - 1Mcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedwebp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionsrsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB SISO 64-QAMgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingdav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitquadray: 1 - 4Kquadray: 1 - 1080pkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastsvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 1 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Ksvt-vp9: VMAF Optimized - Bosphorus 4Ksvt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 4Kx265: Bosphorus 4Kavifenc: 2avifenc: 6avifenc: 6, Losslessc-ray: Total Time - 4K, 16 Rays Per Pixelopenssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57astcenc: Mediumastcenc: Thoroughastcenc: Exhaustivedraco: Church FacadeLLVM Clang 14AOCC 4.04535.61694184412543.478.94941.655.14966.32.015.890.82199.3209.93375812470393.37825.0026.23102.1248.0787.37139.029200.7876.43104.05169.39111.30120.22109.7336.9133.6353.0804.65027.097371079907705998.4391464.016661000001742933333156.196319.24431.969547044475.31755195912571.979.24981.155.45017.22.156.210.87201.3210.74605932474394.20821.2626.44101.5649.0688.76143.901209.4006.43107.09169.54112.09120.86110.2438.1232.2842.9714.50426.974372854928475971.9393216.417509333331868800000162.170719.86281.99794705OpenBenchmarking.org

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21LLVM Clang 14AOCC 4.010002000300040005000SE +/- 44.47, N = 12SE +/- 32.16, N = 124535.64475.31. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21LLVM Clang 14AOCC 4.08001600240032004000Min: 4053.4 / Avg: 4535.6 / Max: 4635.6Min: 4127.1 / Avg: 4475.28 / Max: 4531.21. (CXX) g++ options: -O3 -march=native -rdynamic

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASLLVM Clang 14AOCC 4.0400800120016002000SE +/- 7.42, N = 3SE +/- 19.19, N = 4169417551. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASLLVM Clang 14AOCC 4.030060090012001500Min: 1679 / Avg: 1693.67 / Max: 1703Min: 1729 / Avg: 1754.5 / Max: 18111. (CXX) g++ options: -flto -O3 -march=native -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenLLVM Clang 14AOCC 4.0400800120016002000SE +/- 8.95, N = 3SE +/- 7.67, N = 3184419591. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenLLVM Clang 14AOCC 4.030060090012001500Min: 1826 / Avg: 1843.67 / Max: 1855Min: 1951 / Avg: 1958.67 / Max: 19741. (CXX) g++ options: -flto -O3 -march=native -pthread

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MLLVM Clang 14AOCC 4.03K6K9K12K15KSE +/- 30.70, N = 3SE +/- 26.06, N = 312543.412571.91. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -funroll-loops -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MLLVM Clang 14AOCC 4.02K4K6K8K10KMin: 12484.7 / Avg: 12543.43 / Max: 12588.3Min: 12524.6 / Avg: 12571.93 / Max: 12614.51. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -funroll-loops -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedLLVM Clang 14AOCC 4.020406080100SE +/- 0.06, N = 3SE +/- 0.28, N = 378.979.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedLLVM Clang 14AOCC 4.01530456075Min: 78.8 / Avg: 78.9 / Max: 79Min: 78.9 / Avg: 79.23 / Max: 79.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedLLVM Clang 14AOCC 4.011002200330044005500SE +/- 59.66, N = 3SE +/- 26.61, N = 34941.64981.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedLLVM Clang 14AOCC 4.09001800270036004500Min: 4824.5 / Avg: 4941.57 / Max: 5020.1Min: 4929.6 / Avg: 4981.13 / Max: 5018.41. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedLLVM Clang 14AOCC 4.01224364860SE +/- 0.17, N = 3SE +/- 0.06, N = 355.155.41. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedLLVM Clang 14AOCC 4.01122334455Min: 54.8 / Avg: 55.13 / Max: 55.3Min: 55.3 / Avg: 55.4 / Max: 55.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedLLVM Clang 14AOCC 4.011002200330044005500SE +/- 4.60, N = 3SE +/- 14.78, N = 34966.35017.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedLLVM Clang 14AOCC 4.09001800270036004500Min: 4961.1 / Avg: 4966.33 / Max: 4975.5Min: 4998.3 / Avg: 5017.17 / Max: 5046.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessLLVM Clang 14AOCC 4.00.48380.96761.45141.93522.419SE +/- 0.00, N = 3SE +/- 0.01, N = 32.012.151. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessLLVM Clang 14AOCC 4.0246810Min: 2.01 / Avg: 2.01 / Max: 2.02Min: 2.12 / Avg: 2.15 / Max: 2.161. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionLLVM Clang 14AOCC 4.0246810SE +/- 0.00, N = 3SE +/- 0.01, N = 35.896.211. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionLLVM Clang 14AOCC 4.0246810Min: 5.89 / Avg: 5.89 / Max: 5.9Min: 6.2 / Avg: 6.21 / Max: 6.231. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionLLVM Clang 14AOCC 4.00.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.871. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionLLVM Clang 14AOCC 4.0246810Min: 0.82 / Avg: 0.82 / Max: 0.82Min: 0.87 / Avg: 0.87 / Max: 0.871. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

srsRAN

srsRAN is an open-source LTE/5G software radio suite created by Software Radio Systems (SRS). The srsRAN radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAMLLVM Clang 14AOCC 4.04080120160200SE +/- 1.03, N = 3SE +/- 0.84, N = 3199.3201.31. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm
OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAMLLVM Clang 14AOCC 4.04080120160200Min: 197.8 / Avg: 199.33 / Max: 201.3Min: 199.7 / Avg: 201.33 / Max: 202.51. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm

OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB SISO 64-QAMLLVM Clang 14AOCC 4.050100150200250SE +/- 0.21, N = 3SE +/- 0.99, N = 3209.9210.71. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm
OpenBenchmarking.orgUE Mb/s, More Is BettersrsRAN 22.04.1Test: 4G PHY_DL_Test 100 PRB SISO 64-QAMLLVM Clang 14AOCC 4.04080120160200Min: 209.5 / Avg: 209.9 / Max: 210.2Min: 208.9 / Avg: 210.7 / Max: 212.31. (CXX) g++ options: -O3 -march=native -std=c++14 -fno-strict-aliasing -mfpmath=sse -mavx2 -fvisibility=hidden -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -latomic -ldl -lpthread -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenLLVM Clang 14AOCC 4.0100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 33374601. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenLLVM Clang 14AOCC 4.080160240320400Min: 337 / Avg: 337.33 / Max: 338Min: 459 / Avg: 459.67 / Max: 4601. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedLLVM Clang 14AOCC 4.0130260390520650SE +/- 0.00, N = 3SE +/- 0.33, N = 35815931. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedLLVM Clang 14AOCC 4.0100200300400500Min: 581 / Avg: 581 / Max: 581Min: 593 / Avg: 593.33 / Max: 5941. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingLLVM Clang 14AOCC 4.05001000150020002500SE +/- 0.88, N = 3SE +/- 3.51, N = 3247024741. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingLLVM Clang 14AOCC 4.0400800120016002000Min: 2468 / Avg: 2469.67 / Max: 2471Min: 2470 / Avg: 2474 / Max: 24811. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 4KLLVM Clang 14AOCC 4.090180270360450SE +/- 3.69, N = 3SE +/- 2.47, N = 3393.37394.201. (CC) gcc options: -O3 -march=native -pthread -lm
OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 4KLLVM Clang 14AOCC 4.070140210280350Min: 386 / Avg: 393.37 / Max: 397.36Min: 390.79 / Avg: 394.2 / Max: 3991. (CC) gcc options: -O3 -march=native -pthread -lm

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080p 10-bitLLVM Clang 14AOCC 4.02004006008001000SE +/- 1.48, N = 3SE +/- 1.11, N = 3825.00821.261. (CC) gcc options: -O3 -march=native -pthread -lm
OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080p 10-bitLLVM Clang 14AOCC 4.0140280420560700Min: 822.92 / Avg: 825 / Max: 827.85Min: 819.5 / Avg: 821.26 / Max: 823.321. (CC) gcc options: -O3 -march=native -pthread -lm

QuadRay

VectorChief's QuadRay is a real-time ray-tracing engine written to support SIMD across ARM, MIPS, PPC, and x86/x86_64 processors. QuadRay supports SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 usage on Intel/AMD CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 4KLLVM Clang 14AOCC 4.0612182430SE +/- 0.10, N = 3SE +/- 0.06, N = 326.2326.441. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread
OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 4KLLVM Clang 14AOCC 4.0612182430Min: 26.04 / Avg: 26.23 / Max: 26.39Min: 26.33 / Avg: 26.44 / Max: 26.521. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread

OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 1080pLLVM Clang 14AOCC 4.020406080100SE +/- 0.20, N = 3SE +/- 0.40, N = 3102.12101.561. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread
OpenBenchmarking.orgFPS, More Is BetterQuadRay 2022.05.25Scene: 1 - Resolution: 1080pLLVM Clang 14AOCC 4.020406080100Min: 101.75 / Avg: 102.12 / Max: 102.43Min: 100.93 / Avg: 101.56 / Max: 102.311. (CXX) g++ options: -O3 -pthread -lm -lstdc++ -lX11 -lXext -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastLLVM Clang 14AOCC 4.01122334455SE +/- 0.32, N = 3SE +/- 0.36, N = 348.0749.061. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastLLVM Clang 14AOCC 4.01020304050Min: 47.43 / Avg: 48.07 / Max: 48.41Min: 48.37 / Avg: 49.06 / Max: 49.591. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra FastLLVM Clang 14AOCC 4.020406080100SE +/- 0.05, N = 3SE +/- 0.04, N = 387.3788.761. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra FastLLVM Clang 14AOCC 4.020406080100Min: 87.29 / Avg: 87.37 / Max: 87.45Min: 88.71 / Avg: 88.76 / Max: 88.831. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 10 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0306090120150SE +/- 2.91, N = 15SE +/- 3.87, N = 15139.03143.90
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 10 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0306090120150Min: 107.18 / Avg: 139.03 / Max: 144.31Min: 104.2 / Avg: 143.9 / Max: 150.89

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 12 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.050100150200250SE +/- 1.63, N = 3SE +/- 1.07, N = 3200.79209.40
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.2Encoder Mode: Preset 12 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.04080120160200Min: 197.57 / Avg: 200.79 / Max: 202.82Min: 208.2 / Avg: 209.4 / Max: 211.54

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0246810SE +/- 0.02, N = 3SE +/- 0.02, N = 36.436.431. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.03691215Min: 6.4 / Avg: 6.43 / Max: 6.47Min: 6.4 / Avg: 6.43 / Max: 6.471. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100SE +/- 0.06, N = 3SE +/- 0.23, N = 3104.05107.091. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100Min: 103.95 / Avg: 104.05 / Max: 104.17Min: 106.7 / Avg: 107.09 / Max: 107.491. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.04080120160200SE +/- 0.32, N = 3SE +/- 0.44, N = 3169.39169.541. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0306090120150Min: 168.82 / Avg: 169.39 / Max: 169.92Min: 168.78 / Avg: 169.54 / Max: 170.311. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0306090120150SE +/- 0.93, N = 15SE +/- 1.05, N = 15111.30112.091. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100Min: 101.67 / Avg: 111.3 / Max: 112.85Min: 102.08 / Avg: 112.09 / Max: 114.041. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.0306090120150SE +/- 0.09, N = 3SE +/- 0.23, N = 3120.22120.861. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100Min: 120.04 / Avg: 120.22 / Max: 120.33Min: 120.41 / Avg: 120.86 / Max: 121.131. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100SE +/- 0.08, N = 3SE +/- 0.19, N = 3109.73110.241. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4KLLVM Clang 14AOCC 4.020406080100Min: 109.64 / Avg: 109.73 / Max: 109.89Min: 109.92 / Avg: 110.24 / Max: 110.581. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KLLVM Clang 14AOCC 4.0918273645SE +/- 0.34, N = 3SE +/- 0.03, N = 336.9138.121. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KLLVM Clang 14AOCC 4.0816243240Min: 36.29 / Avg: 36.91 / Max: 37.47Min: 38.09 / Avg: 38.12 / Max: 38.171. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2LLVM Clang 14AOCC 4.0816243240SE +/- 0.22, N = 15SE +/- 0.08, N = 333.6432.281. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2LLVM Clang 14AOCC 4.0714212835Min: 33.13 / Avg: 33.63 / Max: 36.17Min: 32.17 / Avg: 32.28 / Max: 32.441. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6LLVM Clang 14AOCC 4.00.6931.3862.0792.7723.465SE +/- 0.027, N = 8SE +/- 0.004, N = 33.0802.9711. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6LLVM Clang 14AOCC 4.0246810Min: 3.03 / Avg: 3.08 / Max: 3.27Min: 2.96 / Avg: 2.97 / Max: 2.981. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessLLVM Clang 14AOCC 4.01.04632.09263.13894.18525.2315SE +/- 0.032, N = 3SE +/- 0.011, N = 34.6504.5041. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessLLVM Clang 14AOCC 4.0246810Min: 4.61 / Avg: 4.65 / Max: 4.71Min: 4.48 / Avg: 4.5 / Max: 4.521. (CXX) g++ options: -O3 -fPIC -march=native -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelLLVM Clang 14AOCC 4.0612182430SE +/- 0.15, N = 3SE +/- 0.01, N = 327.1026.971. (CC) gcc options: -lm -lpthread -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelLLVM Clang 14AOCC 4.0612182430Min: 26.9 / Avg: 27.1 / Max: 27.39Min: 26.95 / Avg: 26.97 / Max: 271. (CC) gcc options: -lm -lpthread -O3 -march=native

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256LLVM Clang 14AOCC 4.08000M16000M24000M32000M40000MSE +/- 5864900.49, N = 3SE +/- 19509012.02, N = 337107990770372854928471. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256LLVM Clang 14AOCC 4.06000M12000M18000M24000M30000MMin: 37096325130 / Avg: 37107990770 / Max: 37114884630Min: 37246557080 / Avg: 37285492846.67 / Max: 373071537201. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096LLVM Clang 14AOCC 4.013002600390052006500SE +/- 5.11, N = 3SE +/- 27.80, N = 35998.45971.91. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096LLVM Clang 14AOCC 4.010002000300040005000Min: 5989 / Avg: 5998.37 / Max: 6006.6Min: 5916.8 / Avg: 5971.87 / Max: 6006.11. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096LLVM Clang 14AOCC 4.080K160K240K320K400KSE +/- 1647.21, N = 3SE +/- 114.01, N = 3391464.0393216.41. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096LLVM Clang 14AOCC 4.070K140K210K280K350KMin: 388169.7 / Avg: 391464 / Max: 393136.1Min: 393020.4 / Avg: 393216.43 / Max: 393415.31. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=native -lssl -lcrypto -ldl

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57LLVM Clang 14AOCC 4.0400M800M1200M1600M2000MSE +/- 1386842.94, N = 3SE +/- 2171277.15, N = 3166610000017509333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57LLVM Clang 14AOCC 4.0300M600M900M1200M1500MMin: 1664200000 / Avg: 1666100000 / Max: 1668800000Min: 1748100000 / Avg: 1750933333.33 / Max: 17552000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57LLVM Clang 14AOCC 4.0400M800M1200M1600M2000MSE +/- 5397633.23, N = 3SE +/- 1442220.51, N = 3174293333318688000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57LLVM Clang 14AOCC 4.0300M600M900M1200M1500MMin: 1732200000 / Avg: 1742933333.33 / Max: 1749300000Min: 1866000000 / Avg: 1868800000 / Max: 18708000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumLLVM Clang 14AOCC 4.04080120160200SE +/- 0.85, N = 3SE +/- 0.83, N = 3156.20162.171. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumLLVM Clang 14AOCC 4.0306090120150Min: 154.5 / Avg: 156.2 / Max: 157.21Min: 160.51 / Avg: 162.17 / Max: 163.061. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughLLVM Clang 14AOCC 4.0510152025SE +/- 0.08, N = 3SE +/- 0.05, N = 319.2419.861. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughLLVM Clang 14AOCC 4.0510152025Min: 19.09 / Avg: 19.24 / Max: 19.33Min: 19.76 / Avg: 19.86 / Max: 19.921. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveLLVM Clang 14AOCC 4.00.44950.8991.34851.7982.2475SE +/- 0.0036, N = 3SE +/- 0.0025, N = 31.96951.99791. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveLLVM Clang 14AOCC 4.0246810Min: 1.96 / Avg: 1.97 / Max: 1.98Min: 2 / Avg: 2 / Max: 21. (CXX) g++ options: -O3 -march=native -flto -pthread

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeLLVM Clang 14AOCC 4.010002000300040005000SE +/- 19.68, N = 3SE +/- 36.11, N = 3470447051. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeLLVM Clang 14AOCC 4.08001600240032004000Min: 4665 / Avg: 4704.33 / Max: 4725Min: 4654 / Avg: 4705.33 / Max: 47751. (CXX) g++ options: -O3 -march=native