AmpereOne GCC Clang Compiler Benchmarking

AmpereOne compiler testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409062-NE-AMPEREONE13
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 13.2 - Default
September 05
  2 Hours, 50 Minutes
Clang 18.1.3
September 05
  2 Hours, 35 Minutes
Clang 19.1.0
September 06
  2 Hours, 12 Minutes
Clang 20.0 Git
September 06
  2 Hours, 23 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 30 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AmpereOne GCC Clang Compiler BenchmarkingOpenBenchmarking.orgPhoronix Test SuiteAmpereOne @ 3.20GHz (192 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.11.0-061100rc6daily20240904-generic-64k (aarch64)GCC 13.2.0Clang 18.1.3Clang 19.1.0Clang 20.0.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilersFile-SystemScreen ResolutionAmpereOne GCC Clang Compiler Benchmarking PerformanceSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -mcpu=ampere1 -flto" CFLAGS="-O3 -mcpu=ampere1 -flto"- GCC 13.2 - Default: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - Scaling Governor: cppc_cpufreq performance (Boost: Disabled)- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitResult OverviewPhoronix Test Suite100%108%117%125%133%Liquid-DSPlibavif avifencC-RayWebP Image EncodeTSCPPOV-RayminiBUDEGROMACSLAMMPS Molecular Dynamics SimulatorOpus Codec EncodingFLAC Audio EncodingQuantLibGcrypt LibraryHelsing7-Zip CompressionGraphicsMagickEtcpakSecureMarkPrimesieve

AmpereOne GCC Clang Compiler Benchmarkingcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingc-ray: 4K - 16c-ray: 5K - 16etcpak: Multi-Threaded - ETC2encode-flac: WAV To FLACgcrypt: graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: MPI CPU - water_GMX50_barehelsing: 14 digitlammps: 20k Atomslammps: Rhodopsin Proteinavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessliquid-dsp: 128 - 256 - 32liquid-dsp: 192 - 256 - 32liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2encode-opus: WAV To Opus Encodepovray: Trace Timeprimesieve: 1e13quantlib: Multi-Threadedsecuremark: SecureMark-TLSsimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDtscp: AI Chess Performancewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git78695189162421.69338.1794303.50823.999324.2209902584663732192594297.49733.56755.01847.620185.855115.6022.9085.6874.41121978000003148866667152760000224140000746.97329.879783.59931.34472.9527.49414.006300689.61719491.182.210.652.152.2213873509.266.601.122.800.3975549897504918.63532.8024339.77923.046310.1749952684753682152473976.88834.37355.33054.526264.168197.1542.9925.7304.41135008333335249800000164420000246590000690.38827.616719.31328.77368.8768.25514.039313766.71724281.152.180.642.122.2154974911.007.431.193.330.4674866898652218.76533.0614381.34322.802310.96210012674733532172454026.89434.44755.04954.408235.759170.8192.9495.6764.38534990666675246666667164456667246603333684.54627.382719.62428.78568.1427.97314.008314045.6174043157816011.207.561.183.430.4675063197871520.19035.5554385.16122.818310.3319892564783562162454046.88434.45655.22447.414235.730171.3752.9545.6854.36934993000005247633333164463333246653333684.36327.375719.80228.79268.5707.97113.995314473.7173477157997411.287.561.183.440.46OpenBenchmarking.org

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default200K400K600K800K1000KSE +/- 1725.30, N = 3SE +/- 594.21, N = 3SE +/- 700.10, N = 3SE +/- 61.46, N = 37486687506317554987869511. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.0200K400K600K800K1000KSE +/- 373.10, N = 3SE +/- 325.41, N = 3SE +/- 1082.06, N = 3SE +/- 763.73, N = 38916249750499787159865221. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16GCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 321.6920.1918.7718.641. (CC) gcc options: -lpthread -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16GCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3918273645SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 338.1835.5633.0632.801. (CC) gcc options: -lpthread -lm

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git9001800270036004500SE +/- 2.14, N = 3SE +/- 1.86, N = 3SE +/- 4.76, N = 3SE +/- 5.15, N = 34303.514339.784381.344385.161. (CXX) g++ options: -flto -O3 -mcpu=ampere1 -pthread

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC audio format ten times using the --best preset settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.0612182430SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 524.0023.0522.8222.801. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3GCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.370140210280350SE +/- 0.60, N = 3SE +/- 0.72, N = 3SE +/- 0.36, N = 3SE +/- 0.77, N = 3324.22310.96310.33310.171. (CC) gcc options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlClang 20.0 GitGCC 13.2 - DefaultClang 18.1.3Clang 19.1.02004006008001000SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.88, N = 3SE +/- 1.67, N = 39899909951001-lxml2 -lomp-lgomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateClang 20.0 GitGCC 13.2 - DefaultClang 19.1.0Clang 18.1.360120180240300SE +/- 13.19, N = 12SE +/- 6.33, N = 12SE +/- 1.00, N = 3SE +/- 2.00, N = 3256258267268-lxml2 -lomp-lgomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenGCC 13.2 - DefaultClang 19.1.0Clang 18.1.3Clang 20.0 Git100200300400500SE +/- 3.13, N = 15SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3466473475478-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default80160240320400SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3353356368373-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingClang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default50100150200250SE +/- 2.33, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 2.53, N = 4215216217219-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default60120180240300SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3245245247259-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default90180270360450SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 2.67, N = 3SE +/- 0.33, N = 3397402404429-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareClang 20.0 GitClang 18.1.3Clang 19.1.0GCC 13.2 - Default246810SE +/- 0.010, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.014, N = 36.8846.8886.8947.4971. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default816243240SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 334.4634.4534.3733.571. (CC) gcc options: -O2 -pthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsGCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.31224364860SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 355.0255.0555.2255.331. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 20.0 GitGCC 13.2 - DefaultClang 19.1.0Clang 18.1.31224364860SE +/- 3.02, N = 15SE +/- 4.33, N = 12SE +/- 0.19, N = 3SE +/- 0.14, N = 347.4147.6254.4154.531. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default60120180240300SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 2.22, N = 3264.17235.76235.73185.861. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Clang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default4080120160200SE +/- 0.10, N = 3SE +/- 0.18, N = 3SE +/- 0.30, N = 3SE +/- 0.98, N = 3197.15171.38170.82115.601. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6Clang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default0.67321.34642.01962.69283.366SE +/- 0.002, N = 3SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 32.9922.9542.9492.9081. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessClang 18.1.3GCC 13.2 - DefaultClang 20.0 GitClang 19.1.01.28932.57863.86795.15726.4465SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.009, N = 35.7305.6875.6855.6761. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessClang 18.1.3GCC 13.2 - DefaultClang 19.1.0Clang 20.0 Git0.99251.9852.97753.974.9625SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 34.4114.4114.3854.3691. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32GCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.3700M1400M2100M2800M3500MSE +/- 5892367.95, N = 3SE +/- 66666.67, N = 3SE +/- 57735.03, N = 3SE +/- 120185.04, N = 321978000003499066667349930000035008333331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32GCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.31100M2200M3300M4400M5500MSE +/- 440958.55, N = 3SE +/- 470224.53, N = 3SE +/- 33333.33, N = 3SE +/- 100000.00, N = 331488666675246666667524763333352498000001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git40M80M120M160M200MSE +/- 448441.75, N = 3SE +/- 5773.50, N = 3SE +/- 3333.33, N = 3SE +/- 6666.67, N = 31527600001644200001644566671644633331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git50M100M150M200M250MSE +/- 5773.50, N = 3SE +/- 15275.25, N = 3SE +/- 8819.17, N = 3SE +/- 3333.33, N = 32241400002465900002466033332466533331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default160320480640800SE +/- 2.49, N = 3SE +/- 1.59, N = 3SE +/- 7.29, N = 3SE +/- 4.39, N = 3684.36684.55690.39746.971. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default714212835SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.18, N = 327.3827.3827.6229.881. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default2004006008001000SE +/- 0.57, N = 3SE +/- 0.64, N = 3SE +/- 0.71, N = 3SE +/- 0.44, N = 3719.31719.62719.80783.601. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 328.7728.7928.7931.341. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.01632486480SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 572.9568.8868.5768.141. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default246810SE +/- 0.011, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 3SE +/- 0.024, N = 38.2557.9737.9717.4941. (CXX) g++ options: -pipe -O3 -ffast-math -mcpu=ampere1 -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13Clang 18.1.3Clang 19.1.0GCC 13.2 - DefaultClang 20.0 Git48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 314.0414.0114.0114.001. (CXX) g++ options: -O3 -mcpu=ampere1 -flto

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git70K140K210K280K350KSE +/- 98.22, N = 3SE +/- 37.36, N = 3SE +/- 31.64, N = 3SE +/- 88.76, N = 3300689.6313766.7314045.6314473.71. (CXX) g++ options: -O3 -march=native -fPIE -pie

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.040K80K120K160K200KSE +/- 68.80, N = 3SE +/- 134.78, N = 3SE +/- 56.65, N = 3SE +/- 78.47, N = 31719491724281734771740431. (CC) gcc options: -pedantic -O3

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: KostyaClang 18.1.3GCC 13.2 - Default0.26550.5310.79651.0621.3275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.151.181. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: Kostya

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetClang 18.1.3GCC 13.2 - Default0.49730.99461.49191.98922.4865SE +/- 0.00, N = 3SE +/- 0.00, N = 32.182.211. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: TopTweet

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomClang 18.1.3GCC 13.2 - Default0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.640.651. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: LargeRandom

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsClang 18.1.3GCC 13.2 - Default0.48380.96761.45141.93522.419SE +/- 0.01, N = 3SE +/- 0.00, N = 32.122.151. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: PartialTweets

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDClang 18.1.3GCC 13.2 - Default0.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 32.202.221. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: DistinctUserID

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 1747.58, N = 5SE +/- 0.00, N = 5SE +/- 1110.84, N = 513873501549749157816015799741. (CC) gcc options: -O3 -mcpu=ampere1 -flto -march=native

WebP Image Encode

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git3691215SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 39.2611.0011.2011.28-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git246810SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.607.437.567.56-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessGCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.30.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.121.181.181.19-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.803.333.433.44-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Lossless, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.460.460.46-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg