AmpereOne GCC Clang Compiler Benchmarking

AmpereOne compiler testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409062-NE-AMPEREONE13
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 13.2 - Default
September 05
  2 Hours, 50 Minutes
Clang 18.1.3
September 05
  2 Hours, 35 Minutes
Clang 19.1.0
September 06
  2 Hours, 12 Minutes
Clang 20.0 Git
September 06
  2 Hours, 23 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 30 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AmpereOne GCC Clang Compiler BenchmarkingOpenBenchmarking.orgPhoronix Test SuiteAmpereOne @ 3.20GHz (192 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.11.0-061100rc6daily20240904-generic-64k (aarch64)GCC 13.2.0Clang 18.1.3Clang 19.1.0Clang 20.0.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilersFile-SystemScreen ResolutionAmpereOne GCC Clang Compiler Benchmarking PerformanceSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -mcpu=ampere1 -flto" CFLAGS="-O3 -mcpu=ampere1 -flto"- GCC 13.2 - Default: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - Scaling Governor: cppc_cpufreq performance (Boost: Disabled)- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitResult OverviewPhoronix Test Suite100%108%117%125%133%Liquid-DSPlibavif avifencC-RayWebP Image EncodeTSCPPOV-RayminiBUDEGROMACSLAMMPS Molecular Dynamics SimulatorOpus Codec EncodingFLAC Audio EncodingQuantLibGcrypt LibraryHelsing7-Zip CompressionGraphicsMagickEtcpakSecureMarkPrimesieve

AmpereOne GCC Clang Compiler Benchmarkingminibude: OpenMP - BM1minibude: OpenMP - BM2simdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDminibude: OpenMP - BM1minibude: OpenMP - BM2graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacesecuremark: SecureMark-TLSquantlib: Multi-Threadedcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionetcpak: Multi-Threaded - ETC2tscp: AI Chess Performancegromacs: MPI CPU - water_GMX50_barelammps: 20k Atomslammps: Rhodopsin Proteinliquid-dsp: 128 - 256 - 32liquid-dsp: 192 - 256 - 32liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512avifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessc-ray: 4K - 16c-ray: 5K - 16povray: Trace Timeprimesieve: 1e13encode-flac: WAV To FLACencode-opus: WAV To Opus Encodegcrypt: helsing: 14 digitGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git29.87931.3441.182.210.652.152.22746.973783.599990258466373219259429171949300689.67869518916249.266.601.122.800.394303.50813873507.49755.01847.62021978000003148866667152760000224140000185.855115.6022.9085.6874.41121.69338.1797.49414.00623.99972.952324.22033.56727.61628.7731.152.180.642.122.2690.388719.313995268475368215247397172428313766.775549897504911.007.431.193.330.464339.77915497496.88855.33054.52635008333335249800000164420000246590000264.168197.1542.9925.7304.41118.63532.8028.25514.03923.04668.876310.17434.37327.38228.785684.546719.6241001267473353217245402174043314045.674866898652211.207.561.183.430.464381.34315781606.89455.04954.40834990666675246666667164456667246603333235.759170.8192.9495.6764.38518.76533.0617.97314.00822.80268.142310.96234.44727.37528.792684.363719.802989256478356216245404173477314473.775063197871511.287.561.183.440.464385.16115799746.88455.22447.41434993000005247633333164463333246653333235.730171.3752.9545.6854.36920.19035.5557.97113.99522.81868.570310.33134.456OpenBenchmarking.org

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default714212835SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.18, N = 327.3827.3827.6229.881. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 328.7928.7928.7731.341. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: KostyaClang 18.1.3GCC 13.2 - Default0.26550.5310.79651.0621.3275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.151.181. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: Kostya

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetClang 18.1.3GCC 13.2 - Default0.49730.99461.49191.98922.4865SE +/- 0.00, N = 3SE +/- 0.00, N = 32.182.211. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: TopTweet

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomClang 18.1.3GCC 13.2 - Default0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.640.651. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: LargeRandom

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsClang 18.1.3GCC 13.2 - Default0.48380.96761.45141.93522.419SE +/- 0.01, N = 3SE +/- 0.00, N = 32.122.151. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: PartialTweets

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDClang 18.1.3GCC 13.2 - Default0.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 32.202.221. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

Throughput Test: DistinctUserID

Clang 19.1.0: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Clang 20.0 Git: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default160320480640800SE +/- 2.49, N = 3SE +/- 1.59, N = 3SE +/- 7.29, N = 3SE +/- 4.39, N = 3684.36684.55690.39746.971. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default2004006008001000SE +/- 0.71, N = 3SE +/- 0.64, N = 3SE +/- 0.57, N = 3SE +/- 0.44, N = 3719.80719.62719.31783.601. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default2004006008001000SE +/- 0.33, N = 3SE +/- 1.67, N = 3SE +/- 0.88, N = 3SE +/- 1.00, N = 39891001995990-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default60120180240300SE +/- 13.19, N = 12SE +/- 1.00, N = 3SE +/- 2.00, N = 3SE +/- 6.33, N = 12256267268258-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default100200300400500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 3.13, N = 15478473475466-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default80160240320400SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3356353368373-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default50100150200250SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 2.33, N = 3SE +/- 2.53, N = 4216217215219-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default60120180240300SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3245245247259-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default90180270360450SE +/- 2.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3404402397429-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default40K80K120K160K200KSE +/- 56.65, N = 3SE +/- 78.47, N = 3SE +/- 134.78, N = 3SE +/- 68.80, N = 31734771740431724281719491. (CC) gcc options: -pedantic -O3

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default70K140K210K280K350KSE +/- 88.76, N = 3SE +/- 31.64, N = 3SE +/- 37.36, N = 3SE +/- 98.22, N = 3314473.7314045.6313766.7300689.61. (CXX) g++ options: -O3 -march=native -fPIE -pie

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default200K400K600K800K1000KSE +/- 594.21, N = 3SE +/- 1725.30, N = 3SE +/- 700.10, N = 3SE +/- 61.46, N = 37506317486687554987869511. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default200K400K600K800K1000KSE +/- 1082.06, N = 3SE +/- 763.73, N = 3SE +/- 325.41, N = 3SE +/- 373.10, N = 39787159865229750498916241. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

WebP Image Encode

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 311.2811.2011.009.26-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 37.567.567.436.60-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.26780.53560.80341.07121.339SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.181.181.191.12-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.443.433.332.80-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.460.460.460.39-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default9001800270036004500SE +/- 5.15, N = 3SE +/- 4.76, N = 3SE +/- 1.86, N = 3SE +/- 2.14, N = 34385.164381.344339.784303.511. (CXX) g++ options: -flto -O3 -mcpu=ampere1 -pthread

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default300K600K900K1200K1500KSE +/- 1110.84, N = 5SE +/- 0.00, N = 5SE +/- 1747.58, N = 5SE +/- 0.00, N = 515799741578160154974913873501. (CC) gcc options: -O3 -mcpu=ampere1 -flto -march=native

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default246810SE +/- 0.010, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.014, N = 36.8846.8946.8887.4971. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default1224364860SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 355.2255.0555.3355.021. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default1224364860SE +/- 3.02, N = 15SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 4.33, N = 1247.4154.4154.5347.621. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default700M1400M2100M2800M3500MSE +/- 57735.03, N = 3SE +/- 66666.67, N = 3SE +/- 120185.04, N = 3SE +/- 5892367.95, N = 334993000003499066667350083333321978000001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default1100M2200M3300M4400M5500MSE +/- 33333.33, N = 3SE +/- 470224.53, N = 3SE +/- 100000.00, N = 3SE +/- 440958.55, N = 352476333335246666667524980000031488666671. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default40M80M120M160M200MSE +/- 6666.67, N = 3SE +/- 3333.33, N = 3SE +/- 5773.50, N = 3SE +/- 448441.75, N = 31644633331644566671644200001527600001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default50M100M150M200M250MSE +/- 3333.33, N = 3SE +/- 8819.17, N = 3SE +/- 15275.25, N = 3SE +/- 5773.50, N = 32466533332466033332465900002241400001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default60120180240300SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 2.22, N = 3235.73235.76264.17185.861. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default4080120160200SE +/- 0.18, N = 3SE +/- 0.30, N = 3SE +/- 0.10, N = 3SE +/- 0.98, N = 3171.38170.82197.15115.601. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.67321.34642.01962.69283.366SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 32.9542.9492.9922.9081. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default1.28932.57863.86795.15726.4465SE +/- 0.006, N = 3SE +/- 0.009, N = 3SE +/- 0.004, N = 3SE +/- 0.007, N = 35.6855.6765.7305.6871. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.99251.9852.97753.974.9625SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 34.3694.3854.4114.4111. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 320.1918.7718.6421.691. (CC) gcc options: -lpthread -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default918273645SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 335.5633.0632.8038.181. (CC) gcc options: -lpthread -lm

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default246810SE +/- 0.026, N = 3SE +/- 0.026, N = 3SE +/- 0.011, N = 3SE +/- 0.024, N = 37.9717.9738.2557.4941. (CXX) g++ options: -pipe -O3 -ffast-math -mcpu=ampere1 -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default48121620SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 314.0014.0114.0414.011. (CXX) g++ options: -O3 -mcpu=ampere1 -flto

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC audio format ten times using the --best preset settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default612182430SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 522.8222.8023.0524.001. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default1632486480SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 568.5768.1468.8872.951. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default70140210280350SE +/- 0.36, N = 3SE +/- 0.72, N = 3SE +/- 0.77, N = 3SE +/- 0.60, N = 3310.33310.96310.17324.221. (CC) gcc options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default816243240SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 334.4634.4534.3733.571. (CC) gcc options: -O2 -pthread