AmpereOne GCC Clang Compiler Benchmarking

AmpereOne compiler testing by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2409062-NE-AMPEREONE13&sro&grt.

AmpereOne GCC Clang Compiler BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitAmpereOne @ 3.20GHz (192 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.11.0-061100rc6daily20240904-generic-64k (aarch64)GCC 13.2.0ext41920x1080Clang 18.1.3Clang 19.1.0Clang 20.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -mcpu=ampere1 -flto" CFLAGS="-O3 -mcpu=ampere1 -flto"Compiler Details- GCC 13.2 - Default: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AmpereOne GCC Clang Compiler Benchmarkingcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingc-ray: 4K - 16c-ray: 5K - 16etcpak: Multi-Threaded - ETC2encode-flac: WAV To FLACgcrypt: graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: MPI CPU - water_GMX50_barehelsing: 14 digitlammps: 20k Atomslammps: Rhodopsin Proteinavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessliquid-dsp: 128 - 256 - 32liquid-dsp: 192 - 256 - 32liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2encode-opus: WAV To Opus Encodepovray: Trace Timeprimesieve: 1e13quantlib: Multi-Threadedsecuremark: SecureMark-TLSsimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDtscp: AI Chess Performancewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git78695189162421.69338.1794303.50823.999324.2209902584663732192594297.49733.56755.01847.620185.855115.6022.9085.6874.41121978000003148866667152760000224140000746.97329.879783.59931.34472.9527.49414.006300689.61719491.182.210.652.152.2213873509.266.601.122.800.3975549897504918.63532.8024339.77923.046310.1749952684753682152473976.88834.37355.33054.526264.168197.1542.9925.7304.41135008333335249800000164420000246590000690.38827.616719.31328.77368.8768.25514.039313766.71724281.152.180.642.122.2154974911.007.431.193.330.4674866898652218.76533.0614381.34322.802310.96210012674733532172454026.89434.44755.04954.408235.759170.8192.9495.6764.38534990666675246666667164456667246603333684.54627.382719.62428.78568.1427.97314.008314045.6174043157816011.207.561.183.430.4675063197871520.19035.5554385.16122.818310.3319892564783562162454046.88434.45655.22447.414235.730171.3752.9545.6854.36934993000005247633333164463333246653333684.36327.375719.80228.79268.5707.97113.995314473.7173477157997411.287.561.183.440.46OpenBenchmarking.org

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default200K400K600K800K1000KSE +/- 700.10, N = 3SE +/- 1725.30, N = 3SE +/- 594.21, N = 3SE +/- 61.46, N = 37554987486687506317869511. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default200K400K600K800K1000KSE +/- 325.41, N = 3SE +/- 763.73, N = 3SE +/- 1082.06, N = 3SE +/- 373.10, N = 39750499865229787158916241. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 318.6418.7720.1921.691. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default918273645SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 332.8033.0635.5638.181. (CC) gcc options: -lpthread -lm

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default9001800270036004500SE +/- 1.86, N = 3SE +/- 4.76, N = 3SE +/- 5.15, N = 3SE +/- 2.14, N = 34339.784381.344385.164303.511. (CXX) g++ options: -flto -O3 -mcpu=ampere1 -pthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default612182430SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 523.0522.8022.8224.001. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default70140210280350SE +/- 0.77, N = 3SE +/- 0.72, N = 3SE +/- 0.36, N = 3SE +/- 0.60, N = 3310.17310.96310.33324.221. (CC) gcc options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default2004006008001000SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 39951001989990-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default60120180240300SE +/- 2.00, N = 3SE +/- 1.00, N = 3SE +/- 13.19, N = 12SE +/- 6.33, N = 12268267256258-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default100200300400500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 3.13, N = 15475473478466-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default80160240320400SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 3368353356373-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default50100150200250SE +/- 2.33, N = 3SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 2.53, N = 4215217216219-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3247245245259-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default90180270360450SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 2.67, N = 3SE +/- 0.33, N = 3397402404429-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default246810SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.010, N = 3SE +/- 0.014, N = 36.8886.8946.8847.4971. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default816243240SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 334.3734.4534.4633.571. (CC) gcc options: -O2 -pthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default1224364860SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 355.3355.0555.2255.021. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default1224364860SE +/- 0.14, N = 3SE +/- 0.19, N = 3SE +/- 3.02, N = 15SE +/- 4.33, N = 1254.5354.4147.4147.621. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default60120180240300SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 2.22, N = 3264.17235.76235.73185.861. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default4080120160200SE +/- 0.10, N = 3SE +/- 0.30, N = 3SE +/- 0.18, N = 3SE +/- 0.98, N = 3197.15170.82171.38115.601. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default0.67321.34642.01962.69283.366SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 3SE +/- 0.004, N = 32.9922.9492.9542.9081. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default1.28932.57863.86795.15726.4465SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.006, N = 3SE +/- 0.007, N = 35.7305.6765.6855.6871. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default0.99251.9852.97753.974.9625SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 34.4114.3854.3694.4111. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default700M1400M2100M2800M3500MSE +/- 120185.04, N = 3SE +/- 66666.67, N = 3SE +/- 57735.03, N = 3SE +/- 5892367.95, N = 335008333333499066667349930000021978000001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default1100M2200M3300M4400M5500MSE +/- 100000.00, N = 3SE +/- 470224.53, N = 3SE +/- 33333.33, N = 3SE +/- 440958.55, N = 352498000005246666667524763333331488666671. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default40M80M120M160M200MSE +/- 5773.50, N = 3SE +/- 3333.33, N = 3SE +/- 6666.67, N = 3SE +/- 448441.75, N = 31644200001644566671644633331527600001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default50M100M150M200M250MSE +/- 15275.25, N = 3SE +/- 8819.17, N = 3SE +/- 3333.33, N = 3SE +/- 5773.50, N = 32465900002466033332466533332241400001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default160320480640800SE +/- 7.29, N = 3SE +/- 1.59, N = 3SE +/- 2.49, N = 3SE +/- 4.39, N = 3690.39684.55684.36746.971. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default714212835SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.18, N = 327.6227.3827.3829.881. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default2004006008001000SE +/- 0.57, N = 3SE +/- 0.64, N = 3SE +/- 0.71, N = 3SE +/- 0.44, N = 3719.31719.62719.80783.601. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 328.7728.7928.7931.341. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default1632486480SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 568.8868.1468.5772.951. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default246810SE +/- 0.011, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 3SE +/- 0.024, N = 38.2557.9737.9717.4941. (CXX) g++ options: -pipe -O3 -ffast-math -mcpu=ampere1 -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 314.0414.0114.0014.011. (CXX) g++ options: -O3 -mcpu=ampere1 -flto

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default70K140K210K280K350KSE +/- 37.36, N = 3SE +/- 31.64, N = 3SE +/- 88.76, N = 3SE +/- 98.22, N = 3313766.7314045.6314473.7300689.61. (CXX) g++ options: -O3 -march=native -fPIE -pie

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default40K80K120K160K200KSE +/- 134.78, N = 3SE +/- 78.47, N = 3SE +/- 56.65, N = 3SE +/- 68.80, N = 31724281740431734771719491. (CC) gcc options: -pedantic -O3

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: KostyaClang 18.1.3GCC 13.2 - Default0.26550.5310.79651.0621.3275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.151.181. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetClang 18.1.3GCC 13.2 - Default0.49730.99461.49191.98922.4865SE +/- 0.00, N = 3SE +/- 0.00, N = 32.182.211. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomClang 18.1.3GCC 13.2 - Default0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.640.651. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsClang 18.1.3GCC 13.2 - Default0.48380.96761.45141.93522.419SE +/- 0.01, N = 3SE +/- 0.00, N = 32.122.151. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDClang 18.1.3GCC 13.2 - Default0.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 32.202.221. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default300K600K900K1200K1500KSE +/- 1747.58, N = 5SE +/- 0.00, N = 5SE +/- 1110.84, N = 5SE +/- 0.00, N = 515497491578160157997413873501. (CC) gcc options: -O3 -mcpu=ampere1 -flto -march=native

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 311.0011.2011.289.26-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 37.437.567.566.60-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default0.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.191.181.181.12-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.333.433.442.80-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default0.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.460.460.460.39-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg


Phoronix Test Suite v10.8.5