AmpereOne GCC Clang Compiler Benchmarking

AmpereOne compiler testing by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2409062-NE-AMPEREONE13&grt.

AmpereOne GCC Clang Compiler BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitAmpereOne @ 3.20GHz (192 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.11.0-061100rc6daily20240904-generic-64k (aarch64)GCC 13.2.0ext41920x1080Clang 18.1.3Clang 19.1.0Clang 20.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -mcpu=ampere1 -flto" CFLAGS="-O3 -mcpu=ampere1 -flto"Compiler Details- GCC 13.2 - Default: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AmpereOne GCC Clang Compiler Benchmarkingcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingc-ray: 4K - 16c-ray: 5K - 16etcpak: Multi-Threaded - ETC2encode-flac: WAV To FLACgcrypt: graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: MPI CPU - water_GMX50_barehelsing: 14 digitlammps: 20k Atomslammps: Rhodopsin Proteinavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessliquid-dsp: 128 - 256 - 32liquid-dsp: 192 - 256 - 32liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2encode-opus: WAV To Opus Encodepovray: Trace Timeprimesieve: 1e13quantlib: Multi-Threadedsecuremark: SecureMark-TLSsimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDtscp: AI Chess Performancewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git78695189162421.69338.1794303.50823.999324.2209902584663732192594297.49733.56755.01847.620185.855115.6022.9085.6874.41121978000003148866667152760000224140000746.97329.879783.59931.34472.9527.49414.006300689.61719491.182.210.652.152.2213873509.266.601.122.800.3975549897504918.63532.8024339.77923.046310.1749952684753682152473976.88834.37355.33054.526264.168197.1542.9925.7304.41135008333335249800000164420000246590000690.38827.616719.31328.77368.8768.25514.039313766.71724281.152.180.642.122.2154974911.007.431.193.330.4674866898652218.76533.0614381.34322.802310.96210012674733532172454026.89434.44755.04954.408235.759170.8192.9495.6764.38534990666675246666667164456667246603333684.54627.382719.62428.78568.1427.97314.008314045.6174043157816011.207.561.183.430.4675063197871520.19035.5554385.16122.818310.3319892564783562162454046.88434.45655.22447.414235.730171.3752.9545.6854.36934993000005247633333164463333246653333684.36327.375719.80228.79268.5707.97113.995314473.7173477157997411.287.561.183.440.46OpenBenchmarking.org

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git200K400K600K800K1000KSE +/- 61.46, N = 3SE +/- 700.10, N = 3SE +/- 1725.30, N = 3SE +/- 594.21, N = 37869517554987486687506311. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git200K400K600K800K1000KSE +/- 373.10, N = 3SE +/- 325.41, N = 3SE +/- 763.73, N = 3SE +/- 1082.06, N = 38916249750499865229787151. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 321.6918.6418.7720.191. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 338.1832.8033.0635.561. (CC) gcc options: -lpthread -lm

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git9001800270036004500SE +/- 2.14, N = 3SE +/- 1.86, N = 3SE +/- 4.76, N = 3SE +/- 5.15, N = 34303.514339.784381.344385.161. (CXX) g++ options: -flto -O3 -mcpu=ampere1 -pthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git612182430SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 524.0023.0522.8022.821. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git70140210280350SE +/- 0.60, N = 3SE +/- 0.77, N = 3SE +/- 0.72, N = 3SE +/- 0.36, N = 3324.22310.17310.96310.331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git2004006008001000SE +/- 1.00, N = 3SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 0.33, N = 39909951001989-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git60120180240300SE +/- 6.33, N = 12SE +/- 2.00, N = 3SE +/- 1.00, N = 3SE +/- 13.19, N = 12258268267256-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git100200300400500SE +/- 3.13, N = 15SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3466475473478-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git80160240320400SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 3373368353356-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git50100150200250SE +/- 2.53, N = 4SE +/- 2.33, N = 3SE +/- 0.58, N = 3SE +/- 1.20, N = 3219215217216-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3259247245245-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git90180270360450SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 2.67, N = 3429397402404-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git246810SE +/- 0.014, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.010, N = 37.4976.8886.8946.8841. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git816243240SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 333.5734.3734.4534.461. (CC) gcc options: -O2 -pthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git1224364860SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 355.0255.3355.0555.221. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git1224364860SE +/- 4.33, N = 12SE +/- 0.14, N = 3SE +/- 0.19, N = 3SE +/- 3.02, N = 1547.6254.5354.4147.411. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git60120180240300SE +/- 2.22, N = 3SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3185.86264.17235.76235.731. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git4080120160200SE +/- 0.98, N = 3SE +/- 0.10, N = 3SE +/- 0.30, N = 3SE +/- 0.18, N = 3115.60197.15170.82171.381. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.67321.34642.01962.69283.366SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 32.9082.9922.9492.9541. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git1.28932.57863.86795.15726.4465SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.006, N = 35.6875.7305.6765.6851. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.99251.9852.97753.974.9625SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 34.4114.4114.3854.3691. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git700M1400M2100M2800M3500MSE +/- 5892367.95, N = 3SE +/- 120185.04, N = 3SE +/- 66666.67, N = 3SE +/- 57735.03, N = 321978000003500833333349906666734993000001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git1100M2200M3300M4400M5500MSE +/- 440958.55, N = 3SE +/- 100000.00, N = 3SE +/- 470224.53, N = 3SE +/- 33333.33, N = 331488666675249800000524666666752476333331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git40M80M120M160M200MSE +/- 448441.75, N = 3SE +/- 5773.50, N = 3SE +/- 3333.33, N = 3SE +/- 6666.67, N = 31527600001644200001644566671644633331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git50M100M150M200M250MSE +/- 5773.50, N = 3SE +/- 15275.25, N = 3SE +/- 8819.17, N = 3SE +/- 3333.33, N = 32241400002465900002466033332466533331. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git160320480640800SE +/- 4.39, N = 3SE +/- 7.29, N = 3SE +/- 1.59, N = 3SE +/- 2.49, N = 3746.97690.39684.55684.361. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git714212835SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 329.8827.6227.3827.381. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git2004006008001000SE +/- 0.44, N = 3SE +/- 0.57, N = 3SE +/- 0.64, N = 3SE +/- 0.71, N = 3783.60719.31719.62719.801. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git714212835SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 331.3428.7728.7928.791. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git1632486480SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 572.9568.8868.1468.571. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git246810SE +/- 0.024, N = 3SE +/- 0.011, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 37.4948.2557.9737.9711. (CXX) g++ options: -pipe -O3 -ffast-math -mcpu=ampere1 -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git48121620SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 314.0114.0414.0114.001. (CXX) g++ options: -O3 -mcpu=ampere1 -flto

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git70K140K210K280K350KSE +/- 98.22, N = 3SE +/- 37.36, N = 3SE +/- 31.64, N = 3SE +/- 88.76, N = 3300689.6313766.7314045.6314473.71. (CXX) g++ options: -O3 -march=native -fPIE -pie

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git40K80K120K160K200KSE +/- 68.80, N = 3SE +/- 134.78, N = 3SE +/- 78.47, N = 3SE +/- 56.65, N = 31719491724281740431734771. (CC) gcc options: -pedantic -O3

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: KostyaGCC 13.2 - DefaultClang 18.1.30.26550.5310.79651.0621.3275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.181.151. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetGCC 13.2 - DefaultClang 18.1.30.49730.99461.49191.98922.4865SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.181. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomGCC 13.2 - DefaultClang 18.1.30.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.650.641. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsGCC 13.2 - DefaultClang 18.1.30.48380.96761.45141.93522.419SE +/- 0.00, N = 3SE +/- 0.01, N = 32.152.121. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDGCC 13.2 - DefaultClang 18.1.30.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 32.222.201. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 1747.58, N = 5SE +/- 0.00, N = 5SE +/- 1110.84, N = 513873501549749157816015799741. (CC) gcc options: -O3 -mcpu=ampere1 -flto -march=native

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git3691215SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 39.2611.0011.2011.28-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git246810SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.607.437.567.56-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.121.191.181.18-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.803.333.433.44-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Lossless, Highest CompressionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git0.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.460.460.46-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg


Phoronix Test Suite v10.8.5