AmpereOne GCC Clang Compiler Benchmarking

AmpereOne compiler testing by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2409062-NE-AMPEREONE13&sor.

AmpereOne GCC Clang Compiler BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 GitAmpereOne @ 3.20GHz (192 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.11.0-061100rc6daily20240904-generic-64k (aarch64)GCC 13.2.0ext41920x1080Clang 18.1.3Clang 19.1.0Clang 20.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -mcpu=ampere1 -flto" CFLAGS="-O3 -mcpu=ampere1 -flto"Compiler Details- GCC 13.2 - Default: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AmpereOne GCC Clang Compiler Benchmarkingquantlib: Multi-Threadedetcpak: Multi-Threaded - ETC2minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2lammps: 20k Atomslammps: Rhodopsin Proteinsimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressiontscp: AI Chess Performancegraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessc-ray: 4K - 16c-ray: 5K - 16povray: Trace Timeprimesieve: 1e13encode-flac: WAV To FLACencode-opus: WAV To Opus Encodegcrypt: helsing: 14 digitsecuremark: SecureMark-TLSliquid-dsp: 128 - 256 - 32liquid-dsp: 192 - 256 - 32liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512gromacs: MPI CPU - water_GMX50_bareGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git300689.64303.508746.97329.879783.59931.34455.01847.6201.182.210.652.152.229.266.601.122.800.391387350990258466373219259429786951891624185.855115.6022.9085.6874.41121.69338.1797.49414.00623.99972.952324.22033.567171949219780000031488666671527600002241400007.497313766.74339.779690.38827.616719.31328.77355.33054.5261.152.180.642.122.211.007.431.193.330.461549749995268475368215247397755498975049264.168197.1542.9925.7304.41118.63532.8028.25514.03923.04668.876310.17434.373172428350083333352498000001644200002465900006.888314045.64381.343684.54627.382719.62428.78555.04954.40811.207.561.183.430.4615781601001267473353217245402748668986522235.759170.8192.9495.6764.38518.76533.0617.97314.00822.80268.142310.96234.447174043349906666752466666671644566672466033336.894314473.74385.161684.36327.375719.80228.79255.22447.41411.287.561.183.440.461579974989256478356216245404750631978715235.730171.3752.9545.6854.36920.19035.5557.97113.99522.81868.570310.33134.456173477349930000052476333331644633332466533336.884OpenBenchmarking.org

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default70K140K210K280K350KSE +/- 88.76, N = 3SE +/- 31.64, N = 3SE +/- 37.36, N = 3SE +/- 98.22, N = 3314473.7314045.6313766.7300689.61. (CXX) g++ options: -O3 -march=native -fPIE -pie

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default9001800270036004500SE +/- 5.15, N = 3SE +/- 4.76, N = 3SE +/- 1.86, N = 3SE +/- 2.14, N = 34385.164381.344339.784303.511. (CXX) g++ options: -flto -O3 -mcpu=ampere1 -pthread

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git160320480640800SE +/- 4.39, N = 3SE +/- 7.29, N = 3SE +/- 1.59, N = 3SE +/- 2.49, N = 3746.97690.39684.55684.361. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git714212835SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 329.8827.6227.3827.381. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.32004006008001000SE +/- 0.44, N = 3SE +/- 0.71, N = 3SE +/- 0.64, N = 3SE +/- 0.57, N = 3783.60719.80719.62719.311. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 331.3428.7928.7928.771. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -mcpu=native -lm

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsClang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default1224364860SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 355.3355.2255.0555.021. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 18.1.3Clang 19.1.0GCC 13.2 - DefaultClang 20.0 Git1224364860SE +/- 0.14, N = 3SE +/- 0.19, N = 3SE +/- 4.33, N = 12SE +/- 3.02, N = 1554.5354.4147.6247.411. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: KostyaGCC 13.2 - DefaultClang 18.1.30.26550.5310.79651.0621.3275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.181.151. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetGCC 13.2 - DefaultClang 18.1.30.49730.99461.49191.98922.4865SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.181. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomGCC 13.2 - DefaultClang 18.1.30.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.650.641. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsGCC 13.2 - DefaultClang 18.1.30.48380.96761.45141.93522.419SE +/- 0.00, N = 3SE +/- 0.01, N = 32.152.121. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDGCC 13.2 - DefaultClang 18.1.30.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 32.222.201. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lrt

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 311.2811.2011.009.26-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 37.567.567.436.60-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessClang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default0.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.191.181.181.12-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.443.433.332.80-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default0.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.460.460.460.39-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -mcpu=ampere1 -flto -lm -lpng16 -ljpeg

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default300K600K900K1200K1500KSE +/- 1110.84, N = 5SE +/- 0.00, N = 5SE +/- 1747.58, N = 5SE +/- 0.00, N = 515799741578160154974913873501. (CC) gcc options: -O3 -mcpu=ampere1 -flto -march=native

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlClang 19.1.0Clang 18.1.3GCC 13.2 - DefaultClang 20.0 Git2004006008001000SE +/- 1.67, N = 3SE +/- 0.88, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 31001995990989-lxml2 -lomp-lxml2 -lomp-lgomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateClang 18.1.3Clang 19.1.0GCC 13.2 - DefaultClang 20.0 Git60120180240300SE +/- 2.00, N = 3SE +/- 1.00, N = 3SE +/- 6.33, N = 12SE +/- 13.19, N = 12268267258256-lxml2 -lomp-lxml2 -lomp-lgomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenClang 20.0 GitClang 18.1.3Clang 19.1.0GCC 13.2 - Default100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 3.13, N = 15478475473466-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.080160240320400SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3373368356353-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingGCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.350100150200250SE +/- 2.53, N = 4SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 2.33, N = 3219217216215-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.060120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3259247245245-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceGCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.390180270360450SE +/- 0.33, N = 3SE +/- 2.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3429404402397-lgomp-lxml2 -lomp-lxml2 -lomp-lxml2 -lomp1. (CC) gcc options: -fopenmp -O3 -mcpu=ampere1 -flto -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lX11 -lzstd -llzma -lz -lm -lpthread

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGCC 13.2 - DefaultClang 18.1.3Clang 20.0 GitClang 19.1.0200K400K600K800K1000KSE +/- 61.46, N = 3SE +/- 700.10, N = 3SE +/- 594.21, N = 3SE +/- 1725.30, N = 37869517554987506317486681. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default200K400K600K800K1000KSE +/- 763.73, N = 3SE +/- 1082.06, N = 3SE +/- 325.41, N = 3SE +/- 373.10, N = 39865229787159750498916241. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0GCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.360120180240300SE +/- 2.22, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 3185.86235.73235.76264.171. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2GCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.34080120160200SE +/- 0.98, N = 3SE +/- 0.30, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3115.60170.82171.38197.151. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6GCC 13.2 - DefaultClang 19.1.0Clang 20.0 GitClang 18.1.30.67321.34642.01962.69283.366SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 3SE +/- 0.002, N = 32.9082.9492.9542.9921. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessClang 19.1.0Clang 20.0 GitGCC 13.2 - DefaultClang 18.1.31.28932.57863.86795.15726.4465SE +/- 0.009, N = 3SE +/- 0.006, N = 3SE +/- 0.007, N = 3SE +/- 0.004, N = 35.6765.6855.6875.7301. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessClang 20.0 GitClang 19.1.0GCC 13.2 - DefaultClang 18.1.30.99251.9852.97753.974.9625SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 34.3694.3854.4114.4111. (CXX) g++ options: -O3 -fPIC -mcpu=ampere1 -flto -lm

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 318.6418.7720.1921.691. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Clang 18.1.3Clang 19.1.0Clang 20.0 GitGCC 13.2 - Default918273645SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 332.8033.0635.5638.181. (CC) gcc options: -lpthread -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 13.2 - DefaultClang 20.0 GitClang 19.1.0Clang 18.1.3246810SE +/- 0.024, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 3SE +/- 0.011, N = 37.4947.9717.9738.2551. (CXX) g++ options: -pipe -O3 -ffast-math -mcpu=ampere1 -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13Clang 20.0 GitGCC 13.2 - DefaultClang 19.1.0Clang 18.1.348121620SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 314.0014.0114.0114.041. (CXX) g++ options: -O3 -mcpu=ampere1 -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default612182430SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 522.8022.8223.0524.001. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default1632486480SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 568.1468.5768.8872.951. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3Clang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default70140210280350SE +/- 0.77, N = 3SE +/- 0.36, N = 3SE +/- 0.72, N = 3SE +/- 0.60, N = 3310.17310.33310.96324.221. (CC) gcc options: -O3 -mcpu=ampere1 -flto -fvisibility=hidden

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitGCC 13.2 - DefaultClang 18.1.3Clang 19.1.0Clang 20.0 Git816243240SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 333.5734.3734.4534.461. (CC) gcc options: -O2 -pthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 19.1.0Clang 20.0 GitClang 18.1.3GCC 13.2 - Default40K80K120K160K200KSE +/- 78.47, N = 3SE +/- 56.65, N = 3SE +/- 134.78, N = 3SE +/- 68.80, N = 31740431734771724281719491. (CC) gcc options: -pedantic -O3

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Clang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default700M1400M2100M2800M3500MSE +/- 120185.04, N = 3SE +/- 57735.03, N = 3SE +/- 66666.67, N = 3SE +/- 5892367.95, N = 335008333333499300000349906666721978000001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32Clang 18.1.3Clang 20.0 GitClang 19.1.0GCC 13.2 - Default1100M2200M3300M4400M5500MSE +/- 100000.00, N = 3SE +/- 33333.33, N = 3SE +/- 470224.53, N = 3SE +/- 440958.55, N = 352498000005247633333524666666731488666671. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default40M80M120M160M200MSE +/- 6666.67, N = 3SE +/- 3333.33, N = 3SE +/- 5773.50, N = 3SE +/- 448441.75, N = 31644633331644566671644200001527600001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512Clang 20.0 GitClang 19.1.0Clang 18.1.3GCC 13.2 - Default50M100M150M200M250MSE +/- 3333.33, N = 3SE +/- 8819.17, N = 3SE +/- 15275.25, N = 3SE +/- 5773.50, N = 32466533332466033332465900002241400001. (CC) gcc options: -O3 -mcpu=ampere1 -flto -pthread -lm -lc -lliquid

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGCC 13.2 - DefaultClang 19.1.0Clang 18.1.3Clang 20.0 Git246810SE +/- 0.014, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.010, N = 37.4976.8946.8886.8841. (CXX) g++ options: -O3 -mcpu=ampere1 -flto -lm


Phoronix Test Suite v10.8.5