AMD EPYC Genoa Compilers

Genoa compiler benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2305211-NE-EPYCGENOA71&grw&sor.

AMD EPYC Genoa CompilersProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.1.1Clang 16.0.32 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1004D BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeFedora Linux 386.2.15-300.fc38.x86_64 (x86_64)GCC 13.1.1 20230511xfs640x480Clang 16.0.3 + LLVM 16.0.3OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 13.1.1: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096 Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111 Python Details- Python 3.11.3Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Compilerstscp: AI Chess Performancecryptopp: Keyed Algorithmscryptopp: Unkeyed Algorithmssmhasher: wyhashsmhasher: SHA3-256smhasher: Spooky32smhasher: fasthash32smhasher: t1ha2_atoncesmhasher: t1ha0_aes_avx2 x86_64smhasher: MeowHash x86_64 AES-NIcompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedz3: 2.smt2compress-lz4: 9 - Compression Speedz3: 1.smt2compress-lz4: 9 - Decompression Speedtjbench: Decompression Throughputstargate: 44100 - 512stargate: 192000 - 512stargate: 44100 - 1024stargate: 192000 - 1024astcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivedraco: Liondraco: Church Facadeopenjpeg: NASA Curiosity Panorama M34webp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionespeak: Text-To-Speech Synthesissecuremark: SecureMark-TLSquantlib: minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2mrbayes: Primate Phylogeny Analysislammps: 20k Atomslammps: Rhodopsin Proteinonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUkripke: gpaw: Carbon Nanotubecoremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5compress-zstd: 12 - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastaobench: 2048 x 2048 - Total Timegraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacesvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kx265: Bosphorus 4Kc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 1080p - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 1080p - Very Fastuvg266: Bosphorus 1080p - Super Fastuvg266: Bosphorus 1080p - Ultra Fastavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessliquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57liquid-dsp: 384 - 256 - 57openssl: SHA256openssl: SHA512openssl: RSA4096openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305memcached: 1:5memcached: 1:10memcached: 1:100sqlite: 1faiss: demo_sift1Msqlite: 2pgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencymysqlslap: 2048mysqlslap: 4096simdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDsmhasher: wyhashsmhasher: SHA3-256smhasher: Spooky32smhasher: fasthash32smhasher: t1ha2_atoncesmhasher: t1ha0_aes_avx2 x86_64smhasher: MeowHash x86_64 AES-NIGCC 13.1.1Clang 16.0.31661988615.450688435.81009930125.16208.7720584.578721.0621054.87101956.3153237.3515466.8717800.846.2416409.972.23045.8425.89516305.1219.3887385.8371832.5168566.2705592.855186718.8068552.0326117.027512.83945243675017982418.9311.541.353.210.5527.4902887533238.86902.231276.0898251.340330.054113.33653.79549.3702191.222406.70324078880140.8597872619.7253375322135926912461155277322795126133332325628662566728819667312.61662.616.81413.08.311341.434.7735.9468.5770.1070.5128.94029586601311221612098110895.52754.952176.031143.42921.593.0636.12327.4339.0846.6346.9746.72112.43136.19147.3761.49533.8072.5224.6483.7084641933333849066666711285333333116476666672608925602008003585475271935.92921019.61014036223867180135296559715440539847207088735249272033827.483302722.114105286.362.02859.4022.49633621930.2393853944.156.881.266.566.7814.9682113.41326.66122.16721.09720.68345.2231758567703.995610396.46125429923.05363.9021089.017052.2422604.26102085.1852151.9715501.5517350.552.3116246.750.8516340.0217.4742656.6911752.8803587.1613003.257372780.9510570.6249117.753612.76565106655916208619.7211.751.524.160.6328.2663125703391.14285.585171.4248072.488322.899112.10754.31049.689468.269885.030217323727140.2827173897.068183517643140309.81632.716.81387.58.261343.537.3738.3769.8568.5470.1631.1572207659172319011288358755.79656.135170.787146.21625.383.2035.65429.6542.4947.4046.8447.56133.05146.20156.5058.88532.6562.4964.2203.5274680200000866523333311762333333124183333332644451698278270162569071901.72931038.9645125858407179635113652315383506105404404465918872.0452.44234571730.2323733794.437.331.228.228.5214.0651041.17526.71221.87325.66720.32244.965OpenBenchmarking.org

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 16.0.3GCC 13.1.1400K800K1200K1600K2000KSE +/- 2743.38, N = 5SE +/- 2555.19, N = 5175856716619881. (CC) gcc options: -O3 -march=native -flto

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsClang 16.0.3GCC 13.1.1150300450600750SE +/- 0.08, N = 3SE +/- 0.39, N = 3704.00615.451. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 13.1.1Clang 16.0.390180270360450SE +/- 0.35, N = 3SE +/- 0.05, N = 3435.81396.461. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

SMHasher

Hash: wyhash

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: wyhashGCC 13.1.1Clang 16.0.36K12K18K24K30KSE +/- 67.62, N = 3SE +/- 22.88, N = 330309.1229923.05-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 16.0.3GCC 13.1.180160240320400SE +/- 1.16, N = 3SE +/- 0.05, N = 3363.90228.02-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: Spooky32Clang 16.0.3GCC 13.1.15K10K15K20K25KSE +/- 2.94, N = 3SE +/- 0.25, N = 321089.0120584.57-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: fasthash32GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 0.28, N = 3SE +/- 0.01, N = 38812.447052.24-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceClang 16.0.3GCC 13.1.15K10K15K20K25KSE +/- 0.45, N = 3SE +/- 2.58, N = 322604.2621054.87-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 16.0.3GCC 13.1.120K40K60K80K100KSE +/- 33.80, N = 3SE +/- 10.93, N = 3102085.18101972.62-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 13.1.1Clang 16.0.311K22K33K44K55KSE +/- 3.22, N = 3SE +/- 6.98, N = 353264.2352151.97-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedClang 16.0.3GCC 13.1.13K6K9K12K15KSE +/- 139.51, N = 3SE +/- 222.63, N = 315501.5515466.871. (CC) gcc options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 51.33, N = 3SE +/- 77.84, N = 317800.817350.51. (CC) gcc options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedClang 16.0.3GCC 13.1.11224364860SE +/- 0.54, N = 3SE +/- 0.04, N = 352.3146.241. (CC) gcc options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 141.14, N = 3SE +/- 103.96, N = 316409.916246.71. (CC) gcc options: -O3 -march=native -flto

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2GCC 13.1.11632486480SE +/- 0.06, N = 372.231. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedClang 16.0.3GCC 13.1.11122334455SE +/- 0.53, N = 3SE +/- 0.49, N = 350.8545.841. (CC) gcc options: -O3 -march=native -flto

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2GCC 13.1.1612182430SE +/- 0.19, N = 325.901. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClang 16.0.3GCC 13.1.14K8K12K16K20KSE +/- 50.74, N = 3SE +/- 80.69, N = 316340.016305.11. (CC) gcc options: -O3 -march=native -flto

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 13.1.1Clang 16.0.350100150200250SE +/- 0.85, N = 3SE +/- 0.20, N = 3219.39217.471. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 512Clang 16.0.3GCC 13.1.1246810SE +/- 0.001401, N = 3SE +/- 0.001560, N = 36.6911755.8371831. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 512Clang 16.0.3GCC 13.1.10.64811.29621.94432.59243.2405SE +/- 0.007221, N = 3SE +/- 0.007283, N = 32.8803582.5168561. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 1024Clang 16.0.3GCC 13.1.1246810SE +/- 0.005456, N = 3SE +/- 0.000734, N = 37.1613006.2705591. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 1024Clang 16.0.3GCC 13.1.10.73291.46582.19872.93163.6645SE +/- 0.007486, N = 3SE +/- 0.006270, N = 33.2573722.8551861. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastClang 16.0.3GCC 13.1.12004006008001000SE +/- 5.60, N = 3SE +/- 3.96, N = 3780.95718.811. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumClang 16.0.3GCC 13.1.1120240360480600SE +/- 2.38, N = 3SE +/- 2.85, N = 3570.62552.031. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughClang 16.0.3GCC 13.1.1306090120150SE +/- 0.09, N = 3SE +/- 0.12, N = 3117.75117.031. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveGCC 13.1.1Clang 16.0.33691215SE +/- 0.01, N = 3SE +/- 0.05, N = 312.8412.771. (CXX) g++ options: -O3 -march=native -flto -pthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: LionClang 16.0.3GCC 13.1.111002200330044005500SE +/- 44.34, N = 3SE +/- 29.18, N = 3510652431. (CXX) g++ options: -O3 -march=native -flto

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church FacadeClang 16.0.3GCC 13.1.114002800420056007000SE +/- 4.84, N = 3SE +/- 13.35, N = 3655967501. (CXX) g++ options: -O3 -march=native -flto

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34Clang 16.0.3GCC 13.1.140K80K120K160K200KSE +/- 571.61, N = 3SE +/- 812.57, N = 31620861798241. (CXX) g++ options: -O3 -march=native -flto -rdynamic

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultClang 16.0.3GCC 13.1.1510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 319.7218.93-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100Clang 16.0.3GCC 13.1.13691215SE +/- 0.02, N = 3SE +/- 0.00, N = 311.7511.54-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessClang 16.0.3GCC 13.1.10.3420.6841.0261.3681.71SE +/- 0.00, N = 3SE +/- 0.00, N = 31.521.35-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionClang 16.0.3GCC 13.1.10.9361.8722.8083.7444.68SE +/- 0.00, N = 3SE +/- 0.00, N = 34.163.21-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 16.0.3GCC 13.1.10.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 30.630.55-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 1.51Text-To-Speech SynthesisGCC 13.1.1Clang 16.0.3714212835SE +/- 0.02, N = 4SE +/- 0.15, N = 427.4928.271. (CXX) g++ options: -O3 -march=native -flto -lpthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 16.0.3GCC 13.1.170K140K210K280K350KSE +/- 1404.09, N = 3SE +/- 872.41, N = 33125702887531. (CC) gcc options: -pedantic -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30Clang 16.0.3GCC 13.1.17001400210028003500SE +/- 7.81, N = 3SE +/- 2.59, N = 33391.13238.81. (CXX) g++ options: -O3 -march=native -fPIE -pie

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.315003000450060007500SE +/- 7.29, N = 3SE +/- 185.48, N = 156902.234285.591. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.360120180240300SE +/- 0.29, N = 3SE +/- 7.42, N = 15276.09171.421. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 43.34, N = 3SE +/- 80.21, N = 38251.348072.491. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.370140210280350SE +/- 1.73, N = 3SE +/- 3.21, N = 3330.05322.901. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClang 16.0.3GCC 13.1.1306090120150SE +/- 0.66, N = 3SE +/- 0.12, N = 3112.11113.34-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -flto -lm

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsClang 16.0.3GCC 13.1.11224364860SE +/- 0.04, N = 3SE +/- 0.07, N = 354.3153.801. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 16.0.3GCC 13.1.11122334455SE +/- 0.45, N = 15SE +/- 0.43, N = 1549.6949.371. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 16.0.3GCC 13.1.15001000150020002500SE +/- 8.52, N = 15SE +/- 42.69, N = 15468.272191.22-fopenmp=libomp - MIN: 441.14-fopenmp - MIN: 1875.061. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 16.0.3GCC 13.1.15001000150020002500SE +/- 2.20, N = 3SE +/- 26.76, N = 15885.032406.70-fopenmp=libomp - MIN: 871.44-fopenmp - MIN: 2191.91. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4GCC 13.1.1Clang 16.0.370M140M210M280M350MSE +/- 6307414.85, N = 15SE +/- 4076394.65, N = 15324078880217323727-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -flto

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon NanotubeClang 16.0.3GCC 13.1.1306090120150SE +/- 0.24, N = 3SE +/- 0.24, N = 3140.28140.861. (CC) gcc options: -shared -O3 -march=native -flto -lxc -lblas -lmpi

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 13.1.1Clang 16.0.32M4M6M8M10MSE +/- 3457.10, N = 3SE +/- 7211.60, N = 37872619.737173897.071. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeGCC 13.1.1Clang 16.0.3110M220M330M440M550MSE +/- 3102500.18, N = 3SE +/- 8586129.93, N = 15532213592517643140-lgcov -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mbmi2 -fno-peel-loops -fno-tracer -flto=jobserver-fprofile-use1. (CXX) g++ options: -m64 -lpthread -O3 -march=native -flto -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingGCC 13.1.1150K300K450K600K750KSE +/- 7447.92, N = 36912461. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingGCC 13.1.1200K400K600K800K1000KSE +/- 4772.36, N = 311552771. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGCC 13.1.170K140K210K280K350KSE +/- 288.40, N = 33227951. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGCC 13.1.1300K600K900K1200K1500KSE +/- 4630.81, N = 312613331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGCC 13.1.170K140K210K280K350KSE +/- 401.92, N = 33232561. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512GCC 13.1.160M120M180M240M300MSE +/- 2757367.12, N = 32866256671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5GCC 13.1.16M12M18M24M30MSE +/- 29475.04, N = 3288196671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedGCC 13.1.1Clang 16.0.370140210280350SE +/- 4.38, N = 3SE +/- 3.40, N = 3312.6309.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedGCC 13.1.1Clang 16.0.3400800120016002000SE +/- 3.39, N = 3SE +/- 6.60, N = 31662.61632.7-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedClang 16.0.3GCC 13.1.148121620SE +/- 0.00, N = 3SE +/- 0.03, N = 316.816.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedGCC 13.1.1Clang 16.0.330060090012001500SE +/- 9.49, N = 3SE +/- 1.96, N = 31413.01387.5-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedGCC 13.1.1Clang 16.0.3246810SE +/- 0.04, N = 3SE +/- 0.01, N = 38.318.26-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedClang 16.0.3GCC 13.1.130060090012001500SE +/- 4.51, N = 3SE +/- 4.13, N = 31343.51341.4-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowClang 16.0.3GCC 13.1.1918273645SE +/- 0.14, N = 3SE +/- 0.04, N = 337.3734.77-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumClang 16.0.3GCC 13.1.1918273645SE +/- 0.17, N = 3SE +/- 0.06, N = 338.3735.94-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastClang 16.0.3GCC 13.1.11632486480SE +/- 0.57, N = 3SE +/- 0.74, N = 469.8568.57-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 370.1068.54-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.43, N = 3SE +/- 0.62, N = 370.5170.16-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 13.1.1Clang 16.0.3714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 328.9431.161. (CC) gcc options: -lm -O3 -march=native -flto

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SwirlGCC 13.1.1Clang 16.0.36001200180024003000SE +/- 15.88, N = 3SE +/- 3.84, N = 3295822071. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: RotateGCC 13.1.1Clang 16.0.3140280420560700SE +/- 3.28, N = 3SE +/- 0.33, N = 36606591. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenClang 16.0.3GCC 13.1.1400800120016002000SE +/- 12.49, N = 3SE +/- 1.00, N = 3172313111. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedGCC 13.1.1Clang 16.0.35001000150020002500SE +/- 4.26, N = 3SE +/- 2.33, N = 3221619011. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingClang 16.0.3GCC 13.1.1306090120150SE +/- 2.70, N = 15SE +/- 6.54, N = 121281201. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: Noise-GaussianGCC 13.1.1Clang 16.0.32004006008001000SE +/- 9.66, N = 6SE +/- 11.26, N = 39818351. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: HWB Color SpaceGCC 13.1.1Clang 16.0.32004006008001000SE +/- 15.72, N = 3SE +/- 5.51, N = 310898751. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.11.30412.60823.91235.21646.5205SE +/- 0.025, N = 3SE +/- 0.067, N = 35.7965.5271. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.11326395265SE +/- 0.58, N = 5SE +/- 0.42, N = 356.1454.951. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.34080120160200SE +/- 1.61, N = 3SE +/- 1.96, N = 12176.03170.791. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.1306090120150SE +/- 1.04, N = 12SE +/- 1.27, N = 7146.22143.431. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KClang 16.0.3GCC 13.1.1612182430SE +/- 0.34, N = 3SE +/- 0.22, N = 625.3821.591. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 13.1.1Clang 16.0.30.72071.44142.16212.88283.6035SE +/- 0.058, N = 15SE +/- 0.042, N = 33.0633.2031. (CC) gcc options: -lm -lpthread -O3 -march=native -flto

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 16.0.3GCC 13.1.1246810SE +/- 0.053, N = 7SE +/- 0.053, N = 85.6546.1231. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumClang 16.0.3GCC 13.1.1714212835SE +/- 0.07, N = 3SE +/- 0.09, N = 329.6527.431. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumClang 16.0.3GCC 13.1.11020304050SE +/- 0.29, N = 3SE +/- 0.09, N = 342.4939.081. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastClang 16.0.3GCC 13.1.11122334455SE +/- 0.42, N = 3SE +/- 0.50, N = 347.4046.631. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31122334455SE +/- 0.12, N = 3SE +/- 0.11, N = 346.9746.841. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastClang 16.0.3GCC 13.1.11122334455SE +/- 0.56, N = 3SE +/- 0.37, N = 347.5646.721. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastClang 16.0.3GCC 13.1.1306090120150SE +/- 0.64, N = 3SE +/- 0.05, N = 3133.05112.431. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastClang 16.0.3GCC 13.1.1306090120150SE +/- 0.55, N = 3SE +/- 0.33, N = 3146.20136.191. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastClang 16.0.3GCC 13.1.1306090120150SE +/- 2.25, N = 3SE +/- 1.08, N = 11156.50147.371. (CXX) g++ options: -O3 -march=native -flto

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 0Clang 16.0.3GCC 13.1.11428425670SE +/- 0.13, N = 3SE +/- 0.05, N = 358.8961.501. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2Clang 16.0.3GCC 13.1.1816243240SE +/- 0.29, N = 3SE +/- 0.05, N = 332.6633.811. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6Clang 16.0.3GCC 13.1.10.56751.1351.70252.272.8375SE +/- 0.020, N = 3SE +/- 0.025, N = 52.4962.5221. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessClang 16.0.3GCC 13.1.11.04582.09163.13744.18325.229SE +/- 0.035, N = 3SE +/- 0.059, N = 34.2204.6481. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, LosslessClang 16.0.3GCC 13.1.10.83431.66862.50293.33724.1715SE +/- 0.020, N = 3SE +/- 0.023, N = 33.5273.7081. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.11000M2000M3000M4000M5000MSE +/- 1808314.13, N = 3SE +/- 8346922.25, N = 3468020000046419333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.12000M4000M6000M8000M10000MSE +/- 913479.31, N = 3SE +/- 13312441.97, N = 3866523333384906666671. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.13000M6000M9000M12000M15000MSE +/- 2603416.56, N = 3SE +/- 4096068.58, N = 311762333333112853333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.13000M6000M9000M12000M15000MSE +/- 9527737.29, N = 3SE +/- 7264831.57, N = 312418333333116476666671. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256Clang 16.0.3GCC 13.1.160000M120000M180000M240000M300000MSE +/- 64283484.40, N = 3SE +/- 48171917.42, N = 3264445169827260892560200-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512Clang 16.0.3GCC 13.1.120000M40000M60000M80000M100000MSE +/- 4419075.00, N = 3SE +/- 845012614.25, N = 58270162569080035854752-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096GCC 13.1.1Clang 16.0.315K30K45K60K75KSE +/- 24.90, N = 3SE +/- 10.37, N = 371935.971901.7-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096Clang 16.0.3GCC 13.1.1600K1200K1800K2400K3000KSE +/- 437.47, N = 3SE +/- 105.74, N = 32931038.92921019.6-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20GCC 13.1.1Clang 16.0.3200000M400000M600000M800000M1000000MSE +/- 152062084.95, N = 3SE +/- 51499603.07, N = 31014036223867645125858407-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMGCC 13.1.1Clang 16.0.3400000M800000M1200000M1600000M2000000MSE +/- 1317927821.70, N = 3SE +/- 616604050.97, N = 318013529655971796351136523-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMGCC 13.1.1Clang 16.0.3300000M600000M900000M1200000M1500000MSE +/- 293680009.95, N = 3SE +/- 2176141824.81, N = 315440539847201538350610540-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305GCC 13.1.1Clang 16.0.3150000M300000M450000M600000M750000MSE +/- 116220547.85, N = 3SE +/- 48134544.35, N = 3708873524927440446591887-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5GCC 13.1.1400K800K1200K1600K2000KSE +/- 6222.06, N = 32033827.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10GCC 13.1.1700K1400K2100K2800K3500KSE +/- 12793.89, N = 33302722.111. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100GCC 13.1.1900K1800K2700K3600K4500KSE +/- 15774.50, N = 34105286.361. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 1GCC 13.1.1Clang 16.0.30.46010.92021.38031.84042.3005SE +/- 0.002, N = 3SE +/- 0.015, N = 32.0282.0451. (CC) gcc options: -O3 -march=native -flto -lz -lm

Faiss

Test: demo_sift1M

OpenBenchmarking.orgSeconds, Fewer Is BetterFaiss 1.7.4Test: demo_sift1MGCC 13.1.11326395265SE +/- 0.19, N = 259.401. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

SQLite

Threads / Copies: 2

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 2Clang 16.0.3GCC 13.1.10.56161.12321.68482.24642.808SE +/- 0.032, N = 3SE +/- 0.027, N = 32.4422.4961. (CC) gcc options: -O3 -march=native -flto -lz -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read OnlyClang 16.0.3GCC 13.1.1700K1400K2100K2800K3500KSE +/- 34767.93, N = 12SE +/- 57281.48, N = 12345717333621931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencyClang 16.0.3GCC 13.1.10.05380.10760.16140.21520.269SE +/- 0.002, N = 12SE +/- 0.004, N = 120.2320.2391. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048GCC 13.1.1Clang 16.0.380160240320400SE +/- 0.65, N = 3SE +/- 2.37, N = 3385373-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096GCC 13.1.1Clang 16.0.390180270360450SE +/- 2.65, N = 3SE +/- 3.39, N = 3394379-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaClang 16.0.3GCC 13.1.10.99681.99362.99043.98724.984SE +/- 0.00, N = 3SE +/- 0.00, N = 34.434.151. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetClang 16.0.3GCC 13.1.1246810SE +/- 0.06, N = 15SE +/- 0.01, N = 37.336.881. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomGCC 13.1.1Clang 16.0.30.28350.5670.85051.1341.4175SE +/- 0.00, N = 3SE +/- 0.00, N = 31.261.221. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsClang 16.0.3GCC 13.1.1246810SE +/- 0.12, N = 12SE +/- 0.02, N = 38.226.561. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDClang 16.0.3GCC 13.1.1246810SE +/- 0.07, N = 15SE +/- 0.02, N = 38.526.781. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: wyhash

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: wyhashClang 16.0.3GCC 13.1.148121620SE +/- 0.00, N = 3SE +/- 0.00, N = 314.0714.97-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 16.0.3GCC 13.1.15001000150020002500SE +/- 8.90, N = 3SE +/- 0.15, N = 31041.181727.90-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: Spooky32GCC 13.1.1Clang 16.0.3612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 326.6626.71-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: fasthash32Clang 16.0.3GCC 13.1.1510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 521.8722.17-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceGCC 13.1.1Clang 16.0.3612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 321.1025.67-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 16.0.3GCC 13.1.1510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 320.3220.68-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIClang 16.0.3GCC 13.1.11020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 344.9745.22-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto


Phoronix Test Suite v10.8.5