AMD EPYC Genoa Compilers

Genoa compiler benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2305211-NE-EPYCGENOA71&grs&sor.

AMD EPYC Genoa CompilersProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.1.1Clang 16.0.32 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1004D BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeFedora Linux 386.2.15-300.fc38.x86_64 (x86_64)GCC 13.1.1 20230511xfs640x480Clang 16.0.3 + LLVM 16.0.3OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 13.1.1: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096 Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111 Python Details- Python 3.11.3Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Compilersonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUopenssl: ChaCha20-Poly1305openssl: ChaCha20graphics-magick: Swirlgraphics-magick: Sharpenwebp: Quality 100, Highest Compressionsimdjson: DistinctUserIDsimdjson: PartialTweetssmhasher: fasthash32graphics-magick: HWB Color Spaceuvg266: Bosphorus 1080p - Very Fastx265: Bosphorus 4Kgraphics-magick: Noise-Gaussiangraphics-magick: Enhancedstargate: 44100 - 512webp: Quality 100, Lossless, Highest Compressionstargate: 192000 - 512cryptopp: Keyed Algorithmsstargate: 44100 - 1024stargate: 192000 - 1024compress-lz4: 3 - Compression Speedwebp: Quality 100, Losslessopenjpeg: NASA Curiosity Panorama M34compress-lz4: 9 - Compression Speedavifenc: 6, Losslesscryptopp: Unkeyed Algorithmscoremark: CoreMark Size 666 - Iterations Per Seconduvg266: Bosphorus 1080p - Mediumastcenc: Fastpovray: Trace Timesecuremark: SecureMark-TLSuvg266: Bosphorus 4K - Mediumsmhasher: t1ha2_atonceaobench: 2048 x 2048 - Total Timekvazaar: Bosphorus 4K - Slowuvg266: Bosphorus 1080p - Super Fastkvazaar: Bosphorus 4K - Mediumsimdjson: Kostyaliquid-dsp: 384 - 256 - 57simdjson: TopTweetuvg266: Bosphorus 1080p - Ultra Fasttscp: AI Chess Performanceavifenc: 10, Losslesssvt-av1: Preset 4 - Bosphorus 4Kquantlib: avifenc: 0liquid-dsp: 256 - 256 - 57webp: Defaultmysqlslap: 4096avifenc: 2astcenc: Mediumopenssl: SHA512simdjson: LargeRandmysqlslap: 2048svt-av1: Preset 12 - Bosphorus 4Kdraco: Church Facadepgbench: 100 - 800 - Read Onlyespeak: Text-To-Speech Synthesisdraco: Lioncompress-lz4: 1 - Decompression Speedsmhasher: Spooky32kvazaar: Bosphorus 4K - Super Fastminibude: OpenMP - BM2minibude: OpenMP - BM2sqlite: 2svt-av1: Preset 8 - Bosphorus 4Ksmhasher: MeowHash x86_64 AES-NIliquid-dsp: 128 - 256 - 57svt-av1: Preset 13 - Bosphorus 4Kkvazaar: Bosphorus 4K - Very Fastcompress-zstd: 19 - Decompression Speedcompress-zstd: 12 - Decompression Speedwebp: Quality 100uvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 4K - Very Fastopenssl: SHA256smhasher: wyhashmrbayes: Primate Phylogeny Analysisavifenc: 6compress-lz4: 3 - Decompression Speedlammps: 20k Atomscompress-zstd: 12 - Compression Speedtjbench: Decompression Throughputsqlite: 1liquid-dsp: 64 - 256 - 57lammps: Rhodopsin Proteinastcenc: Thoroughcompress-zstd: 19, Long Mode - Compression Speedastcenc: Exhaustivekvazaar: Bosphorus 4K - Ultra Fastgpaw: Carbon Nanotubeopenssl: AES-256-GCMopenssl: RSA4096openssl: AES-128-GCMuvg266: Bosphorus 4K - Super Fastcompress-lz4: 1 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-zstd: 19, Long Mode - Decompression Speedgraphics-magick: Rotatesmhasher: t1ha0_aes_avx2 x86_64openssl: RSA4096faiss: demo_sift1Mmemcached: 1:100memcached: 1:10memcached: 1:5compress-7zip: Decompression Ratingcompress-7zip: Compression Ratingjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512john-the-ripper: Blowfishjohn-the-ripper: WPA PSKjohn-the-ripper: bcryptcompress-zstd: 19 - Compression Speedz3: 2.smt2z3: 1.smt2kripke: pgbench: 100 - 800 - Read Only - Average Latencyonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUc-ray: Total Time - 4K, 16 Rays Per Pixelstockfish: Total Timegraphics-magick: Resizingsmhasher: MeowHash x86_64 AES-NIsmhasher: t1ha0_aes_avx2 x86_64smhasher: t1ha2_atoncesmhasher: fasthash32smhasher: Spooky32smhasher: SHA3-256smhasher: SHA3-256smhasher: wyhashminibude: OpenMP - BM1minibude: OpenMP - BM1GCC 13.1.1Clang 16.0.32406.707088735249271014036223867295813113.216.786.568721.061089112.4321.5998122165.8371830.552.516856615.4506886.2705592.85518646.241.3517982445.844.648435.8100997872619.72533739.08718.80686.12328875327.4321054.8728.94034.77136.1935.944.15116476666676.88147.3716619883.7085.5273238.861.4951128533333318.9339433.807552.0326800358547521.26385176.0316750336219327.490524317800.820584.5770.10330.0548251.3402.49654.95253237.358490666667143.42968.571413.01662.611.5446.7246.6326089256020030125.16113.3362.52216409.953.795312.6219.3887382.028464193333349.370117.02758.3112.839470.51140.85915440539847202921019.6180135296559746.9715466.8716305.11341.4660101956.3171935.959.4024105286.363302722.112033827.48115527769124628819667286625667323256126133332279516.872.23025.8953240788800.2392191.223.06353221359212045.22320.68321.09722.16726.6612113.413208.7714.968276.0896902.231885.030440446591887645125858407220717234.168.528.227052.24875133.0525.3883519016.6911750.632.880358703.9956107.1613003.25737252.311.5216208650.854.220396.4612547173897.06818342.49780.95105.65431257029.6522604.2631.15737.37146.2038.374.43124183333337.33156.5017585673.5275.7963391.158.8851176233333319.7237932.656570.6249827016256901.22373170.7876559345717328.266510617350.521089.0168.54322.8998072.4882.44256.13552151.978665233333146.21669.851387.51632.711.7547.5647.4026444516982729923.05112.1072.49616246.754.310309.8217.4742652.045468020000049.689117.75368.2612.765670.16140.28215383506105402931038.9179635113652346.8415501.5516340.01343.5659102085.1871901.716.82173237270.232468.2693.20351764314012844.96520.32225.66721.87326.7121041.175363.9014.065171.4244285.585OpenBenchmarking.org

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 16.0.3GCC 13.1.15001000150020002500SE +/- 2.20, N = 3SE +/- 26.76, N = 15885.032406.70-fopenmp=libomp - MIN: 871.44-fopenmp - MIN: 2191.91. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305GCC 13.1.1Clang 16.0.3150000M300000M450000M600000M750000MSE +/- 116220547.85, N = 3SE +/- 48134544.35, N = 3708873524927440446591887-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20GCC 13.1.1Clang 16.0.3200000M400000M600000M800000M1000000MSE +/- 152062084.95, N = 3SE +/- 51499603.07, N = 31014036223867645125858407-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SwirlGCC 13.1.1Clang 16.0.36001200180024003000SE +/- 15.88, N = 3SE +/- 3.84, N = 3295822071. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenClang 16.0.3GCC 13.1.1400800120016002000SE +/- 12.49, N = 3SE +/- 1.00, N = 3172313111. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionClang 16.0.3GCC 13.1.10.9361.8722.8083.7444.68SE +/- 0.00, N = 3SE +/- 0.00, N = 34.163.21-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDClang 16.0.3GCC 13.1.1246810SE +/- 0.07, N = 15SE +/- 0.02, N = 38.526.781. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsClang 16.0.3GCC 13.1.1246810SE +/- 0.12, N = 12SE +/- 0.02, N = 38.226.561. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: fasthash32GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 0.28, N = 3SE +/- 0.01, N = 38812.447052.24-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: HWB Color SpaceGCC 13.1.1Clang 16.0.32004006008001000SE +/- 15.72, N = 3SE +/- 5.51, N = 310898751. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

uvg266

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastClang 16.0.3GCC 13.1.1306090120150SE +/- 0.64, N = 3SE +/- 0.05, N = 3133.05112.431. (CXX) g++ options: -O3 -march=native -flto

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KClang 16.0.3GCC 13.1.1612182430SE +/- 0.34, N = 3SE +/- 0.22, N = 625.3821.591. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: Noise-GaussianGCC 13.1.1Clang 16.0.32004006008001000SE +/- 9.66, N = 6SE +/- 11.26, N = 39818351. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedGCC 13.1.1Clang 16.0.35001000150020002500SE +/- 4.26, N = 3SE +/- 2.33, N = 3221619011. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 512Clang 16.0.3GCC 13.1.1246810SE +/- 0.001401, N = 3SE +/- 0.001560, N = 36.6911755.8371831. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 16.0.3GCC 13.1.10.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 30.630.55-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 512Clang 16.0.3GCC 13.1.10.64811.29621.94432.59243.2405SE +/- 0.007221, N = 3SE +/- 0.007283, N = 32.8803582.5168561. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsClang 16.0.3GCC 13.1.1150300450600750SE +/- 0.08, N = 3SE +/- 0.39, N = 3704.00615.451. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 1024Clang 16.0.3GCC 13.1.1246810SE +/- 0.005456, N = 3SE +/- 0.000734, N = 37.1613006.2705591. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 1024Clang 16.0.3GCC 13.1.10.73291.46582.19872.93163.6645SE +/- 0.007486, N = 3SE +/- 0.006270, N = 33.2573722.8551861. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedClang 16.0.3GCC 13.1.11224364860SE +/- 0.54, N = 3SE +/- 0.04, N = 352.3146.241. (CC) gcc options: -O3 -march=native -flto

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessClang 16.0.3GCC 13.1.10.3420.6841.0261.3681.71SE +/- 0.00, N = 3SE +/- 0.00, N = 31.521.35-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34Clang 16.0.3GCC 13.1.140K80K120K160K200KSE +/- 571.61, N = 3SE +/- 812.57, N = 31620861798241. (CXX) g++ options: -O3 -march=native -flto -rdynamic

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedClang 16.0.3GCC 13.1.11122334455SE +/- 0.53, N = 3SE +/- 0.49, N = 350.8545.841. (CC) gcc options: -O3 -march=native -flto

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessClang 16.0.3GCC 13.1.11.04582.09163.13744.18325.229SE +/- 0.035, N = 3SE +/- 0.059, N = 34.2204.6481. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 13.1.1Clang 16.0.390180270360450SE +/- 0.35, N = 3SE +/- 0.05, N = 3435.81396.461. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 13.1.1Clang 16.0.32M4M6M8M10MSE +/- 3457.10, N = 3SE +/- 7211.60, N = 37872619.737173897.071. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt

uvg266

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumClang 16.0.3GCC 13.1.11020304050SE +/- 0.29, N = 3SE +/- 0.09, N = 342.4939.081. (CXX) g++ options: -O3 -march=native -flto

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastClang 16.0.3GCC 13.1.12004006008001000SE +/- 5.60, N = 3SE +/- 3.96, N = 3780.95718.811. (CXX) g++ options: -O3 -march=native -flto -pthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 16.0.3GCC 13.1.1246810SE +/- 0.053, N = 7SE +/- 0.053, N = 85.6546.1231. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 16.0.3GCC 13.1.170K140K210K280K350KSE +/- 1404.09, N = 3SE +/- 872.41, N = 33125702887531. (CC) gcc options: -pedantic -O3

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumClang 16.0.3GCC 13.1.1714212835SE +/- 0.07, N = 3SE +/- 0.09, N = 329.6527.431. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceClang 16.0.3GCC 13.1.15K10K15K20K25KSE +/- 0.45, N = 3SE +/- 2.58, N = 322604.2621054.87-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 13.1.1Clang 16.0.3714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 328.9431.161. (CC) gcc options: -lm -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowClang 16.0.3GCC 13.1.1918273645SE +/- 0.14, N = 3SE +/- 0.04, N = 337.3734.77-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

uvg266

Video Input: Bosphorus 1080p - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastClang 16.0.3GCC 13.1.1306090120150SE +/- 0.55, N = 3SE +/- 0.33, N = 3146.20136.191. (CXX) g++ options: -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumClang 16.0.3GCC 13.1.1918273645SE +/- 0.17, N = 3SE +/- 0.06, N = 338.3735.94-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaClang 16.0.3GCC 13.1.10.99681.99362.99043.98724.984SE +/- 0.00, N = 3SE +/- 0.00, N = 34.434.151. (CXX) g++ options: -O3 -march=native -flto

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.13000M6000M9000M12000M15000MSE +/- 9527737.29, N = 3SE +/- 7264831.57, N = 312418333333116476666671. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetClang 16.0.3GCC 13.1.1246810SE +/- 0.06, N = 15SE +/- 0.01, N = 37.336.881. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastClang 16.0.3GCC 13.1.1306090120150SE +/- 2.25, N = 3SE +/- 1.08, N = 11156.50147.371. (CXX) g++ options: -O3 -march=native -flto

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 16.0.3GCC 13.1.1400K800K1200K1600K2000KSE +/- 2743.38, N = 5SE +/- 2555.19, N = 5175856716619881. (CC) gcc options: -O3 -march=native -flto

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, LosslessClang 16.0.3GCC 13.1.10.83431.66862.50293.33724.1715SE +/- 0.020, N = 3SE +/- 0.023, N = 33.5273.7081. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.11.30412.60823.91235.21646.5205SE +/- 0.025, N = 3SE +/- 0.067, N = 35.7965.5271. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30Clang 16.0.3GCC 13.1.17001400210028003500SE +/- 7.81, N = 3SE +/- 2.59, N = 33391.13238.81. (CXX) g++ options: -O3 -march=native -fPIE -pie

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 0Clang 16.0.3GCC 13.1.11428425670SE +/- 0.13, N = 3SE +/- 0.05, N = 358.8961.501. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.13000M6000M9000M12000M15000MSE +/- 2603416.56, N = 3SE +/- 4096068.58, N = 311762333333112853333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultClang 16.0.3GCC 13.1.1510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 319.7218.93-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096GCC 13.1.1Clang 16.0.390180270360450SE +/- 2.65, N = 3SE +/- 3.39, N = 3394379-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2Clang 16.0.3GCC 13.1.1816243240SE +/- 0.29, N = 3SE +/- 0.05, N = 332.6633.811. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumClang 16.0.3GCC 13.1.1120240360480600SE +/- 2.38, N = 3SE +/- 2.85, N = 3570.62552.031. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512Clang 16.0.3GCC 13.1.120000M40000M60000M80000M100000MSE +/- 4419075.00, N = 3SE +/- 845012614.25, N = 58270162569080035854752-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomGCC 13.1.1Clang 16.0.30.28350.5670.85051.1341.4175SE +/- 0.00, N = 3SE +/- 0.00, N = 31.261.221. (CXX) g++ options: -O3 -march=native -flto

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048GCC 13.1.1Clang 16.0.380160240320400SE +/- 0.65, N = 3SE +/- 2.37, N = 3385373-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.34080120160200SE +/- 1.61, N = 3SE +/- 1.96, N = 12176.03170.791. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church FacadeClang 16.0.3GCC 13.1.114002800420056007000SE +/- 4.84, N = 3SE +/- 13.35, N = 3655967501. (CXX) g++ options: -O3 -march=native -flto

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read OnlyClang 16.0.3GCC 13.1.1700K1400K2100K2800K3500KSE +/- 34767.93, N = 12SE +/- 57281.48, N = 12345717333621931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 1.51Text-To-Speech SynthesisGCC 13.1.1Clang 16.0.3714212835SE +/- 0.02, N = 4SE +/- 0.15, N = 427.4928.271. (CXX) g++ options: -O3 -march=native -flto -lpthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: LionClang 16.0.3GCC 13.1.111002200330044005500SE +/- 44.34, N = 3SE +/- 29.18, N = 3510652431. (CXX) g++ options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 51.33, N = 3SE +/- 77.84, N = 317800.817350.51. (CC) gcc options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: Spooky32Clang 16.0.3GCC 13.1.15K10K15K20K25KSE +/- 2.94, N = 3SE +/- 0.25, N = 321089.0120584.57-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 370.1068.54-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.370140210280350SE +/- 1.73, N = 3SE +/- 3.21, N = 3330.05322.901. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 43.34, N = 3SE +/- 80.21, N = 38251.348072.491. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

SQLite

Threads / Copies: 2

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 2Clang 16.0.3GCC 13.1.10.56161.12321.68482.24642.808SE +/- 0.032, N = 3SE +/- 0.027, N = 32.4422.4961. (CC) gcc options: -O3 -march=native -flto -lz -lm

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.11326395265SE +/- 0.58, N = 5SE +/- 0.42, N = 356.1454.951. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 13.1.1Clang 16.0.311K22K33K44K55KSE +/- 3.22, N = 3SE +/- 6.98, N = 353264.2352151.97-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.12000M4000M6000M8000M10000MSE +/- 913479.31, N = 3SE +/- 13312441.97, N = 3866523333384906666671. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 4KClang 16.0.3GCC 13.1.1306090120150SE +/- 1.04, N = 12SE +/- 1.27, N = 7146.22143.431. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastClang 16.0.3GCC 13.1.11632486480SE +/- 0.57, N = 3SE +/- 0.74, N = 469.8568.57-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedGCC 13.1.1Clang 16.0.330060090012001500SE +/- 9.49, N = 3SE +/- 1.96, N = 31413.01387.5-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedGCC 13.1.1Clang 16.0.3400800120016002000SE +/- 3.39, N = 3SE +/- 6.60, N = 31662.61632.7-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100Clang 16.0.3GCC 13.1.13691215SE +/- 0.02, N = 3SE +/- 0.00, N = 311.7511.54-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastClang 16.0.3GCC 13.1.11122334455SE +/- 0.56, N = 3SE +/- 0.37, N = 347.5646.721. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastClang 16.0.3GCC 13.1.11122334455SE +/- 0.42, N = 3SE +/- 0.50, N = 347.4046.631. (CXX) g++ options: -O3 -march=native -flto

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256Clang 16.0.3GCC 13.1.160000M120000M180000M240000M300000MSE +/- 64283484.40, N = 3SE +/- 48171917.42, N = 3264445169827260892560200-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

SMHasher

Hash: wyhash

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: wyhashGCC 13.1.1Clang 16.0.36K12K18K24K30KSE +/- 67.62, N = 3SE +/- 22.88, N = 330309.1229923.05-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClang 16.0.3GCC 13.1.1306090120150SE +/- 0.66, N = 3SE +/- 0.12, N = 3112.11113.34-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6Clang 16.0.3GCC 13.1.10.56751.1351.70252.272.8375SE +/- 0.020, N = 3SE +/- 0.025, N = 52.4962.5221. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 141.14, N = 3SE +/- 103.96, N = 316409.916246.71. (CC) gcc options: -O3 -march=native -flto

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsClang 16.0.3GCC 13.1.11224364860SE +/- 0.04, N = 3SE +/- 0.07, N = 354.3153.801. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedGCC 13.1.1Clang 16.0.370140210280350SE +/- 4.38, N = 3SE +/- 3.40, N = 3312.6309.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 13.1.1Clang 16.0.350100150200250SE +/- 0.85, N = 3SE +/- 0.20, N = 3219.39217.471. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 1GCC 13.1.1Clang 16.0.30.46010.92021.38031.84042.3005SE +/- 0.002, N = 3SE +/- 0.015, N = 32.0282.0451. (CC) gcc options: -O3 -march=native -flto -lz -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Clang 16.0.3GCC 13.1.11000M2000M3000M4000M5000MSE +/- 1808314.13, N = 3SE +/- 8346922.25, N = 3468020000046419333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClang 16.0.3GCC 13.1.11122334455SE +/- 0.45, N = 15SE +/- 0.43, N = 1549.6949.371. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughClang 16.0.3GCC 13.1.1306090120150SE +/- 0.09, N = 3SE +/- 0.12, N = 3117.75117.031. (CXX) g++ options: -O3 -march=native -flto -pthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedGCC 13.1.1Clang 16.0.3246810SE +/- 0.04, N = 3SE +/- 0.01, N = 38.318.26-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveGCC 13.1.1Clang 16.0.33691215SE +/- 0.01, N = 3SE +/- 0.05, N = 312.8412.771. (CXX) g++ options: -O3 -march=native -flto -pthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.43, N = 3SE +/- 0.62, N = 370.5170.16-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon NanotubeClang 16.0.3GCC 13.1.1306090120150SE +/- 0.24, N = 3SE +/- 0.24, N = 3140.28140.861. (CC) gcc options: -shared -O3 -march=native -flto -lxc -lblas -lmpi

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMGCC 13.1.1Clang 16.0.3300000M600000M900000M1200000M1500000MSE +/- 293680009.95, N = 3SE +/- 2176141824.81, N = 315440539847201538350610540-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096Clang 16.0.3GCC 13.1.1600K1200K1800K2400K3000KSE +/- 437.47, N = 3SE +/- 105.74, N = 32931038.92921019.6-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMGCC 13.1.1Clang 16.0.3400000M800000M1200000M1600000M2000000MSE +/- 1317927821.70, N = 3SE +/- 616604050.97, N = 318013529655971796351136523-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31122334455SE +/- 0.12, N = 3SE +/- 0.11, N = 346.9746.841. (CXX) g++ options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedClang 16.0.3GCC 13.1.13K6K9K12K15KSE +/- 139.51, N = 3SE +/- 222.63, N = 315501.5515466.871. (CC) gcc options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClang 16.0.3GCC 13.1.14K8K12K16K20KSE +/- 50.74, N = 3SE +/- 80.69, N = 316340.016305.11. (CC) gcc options: -O3 -march=native -flto

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedClang 16.0.3GCC 13.1.130060090012001500SE +/- 4.51, N = 3SE +/- 4.13, N = 31343.51341.4-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: RotateGCC 13.1.1Clang 16.0.3140280420560700SE +/- 3.28, N = 3SE +/- 0.33, N = 36606591. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 16.0.3GCC 13.1.120K40K60K80K100KSE +/- 33.80, N = 3SE +/- 10.93, N = 3102085.18101972.62-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096GCC 13.1.1Clang 16.0.315K30K45K60K75KSE +/- 24.90, N = 3SE +/- 10.37, N = 371935.971901.7-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Faiss

Test: demo_sift1M

OpenBenchmarking.orgSeconds, Fewer Is BetterFaiss 1.7.4Test: demo_sift1MGCC 13.1.11326395265SE +/- 0.19, N = 259.401. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100GCC 13.1.1900K1800K2700K3600K4500KSE +/- 15774.50, N = 34105286.361. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10GCC 13.1.1700K1400K2100K2800K3500KSE +/- 12793.89, N = 33302722.111. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5GCC 13.1.1400K800K1200K1600K2000KSE +/- 6222.06, N = 32033827.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingGCC 13.1.1200K400K600K800K1000KSE +/- 4772.36, N = 311552771. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingGCC 13.1.1150K300K450K600K750KSE +/- 7447.92, N = 36912461. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5GCC 13.1.16M12M18M24M30MSE +/- 29475.04, N = 3288196671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512GCC 13.1.160M120M180M240M300MSE +/- 2757367.12, N = 32866256671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGCC 13.1.170K140K210K280K350KSE +/- 401.92, N = 33232561. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGCC 13.1.1300K600K900K1200K1500KSE +/- 4630.81, N = 312613331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGCC 13.1.170K140K210K280K350KSE +/- 288.40, N = 33227951. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedClang 16.0.3GCC 13.1.148121620SE +/- 0.00, N = 3SE +/- 0.03, N = 316.816.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2GCC 13.1.11632486480SE +/- 0.06, N = 372.231. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2GCC 13.1.1612182430SE +/- 0.19, N = 325.901. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4GCC 13.1.1Clang 16.0.370M140M210M280M350MSE +/- 6307414.85, N = 15SE +/- 4076394.65, N = 15324078880217323727-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -flto

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencyClang 16.0.3GCC 13.1.10.05380.10760.16140.21520.269SE +/- 0.002, N = 12SE +/- 0.004, N = 120.2320.2391. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 16.0.3GCC 13.1.15001000150020002500SE +/- 8.52, N = 15SE +/- 42.69, N = 15468.272191.22-fopenmp=libomp - MIN: 441.14-fopenmp - MIN: 1875.061. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 13.1.1Clang 16.0.30.72071.44142.16212.88283.6035SE +/- 0.058, N = 15SE +/- 0.042, N = 33.0633.2031. (CC) gcc options: -lm -lpthread -O3 -march=native -flto

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeGCC 13.1.1Clang 16.0.3110M220M330M440M550MSE +/- 3102500.18, N = 3SE +/- 8586129.93, N = 15532213592517643140-lgcov -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mbmi2 -fno-peel-loops -fno-tracer -flto=jobserver-fprofile-use1. (CXX) g++ options: -m64 -lpthread -O3 -march=native -flto -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingClang 16.0.3GCC 13.1.1306090120150SE +/- 2.70, N = 15SE +/- 6.54, N = 121281201. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIClang 16.0.3GCC 13.1.11020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 344.9745.22-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 16.0.3GCC 13.1.1510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 320.3220.68-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceGCC 13.1.1Clang 16.0.3612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 321.1025.67-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: fasthash32Clang 16.0.3GCC 13.1.1510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 521.8722.17-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: Spooky32GCC 13.1.1Clang 16.0.3612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 326.6626.71-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 16.0.3GCC 13.1.15001000150020002500SE +/- 8.90, N = 3SE +/- 0.15, N = 31041.181727.90-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 16.0.3GCC 13.1.180160240320400SE +/- 1.16, N = 3SE +/- 0.05, N = 3363.90228.02-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: wyhash

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: wyhashClang 16.0.3GCC 13.1.148121620SE +/- 0.00, N = 3SE +/- 0.00, N = 314.0714.97-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native -flto

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.360120180240300SE +/- 0.29, N = 3SE +/- 7.42, N = 15276.09171.421. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.315003000450060007500SE +/- 7.29, N = 3SE +/- 185.48, N = 156902.234285.591. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm


Phoronix Test Suite v10.8.5