AMD EPYC Genoa Compilers

Genoa compiler benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2305211-NE-EPYCGENOA71&rdt&grs.

AMD EPYC Genoa CompilersProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 13.1.1Clang 16.0.32 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1004D BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeFedora Linux 386.2.15-300.fc38.x86_64 (x86_64)GCC 13.1.1 20230511xfs640x480Clang 16.0.3 + LLVM 16.0.3OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 13.1.1: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096 Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111 Python Details- Python 3.11.3Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Compilersonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUopenssl: ChaCha20-Poly1305openssl: ChaCha20graphics-magick: Swirlgraphics-magick: Sharpenwebp: Quality 100, Highest Compressionsimdjson: DistinctUserIDsimdjson: PartialTweetssmhasher: fasthash32graphics-magick: HWB Color Spaceuvg266: Bosphorus 1080p - Very Fastx265: Bosphorus 4Kgraphics-magick: Noise-Gaussiangraphics-magick: Enhancedstargate: 44100 - 512webp: Quality 100, Lossless, Highest Compressionstargate: 192000 - 512cryptopp: Keyed Algorithmsstargate: 44100 - 1024stargate: 192000 - 1024compress-lz4: 3 - Compression Speedwebp: Quality 100, Losslessopenjpeg: NASA Curiosity Panorama M34compress-lz4: 9 - Compression Speedavifenc: 6, Losslesscryptopp: Unkeyed Algorithmscoremark: CoreMark Size 666 - Iterations Per Seconduvg266: Bosphorus 1080p - Mediumastcenc: Fastpovray: Trace Timesecuremark: SecureMark-TLSuvg266: Bosphorus 4K - Mediumsmhasher: t1ha2_atonceaobench: 2048 x 2048 - Total Timekvazaar: Bosphorus 4K - Slowuvg266: Bosphorus 1080p - Super Fastkvazaar: Bosphorus 4K - Mediumsimdjson: Kostyaliquid-dsp: 384 - 256 - 57simdjson: TopTweetuvg266: Bosphorus 1080p - Ultra Fasttscp: AI Chess Performanceavifenc: 10, Losslesssvt-av1: Preset 4 - Bosphorus 4Kquantlib: avifenc: 0liquid-dsp: 256 - 256 - 57webp: Defaultmysqlslap: 4096avifenc: 2astcenc: Mediumopenssl: SHA512simdjson: LargeRandmysqlslap: 2048svt-av1: Preset 12 - Bosphorus 4Kdraco: Church Facadepgbench: 100 - 800 - Read Onlyespeak: Text-To-Speech Synthesisdraco: Lioncompress-lz4: 1 - Decompression Speedsmhasher: Spooky32kvazaar: Bosphorus 4K - Super Fastminibude: OpenMP - BM2minibude: OpenMP - BM2sqlite: 2svt-av1: Preset 8 - Bosphorus 4Ksmhasher: MeowHash x86_64 AES-NIliquid-dsp: 128 - 256 - 57svt-av1: Preset 13 - Bosphorus 4Kkvazaar: Bosphorus 4K - Very Fastcompress-zstd: 19 - Decompression Speedcompress-zstd: 12 - Decompression Speedwebp: Quality 100uvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 4K - Very Fastopenssl: SHA256smhasher: wyhashmrbayes: Primate Phylogeny Analysisavifenc: 6compress-lz4: 3 - Decompression Speedlammps: 20k Atomscompress-zstd: 12 - Compression Speedtjbench: Decompression Throughputsqlite: 1liquid-dsp: 64 - 256 - 57lammps: Rhodopsin Proteinastcenc: Thoroughcompress-zstd: 19, Long Mode - Compression Speedastcenc: Exhaustivekvazaar: Bosphorus 4K - Ultra Fastgpaw: Carbon Nanotubeopenssl: AES-256-GCMopenssl: RSA4096openssl: AES-128-GCMuvg266: Bosphorus 4K - Super Fastcompress-lz4: 1 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-zstd: 19, Long Mode - Decompression Speedgraphics-magick: Rotatesmhasher: t1ha0_aes_avx2 x86_64openssl: RSA4096faiss: demo_sift1Mmemcached: 1:100memcached: 1:10memcached: 1:5compress-7zip: Decompression Ratingcompress-7zip: Compression Ratingjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512john-the-ripper: Blowfishjohn-the-ripper: WPA PSKjohn-the-ripper: bcryptcompress-zstd: 19 - Compression Speedz3: 2.smt2z3: 1.smt2kripke: pgbench: 100 - 800 - Read Only - Average Latencyonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUc-ray: Total Time - 4K, 16 Rays Per Pixelstockfish: Total Timegraphics-magick: Resizingsmhasher: MeowHash x86_64 AES-NIsmhasher: t1ha0_aes_avx2 x86_64smhasher: t1ha2_atoncesmhasher: fasthash32smhasher: Spooky32smhasher: SHA3-256smhasher: SHA3-256smhasher: wyhashminibude: OpenMP - BM1minibude: OpenMP - BM1GCC 13.1.1Clang 16.0.32406.707088735249271014036223867295813113.216.786.568812.441089112.4321.5998122165.8371830.552.516856615.4506886.2705592.85518646.241.3517982445.844.648435.8100997872619.72533739.08718.80686.12328875327.4320995.4628.94034.77136.1935.944.15116476666676.88147.3716619883.7085.5273238.861.4951128533333318.9339433.807552.0326800358547521.26385176.0316750336219327.490524317800.820581.6070.10330.0548251.3402.49654.95253264.238490666667143.42968.571413.01662.611.5446.7246.6326089256020030309.12113.3362.52216409.953.795312.6219.3887382.028464193333349.370117.02758.3112.839470.51140.85915440539847202921019.6180135296559746.9715466.8716305.11341.4660101972.6271935.959.4024105286.363302722.112033827.48115527769124628819667286625667323256126133332279516.872.23025.8953240788800.2392191.223.06353221359212045.24120.69921.09722.54426.6591727.902228.0214.968276.0896902.231885.030440446591887645125858407220717234.168.528.227052.24875133.0525.3883519016.6911750.632.880358703.9956107.1613003.25737252.311.5216208650.854.220396.4612547173897.06818342.49780.95105.65431257029.6522604.2631.15737.37146.2038.374.43124183333337.33156.5017585673.5275.7963391.158.8851176233333319.7237932.656570.6249827016256901.22373170.7876559345717328.266510617350.521089.0168.54322.8998072.4882.44256.13552151.978665233333146.21669.851387.51632.711.7547.5647.4026444516982729923.05112.1072.49616246.754.310309.8217.4742652.045468020000049.689117.75368.2612.765670.16140.28215383506105402931038.9179635113652346.8415501.5516340.01343.5659102085.1871901.716.82173237270.232468.2693.20351764314012844.96520.32225.66721.87326.7121041.175363.9014.065171.4244285.585OpenBenchmarking.org

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUGCC 13.1.1Clang 16.0.35001000150020002500SE +/- 26.76, N = 15SE +/- 2.20, N = 32406.70885.03-fopenmp - MIN: 2191.9-fopenmp=libomp - MIN: 871.441. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305GCC 13.1.1Clang 16.0.3150000M300000M450000M600000M750000MSE +/- 116220547.85, N = 3SE +/- 48134544.35, N = 3708873524927440446591887-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20GCC 13.1.1Clang 16.0.3200000M400000M600000M800000M1000000MSE +/- 152062084.95, N = 3SE +/- 51499603.07, N = 31014036223867645125858407-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SwirlGCC 13.1.1Clang 16.0.36001200180024003000SE +/- 15.88, N = 3SE +/- 3.84, N = 3295822071. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenGCC 13.1.1Clang 16.0.3400800120016002000SE +/- 1.00, N = 3SE +/- 12.49, N = 3131117231. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionGCC 13.1.1Clang 16.0.30.9361.8722.8083.7444.68SE +/- 0.00, N = 3SE +/- 0.00, N = 33.214.16-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDGCC 13.1.1Clang 16.0.3246810SE +/- 0.02, N = 3SE +/- 0.07, N = 156.788.521. (CXX) g++ options: -O3 -march=native -flto

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsGCC 13.1.1Clang 16.0.3246810SE +/- 0.02, N = 3SE +/- 0.12, N = 126.568.221. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: fasthash32GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 90.46, N = 5SE +/- 0.01, N = 38721.067052.24-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: HWB Color SpaceGCC 13.1.1Clang 16.0.32004006008001000SE +/- 15.72, N = 3SE +/- 5.51, N = 310898751. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

uvg266

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastGCC 13.1.1Clang 16.0.3306090120150SE +/- 0.05, N = 3SE +/- 0.64, N = 3112.43133.051. (CXX) g++ options: -O3 -march=native -flto

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 13.1.1Clang 16.0.3612182430SE +/- 0.22, N = 6SE +/- 0.34, N = 321.5925.381. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: Noise-GaussianGCC 13.1.1Clang 16.0.32004006008001000SE +/- 9.66, N = 6SE +/- 11.26, N = 39818351. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedGCC 13.1.1Clang 16.0.35001000150020002500SE +/- 4.26, N = 3SE +/- 2.33, N = 3221619011. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 512GCC 13.1.1Clang 16.0.3246810SE +/- 0.001560, N = 3SE +/- 0.001401, N = 35.8371836.6911751. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionGCC 13.1.1Clang 16.0.30.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 30.550.63-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 512GCC 13.1.1Clang 16.0.30.64811.29621.94432.59243.2405SE +/- 0.007283, N = 3SE +/- 0.007221, N = 32.5168562.8803581. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsGCC 13.1.1Clang 16.0.3150300450600750SE +/- 0.39, N = 3SE +/- 0.08, N = 3615.45704.001. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 1024GCC 13.1.1Clang 16.0.3246810SE +/- 0.000734, N = 3SE +/- 0.005456, N = 36.2705597.1613001. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 1024GCC 13.1.1Clang 16.0.30.73291.46582.19872.93163.6645SE +/- 0.006270, N = 3SE +/- 0.007486, N = 32.8551863.2573721. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedGCC 13.1.1Clang 16.0.31224364860SE +/- 0.04, N = 3SE +/- 0.54, N = 346.2452.311. (CC) gcc options: -O3 -march=native -flto

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessGCC 13.1.1Clang 16.0.30.3420.6841.0261.3681.71SE +/- 0.00, N = 3SE +/- 0.00, N = 31.351.52-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34GCC 13.1.1Clang 16.0.340K80K120K160K200KSE +/- 812.57, N = 3SE +/- 571.61, N = 31798241620861. (CXX) g++ options: -O3 -march=native -flto -rdynamic

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedGCC 13.1.1Clang 16.0.31122334455SE +/- 0.49, N = 3SE +/- 0.53, N = 345.8450.851. (CC) gcc options: -O3 -march=native -flto

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, LosslessGCC 13.1.1Clang 16.0.31.04582.09163.13744.18325.229SE +/- 0.059, N = 3SE +/- 0.035, N = 34.6484.2201. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 13.1.1Clang 16.0.390180270360450SE +/- 0.35, N = 3SE +/- 0.05, N = 3435.81396.461. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 13.1.1Clang 16.0.32M4M6M8M10MSE +/- 3457.10, N = 3SE +/- 7211.60, N = 37872619.737173897.071. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt

uvg266

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumGCC 13.1.1Clang 16.0.31020304050SE +/- 0.09, N = 3SE +/- 0.29, N = 339.0842.491. (CXX) g++ options: -O3 -march=native -flto

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastGCC 13.1.1Clang 16.0.32004006008001000SE +/- 3.96, N = 3SE +/- 5.60, N = 3718.81780.951. (CXX) g++ options: -O3 -march=native -flto -pthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 13.1.1Clang 16.0.3246810SE +/- 0.053, N = 8SE +/- 0.053, N = 76.1235.6541. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 13.1.1Clang 16.0.370K140K210K280K350KSE +/- 872.41, N = 3SE +/- 1404.09, N = 32887533125701. (CC) gcc options: -pedantic -O3

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumGCC 13.1.1Clang 16.0.3714212835SE +/- 0.09, N = 3SE +/- 0.07, N = 327.4329.651. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceGCC 13.1.1Clang 16.0.35K10K15K20K25KSE +/- 2.58, N = 3SE +/- 0.45, N = 321054.8722604.26-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 13.1.1Clang 16.0.3714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 328.9431.161. (CC) gcc options: -lm -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowGCC 13.1.1Clang 16.0.3918273645SE +/- 0.04, N = 3SE +/- 0.14, N = 334.7737.37-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

uvg266

Video Input: Bosphorus 1080p - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastGCC 13.1.1Clang 16.0.3306090120150SE +/- 0.33, N = 3SE +/- 0.55, N = 3136.19146.201. (CXX) g++ options: -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumGCC 13.1.1Clang 16.0.3918273645SE +/- 0.06, N = 3SE +/- 0.17, N = 335.9438.37-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaGCC 13.1.1Clang 16.0.30.99681.99362.99043.98724.984SE +/- 0.00, N = 3SE +/- 0.00, N = 34.154.431. (CXX) g++ options: -O3 -march=native -flto

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 57GCC 13.1.1Clang 16.0.33000M6000M9000M12000M15000MSE +/- 7264831.57, N = 3SE +/- 9527737.29, N = 311647666667124183333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetGCC 13.1.1Clang 16.0.3246810SE +/- 0.01, N = 3SE +/- 0.06, N = 156.887.331. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastGCC 13.1.1Clang 16.0.3306090120150SE +/- 1.08, N = 11SE +/- 2.25, N = 3147.37156.501. (CXX) g++ options: -O3 -march=native -flto

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 13.1.1Clang 16.0.3400K800K1200K1600K2000KSE +/- 2555.19, N = 5SE +/- 2743.38, N = 5166198817585671. (CC) gcc options: -O3 -march=native -flto

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, LosslessGCC 13.1.1Clang 16.0.30.83431.66862.50293.33724.1715SE +/- 0.023, N = 3SE +/- 0.020, N = 33.7083.5271. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.31.30412.60823.91235.21646.5205SE +/- 0.067, N = 3SE +/- 0.025, N = 35.5275.7961. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30GCC 13.1.1Clang 16.0.37001400210028003500SE +/- 2.59, N = 3SE +/- 7.81, N = 33238.83391.11. (CXX) g++ options: -O3 -march=native -fPIE -pie

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 0GCC 13.1.1Clang 16.0.31428425670SE +/- 0.05, N = 3SE +/- 0.13, N = 361.5058.891. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57GCC 13.1.1Clang 16.0.33000M6000M9000M12000M15000MSE +/- 4096068.58, N = 3SE +/- 2603416.56, N = 311285333333117623333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultGCC 13.1.1Clang 16.0.3510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 318.9319.72-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096GCC 13.1.1Clang 16.0.390180270360450SE +/- 2.65, N = 3SE +/- 3.39, N = 3394379-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 2GCC 13.1.1Clang 16.0.3816243240SE +/- 0.05, N = 3SE +/- 0.29, N = 333.8132.661. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumGCC 13.1.1Clang 16.0.3120240360480600SE +/- 2.85, N = 3SE +/- 2.38, N = 3552.03570.621. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512GCC 13.1.1Clang 16.0.320000M40000M60000M80000M100000MSE +/- 845012614.25, N = 5SE +/- 4419075.00, N = 38003585475282701625690-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomGCC 13.1.1Clang 16.0.30.28350.5670.85051.1341.4175SE +/- 0.00, N = 3SE +/- 0.00, N = 31.261.221. (CXX) g++ options: -O3 -march=native -flto

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048GCC 13.1.1Clang 16.0.380160240320400SE +/- 0.65, N = 3SE +/- 2.37, N = 3385373-fPIC -pie -shared -lrt -lstdc++-lcrypt -lssl -lcrypto1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lpthread -lz -ldl -lm

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.34080120160200SE +/- 1.61, N = 3SE +/- 1.96, N = 12176.03170.791. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church FacadeGCC 13.1.1Clang 16.0.314002800420056007000SE +/- 13.35, N = 3SE +/- 4.84, N = 3675065591. (CXX) g++ options: -O3 -march=native -flto

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read OnlyGCC 13.1.1Clang 16.0.3700K1400K2100K2800K3500KSE +/- 57281.48, N = 12SE +/- 34767.93, N = 12336219334571731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 1.51Text-To-Speech SynthesisGCC 13.1.1Clang 16.0.3714212835SE +/- 0.02, N = 4SE +/- 0.15, N = 427.4928.271. (CXX) g++ options: -O3 -march=native -flto -lpthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: LionGCC 13.1.1Clang 16.0.311002200330044005500SE +/- 29.18, N = 3SE +/- 44.34, N = 3524351061. (CXX) g++ options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 51.33, N = 3SE +/- 77.84, N = 317800.817350.51. (CC) gcc options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: Spooky32GCC 13.1.1Clang 16.0.35K10K15K20K25KSE +/- 0.25, N = 3SE +/- 2.94, N = 320584.5721089.01-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 370.1068.54-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.370140210280350SE +/- 1.73, N = 3SE +/- 3.21, N = 3330.05322.901. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GCC 13.1.1Clang 16.0.32K4K6K8K10KSE +/- 43.34, N = 3SE +/- 80.21, N = 38251.348072.491. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

SQLite

Threads / Copies: 2

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 2GCC 13.1.1Clang 16.0.30.56161.12321.68482.24642.808SE +/- 0.027, N = 3SE +/- 0.032, N = 32.4962.4421. (CC) gcc options: -O3 -march=native -flto -lz -lm

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.31326395265SE +/- 0.42, N = 3SE +/- 0.58, N = 554.9556.141. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 13.1.1Clang 16.0.311K22K33K44K55KSE +/- 31.79, N = 3SE +/- 6.98, N = 353237.3552151.97-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57GCC 13.1.1Clang 16.0.32000M4000M6000M8000M10000MSE +/- 13312441.97, N = 3SE +/- 913479.31, N = 3849066666786652333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 4KGCC 13.1.1Clang 16.0.3306090120150SE +/- 1.27, N = 7SE +/- 1.04, N = 12143.43146.221. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.74, N = 4SE +/- 0.57, N = 368.5769.85-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedGCC 13.1.1Clang 16.0.330060090012001500SE +/- 9.49, N = 3SE +/- 1.96, N = 31413.01387.5-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedGCC 13.1.1Clang 16.0.3400800120016002000SE +/- 3.39, N = 3SE +/- 6.60, N = 31662.61632.7-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100GCC 13.1.1Clang 16.0.33691215SE +/- 0.00, N = 3SE +/- 0.02, N = 311.5411.75-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 13.1.1Clang 16.0.31122334455SE +/- 0.37, N = 3SE +/- 0.56, N = 346.7247.561. (CXX) g++ options: -O3 -march=native -flto

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastGCC 13.1.1Clang 16.0.31122334455SE +/- 0.50, N = 3SE +/- 0.42, N = 346.6347.401. (CXX) g++ options: -O3 -march=native -flto

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256GCC 13.1.1Clang 16.0.360000M120000M180000M240000M300000MSE +/- 48171917.42, N = 3SE +/- 64283484.40, N = 3260892560200264445169827-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

SMHasher

Hash: wyhash

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: wyhashGCC 13.1.1Clang 16.0.36K12K18K24K30KSE +/- 169.17, N = 3SE +/- 22.88, N = 330125.1629923.05-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 13.1.1Clang 16.0.3306090120150SE +/- 0.12, N = 3SE +/- 0.66, N = 3113.34112.11-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6GCC 13.1.1Clang 16.0.30.56751.1351.70252.272.8375SE +/- 0.025, N = 5SE +/- 0.020, N = 32.5222.4961. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 141.14, N = 3SE +/- 103.96, N = 316409.916246.71. (CC) gcc options: -O3 -march=native -flto

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsGCC 13.1.1Clang 16.0.31224364860SE +/- 0.07, N = 3SE +/- 0.04, N = 353.8054.311. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedGCC 13.1.1Clang 16.0.370140210280350SE +/- 4.38, N = 3SE +/- 3.40, N = 3312.6309.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 13.1.1Clang 16.0.350100150200250SE +/- 0.85, N = 3SE +/- 0.20, N = 3219.39217.471. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 1GCC 13.1.1Clang 16.0.30.46010.92021.38031.84042.3005SE +/- 0.002, N = 3SE +/- 0.015, N = 32.0282.0451. (CC) gcc options: -O3 -march=native -flto -lz -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57GCC 13.1.1Clang 16.0.31000M2000M3000M4000M5000MSE +/- 8346922.25, N = 3SE +/- 1808314.13, N = 3464193333346802000001. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinGCC 13.1.1Clang 16.0.31122334455SE +/- 0.43, N = 15SE +/- 0.45, N = 1549.3749.691. (CXX) g++ options: -O3 -march=native -flto -lm -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughGCC 13.1.1Clang 16.0.3306090120150SE +/- 0.12, N = 3SE +/- 0.09, N = 3117.03117.751. (CXX) g++ options: -O3 -march=native -flto -pthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedGCC 13.1.1Clang 16.0.3246810SE +/- 0.04, N = 3SE +/- 0.01, N = 38.318.26-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveGCC 13.1.1Clang 16.0.33691215SE +/- 0.01, N = 3SE +/- 0.05, N = 312.8412.771. (CXX) g++ options: -O3 -march=native -flto -pthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 13.1.1Clang 16.0.31632486480SE +/- 0.43, N = 3SE +/- 0.62, N = 370.5170.16-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon NanotubeGCC 13.1.1Clang 16.0.3306090120150SE +/- 0.24, N = 3SE +/- 0.24, N = 3140.86140.281. (CC) gcc options: -shared -O3 -march=native -flto -lxc -lblas -lmpi

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMGCC 13.1.1Clang 16.0.3300000M600000M900000M1200000M1500000MSE +/- 293680009.95, N = 3SE +/- 2176141824.81, N = 315440539847201538350610540-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096GCC 13.1.1Clang 16.0.3600K1200K1800K2400K3000KSE +/- 105.74, N = 3SE +/- 437.47, N = 32921019.62931038.9-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMGCC 13.1.1Clang 16.0.3400000M800000M1200000M1600000M2000000MSE +/- 1317927821.70, N = 3SE +/- 616604050.97, N = 318013529655971796351136523-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastGCC 13.1.1Clang 16.0.31122334455SE +/- 0.12, N = 3SE +/- 0.11, N = 346.9746.841. (CXX) g++ options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedGCC 13.1.1Clang 16.0.33K6K9K12K15KSE +/- 222.63, N = 3SE +/- 139.51, N = 315466.8715501.551. (CC) gcc options: -O3 -march=native -flto

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedGCC 13.1.1Clang 16.0.34K8K12K16K20KSE +/- 80.69, N = 3SE +/- 50.74, N = 316305.116340.01. (CC) gcc options: -O3 -march=native -flto

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedGCC 13.1.1Clang 16.0.330060090012001500SE +/- 4.13, N = 3SE +/- 4.51, N = 31341.41343.5-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: RotateGCC 13.1.1Clang 16.0.3140280420560700SE +/- 3.28, N = 3SE +/- 0.33, N = 36606591. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64GCC 13.1.1Clang 16.0.320K40K60K80K100KSE +/- 7.16, N = 3SE +/- 33.80, N = 3101956.31102085.18-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096GCC 13.1.1Clang 16.0.315K30K45K60K75KSE +/- 24.90, N = 3SE +/- 10.37, N = 371935.971901.7-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Faiss

Test: demo_sift1M

OpenBenchmarking.orgSeconds, Fewer Is BetterFaiss 1.7.4Test: demo_sift1MGCC 13.1.11326395265SE +/- 0.19, N = 259.401. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100GCC 13.1.1900K1800K2700K3600K4500KSE +/- 15774.50, N = 34105286.361. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10GCC 13.1.1700K1400K2100K2800K3500KSE +/- 12793.89, N = 33302722.111. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5GCC 13.1.1400K800K1200K1600K2000KSE +/- 6222.06, N = 32033827.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingGCC 13.1.1200K400K600K800K1000KSE +/- 4772.36, N = 311552771. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingGCC 13.1.1150K300K450K600K750KSE +/- 7447.92, N = 36912461. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5GCC 13.1.16M12M18M24M30MSE +/- 29475.04, N = 3288196671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512GCC 13.1.160M120M180M240M300MSE +/- 2757367.12, N = 32866256671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGCC 13.1.170K140K210K280K350KSE +/- 401.92, N = 33232561. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGCC 13.1.1300K600K900K1200K1500KSE +/- 4630.81, N = 312613331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGCC 13.1.170K140K210K280K350KSE +/- 288.40, N = 33227951. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedGCC 13.1.1Clang 16.0.348121620SE +/- 0.03, N = 3SE +/- 0.00, N = 316.816.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -flto -pthread -lz

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2GCC 13.1.11632486480SE +/- 0.06, N = 372.231. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2GCC 13.1.1612182430SE +/- 0.19, N = 325.901. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4GCC 13.1.1Clang 16.0.370M140M210M280M350MSE +/- 6307414.85, N = 15SE +/- 4076394.65, N = 15324078880217323727-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -flto

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencyGCC 13.1.1Clang 16.0.30.05380.10760.16140.21520.269SE +/- 0.004, N = 12SE +/- 0.002, N = 120.2390.2321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUGCC 13.1.1Clang 16.0.35001000150020002500SE +/- 42.69, N = 15SE +/- 8.52, N = 152191.22468.27-fopenmp - MIN: 1875.06-fopenmp=libomp - MIN: 441.141. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 13.1.1Clang 16.0.30.72071.44142.16212.88283.6035SE +/- 0.058, N = 15SE +/- 0.042, N = 33.0633.2031. (CC) gcc options: -lm -lpthread -O3 -march=native -flto

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeGCC 13.1.1Clang 16.0.3110M220M330M440M550MSE +/- 3102500.18, N = 3SE +/- 8586129.93, N = 15532213592517643140-lgcov -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mbmi2 -fno-peel-loops -fno-tracer -flto=jobserver-fprofile-use1. (CXX) g++ options: -m64 -lpthread -O3 -march=native -flto -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingGCC 13.1.1Clang 16.0.3306090120150SE +/- 6.54, N = 12SE +/- 2.70, N = 151201281. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 13.1.1Clang 16.0.31020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 345.2244.97-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64GCC 13.1.1Clang 16.0.3510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 320.6820.32-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceGCC 13.1.1Clang 16.0.3612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 321.1025.67-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: fasthash32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: fasthash32GCC 13.1.1Clang 16.0.3510152025SE +/- 0.01, N = 5SE +/- 0.01, N = 322.1721.87-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: Spooky32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: Spooky32GCC 13.1.1Clang 16.0.3612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 326.6626.71-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: SHA3-256GCC 13.1.1Clang 16.0.35001000150020002500SE +/- 368.15, N = 9SE +/- 8.90, N = 32113.411041.18-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: SHA3-256GCC 13.1.1Clang 16.0.380160240320400SE +/- 16.57, N = 9SE +/- 1.16, N = 3208.77363.90-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

SMHasher

Hash: wyhash

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: wyhashGCC 13.1.1Clang 16.0.348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 314.9714.07-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native -flto

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.360120180240300SE +/- 0.29, N = 3SE +/- 7.42, N = 15276.09171.421. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 13.1.1Clang 16.0.315003000450060007500SE +/- 7.29, N = 3SE +/- 185.48, N = 156902.234285.591. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm


Phoronix Test Suite v10.8.5