GCC Clang Compiler Benchmarks Zen 4 Threadripper

GCC and Clang compiler benchmarks by Michael Larabel for year end 2023 future article.

HTML result view exported from: https://openbenchmarking.org/result/2312240-PTS-COMPILER73&grs&sor.

GCC Clang Compiler Benchmarks Zen 4 ThreadripperProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 DecAMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads)HP 8B24 (U65 Ver. 01.01.04 BIOS)AMD Device 14a4128GB2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1NVIDIA RTX A4000 16GBNVIDIA GA104 HD AudioASUS VP28URealtek RTL8111/8168/8411Ubuntu 23.106.5.0-14-generic (x86_64)GNOME Shell 45.0X Server 1.21.1.7NVIDIA 535.129.034.6.0OpenCL 3.0 CUDA 12.2.147GCC 13.2.0ext43840x2160Clang 17.0.2Clang 18.0.0GCC 14.0.0 20231224OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 13.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - GCC 14 23 Dec: --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Details- Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Details- GPU Compute Cores: 6144Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC Clang Compiler Benchmarks Zen 4 Threadrippersmhasher: SHA3-256openssl: ChaCha20-Poly1305openssl: ChaCha20graphics-magick: HWB Color Spacec-ray: Total Time - 4K, 16 Rays Per Pixelgraphics-magick: Swirlgraphics-magick: Noise-Gaussianlczero: Eigensmhasher: FarmHash32 x86_64 AVXliquid-dsp: 1 - 256 - 512webp: Quality 100, Highest Compressiongraphics-magick: Enhancedtscp: AI Chess Performanceminibude: OpenMP - BM2minibude: OpenMP - BM2john-the-ripper: WPA PSKcryptopp: Unkeyed Algorithmsliquid-dsp: 128 - 256 - 32coremark: CoreMark Size 666 - Iterations Per Secondliquid-dsp: 192 - 256 - 32liquid-dsp: 1 - 256 - 32astcenc: Mediumsimdjson: LargeRandstargate: 96000 - 1024stargate: 96000 - 512stargate: 44100 - 512stargate: 192000 - 1024stargate: 192000 - 512stargate: 44100 - 1024liquid-dsp: 128 - 256 - 512securemark: SecureMark-TLSsimdjson: DistinctUserIDblosc: blosclz shuffle - 256MBsimdjson: Kostyaopenjpeg: NASA Curiosity Panorama M34quantlib: Single-Threadedencode-flac: WAV To FLACkvazaar: Bosphorus 4K - Slowencode-mp3: WAV To MP3smhasher: FarmHash128redis: GET - 500kvazaar: Bosphorus 4K - Mediumpgbench: 100 - 1000 - Read Onlyquantlib: Multi-Threadedpgbench: 100 - 1000 - Read Only - Average Latencyblosc: blosclz noshuffle - 256MBsimdjson: PartialTweetsavifenc: 6, Losslessuvg266: Bosphorus 4K - Very Fastgraphics-magick: Sharpencompress-lz4: 9 - Compression Speedx265: Bosphorus 4Kjohn-the-ripper: MD5blosc: blosclz bitshuffle - 256MBvvenc: Bosphorus 4K - Fasterwebp: Defaultcryptopp: Keyed Algorithmsencode-opus: WAV To Opus Encodesvt-av1: Preset 8 - Bosphorus 4Kgpaw: Carbon Nanotubejohn-the-ripper: bcryptkvazaar: Bosphorus 4K - Super Fastopenvino: Person Vehicle Bike Detection FP16 - CPUgromacs: MPI CPU - water_GMX50_bareopenvino: Person Vehicle Bike Detection FP16 - CPUsvt-vp9: Visual Quality Optimized - Bosphorus 4Kuvg266: Bosphorus 4K - Super Fastopenssl: SHA256kvazaar: Bosphorus 4K - Very Fastjohn-the-ripper: Blowfishwebp: Quality 100, Losslessvvenc: Bosphorus 4K - Fastopenvino: Handwritten English Recognition FP16-INT8 - CPUuvg266: Bosphorus 4K - Slowopenvino: Handwritten English Recognition FP16-INT8 - CPUavifenc: 0avifenc: 2mrbayes: Primate Phylogeny Analysiscompress-zstd: 12 - Decompression Speeduvg266: Bosphorus 4K - Mediumpovray: Trace Timegraphics-magick: Rotatesvt-av1: Preset 12 - Bosphorus 4Kuvg266: Bosphorus 4K - Ultra Fastwebp: Quality 100compress-zstd: 19 - Decompression Speedmemcached: 1:10avifenc: 6svt-av1: Preset 4 - Bosphorus 4Kopenssl: SHA512compress-zstd: 12 - Compression Speedliquid-dsp: 192 - 256 - 512smhasher: MeowHash x86_64 AES-NIsimdjson: TopTweetsvt-av1: Preset 13 - Bosphorus 4Klammps: 20k Atomsxmrig: KawPow - 1Mopenvino: Face Detection Retail FP16-INT8 - CPUcompress-lz4: 9 - Decompression Speedopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUastcenc: Thoroughopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUxmrig: GhostRider - 1Mopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenssl: RSA4096webp: Quality 100, Lossless, Highest Compressioncompress-zstd: 19, Long Mode - Decompression Speedmemcached: 1:100compress-zstd: 19 - Compression Speedastcenc: Exhaustiveopenssl: AES-256-GCMsmhasher: t1ha0_aes_avx2 x86_64kvazaar: Bosphorus 4K - Ultra Fastxmrig: CryptoNight-Femto UPX2 - 1Mcompress-zstd: 19, Long Mode - Compression Speedxmrig: Monero - 1Mxmrig: CryptoNight-Heavy - 1Mopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenssl: AES-128-GCMpetsc: Streamsopenssl: RSA4096openvino: Weld Porosity Detection FP16-INT8 - CPUxmrig: Wownero - 1Mopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUredis: SET - 500svt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksmhasher: MeowHash x86_64 AES-NIsmhasher: t1ha0_aes_avx2 x86_64smhasher: FarmHash32 x86_64 AVXsmhasher: FarmHash128smhasher: SHA3-256GCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec290.6536130103978751244201647310694.950260081767151654.97177814004.51163822163454581.390183.256613307539.17220542280333333955168.745941551830000048601000426.48571.705.6018315.1313787.4030373.6579153.2586747.98704712932333333924478.678137.15.45820084151.012.23142.444.74729054.814484373.843.543780387308069.50.2657804.79.014.17667.31103867.0139.38145266677649.316.49225.65848.07385220.03194.71037.89017391389.125480.2511.2038.75199.5469.5412862938722386.891737602.049.03840.4530.822372.4049.18426.596113.1062212.934.056.050690212.00171.3715.241895.85921751.452.1169.58943028797080416.1149610000068597.569.72208.82442.44856883.518002.3417018.55.320.6269.835825.451883.5713592.0141.26339.371541833.50.791795.57695223.1524.97.2724819163695240131074.9894.7056592.811.756279.656361.093.18514.54113554.50487.7998.10946233938740183886.950050251.79.771242.78.035964.989919.183079293.25224.1234.25815.63520.18935.5441353.924459.022291282599033359099317277516.848198762381640846.89219746675.60147924588695376.447215.058511546450.46276550403000003332695.383619654393333356815750499.39091.596.4743125.9226978.5273284.2046933.7388949.12452714479666674429029.738125.55.25756304578.811.12346.655.20926974.454242865.9247.383570369329638.60.2807735.79.643.91770.8198565.1840.43151650007628.015.57727.08873.13343319.12899.21237.61618257092.525230.4910.8169.16190.7971.8412452619710789.691801701.988.74739.3832.012436.0247.78725.652109.1162151.035.235.868685208.92473.2515.731856.66092998.102.0879.87243131468240414.4149583333367078.439.59208.70742.36757018.317698.6217310.25.410.6370.009225.821856.6713628.9142.62336.161538339.50.791774.37774340.3025.07.2323816930832563132159.7794.2956300.811.855888.656105.993.05515.15112991.73490.2097.62944090667870183268.528250097.79.6771216.18.055952.839912.532829400.83232.2834.10815.25122.55925.128812.789463.942289036332603358862215877476.896196662385741578.48222390005.39145727080025572.326222.893518594533.22722549946000003642203.842569654143333356732000482.38221.466.3714755.8273548.4152514.1227843.7032949.03074714698333334206879.727795.94.877434311.14345.545.07426494.224649256.846.7635119280.2857496.29.514.05371.7697769.1941.05153996677454.226.81894.48590019.28794.44436.40318140092.025271.5210.6959.09194.9272.6212980044751389.001810911.9638.9431.582463.9147.36925.619109.5832173.334.665.848675205.93373.7215.371866.66108890.582.1499.66743912408803405.5152876666767945.389.51204.32841.65856651.017696.8617044.45.410.6368.898025.801857.8313438.4143.24334.691552460.90.791776.67679397.0924.77.1900811977109867130792.8394.5956023.011.855839.555924.593.71511.63113505.56489.6697.72944539664203183176.558150206.99.6771240.68.055951.959909.423090972.67235.7634.75615.56522.07225.247817.895287.8636404162924351353663005310484.912244680750400.25179500674.48133122462604789.291191.57261644742625000003955659.2911685587766667480680005.9518285.3881267.5305933.9552023.4797228.11192813012000004062827258.5829854504.611.84643.434.81928278.494589226.344.323593163331562.30.2787291.94.10667.98100366.3238.70147300007218.116.34526.0720.15296.12938.23017706288.23199.2969.7112538462683386.051755442.019.10231.6048.04826.165110.5582136.234.91667205.09871.6215.651837.35956401.512.1429.71742698789480411.4149146666768248.31205.98257660.117246.013625.21532147.30.781773.27735938.6624.7821035343690132218.2693.7256155.211.855867.956126.394857368608050065.071393.63175906.7218.0234.17515.43319.86035.6811356.534OpenBenchmarking.org

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec100200300400500SE +/- 3.98, N = 3SE +/- 4.14, N = 7SE +/- 3.50, N = 4SE +/- 2.29, N = 3463.94459.02290.65287.86-flto=thin-flto=thin-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305GCC 14 23 DecGCC 13.2Clang 17.0.2Clang 18 23 Dec80000M160000M240000M320000M400000MSE +/- 38965461.92, N = 3SE +/- 138801171.04, N = 3SE +/- 74912722.41, N = 3SE +/- 143870310.16, N = 3364041629243361301039787229128259903228903633260-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20GCC 14 23 DecGCC 13.2Clang 17.0.2Clang 18 23 Dec110000M220000M330000M440000M550000MSE +/- 289414516.97, N = 3SE +/- 264097935.23, N = 3SE +/- 201482071.29, N = 3SE +/- 195079194.35, N = 3513536630053512442016473335909931727335886221587-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: HWB Color SpaceGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec2004006008001000SE +/- 5.24, N = 3SE +/- 5.13, N = 3SE +/- 3.18, N = 3SE +/- 0.58, N = 3106910487517471. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 14 23 DecGCC 13.2Clang 17.0.2Clang 18 23 Dec246810SE +/- 0.029, N = 3SE +/- 0.029, N = 3SE +/- 0.068, N = 3SE +/- 0.067, N = 34.9124.9506.8486.8961. (CC) gcc options: -lm -lpthread -O3 -march=native

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SwirlGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec6001200180024003000SE +/- 3.93, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 3SE +/- 2.89, N = 326002446198719661. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: Noise-GaussianGCC 13.2GCC 14 23 DecClang 18 23 DecClang 17.0.22004006008001000SE +/- 4.33, N = 3SE +/- 5.04, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 38178076236231. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.30Backend: EigenClang 18 23 DecClang 17.0.2GCC 13.22004006008001000SE +/- 7.84, N = 3SE +/- 11.58, N = 9SE +/- 6.34, N = 98578166711. (CXX) g++ options: -flto -O3 -march=native -pthread

SMHasher

Hash: FarmHash32 x86_64 AVX

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXGCC 13.2GCC 14 23 DecClang 18 23 DecClang 17.0.211K22K33K44K55KSE +/- 264.89, N = 3SE +/- 531.14, N = 3SE +/- 372.44, N = 3SE +/- 439.18, N = 351654.9750400.2541578.4840846.89-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin-flto=thin1. (CXX) g++ options: -O3 -march=native

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.25M10M15M20M25MSE +/- 100374.30, N = 3SE +/- 88706.13, N = 3SE +/- 235451.13, N = 15SE +/- 127219.34, N = 15222390002197466717950067177814001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionClang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec1.262.523.785.046.3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 35.605.394.514.48-lpng16 -ljpeg -ltiff-lpng16 -ljpeg -ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec400800120016002000SE +/- 4.04, N = 3SE +/- 1.53, N = 3SE +/- 2.65, N = 3SE +/- 5.84, N = 316381479145713311. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.2600K1200K1800K2400K3000KSE +/- 9910.25, N = 5SE +/- 4909.07, N = 5SE +/- 3431.38, N = 5SE +/- 21601.14, N = 527080022458869224626022163451. (CC) gcc options: -O3 -march=native

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.212002400360048006000SE +/- 24.10, N = 3SE +/- 62.02, N = 3SE +/- 52.09, N = 5SE +/- 51.09, N = 35572.335376.454789.294581.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.250100150200250SE +/- 0.96, N = 3SE +/- 2.48, N = 3SE +/- 2.08, N = 5SE +/- 2.04, N = 3222.89215.06191.57183.261. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGCC 14 23 DecGCC 13.2Clang 18 23 DecClang 17.0.2130K260K390K520K650KSE +/- 3178.61, N = 3SE +/- 2497.30, N = 3SE +/- 5465.43, N = 3SE +/- 4559.35, N = 36164476133075185945115461. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.8Test: Unkeyed AlgorithmsGCC 13.2Clang 18 23 DecClang 17.0.2120240360480600SE +/- 0.95, N = 3SE +/- 2.74, N = 3SE +/- 0.49, N = 3539.17533.23450.46-fno-devirtualize1. (CXX) g++ options: -g2 -fPIC -pthread -pipe -O3 -march=native

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.21100M2200M3300M4400M5500MSE +/- 38722904.51, N = 3SE +/- 37430513.40, N = 3SE +/- 43594839.14, N = 3SE +/- 12640191.63, N = 350403000004994600000426250000042280333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 14 23 DecGCC 13.2Clang 18 23 DecClang 17.0.2800K1600K2400K3200K4000KSE +/- 38604.00, N = 3SE +/- 27949.92, N = 3SE +/- 34986.71, N = 3SE +/- 19620.88, N = 33955659.293955168.753642203.843332695.381. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 32Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.21400M2800M4200M5600M7000MSE +/- 23899256.15, N = 3SE +/- 42403157.64, N = 3SE +/- 36511383.31, N = 3SE +/- 32255852.18, N = 365439333336541433333558776666755183000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32Clang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec12M24M36M48M60MSE +/- 705129.82, N = 4SE +/- 61098.28, N = 3SE +/- 13747.73, N = 3SE +/- 134747.91, N = 3568157505673200048601000480680001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumClang 17.0.2Clang 18 23 DecGCC 13.2110220330440550SE +/- 0.56, N = 3SE +/- 1.39, N = 3SE +/- 0.80, N = 3499.39482.38426.491. (CXX) g++ options: -O3 -march=native -flto -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomGCC 13.2Clang 17.0.2Clang 18 23 Dec0.38250.7651.14751.531.9125SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.701.591.461. (CXX) g++ options: -O3 -march=native

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 1024Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.2246810SE +/- 0.004464, N = 3SE +/- 0.009480, N = 3SE +/- 0.007988, N = 3SE +/- 0.009825, N = 36.4743126.3714755.9518285.6018311. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 512Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.21.33262.66523.99785.33046.663SE +/- 0.004937, N = 3SE +/- 0.003453, N = 3SE +/- 0.012082, N = 3SE +/- 0.011976, N = 35.9226975.8273545.3881265.1313781. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 512Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.2246810SE +/- 0.051637, N = 3SE +/- 0.038112, N = 3SE +/- 0.023956, N = 3SE +/- 0.067600, N = 38.5273288.4152517.5305937.4030371. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 1024Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.20.94611.89222.83833.78444.7305SE +/- 0.018365, N = 3SE +/- 0.020650, N = 3SE +/- 0.006090, N = 3SE +/- 0.008117, N = 34.2046934.1227843.9552023.6579151. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 512Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.20.84131.68262.52393.36524.2065SE +/- 0.004927, N = 3SE +/- 0.002560, N = 3SE +/- 0.001538, N = 3SE +/- 0.007583, N = 33.7388943.7032943.4797223.2586741. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 44100 - Buffer Size: 1024Clang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.23691215SE +/- 0.009153, N = 3SE +/- 0.018602, N = 3SE +/- 0.013114, N = 3SE +/- 0.011137, N = 39.1245279.0307478.1119287.9870471. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.2300M600M900M1200M1500MSE +/- 3295114.23, N = 3SE +/- 5446813.54, N = 3SE +/- 1882374.39, N = 3SE +/- 3447865.68, N = 314698333331447966667130120000012932333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.290K180K270K360K450KSE +/- 3387.50, N = 3SE +/- 3191.95, N = 3SE +/- 996.80, N = 3SE +/- 25.96, N = 34429024206874062823924471. (CC) gcc options: -pedantic -O3

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDClang 17.0.2Clang 18 23 DecGCC 13.23691215SE +/- 0.09, N = 7SE +/- 0.08, N = 15SE +/- 0.03, N = 39.739.728.671. (CXX) g++ options: -O3 -march=native

C-Blosc

Test: blosclz shuffle - Buffer Size: 256MB

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.11Test: blosclz shuffle - Buffer Size: 256MBGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec2K4K6K8K10KSE +/- 15.55, N = 3SE +/- 23.87, N = 3SE +/- 34.63, N = 3SE +/- 37.76, N = 38137.18125.57795.97258.5-std=gnu99 -lrt -ldl -lm-march=native-march=native-std=gnu99 -ldl -lrt -lm1. (CC) gcc options: -O3

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaGCC 13.2Clang 17.0.2Clang 18 23 Dec1.22632.45263.67894.90526.1315SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 35.455.254.871. (CXX) g++ options: -O3 -march=native

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34Clang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec20K40K60K80K100KSE +/- 627.99, N = 3SE +/- 34.27, N = 3SE +/- 357.19, N = 3SE +/- 1022.49, N = 4743437563082008829851. (CXX) g++ options: -O3 -march=native -rdynamic

QuantLib

Configuration: Single-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedClang 17.0.2GCC 14 23 DecGCC 13.210002000300040005000SE +/- 40.89, N = 3SE +/- 30.04, N = 3SE +/- 33.97, N = 34578.84504.64151.01. (CXX) g++ options: -O3 -march=native -fPIE -pie

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.23691215SE +/- 0.02, N = 5SE +/- 0.05, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 511.1211.1411.8512.231. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.21122334455SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.46, N = 346.6545.5443.4342.44-lpthread-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 13.2GCC 14 23 DecClang 18 23 DecClang 17.0.21.1722.3443.5164.6885.86SE +/- 0.051, N = 3SE +/- 0.059, N = 3SE +/- 0.047, N = 3SE +/- 0.022, N = 34.7474.8195.0745.209-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

SMHasher

Hash: FarmHash128

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash128GCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec6K12K18K24K30KSE +/- 187.58, N = 15SE +/- 311.86, N = 3SE +/- 227.32, N = 3SE +/- 100.64, N = 329054.8128278.4926974.4526494.22-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin-flto=thin1. (CXX) g++ options: -O3 -march=native

Redis

Test: GET - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500Clang 18 23 DecGCC 14 23 DecGCC 13.2Clang 17.0.21000K2000K3000K4000K5000KSE +/- 27204.42, N = 3SE +/- 33944.52, N = 15SE +/- 47771.21, N = 4SE +/- 37859.85, N = 154649256.804589226.304484373.804242865.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.21122334455SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.11, N = 3SE +/- 0.21, N = 347.3846.7644.3243.54-lpthread-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec800K1600K2400K3200K4000KSE +/- 14867.69, N = 3SE +/- 7434.99, N = 3SE +/- 34138.17, N = 3SE +/- 23217.69, N = 337803873593163357036935119281. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedGCC 14 23 DecClang 17.0.2GCC 13.270K140K210K280K350KSE +/- 1487.52, N = 3SE +/- 1401.25, N = 3SE +/- 819.02, N = 3331562.3329638.6308069.51. (CXX) g++ options: -O3 -march=native -fPIE -pie

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec0.06410.12820.19230.25640.3205SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.2650.2780.2800.2851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

C-Blosc

Test: blosclz noshuffle - Buffer Size: 256MB

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.11Test: blosclz noshuffle - Buffer Size: 256MBGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec2K4K6K8K10KSE +/- 17.86, N = 3SE +/- 10.30, N = 3SE +/- 0.87, N = 3SE +/- 3.92, N = 37804.77735.77496.27291.9-std=gnu99 -lrt -ldl -lm-march=native-march=native-std=gnu99 -ldl -lrt -lm1. (CC) gcc options: -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsClang 17.0.2Clang 18 23 DecGCC 13.23691215SE +/- 0.12, N = 15SE +/- 0.11, N = 3SE +/- 0.04, N = 39.649.519.011. (CXX) g++ options: -O3 -march=native

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.20.93961.87922.81883.75844.698SE +/- 0.004, N = 3SE +/- 0.027, N = 3SE +/- 0.014, N = 3SE +/- 0.004, N = 33.9174.0534.1064.1761. (CXX) g++ options: -O3 -fPIC -march=native -lm

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastClang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.21632486480SE +/- 0.19, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.11, N = 371.7670.8167.9867.311. (CXX) g++ options: -O3 -march=native

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec2004006008001000SE +/- 1.73, N = 3SE +/- 0.67, N = 3SE +/- 2.00, N = 3SE +/- 4.10, N = 3103810039859771. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedClang 18 23 DecGCC 13.2GCC 14 23 DecClang 17.0.21530456075SE +/- 0.58, N = 3SE +/- 0.67, N = 5SE +/- 0.69, N = 3SE +/- 0.89, N = 369.1967.0166.3265.181. (CC) gcc options: -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KClang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec918273645SE +/- 0.09, N = 3SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 341.0540.4339.3838.701. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.23M6M9M12M15MSE +/- 133559.39, N = 3SE +/- 114054.08, N = 3SE +/- 75020.00, N = 3SE +/- 43498.40, N = 3153996671516500014730000145266671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

C-Blosc

Test: blosclz bitshuffle - Buffer Size: 256MB

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.11Test: blosclz bitshuffle - Buffer Size: 256MBGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec16003200480064008000SE +/- 30.09, N = 3SE +/- 30.25, N = 3SE +/- 15.49, N = 3SE +/- 20.20, N = 37649.37628.07454.27218.1-std=gnu99 -lrt -ldl -lm-march=native-march=native-std=gnu99 -ldl -lrt -lm1. (CC) gcc options: -O3

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FasterGCC 13.2GCC 14 23 DecClang 17.0.248121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 316.4916.3515.58-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.2612182430SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 327.0826.8126.0725.65-lpng16 -ljpeg -ltiff-lpng16 -ljpeg -ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.8Test: Keyed AlgorithmsClang 18 23 DecClang 17.0.2GCC 13.22004006008001000SE +/- 0.92, N = 3SE +/- 0.30, N = 3SE +/- 1.20, N = 3894.49873.13848.07-fno-devirtualize1. (CXX) g++ options: -g2 -fPIC -pthread -pipe -O3 -march=native

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeClang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec510152025SE +/- 0.10, N = 5SE +/- 0.07, N = 5SE +/- 0.13, N = 5SE +/- 0.01, N = 519.1319.2920.0320.151. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 17.0.2GCC 14 23 DecGCC 13.2Clang 18 23 Dec20406080100SE +/- 0.81, N = 3SE +/- 1.18, N = 3SE +/- 0.65, N = 15SE +/- 1.06, N = 499.2196.1394.7194.441. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeClang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec918273645SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.52, N = 3SE +/- 0.27, N = 336.4037.6237.8938.231. (CC) gcc options: -shared -fwrapv -O2 -O3 -march=native -lxc -lblas -lmpi

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.240K80K120K160K200KSE +/- 2054.27, N = 3SE +/- 1771.90, N = 6SE +/- 2123.51, N = 4SE +/- 798.06, N = 31825701814001770621739131. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastClang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec20406080100SE +/- 0.65, N = 3SE +/- 0.58, N = 3SE +/- 0.68, N = 3SE +/- 0.29, N = 392.5292.0289.1288.23-lpthread-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.212002400360048006000SE +/- 61.07, N = 4SE +/- 37.06, N = 3SE +/- 37.89, N = 35480.255271.525230.49-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareGCC 13.2Clang 17.0.2Clang 18 23 Dec3691215SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 311.2010.8210.701. (CXX) g++ options: -O3 -march=native

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.23691215SE +/- 0.10, N = 4SE +/- 0.06, N = 3SE +/- 0.07, N = 38.759.099.16-fno-strict-overflow -fwrapv - MIN: 5.37 / MAX: 26.49MIN: 5.51 / MAX: 25.76MIN: 5.67 / MAX: 27.631. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4KGCC 13.2GCC 14 23 DecClang 18 23 DecClang 17.0.24080120160200SE +/- 2.26, N = 15SE +/- 2.39, N = 4SE +/- 1.78, N = 15SE +/- 1.99, N = 3199.54199.29194.92190.791. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastClang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.21632486480SE +/- 0.29, N = 3SE +/- 0.25, N = 3SE +/- 0.45, N = 3SE +/- 0.14, N = 372.6271.8469.7169.541. (CXX) g++ options: -O3 -march=native

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256Clang 18 23 DecGCC 13.2GCC 14 23 DecClang 17.0.230000M60000M90000M120000M150000MSE +/- 366849309.72, N = 3SE +/- 321788378.42, N = 3SE +/- 271320016.95, N = 3SE +/- 323275121.51, N = 3129800447513128629387223125384626833124526197107-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastClang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec20406080100SE +/- 0.37, N = 3SE +/- 0.40, N = 3SE +/- 1.06, N = 3SE +/- 0.65, N = 389.6989.0086.8986.05-lpthread-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishClang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.240K80K120K160K200KSE +/- 1422.62, N = 3SE +/- 1555.73, N = 3SE +/- 258.75, N = 3SE +/- 164.19, N = 31810911801701755441737601. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec0.4590.9181.3771.8362.295SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.042.011.981.96-lpng16 -ljpeg -ltiff-lpng16 -ljpeg -ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FastGCC 14 23 DecGCC 13.2Clang 17.0.23691215SE +/- 0.019, N = 3SE +/- 0.090, N = 3SE +/- 0.023, N = 39.1029.0388.747-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUClang 18 23 DecClang 17.0.2GCC 13.2918273645SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.43, N = 538.9439.3840.45MIN: 26.83 / MAX: 59.8MIN: 24.17 / MAX: 80.88-fno-strict-overflow -fwrapv - MIN: 20.38 / MAX: 57.811. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowClang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.2714212835SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 332.0131.6031.5830.821. (CXX) g++ options: -O3 -march=native

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUClang 18 23 DecClang 17.0.2GCC 13.25001000150020002500SE +/- 13.82, N = 3SE +/- 2.50, N = 3SE +/- 24.51, N = 52463.912436.022372.40-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.21122334455SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.42, N = 3SE +/- 0.28, N = 347.3747.7948.0549.181. (CXX) g++ options: -O3 -fPIC -march=native -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.2612182430SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.23, N = 3SE +/- 0.23, N = 325.6225.6526.1726.601. (CXX) g++ options: -O3 -fPIC -march=native -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClang 17.0.2Clang 18 23 DecGCC 14 23 DecGCC 13.2306090120150SE +/- 0.58, N = 3SE +/- 0.68, N = 3SE +/- 0.79, N = 3SE +/- 1.70, N = 15109.12109.58110.56113.11-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm -lreadline

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedGCC 13.2Clang 18 23 DecClang 17.0.2GCC 14 23 Dec5001000150020002500SE +/- 26.26, N = 3SE +/- 30.79, N = 3SE +/- 29.73, N = 3SE +/- 10.68, N = 32212.92173.32151.02136.2-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumClang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.2816243240SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.18, N = 335.2334.9134.6634.051. (CXX) g++ options: -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 18 23 DecClang 17.0.2GCC 13.2246810SE +/- 0.061, N = 3SE +/- 0.033, N = 3SE +/- 0.037, N = 35.8485.8686.0501. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: RotateGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec150300450600750SE +/- 0.67, N = 3SE +/- 1.73, N = 3SE +/- 2.33, N = 3SE +/- 1.20, N = 36906856756671. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 13.2Clang 17.0.2Clang 18 23 DecGCC 14 23 Dec50100150200250SE +/- 0.51, N = 3SE +/- 1.47, N = 13SE +/- 1.52, N = 3SE +/- 1.12, N = 3212.00208.92205.93205.101. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastClang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.21632486480SE +/- 0.24, N = 3SE +/- 0.27, N = 3SE +/- 0.15, N = 3SE +/- 0.51, N = 373.7273.2571.6271.371. (CXX) g++ options: -O3 -march=native

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100Clang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.248121620SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 315.7315.6515.3715.24-lpng16 -ljpeg -ltiff-lpng16 -ljpeg -ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedGCC 13.2Clang 18 23 DecClang 17.0.2GCC 14 23 Dec400800120016002000SE +/- 2.12, N = 3SE +/- 5.93, N = 3SE +/- 14.06, N = 3SE +/- 18.21, N = 31895.81866.61856.61837.3-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10Clang 18 23 DecClang 17.0.2GCC 14 23 DecGCC 13.21.3M2.6M3.9M5.2M6.5MSE +/- 27110.93, N = 3SE +/- 18523.18, N = 3SE +/- 50152.31, N = 3SE +/- 44027.28, N = 36108890.586092998.105956401.515921751.451. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6Clang 17.0.2GCC 13.2GCC 14 23 DecClang 18 23 Dec0.48350.9671.45051.9342.4175SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 32.0872.1162.1422.1491. (CXX) g++ options: -O3 -fPIC -march=native -lm

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.23691215SE +/- 0.063, N = 3SE +/- 0.096, N = 5SE +/- 0.065, N = 3SE +/- 0.076, N = 39.8729.7179.6679.5891. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512Clang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec9000M18000M27000M36000M45000MSE +/- 11847693.41, N = 3SE +/- 29437974.38, N = 3SE +/- 55003825.70, N = 3SE +/- 7258386.62, N = 343912408803431314682404302879708042698789480-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedGCC 13.2Clang 17.0.2GCC 14 23 DecClang 18 23 Dec90180270360450SE +/- 4.45, N = 3SE +/- 2.30, N = 3SE +/- 3.96, N = 3SE +/- 4.56, N = 3416.1414.4411.4405.5-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 512Clang 18 23 DecGCC 13.2Clang 17.0.2GCC 14 23 Dec300M600M900M1200M1500MSE +/- 4115148.17, N = 3SE +/- 5921430.01, N = 3SE +/- 4870432.33, N = 3SE +/- 7348998.42, N = 315287666671496100000149583333314914666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 13.2GCC 14 23 DecClang 18 23 DecClang 17.0.215K30K45K60K75KSE +/- 715.67, N = 3SE +/- 626.56, N = 6SE +/- 980.00, N = 3SE +/- 966.63, N = 368597.5668248.3167945.3867078.43-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin-flto=thin1. (CXX) g++ options: -O3 -march=native

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetGCC 13.2Clang 17.0.2Clang 18 23 Dec3691215SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 69.729.599.511. (CXX) g++ options: -O3 -march=native

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 4KGCC 13.2Clang 17.0.2GCC 14 23 DecClang 18 23 Dec50100150200250SE +/- 0.96, N = 3SE +/- 0.72, N = 3SE +/- 1.82, N = 15SE +/- 2.03, N = 3208.82208.71205.98204.331. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsGCC 13.2Clang 17.0.2Clang 18 23 Dec1020304050SE +/- 0.20, N = 3SE +/- 0.22, N = 3SE +/- 0.23, N = 342.4542.3741.661. (CXX) g++ options: -O3 -march=native -lm -ldl

Xmrig

Variant: KawPow - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MGCC 14 23 DecClang 17.0.2GCC 13.2Clang 18 23 Dec12K24K36K48K60KSE +/- 668.49, N = 3SE +/- 640.77, N = 3SE +/- 417.45, N = 3SE +/- 529.15, N = 357660.157018.356883.556651.0-static-libgcc -static-libstdc++-funroll-loops-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec4K8K12K16K20KSE +/- 35.94, N = 3SE +/- 14.36, N = 3SE +/- 15.90, N = 318002.3417698.6217696.86-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.24K8K12K16K20KSE +/- 45.07, N = 3SE +/- 98.79, N = 3SE +/- 42.11, N = 3SE +/- 22.99, N = 517310.217246.017044.417018.51. (CC) gcc options: -O3

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec1.21732.43463.65194.86926.0865SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.325.415.41-fno-strict-overflow -fwrapv - MIN: 3.18 / MAX: 23.99MIN: 2.92 / MAX: 24.06MIN: 3.22 / MAX: 23.181. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.620.630.63-fno-strict-overflow -fwrapv - MIN: 0.2 / MAX: 20.88MIN: 0.18 / MAX: 18.68MIN: 0.21 / MAX: 18.551. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughClang 17.0.2GCC 13.2Clang 18 23 Dec1632486480SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 370.0169.8468.901. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.2612182430SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 325.4525.8025.82-fno-strict-overflow -fwrapv - MIN: 13.56 / MAX: 47.82MIN: 13.56 / MAX: 46.4MIN: 12.71 / MAX: 45.921. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.2400800120016002000SE +/- 4.24, N = 3SE +/- 11.87, N = 3SE +/- 9.44, N = 31883.571857.831856.67-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MClang 17.0.2GCC 14 23 DecGCC 13.2Clang 18 23 Dec3K6K9K12K15KSE +/- 36.45, N = 3SE +/- 11.65, N = 3SE +/- 43.77, N = 3SE +/- 17.51, N = 313628.913625.213592.013438.4-funroll-loops-static-libgcc -static-libstdc++-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec306090120150SE +/- 0.40, N = 3SE +/- 0.11, N = 3SE +/- 0.01, N = 3141.26142.62143.24-fno-strict-overflow -fwrapv - MIN: 43.59 / MAX: 209.89MIN: 60.05 / MAX: 210.86MIN: 53.57 / MAX: 210.541. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec70140210280350SE +/- 1.01, N = 3SE +/- 0.30, N = 3SE +/- 0.04, N = 3339.37336.16334.69-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096Clang 18 23 DecGCC 13.2Clang 17.0.2GCC 14 23 Dec300K600K900K1200K1500KSE +/- 595.23, N = 3SE +/- 1093.03, N = 3SE +/- 3224.17, N = 3SE +/- 4067.66, N = 31552460.91541833.51538339.51532147.3-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec0.17780.35560.53340.71120.889SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.790.790.790.78-lpng16 -ljpeg -ltiff-lpng16 -ljpeg -ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedGCC 13.2Clang 18 23 DecClang 17.0.2GCC 14 23 Dec400800120016002000SE +/- 19.09, N = 3SE +/- 21.26, N = 3SE +/- 23.70, N = 3SE +/- 21.40, N = 31795.51776.61774.31773.2-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100Clang 17.0.2GCC 14 23 DecGCC 13.2Clang 18 23 Dec1.7M3.4M5.1M6.8M8.5MSE +/- 54728.52, N = 3SE +/- 44302.45, N = 3SE +/- 9027.10, N = 3SE +/- 7633.51, N = 37774340.307735938.667695223.157679397.091. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedClang 17.0.2GCC 13.2GCC 14 23 DecClang 18 23 Dec612182430SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 325.024.924.724.7-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveGCC 13.2Clang 17.0.2Clang 18 23 Dec246810SE +/- 0.0327, N = 3SE +/- 0.0190, N = 3SE +/- 0.0210, N = 37.27247.23237.19001. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMGCC 14 23 DecGCC 13.2Clang 17.0.2Clang 18 23 Dec200000M400000M600000M800000M1000000MSE +/- 458494386.59, N = 3SE +/- 617348183.36, N = 3SE +/- 367647705.83, N = 3SE +/- 745364268.21, N = 3821035343690819163695240816930832563811977109867-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64GCC 14 23 DecClang 17.0.2GCC 13.2Clang 18 23 Dec30K60K90K120K150KSE +/- 1656.41, N = 3SE +/- 1814.82, N = 3SE +/- 759.57, N = 3SE +/- 954.34, N = 3132218.26132159.77131074.98130792.83-flto=auto -fno-fat-lto-objects-flto=thin-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastGCC 13.2Clang 18 23 DecClang 17.0.2GCC 14 23 Dec20406080100SE +/- 0.61, N = 3SE +/- 0.18, N = 3SE +/- 0.35, N = 3SE +/- 0.73, N = 394.7094.5994.2993.72-lpthread-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

Xmrig

Variant: CryptoNight-Femto UPX2 - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MGCC 13.2Clang 17.0.2GCC 14 23 DecClang 18 23 Dec12K24K36K48K60KSE +/- 150.17, N = 3SE +/- 155.80, N = 3SE +/- 136.66, N = 3SE +/- 216.50, N = 356592.856300.856155.256023.0-static-libgcc -static-libstdc++-funroll-loops-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedGCC 14 23 DecClang 18 23 DecClang 17.0.2GCC 13.23691215SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 311.811.811.811.7-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MGCC 13.2Clang 17.0.2GCC 14 23 DecClang 18 23 Dec12K24K36K48K60KSE +/- 228.27, N = 3SE +/- 154.69, N = 3SE +/- 234.92, N = 3SE +/- 215.49, N = 356279.655888.655867.955839.5-static-libgcc -static-libstdc++-funroll-loops-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Heavy - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MGCC 13.2GCC 14 23 DecClang 17.0.2Clang 18 23 Dec12K24K36K48K60KSE +/- 147.72, N = 3SE +/- 185.10, N = 3SE +/- 146.78, N = 3SE +/- 91.23, N = 356361.056126.356105.955924.5-static-libgcc -static-libstdc++-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUClang 17.0.2GCC 13.2Clang 18 23 Dec20406080100SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.33, N = 393.0593.1893.71MIN: 43.63 / MAX: 147.75-fno-strict-overflow -fwrapv - MIN: 45.56 / MAX: 146.45MIN: 39.51 / MAX: 151.451. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUClang 17.0.2GCC 13.2Clang 18 23 Dec110220330440550SE +/- 0.40, N = 3SE +/- 0.42, N = 3SE +/- 1.83, N = 3515.15514.54511.63-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.220K40K60K80K100KSE +/- 733.05, N = 3SE +/- 50.71, N = 3SE +/- 567.66, N = 3113554.50113505.56112991.73-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.2110220330440550SE +/- 0.75, N = 3SE +/- 0.22, N = 3SE +/- 0.30, N = 3487.79489.66490.20-fno-strict-overflow -fwrapv - MIN: 394.16 / MAX: 515.77MIN: 381.31 / MAX: 514.98MIN: 249.93 / MAX: 516.281. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUGCC 13.2Clang 18 23 DecClang 17.0.220406080100SE +/- 0.18, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 398.1097.7297.62-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMGCC 14 23 DecGCC 13.2Clang 18 23 DecClang 17.0.2200000M400000M600000M800000M1000000MSE +/- 227590943.13, N = 3SE +/- 926361343.56, N = 3SE +/- 712012274.52, N = 3SE +/- 1012640703.78, N = 3948573686080946233938740944539664203944090667870-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

PETSc

Test: Streams

OpenBenchmarking.orgMB/s, More Is BetterPETSc 3.19Test: StreamsGCC 13.2Clang 17.0.2Clang 18 23 Dec40K80K120K160K200KSE +/- 272.88, N = 3SE +/- 138.55, N = 3SE +/- 20.67, N = 3183886.95183268.53183176.56-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -fPIC -O3 -O2 -lpthread -lpciaccess -lm

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096GCC 13.2Clang 18 23 DecClang 17.0.2GCC 14 23 Dec11K22K33K44K55KSE +/- 84.27, N = 3SE +/- 80.27, N = 3SE +/- 91.28, N = 3SE +/- 103.69, N = 350251.750206.950097.750065.0-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUClang 17.0.2Clang 18 23 DecGCC 13.23691215SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 39.679.679.70MIN: 4.96 / MAX: 29.01MIN: 5.1 / MAX: 28.3-fno-strict-overflow -fwrapv - MIN: 4.82 / MAX: 28.191. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MGCC 14 23 DecGCC 13.2Clang 18 23 DecClang 17.0.215K30K45K60K75KSE +/- 161.46, N = 3SE +/- 161.43, N = 3SE +/- 267.90, N = 3SE +/- 205.42, N = 371393.671242.771240.671216.1-static-libgcc -static-libstdc++-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec246810SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 38.038.058.05-fno-strict-overflow -fwrapv - MIN: 4.11 / MAX: 25.57MIN: 4.4 / MAX: 25.17MIN: 4.56 / MAX: 25.341. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec13002600390052006500SE +/- 20.52, N = 3SE +/- 28.17, N = 3SE +/- 23.74, N = 35964.985952.835951.95-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUGCC 13.2Clang 17.0.2Clang 18 23 Dec2K4K6K8K10KSE +/- 32.34, N = 3SE +/- 35.14, N = 3SE +/- 27.69, N = 39919.189912.539909.42-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

Redis

Test: SET - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500GCC 14 23 DecClang 18 23 DecGCC 13.2Clang 17.0.2700K1400K2100K2800K3500KSE +/- 22198.55, N = 3SE +/- 21571.11, N = 3SE +/- 53058.07, N = 15SE +/- 37397.13, N = 153175906.703090972.673079293.252829400.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4KClang 18 23 DecClang 17.0.2GCC 13.2GCC 14 23 Dec50100150200250SE +/- 1.77, N = 3SE +/- 0.19, N = 3SE +/- 8.63, N = 12SE +/- 0.67, N = 3235.76232.28224.12218.021. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIClang 17.0.2GCC 14 23 DecGCC 13.2Clang 18 23 Dec816243240SE +/- 0.40, N = 3SE +/- 0.24, N = 6SE +/- 0.24, N = 3SE +/- 0.45, N = 334.1134.1834.2634.76-flto=thin-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 17.0.2GCC 14 23 DecClang 18 23 DecGCC 13.248121620SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 315.2515.4315.5715.64-flto=thin-flto=auto -fno-fat-lto-objects-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: FarmHash32 x86_64 AVX

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXGCC 14 23 DecGCC 13.2Clang 18 23 DecClang 17.0.2510152025SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 319.8620.1922.0722.56-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects-flto=thin-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: FarmHash128

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: FarmHash128Clang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec816243240SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.17, N = 15SE +/- 0.12, N = 325.1325.2535.5435.68-flto=thin-flto=thin-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 17.0.2Clang 18 23 DecGCC 13.2GCC 14 23 Dec30060090012001500SE +/- 7.46, N = 7SE +/- 8.35, N = 3SE +/- 14.31, N = 4SE +/- 8.88, N = 3812.79817.901353.921356.53-flto=thin-flto=thin-flto=auto -fno-fat-lto-objects-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native


Phoronix Test Suite v10.8.4