AMD Ryzen Threadripper 7980X compiler benchmarking on Fedora 40 by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2404242-NE-GCC14AMDR04 GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980X - Phoronix Test Suite GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980X AMD Ryzen Threadripper 7980X compiler benchmarking on Fedora 40 by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2404242-NE-GCC14AMDR04&grs&rdt .
GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980X Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution GCC 14.0.1 20240411 Clang 18.1.1 AMD Ryzen Threadripper 7980X 64-Cores @ 7.79GHz (64 Cores / 128 Threads) System76 Thelio Major (FA Z5 BIOS) AMD Device 14a4 4 x 32GB DDR5-4800MT/s Micron MTC20F1045S1RC48BA2 1000GB CT1000T700SSD5 AMD Radeon Pro W7900 45GB AMD Device 14cc DELL P2415Q Aquantia AQC113C NBase-T/IEEE + Realtek RTL8125 2.5GbE + Intel Wi-Fi 6E Fedora Linux 40 6.8.5-301.fc40.x86_64 (x86_64) GNOME Shell 46.1 X Server + Wayland 4.6 Mesa 24.0.5 (LLVM 18.1.1 DRM 3.57) GCC 14.0.1 20240411 btrfs 1920x1080 Clang 18.1.1 + LLVM 18.1.1 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Compiler Details - GCC 14.0.1 20240411: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa108105 Python Details - Python 3.12.2 Security Details - SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980X smhasher: SHA3-256 openssl: ChaCha20-Poly1305 openssl: ChaCha20 c-ray: Total Time - 4K, 16 Rays Per Pixel graphics-magick: HWB Color Space webp: Quality 100, Highest Compression helsing: 14 digit quicksilver: CORAL2 P1 graphics-magick: Swirl smhasher: FarmHash32 x86_64 AVX minibude: OpenMP - BM2 minibude: OpenMP - BM2 liquid-dsp: 64 - 256 - 512 john-the-ripper: WPA PSK graphics-magick: Noise-Gaussian openvino: Noise Suppression Poconet-Like FP16 - CPU openvino: Noise Suppression Poconet-Like FP16 - CPU smhasher: wyhash quicksilver: CTS2 openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU john-the-ripper: MD5 coremark: CoreMark Size 666 - Iterations Per Second jpegxl: JPEG - 90 graphics-magick: Sharpen graphics-magick: Resizing tscp: AI Chess Performance srsran: PUSCH Processor Benchmark, Throughput Total compress-lz4: 9 - Compression Speed smhasher: FarmHash128 encode-flac: WAV To FLAC stargate: 96000 - 1024 gromacs: MPI CPU - water_GMX50_bare kvazaar: Bosphorus 4K - Very Fast uvg266: Bosphorus 4K - Very Fast encode-opus: WAV To Opus Encode liquid-dsp: 128 - 256 - 512 kvazaar: Bosphorus 4K - Medium compress-lz4: 9 - Decompression Speed compress-zstd: 12 - Compression Speed john-the-ripper: bcrypt securemark: SecureMark-TLS compress-zstd: 12 - Decompression Speed uvg266: Bosphorus 4K - Super Fast compress-lz4: 3 - Compression Speed kvazaar: Bosphorus 4K - Super Fast lammps: 20k Atoms stargate: 192000 - 1024 john-the-ripper: Blowfish uvg266: Bosphorus 4K - Ultra Fast astcenc: Exhaustive openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU uvg266: Bosphorus 4K - Medium compress-lz4: 3 - Decompression Speed smhasher: Spooky32 openssl: SHA256 astcenc: Very Thorough smhasher: fasthash32 draco: Lion uvg266: Bosphorus 4K - Slow draco: Church Facade graphics-magick: Enhanced x265: Bosphorus 4K compress-zstd: 19 - Decompression Speed smhasher: MeowHash x86_64 AES-NI svt-av1: Preset 8 - Bosphorus 4K astcenc: Medium compress-zstd: 19, Long Mode - Decompression Speed quantlib: Single-Threaded openvino: Face Detection Retail FP16-INT8 - CPU jpegxl: PNG - 90 astcenc: Thorough compress-zstd: 19, Long Mode - Compression Speed rnnoise: 26 Minute Long Talking Sample openvino: Face Detection Retail FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU quicksilver: CORAL2 P2 webp: Quality 100 openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Re-Identification Retail FP16 - CPU srsran: PDSCH Processor Benchmark, Throughput Total svt-av1: Preset 12 - Bosphorus 4K openvino: Person Re-Identification Retail FP16 - CPU graphics-magick: Rotate webp: Quality 100, Lossless, Highest Compression synthmark: VoiceMark_100 jpegxl: PNG - 100 openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU webp: Quality 100, Lossless openssl: AES-256-GCM smhasher: t1ha0_aes_avx2 x86_64 openvino: Face Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU svt-av1: Preset 13 - Bosphorus 4K openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU kvazaar: Bosphorus 4K - Ultra Fast quantlib: Multi-Threaded openssl: AES-128-GCM minibude: OpenMP - BM1 minibude: OpenMP - BM1 jpegxl-decode: All openssl: SHA512 compress-zstd: 19 - Compression Speed smhasher: t1ha2_atonce openssl: RSA4096 primesieve: 1e13 openssl: RSA4096 openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU jpegxl: JPEG - 100 svt-av1: Preset 4 - Bosphorus 4K encode-wavpack: WAV To WavPack openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU liquid-dsp: 1 - 256 - 512 john-the-ripper: HMAC-SHA512 smhasher: MeowHash x86_64 AES-NI smhasher: t1ha0_aes_avx2 x86_64 smhasher: FarmHash32 x86_64 AVX smhasher: t1ha2_atonce smhasher: FarmHash128 smhasher: fasthash32 smhasher: Spooky32 smhasher: SHA3-256 smhasher: wyhash GCC 14.0.1 20240411 Clang 18.1.1 237.33 310352617927 437829686883 6.179 259 4.57 63.285 28573333 554 37461.46 4376.244 175.050 1025733333 541461 185 18.28 3367.26 26137.98 21853333 89.77 355.93 13318667 3365815.032822 46.969 216 193 1960084 4801.9 49.02 21159.37 11.392 6.161616 5.571 79.49 70.73 19.328 1242200000 37.05 5882.1 399.1 153446 416097 2251.4 72.65 146.67 93.12 27.242 4.097617 152866 74.07 5.3446 24.44 1304.85 30.39 5658.0 20053.07 112513527163 8.7120 8833.71 4049 27.16 5240 250 39.16 1917.8 50680.40 95.246 450.1098 1825.3 4611.6 4.38 48.575 62.4484 12.1 7.607 13824.92 4299.75 21786667 14.80 7.33 5969.17 20064.5 197.492 5.33 155 0.81 990.570 41.175 6.38 4938.07 2.07 707658246553 96787.57 84.53 8089.55 116609.44 193.779 7.64 377.58 97.05 293011.2 822088248977 4397.544 175.902 600.868 37084177003 25.9 19662.33 1280463.3 26.351 44305.1 200.06 159.76 27.86 2288.66 41.694 9.848 3.702 0.43 18258917 298853000 41.836 19.085 24.114 19.110 43.847 20.797 25.094 1648.905 13.442 380.27 196215871807 288123851390 8.492 209 5.64 78.057 23570000 457 31033.62 5269.550 210.782 1231066667 460525 159 21.12 2933.12 29888.69 19186667 100.60 317.98 14840333 3044772.380526 51.254 199 178 2118389 4446.1 45.86 19825.36 10.680 6.550766 5.247 84.16 74.38 18.416 1297866667 38.67 5640.1 383.4 159726 432909 2340.9 75.27 141.72 96.37 26.333 4.238755 157785 76.45 5.5130 25.20 1266.10 29.50 5493.7 20643.45 109345824360 8.9615 8593.78 3942 26.45 5112 256 40.08 1962.8 51816.54 93.194 459.5831 1861.0 4697.6 4.46 49.454 63.5606 12.3 7.484 13601.85 4232.67 22130000 14.58 7.44 5885.26 20338.7 194.885 5.40 153 0.82 1002.515 40.707 6.45 4888.56 2.05 701069882433 97667.26 83.82 8024.94 115684.85 195.307 7.70 380.50 97.80 295240.5 816719157710 4369.140 174.766 597.362 37246099377 25.8 19737.39 1275596.8 26.431 44172.3 199.47 160.20 27.93 2283.11 41.653 9.853 3.701 0.43 23426308 214448333 42.393 18.839 26.999 19.351 31.024 20.564 25.146 996.372 12.968 OpenBenchmarking.org
SMHasher Hash: SHA3-256 OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: SHA3-256 GCC 14.0.1 20240411 Clang 18.1.1 80 160 240 320 400 SE +/- 1.54, N = 3 SE +/- 3.36, N = 3 237.33 380.27 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20-Poly1305 GCC 14.0.1 20240411 Clang 18.1.1 70000M 140000M 210000M 280000M 350000M SE +/- 34389860.61, N = 3 SE +/- 32266915.14, N = 3 310352617927 196215871807 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20 GCC 14.0.1 20240411 Clang 18.1.1 90000M 180000M 270000M 360000M 450000M SE +/- 56533218.78, N = 3 SE +/- 161435637.78, N = 3 437829686883 288123851390 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.026, N = 3 SE +/- 0.018, N = 3 6.179 8.492 1. (CC) gcc options: -lm -lpthread -O3 -march=native
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space GCC 14.0.1 20240411 Clang 18.1.1 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 259 209 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression GCC 14.0.1 20240411 Clang 18.1.1 1.269 2.538 3.807 5.076 6.345 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.57 5.64 -lpng16 -ljpeg 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.30, N = 3 63.29 78.06 1. (CC) gcc options: -O2 -pthread
Quicksilver Input: CORAL2 P1 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 GCC 14.0.1 20240411 Clang 18.1.1 6M 12M 18M 24M 30M SE +/- 26034.17, N = 3 SE +/- 32145.50, N = 3 28573333 23570000 1. (CXX) g++ options: -fopenmp -O3 -march=native
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl GCC 14.0.1 20240411 Clang 18.1.1 120 240 360 480 600 SE +/- 0.88, N = 3 SE +/- 0.58, N = 3 554 457 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
SMHasher Hash: FarmHash32 x86_64 AVX OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: FarmHash32 x86_64 AVX GCC 14.0.1 20240411 Clang 18.1.1 8K 16K 24K 32K 40K SE +/- 13.98, N = 3 SE +/- 154.09, N = 3 37461.46 31033.62 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 GCC 14.0.1 20240411 Clang 18.1.1 1100 2200 3300 4400 5500 SE +/- 5.94, N = 3 SE +/- 6.63, N = 3 4376.24 5269.55 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 GCC 14.0.1 20240411 Clang 18.1.1 50 100 150 200 250 SE +/- 0.24, N = 3 SE +/- 0.27, N = 3 175.05 210.78 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 GCC 14.0.1 20240411 Clang 18.1.1 300M 600M 900M 1200M 1500M SE +/- 3868821.24, N = 3 SE +/- 5417358.93, N = 3 1025733333 1231066667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
John The Ripper Test: WPA PSK OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK GCC 14.0.1 20240411 Clang 18.1.1 120K 240K 360K 480K 600K SE +/- 668.00, N = 3 SE +/- 3691.65, N = 3 541461 460525 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 185 159 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 5 10 15 20 25 SE +/- 0.14, N = 15 SE +/- 0.18, N = 15 18.28 21.12 -fno-strict-overflow -fwrapv - MIN: 7.31 / MAX: 109.14 MIN: 7.43 / MAX: 121.23 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 700 1400 2100 2800 3500 SE +/- 28.58, N = 15 SE +/- 27.73, N = 15 3367.26 2933.12 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
SMHasher Hash: wyhash OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: wyhash GCC 14.0.1 20240411 Clang 18.1.1 6K 12K 18K 24K 30K SE +/- 77.66, N = 3 SE +/- 26.97, N = 3 26137.98 29888.69 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
Quicksilver Input: CTS2 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CTS2 GCC 14.0.1 20240411 Clang 18.1.1 5M 10M 15M 20M 25M SE +/- 6666.67, N = 3 SE +/- 75351.03, N = 3 21853333 19186667 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.85, N = 15 89.77 100.60 -fno-strict-overflow -fwrapv - MIN: 33.39 / MAX: 179.49 MIN: 33.14 / MAX: 247.72 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 80 160 240 320 400 SE +/- 0.54, N = 3 SE +/- 2.83, N = 15 355.93 317.98 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 GCC 14.0.1 20240411 Clang 18.1.1 3M 6M 9M 12M 15M SE +/- 15878.01, N = 3 SE +/- 17975.29, N = 3 13318667 14840333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 14.0.1 20240411 Clang 18.1.1 700K 1400K 2100K 2800K 3500K SE +/- 1831.80, N = 3 SE +/- 24583.22, N = 15 3365815.03 3044772.38 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
JPEG-XL libjxl Input: JPEG - Quality: 90 OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 90 GCC 14.0.1 20240411 Clang 18.1.1 12 24 36 48 60 SE +/- 0.51, N = 5 SE +/- 0.62, N = 4 46.97 51.25 1. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen GCC 14.0.1 20240411 Clang 18.1.1 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 216 199 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 193 178 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 14.0.1 20240411 Clang 18.1.1 500K 1000K 1500K 2000K 2500K SE +/- 3119.54, N = 5 SE +/- 2576.48, N = 5 1960084 2118389 1. (CC) gcc options: -O3 -march=native
srsRAN Project Test: PUSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PUSCH Processor Benchmark, Throughput Total GCC 14.0.1 20240411 Clang 18.1.1 1000 2000 3000 4000 5000 SE +/- 0.38, N = 3 SE +/- 0.23, N = 3 4801.9 4446.1 MIN: 3393.9 / MAX: 4802.5 MIN: 2910.7 / MAX: 4446.5 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed GCC 14.0.1 20240411 Clang 18.1.1 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.07, N = 3 49.02 45.86 1. (CC) gcc options: -O3 -march=native
SMHasher Hash: FarmHash128 OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: FarmHash128 GCC 14.0.1 20240411 Clang 18.1.1 5K 10K 15K 20K 25K SE +/- 114.70, N = 3 SE +/- 58.42, N = 3 21159.37 19825.36 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.4 WAV To FLAC GCC 14.0.1 20240411 Clang 18.1.1 3 6 9 12 15 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 11.39 10.68 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
Stargate Digital Audio Workstation Sample Rate: 96000 - Buffer Size: 1024 OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 96000 - Buffer Size: 1024 GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.000716, N = 3 SE +/- 0.013288, N = 3 6.161616 6.550766 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare GCC 14.0.1 20240411 Clang 18.1.1 1.2535 2.507 3.7605 5.014 6.2675 SE +/- 0.005, N = 3 SE +/- 0.008, N = 3 5.571 5.247 1. (CXX) g++ options: -O3 -march=native -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 79.49 84.16 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
uvg266 Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.26, N = 3 70.73 74.38 1. (CXX) g++ options: -O3 -march=native
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.4 WAV To Opus Encode GCC 14.0.1 20240411 Clang 18.1.1 5 10 15 20 25 SE +/- 0.02, N = 5 SE +/- 0.02, N = 5 19.33 18.42 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 512 GCC 14.0.1 20240411 Clang 18.1.1 300M 600M 900M 1200M 1500M SE +/- 4106498.91, N = 3 SE +/- 7521155.35, N = 3 1242200000 1297866667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium GCC 14.0.1 20240411 Clang 18.1.1 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 37.05 38.67 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed GCC 14.0.1 20240411 Clang 18.1.1 1300 2600 3900 5200 6500 SE +/- 36.97, N = 3 SE +/- 4.45, N = 3 5882.1 5640.1 1. (CC) gcc options: -O3 -march=native
Zstd Compression Compression Level: 12 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed GCC 14.0.1 20240411 Clang 18.1.1 90 180 270 360 450 SE +/- 3.11, N = 3 SE +/- 0.40, N = 3 399.1 383.4 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt GCC 14.0.1 20240411 Clang 18.1.1 30K 60K 90K 120K 150K SE +/- 239.95, N = 3 SE +/- 1053.28, N = 3 153446 159726 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 14.0.1 20240411 Clang 18.1.1 90K 180K 270K 360K 450K SE +/- 1794.60, N = 3 SE +/- 1948.09, N = 3 416097 432909 1. (CC) gcc options: -pedantic -O3
Zstd Compression Compression Level: 12 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed GCC 14.0.1 20240411 Clang 18.1.1 500 1000 1500 2000 2500 SE +/- 21.94, N = 3 SE +/- 2.17, N = 3 2251.4 2340.9 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
uvg266 Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.22, N = 3 72.65 75.27 1. (CXX) g++ options: -O3 -march=native
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed GCC 14.0.1 20240411 Clang 18.1.1 30 60 90 120 150 SE +/- 0.46, N = 3 SE +/- 1.63, N = 4 146.67 141.72 1. (CC) gcc options: -O3 -march=native
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.91, N = 5 SE +/- 0.32, N = 3 93.12 96.37 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.28, N = 3 SE +/- 0.04, N = 3 27.24 26.33 1. (CXX) g++ options: -O3 -march=native -lm -ldl
Stargate Digital Audio Workstation Sample Rate: 192000 - Buffer Size: 1024 OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 1024 GCC 14.0.1 20240411 Clang 18.1.1 0.9537 1.9074 2.8611 3.8148 4.7685 SE +/- 0.009323, N = 3 SE +/- 0.003706, N = 3 4.097617 4.238755 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish GCC 14.0.1 20240411 Clang 18.1.1 30K 60K 90K 120K 150K SE +/- 79.08, N = 3 SE +/- 609.99, N = 3 152866 157785 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
uvg266 Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 74.07 76.45 1. (CXX) g++ options: -O3 -march=native
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Exhaustive GCC 14.0.1 20240411 Clang 18.1.1 1.2404 2.4808 3.7212 4.9616 6.202 SE +/- 0.0104, N = 3 SE +/- 0.0146, N = 3 5.3446 5.5130 1. (CXX) g++ options: -O3 -flto -pthread
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 24.44 25.20 -fno-strict-overflow -fwrapv - MIN: 11.45 / MAX: 51.44 MIN: 10.11 / MAX: 73 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 300 600 900 1200 1500 SE +/- 9.55, N = 3 SE +/- 16.52, N = 3 1304.85 1266.10 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
uvg266 Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium GCC 14.0.1 20240411 Clang 18.1.1 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 30.39 29.50 1. (CXX) g++ options: -O3 -march=native
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed GCC 14.0.1 20240411 Clang 18.1.1 1200 2400 3600 4800 6000 SE +/- 20.86, N = 3 SE +/- 60.76, N = 4 5658.0 5493.7 1. (CC) gcc options: -O3 -march=native
SMHasher Hash: Spooky32 OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: Spooky32 GCC 14.0.1 20240411 Clang 18.1.1 4K 8K 12K 16K 20K SE +/- 51.73, N = 3 SE +/- 64.55, N = 3 20053.07 20643.45 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA256 GCC 14.0.1 20240411 Clang 18.1.1 20000M 40000M 60000M 80000M 100000M SE +/- 254544356.50, N = 3 SE +/- 281782565.20, N = 3 112513527163 109345824360 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
ASTC Encoder Preset: Very Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Very Thorough GCC 14.0.1 20240411 Clang 18.1.1 3 6 9 12 15 SE +/- 0.0183, N = 3 SE +/- 0.0212, N = 3 8.7120 8.9615 1. (CXX) g++ options: -O3 -flto -pthread
SMHasher Hash: fasthash32 OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: fasthash32 GCC 14.0.1 20240411 Clang 18.1.1 2K 4K 6K 8K 10K SE +/- 25.94, N = 3 SE +/- 13.72, N = 3 8833.71 8593.78 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Lion GCC 14.0.1 20240411 Clang 18.1.1 900 1800 2700 3600 4500 SE +/- 16.90, N = 3 SE +/- 18.35, N = 3 4049 3942 1. (CXX) g++ options: -O3 -march=native
uvg266 Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Slow GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 27.16 26.45 1. (CXX) g++ options: -O3 -march=native
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Church Facade GCC 14.0.1 20240411 Clang 18.1.1 1100 2200 3300 4400 5500 SE +/- 9.33, N = 3 SE +/- 9.17, N = 3 5240 5112 1. (CXX) g++ options: -O3 -march=native
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced GCC 14.0.1 20240411 Clang 18.1.1 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 250 256 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 4K GCC 14.0.1 20240411 Clang 18.1.1 9 18 27 36 45 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 39.16 40.08 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed GCC 14.0.1 20240411 Clang 18.1.1 400 800 1200 1600 2000 SE +/- 2.52, N = 3 SE +/- 2.11, N = 3 1917.8 1962.8 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
SMHasher Hash: MeowHash x86_64 AES-NI OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: MeowHash x86_64 AES-NI GCC 14.0.1 20240411 Clang 18.1.1 11K 22K 33K 44K 55K SE +/- 105.54, N = 3 SE +/- 397.13, N = 3 50680.40 51816.54 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.71, N = 3 SE +/- 0.39, N = 3 95.25 93.19 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
ASTC Encoder Preset: Medium OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Medium GCC 14.0.1 20240411 Clang 18.1.1 100 200 300 400 500 SE +/- 0.79, N = 3 SE +/- 0.34, N = 3 450.11 459.58 1. (CXX) g++ options: -O3 -flto -pthread
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed GCC 14.0.1 20240411 Clang 18.1.1 400 800 1200 1600 2000 SE +/- 10.14, N = 3 SE +/- 3.56, N = 3 1825.3 1861.0 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
QuantLib Configuration: Single-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded GCC 14.0.1 20240411 Clang 18.1.1 1000 2000 3000 4000 5000 SE +/- 36.95, N = 3 SE +/- 22.05, N = 3 4611.6 4697.6 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 1.0035 2.007 3.0105 4.014 5.0175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.38 4.46 -fno-strict-overflow -fwrapv - MIN: 2.05 / MAX: 26.42 MIN: 2.1 / MAX: 27.33 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
JPEG-XL libjxl Input: PNG - Quality: 90 OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 90 GCC 14.0.1 20240411 Clang 18.1.1 11 22 33 44 55 SE +/- 0.41, N = 15 SE +/- 0.47, N = 15 48.58 49.45 1. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm
ASTC Encoder Preset: Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Thorough GCC 14.0.1 20240411 Clang 18.1.1 14 28 42 56 70 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 62.45 63.56 1. (CXX) g++ options: -O3 -flto -pthread
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed GCC 14.0.1 20240411 Clang 18.1.1 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 12.1 12.3 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
RNNoise Input: 26 Minute Long Talking Sample OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 0.2 Input: 26 Minute Long Talking Sample GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.046, N = 3 7.607 7.484 1. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 3K 6K 9K 12K 15K SE +/- 11.99, N = 3 SE +/- 5.86, N = 3 13824.92 13601.85 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 900 1800 2700 3600 4500 SE +/- 5.60, N = 3 SE +/- 5.05, N = 3 4299.75 4232.67 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
Quicksilver Input: CORAL2 P2 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 GCC 14.0.1 20240411 Clang 18.1.1 5M 10M 15M 20M 25M SE +/- 80069.41, N = 3 SE +/- 10000.00, N = 3 21786667 22130000 1. (CXX) g++ options: -fopenmp -O3 -march=native
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 GCC 14.0.1 20240411 Clang 18.1.1 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 14.80 14.58 -lpng16 -ljpeg 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.33 7.44 -fno-strict-overflow -fwrapv - MIN: 3.9 / MAX: 28.61 MIN: 4.21 / MAX: 29.46 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 1300 2600 3900 5200 6500 SE +/- 3.36, N = 3 SE +/- 2.64, N = 3 5969.17 5885.26 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
srsRAN Project Test: PDSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Total GCC 14.0.1 20240411 Clang 18.1.1 4K 8K 12K 16K 20K SE +/- 160.00, N = 15 SE +/- 173.47, N = 3 20064.5 20338.7 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 1.09, N = 3 SE +/- 1.72, N = 8 197.49 194.89 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 1.215 2.43 3.645 4.86 6.075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.33 5.40 -fno-strict-overflow -fwrapv - MIN: 3.15 / MAX: 26.39 MIN: 3.38 / MAX: 23.08 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate GCC 14.0.1 20240411 Clang 18.1.1 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 155 153 -lgomp -lomp 1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression GCC 14.0.1 20240411 Clang 18.1.1 0.1845 0.369 0.5535 0.738 0.9225 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.81 0.82 -lpng16 -ljpeg 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 14.0.1 20240411 Clang 18.1.1 200 400 600 800 1000 SE +/- 2.19, N = 3 SE +/- 2.74, N = 3 990.57 1002.52 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
JPEG-XL libjxl Input: PNG - Quality: 100 OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 100 GCC 14.0.1 20240411 Clang 18.1.1 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 41.18 40.71 1. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.38 6.45 -fno-strict-overflow -fwrapv - MIN: 3.23 / MAX: 27.63 MIN: 3.45 / MAX: 34.45 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 1100 2200 3300 4400 5500 SE +/- 6.00, N = 3 SE +/- 8.31, N = 3 4938.07 4888.56 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless GCC 14.0.1 20240411 Clang 18.1.1 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.07 2.05 -lpng16 -ljpeg 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-256-GCM GCC 14.0.1 20240411 Clang 18.1.1 150000M 300000M 450000M 600000M 750000M SE +/- 612129667.02, N = 3 SE +/- 847452668.98, N = 3 707658246553 701069882433 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
SMHasher Hash: t1ha0_aes_avx2 x86_64 OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: t1ha0_aes_avx2 x86_64 GCC 14.0.1 20240411 Clang 18.1.1 20K 40K 60K 80K 100K SE +/- 144.34, N = 3 SE +/- 51.02, N = 3 96787.57 97667.26 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 84.53 83.82 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 2K 4K 6K 8K 10K SE +/- 6.57, N = 3 SE +/- 9.39, N = 3 8089.55 8024.94 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 20K 40K 60K 80K 100K SE +/- 108.76, N = 3 SE +/- 47.46, N = 3 116609.44 115684.85 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 4K GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 1.61, N = 3 SE +/- 1.67, N = 15 193.78 195.31 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.64 7.70 -fno-strict-overflow -fwrapv - MIN: 3.6 / MAX: 33.44 MIN: 3.41 / MAX: 30.42 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 80 160 240 320 400 SE +/- 0.25, N = 3 SE +/- 0.31, N = 3 377.58 380.50 -fno-strict-overflow -fwrapv - MIN: 186.1 / MAX: 401.52 MIN: 321.62 / MAX: 405.78 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 14.0.1 20240411 Clang 18.1.1 20 40 60 80 100 SE +/- 0.89, N = 3 SE +/- 0.86, N = 3 97.05 97.80 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
QuantLib Configuration: Multi-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded GCC 14.0.1 20240411 Clang 18.1.1 60K 120K 180K 240K 300K SE +/- 975.48, N = 3 SE +/- 1085.73, N = 3 293011.2 295240.5 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-128-GCM GCC 14.0.1 20240411 Clang 18.1.1 200000M 400000M 600000M 800000M 1000000M SE +/- 665285492.45, N = 3 SE +/- 246618519.80, N = 3 822088248977 816719157710 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 GCC 14.0.1 20240411 Clang 18.1.1 900 1800 2700 3600 4500 SE +/- 2.34, N = 3 SE +/- 37.96, N = 15 4397.54 4369.14 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 0.09, N = 3 SE +/- 1.52, N = 15 175.90 174.77 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
JPEG-XL Decoding libjxl CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG-XL Decoding libjxl 0.10.1 CPU Threads: All GCC 14.0.1 20240411 Clang 18.1.1 130 260 390 520 650 SE +/- 1.93, N = 3 SE +/- 3.24, N = 3 600.87 597.36
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA512 GCC 14.0.1 20240411 Clang 18.1.1 8000M 16000M 24000M 32000M 40000M SE +/- 40543653.85, N = 3 SE +/- 38811972.88, N = 3 37084177003 37246099377 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 25.9 25.8 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz
SMHasher Hash: t1ha2_atonce OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: t1ha2_atonce GCC 14.0.1 20240411 Clang 18.1.1 4K 8K 12K 16K 20K SE +/- 10.73, N = 3 SE +/- 62.32, N = 3 19662.33 19737.39 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 GCC 14.0.1 20240411 Clang 18.1.1 300K 600K 900K 1200K 1500K SE +/- 1971.44, N = 3 SE +/- 967.52, N = 3 1280463.3 1275596.8 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
Primesieve Length: 1e13 OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.1 Length: 1e13 GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 26.35 26.43 1. (CXX) g++ options: -O3 -march=native
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 GCC 14.0.1 20240411 Clang 18.1.1 9K 18K 27K 36K 45K SE +/- 89.80, N = 3 SE +/- 71.24, N = 3 44305.1 44172.3 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.66, N = 3 200.06 199.47 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 40 80 120 160 200 SE +/- 0.22, N = 3 SE +/- 0.53, N = 3 159.76 160.20 -fno-strict-overflow -fwrapv - MIN: 54.41 / MAX: 238.04 MIN: 50.55 / MAX: 283.62 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 27.86 27.93 -fno-strict-overflow -fwrapv - MIN: 16.47 / MAX: 52.7 MIN: 16.08 / MAX: 54.79 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 500 1000 1500 2000 2500 SE +/- 1.62, N = 3 SE +/- 4.45, N = 3 2288.66 2283.11 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
JPEG-XL libjxl Input: JPEG - Quality: 100 OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 100 GCC 14.0.1 20240411 Clang 18.1.1 10 20 30 40 50 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 41.69 41.65 1. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K GCC 14.0.1 20240411 Clang 18.1.1 3 6 9 12 15 SE +/- 0.023, N = 3 SE +/- 0.023, N = 3 9.848 9.853 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.7 WAV To WavPack GCC 14.0.1 20240411 Clang 18.1.1 0.833 1.666 2.499 3.332 4.165 SE +/- 0.009, N = 5 SE +/- 0.002, N = 5 3.702 3.701 1. (CXX) g++ options: -O3 -march=native
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU GCC 14.0.1 20240411 Clang 18.1.1 0.0968 0.1936 0.2904 0.3872 0.484 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.43 0.43 -fno-strict-overflow -fwrapv - MIN: 0.19 / MAX: 22.84 MIN: 0.2 / MAX: 25.55 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 GCC 14.0.1 20240411 Clang 18.1.1 5M 10M 15M 20M 25M SE +/- 417025.62, N = 12 SE +/- 343818.42, N = 13 18258917 23426308 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 GCC 14.0.1 20240411 Clang 18.1.1 60M 120M 180M 240M 300M SE +/- 1504066.82, N = 3 SE +/- 6642498.38, N = 15 298853000 214448333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
SMHasher Hash: MeowHash x86_64 AES-NI OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: MeowHash x86_64 AES-NI GCC 14.0.1 20240411 Clang 18.1.1 10 20 30 40 50 SE +/- 0.08, N = 3 SE +/- 0.59, N = 3 41.84 42.39 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: t1ha0_aes_avx2 x86_64 OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: t1ha0_aes_avx2 x86_64 GCC 14.0.1 20240411 Clang 18.1.1 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 19.09 18.84 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: FarmHash32 x86_64 AVX OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: FarmHash32 x86_64 AVX GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 24.11 27.00 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: t1ha2_atonce OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: t1ha2_atonce GCC 14.0.1 20240411 Clang 18.1.1 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 19.11 19.35 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: FarmHash128 OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: FarmHash128 GCC 14.0.1 20240411 Clang 18.1.1 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 43.85 31.02 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: fasthash32 OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: fasthash32 GCC 14.0.1 20240411 Clang 18.1.1 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 20.80 20.56 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: Spooky32 OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: Spooky32 GCC 14.0.1 20240411 Clang 18.1.1 6 12 18 24 30 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 25.09 25.15 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: SHA3-256 OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: SHA3-256 GCC 14.0.1 20240411 Clang 18.1.1 400 800 1200 1600 2000 SE +/- 5.66, N = 3 SE +/- 3.42, N = 3 1648.91 996.37 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
SMHasher Hash: wyhash OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: wyhash GCC 14.0.1 20240411 Clang 18.1.1 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.44 12.97 -flto=auto -fno-fat-lto-objects -flto=thin 1. (CXX) g++ options: -O3 -march=native
Phoronix Test Suite v10.8.4