GCC and Clang compiler benchmarks by Michael Larabel for year end 2023 future article.
GCC 13.2 Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Clang 17.0.2 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: Clang 17.0.2, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Clang 18 23 Dec OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: Clang 18.0.0, File-System: ext4, Screen Resolution: 3840x2160
GCC 14 23 Dec OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 14.0.0 20231224, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 110 220 330 440 550 SE +/- 0.80, N = 3 SE +/- 0.56, N = 3 SE +/- 1.39, N = 3 426.49 499.39 482.38 1. (CXX) g++ options: -O3 -march=native -flto -pthread
Preset: Medium
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./astcenc: 2: ./astc-encoder-4.0.0/build/Source/astcenc-avx2: not found
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 16 32 48 64 80 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 69.84 70.01 68.90 1. (CXX) g++ options: -O3 -march=native -flto -pthread
Preset: Thorough
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./astcenc: 2: ./astc-encoder-4.0.0/build/Source/astcenc-avx2: not found
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 2 4 6 8 10 SE +/- 0.0327, N = 3 SE +/- 0.0190, N = 3 SE +/- 0.0210, N = 3 7.2724 7.2323 7.1900 1. (CXX) g++ options: -O3 -march=native -flto -pthread
Preset: Exhaustive
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./astcenc: 2: ./astc-encoder-4.0.0/build/Source/astcenc-avx2: not found
C-Blosc C-Blosc (c-blosc2) simple, compressed, fast and persistent data store library for C that focuses on compression of binary data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 256MB GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 2K 4K 6K 8K 10K SE +/- 15.55, N = 3 SE +/- 23.87, N = 3 SE +/- 34.63, N = 3 SE +/- 37.76, N = 3 8137.1 8125.5 7795.9 7258.5 -std=gnu99 -lrt -ldl -lm -march=native -march=native -std=gnu99 -ldl -lrt -lm 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 256MB GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 2K 4K 6K 8K 10K SE +/- 17.86, N = 3 SE +/- 10.30, N = 3 SE +/- 0.87, N = 3 SE +/- 3.92, N = 3 7804.7 7735.7 7496.2 7291.9 -std=gnu99 -lrt -ldl -lm -march=native -march=native -std=gnu99 -ldl -lrt -lm 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 256MB GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1600 3200 4800 6400 8000 SE +/- 30.09, N = 3 SE +/- 30.25, N = 3 SE +/- 15.49, N = 3 SE +/- 20.20, N = 3 7649.3 7628.0 7454.2 7218.1 -std=gnu99 -lrt -ldl -lm -march=native -march=native -std=gnu99 -ldl -lrt -lm 1. (CC) gcc options: -O3
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 2 4 6 8 10 SE +/- 0.029, N = 3 SE +/- 0.068, N = 3 SE +/- 0.067, N = 3 SE +/- 0.029, N = 3 4.950 6.848 6.896 4.912 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 800K 1600K 2400K 3200K 4000K SE +/- 27949.92, N = 3 SE +/- 19620.88, N = 3 SE +/- 34986.71, N = 3 SE +/- 38604.00, N = 3 3955168.75 3332695.38 3642203.84 3955659.29 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Crypto++ Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.8 Test: Keyed Algorithms GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 0.30, N = 3 SE +/- 0.92, N = 3 848.07 873.13 894.49 -fno-devirtualize 1. (CXX) g++ options: -g2 -fPIC -pthread -pipe -O3 -march=native
Test: Keyed Algorithms
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./cryptest.exe: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./cryptest.exe)
OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.8 Test: Unkeyed Algorithms GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 120 240 360 480 600 SE +/- 0.95, N = 3 SE +/- 0.49, N = 3 SE +/- 2.74, N = 3 539.17 450.46 533.23 -fno-devirtualize 1. (CXX) g++ options: -g2 -fPIC -pthread -pipe -O3 -march=native
Test: Unkeyed Algorithms
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./cryptest.exe: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./cryptest.exe)
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC audio format ten times using the --best preset settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.4 WAV To FLAC GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3 6 9 12 15 SE +/- 0.04, N = 5 SE +/- 0.02, N = 5 SE +/- 0.05, N = 5 SE +/- 0.06, N = 5 12.23 11.12 11.14 11.85 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 9 18 27 36 45 SE +/- 0.52, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.27, N = 3 37.89 37.62 36.40 38.23 1. (CC) gcc options: -shared -fwrapv -O2 -O3 -march=native -lxc -lblas -lmpi
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Swirl GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 600 1200 1800 2400 3000 SE +/- 3.93, N = 3 SE +/- 1.33, N = 3 SE +/- 2.89, N = 3 SE +/- 0.88, N = 3 2600 1987 1966 2446 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Rotate GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 150 300 450 600 750 SE +/- 0.67, N = 3 SE +/- 1.73, N = 3 SE +/- 2.33, N = 3 SE +/- 1.20, N = 3 690 685 675 667 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Sharpen GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 200 400 600 800 1000 SE +/- 1.73, N = 3 SE +/- 2.00, N = 3 SE +/- 4.10, N = 3 SE +/- 0.67, N = 3 1038 985 977 1003 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Enhanced GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 400 800 1200 1600 2000 SE +/- 4.04, N = 3 SE +/- 1.53, N = 3 SE +/- 2.65, N = 3 SE +/- 5.84, N = 3 1638 1479 1457 1331 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Noise-Gaussian GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 200 400 600 800 1000 SE +/- 4.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 5.04, N = 3 817 623 623 807 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: HWB Color Space GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 200 400 600 800 1000 SE +/- 5.24, N = 3 SE +/- 3.18, N = 3 SE +/- 0.58, N = 3 SE +/- 5.13, N = 3 1069 751 747 1048 1. (CC) gcc options: -fopenmp -O3 -march=native -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 11.20 10.82 10.70 1. (CXX) g++ options: -O3 -march=native
Implementation: MPI CPU - Input: water_GMX50_bare
GCC 14 23 Dec: The test quit with a non-zero exit status. E: /mpi-build/bin/gmx_mpi: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by mpi-build/bin/../lib/libgromacs_mpi.so.8)
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 130K 260K 390K 520K 650K SE +/- 2497.30, N = 3 SE +/- 4559.35, N = 3 SE +/- 5465.43, N = 3 SE +/- 3178.61, N = 3 613307 511546 518594 616447 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 40K 80K 120K 160K 200K SE +/- 164.19, N = 3 SE +/- 1555.73, N = 3 SE +/- 1422.62, N = 3 SE +/- 258.75, N = 3 173760 180170 181091 175544 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3M 6M 9M 12M 15M SE +/- 43498.40, N = 3 SE +/- 114054.08, N = 3 SE +/- 133559.39, N = 3 SE +/- 75020.00, N = 3 14526667 15165000 15399667 14730000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 11 22 33 44 55 SE +/- 0.46, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 42.44 46.65 45.54 43.43 -lpthread -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 11 22 33 44 55 SE +/- 0.21, N = 3 SE +/- 0.23, N = 3 SE +/- 0.22, N = 3 SE +/- 0.11, N = 3 43.54 47.38 46.76 44.32 -lpthread -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 20 40 60 80 100 SE +/- 1.06, N = 3 SE +/- 0.37, N = 3 SE +/- 0.40, N = 3 SE +/- 0.65, N = 3 86.89 89.69 89.00 86.05 -lpthread -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 20 40 60 80 100 SE +/- 0.68, N = 3 SE +/- 0.65, N = 3 SE +/- 0.58, N = 3 SE +/- 0.29, N = 3 89.12 92.52 92.02 88.23 -lpthread -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.35, N = 3 SE +/- 0.18, N = 3 SE +/- 0.73, N = 3 94.70 94.29 94.59 93.72 -lpthread -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1.172 2.344 3.516 4.688 5.86 SE +/- 0.051, N = 3 SE +/- 0.022, N = 3 SE +/- 0.047, N = 3 SE +/- 0.059, N = 3 4.747 5.209 5.074 4.819 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr 1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 10 20 30 40 50 SE +/- 0.20, N = 3 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 42.45 42.37 41.66 1. (CXX) g++ options: -O3 -march=native -lm -ldl
Model: 20k Atoms
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ../b/lmp: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ../b/lmp)
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.30 Backend: Eigen GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 200 400 600 800 1000 SE +/- 6.34, N = 9 SE +/- 11.58, N = 9 SE +/- 7.84, N = 3 671 816 857 1. (CXX) g++ options: -flto -O3 -march=native -pthread
Backend: Eigen
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./lc0: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./lc0)
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 11 22 33 44 55 SE +/- 0.28, N = 3 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.42, N = 3 49.18 47.79 47.37 48.05 1. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 6 12 18 24 30 SE +/- 0.23, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.23, N = 3 26.60 25.65 25.62 26.17 1. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.4835 0.967 1.4505 1.934 2.4175 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 2.116 2.087 2.149 2.142 1. (CXX) g++ options: -O3 -fPIC -march=native -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.9396 1.8792 2.8188 3.7584 4.698 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.027, N = 3 SE +/- 0.014, N = 3 4.176 3.917 4.053 4.106 1. (CXX) g++ options: -O3 -fPIC -march=native -lm
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 12M 24M 36M 48M 60M SE +/- 13747.73, N = 3 SE +/- 705129.82, N = 4 SE +/- 61098.28, N = 3 SE +/- 134747.91, N = 3 48601000 56815750 56732000 48068000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 5M 10M 15M 20M 25M SE +/- 127219.34, N = 15 SE +/- 88706.13, N = 3 SE +/- 100374.30, N = 3 SE +/- 235451.13, N = 15 17781400 21974667 22239000 17950067 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 32 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1100M 2200M 3300M 4400M 5500M SE +/- 12640191.63, N = 3 SE +/- 38722904.51, N = 3 SE +/- 37430513.40, N = 3 SE +/- 43594839.14, N = 3 4228033333 5040300000 4994600000 4262500000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 192 - Buffer Length: 256 - Filter Length: 32 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1400M 2800M 4200M 5600M 7000M SE +/- 32255852.18, N = 3 SE +/- 23899256.15, N = 3 SE +/- 42403157.64, N = 3 SE +/- 36511383.31, N = 3 5518300000 6543933333 6541433333 5587766667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 300M 600M 900M 1200M 1500M SE +/- 3447865.68, N = 3 SE +/- 5446813.54, N = 3 SE +/- 3295114.23, N = 3 SE +/- 1882374.39, N = 3 1293233333 1447966667 1469833333 1301200000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 192 - Buffer Length: 256 - Filter Length: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 300M 600M 900M 1200M 1500M SE +/- 5921430.01, N = 3 SE +/- 4870432.33, N = 3 SE +/- 4115148.17, N = 3 SE +/- 7348998.42, N = 3 1496100000 1495833333 1528766667 1491466667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 4K 8K 12K 16K 20K SE +/- 22.99, N = 5 SE +/- 45.07, N = 3 SE +/- 42.11, N = 3 SE +/- 98.79, N = 3 17018.5 17310.2 17044.4 17246.0 1. (CC) gcc options: -O3
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 44027.28, N = 3 SE +/- 18523.18, N = 3 SE +/- 27110.93, N = 3 SE +/- 50152.31, N = 3 5921751.45 6092998.10 6108890.58 5956401.51 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1.7M 3.4M 5.1M 6.8M 8.5M SE +/- 9027.10, N = 3 SE +/- 54728.52, N = 3 SE +/- 7633.51, N = 3 SE +/- 44302.45, N = 3 7695223.15 7774340.30 7679397.09 7735938.66 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1200 2400 3600 4800 6000 SE +/- 51.09, N = 3 SE +/- 62.02, N = 3 SE +/- 24.10, N = 3 SE +/- 52.09, N = 5 4581.39 5376.45 5572.33 4789.29 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 50 100 150 200 250 SE +/- 2.04, N = 3 SE +/- 2.48, N = 3 SE +/- 0.96, N = 3 SE +/- 2.08, N = 5 183.26 215.06 222.89 191.57 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenJPEG OpenJPEG is an open-source JPEG 2000 codec written in the C programming language. The default input for this test profile is the NASA/JPL-Caltech/MSSS Curiosity panorama 717MB TIFF image file converting to JPEG2000 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenJPEG 2.4 Encode: NASA Curiosity Panorama M34 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 20K 40K 60K 80K 100K SE +/- 357.19, N = 3 SE +/- 34.27, N = 3 SE +/- 627.99, N = 3 SE +/- 1022.49, N = 4 82008 75630 74343 82985 1. (CXX) g++ options: -O3 -march=native -rdynamic
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 30000M 60000M 90000M 120000M 150000M SE +/- 321788378.42, N = 3 SE +/- 323275121.51, N = 3 SE +/- 366849309.72, N = 3 SE +/- 271320016.95, N = 3 128629387223 124526197107 129800447513 125384626833 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 9000M 18000M 27000M 36000M 45000M SE +/- 55003825.70, N = 3 SE +/- 29437974.38, N = 3 SE +/- 11847693.41, N = 3 SE +/- 7258386.62, N = 3 43028797080 43131468240 43912408803 42698789480 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 11K 22K 33K 44K 55K SE +/- 84.27, N = 3 SE +/- 91.28, N = 3 SE +/- 80.27, N = 3 SE +/- 103.69, N = 3 50251.7 50097.7 50206.9 50065.0 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 300K 600K 900K 1200K 1500K SE +/- 1093.03, N = 3 SE +/- 3224.17, N = 3 SE +/- 595.23, N = 3 SE +/- 4067.66, N = 3 1541833.5 1538339.5 1552460.9 1532147.3 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 110000M 220000M 330000M 440000M 550000M SE +/- 264097935.23, N = 3 SE +/- 201482071.29, N = 3 SE +/- 195079194.35, N = 3 SE +/- 289414516.97, N = 3 512442016473 335909931727 335886221587 513536630053 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 200000M 400000M 600000M 800000M 1000000M SE +/- 926361343.56, N = 3 SE +/- 1012640703.78, N = 3 SE +/- 712012274.52, N = 3 SE +/- 227590943.13, N = 3 946233938740 944090667870 944539664203 948573686080 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 200000M 400000M 600000M 800000M 1000000M SE +/- 617348183.36, N = 3 SE +/- 367647705.83, N = 3 SE +/- 745364268.21, N = 3 SE +/- 458494386.59, N = 3 819163695240 816930832563 811977109867 821035343690 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 80000M 160000M 240000M 320000M 400000M SE +/- 138801171.04, N = 3 SE +/- 74912722.41, N = 3 SE +/- 143870310.16, N = 3 SE +/- 38965461.92, N = 3 361301039787 229128259903 228903633260 364041629243 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 70 140 210 280 350 SE +/- 1.01, N = 3 SE +/- 0.30, N = 3 SE +/- 0.04, N = 3 339.37 336.16 334.69 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 30 60 90 120 150 SE +/- 0.40, N = 3 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 141.26 142.62 143.24 -fno-strict-overflow -fwrapv - MIN: 43.59 / MAX: 209.89 MIN: 60.05 / MAX: 210.86 MIN: 53.57 / MAX: 210.54 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 98.10 97.62 97.72 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 110 220 330 440 550 SE +/- 0.75, N = 3 SE +/- 0.30, N = 3 SE +/- 0.22, N = 3 487.79 490.20 489.66 -fno-strict-overflow -fwrapv - MIN: 394.16 / MAX: 515.77 MIN: 249.93 / MAX: 516.28 MIN: 381.31 / MAX: 514.98 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 1300 2600 3900 5200 6500 SE +/- 20.52, N = 3 SE +/- 28.17, N = 3 SE +/- 23.74, N = 3 5964.98 5952.83 5951.95 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 8.03 8.05 8.05 -fno-strict-overflow -fwrapv - MIN: 4.11 / MAX: 25.57 MIN: 4.4 / MAX: 25.17 MIN: 4.56 / MAX: 25.34 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 4K 8K 12K 16K 20K SE +/- 35.94, N = 3 SE +/- 14.36, N = 3 SE +/- 15.90, N = 3 18002.34 17698.62 17696.86 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 1.2173 2.4346 3.6519 4.8692 6.0865 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.32 5.41 5.41 -fno-strict-overflow -fwrapv - MIN: 3.18 / MAX: 23.99 MIN: 2.92 / MAX: 24.06 MIN: 3.22 / MAX: 23.18 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 400 800 1200 1600 2000 SE +/- 4.24, N = 3 SE +/- 9.44, N = 3 SE +/- 11.87, N = 3 1883.57 1856.67 1857.83 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 25.45 25.82 25.80 -fno-strict-overflow -fwrapv - MIN: 13.56 / MAX: 47.82 MIN: 12.71 / MAX: 45.92 MIN: 13.56 / MAX: 46.4 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 110 220 330 440 550 SE +/- 0.42, N = 3 SE +/- 0.40, N = 3 SE +/- 1.83, N = 3 514.54 515.15 511.63 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.33, N = 3 93.18 93.05 93.71 -fno-strict-overflow -fwrapv - MIN: 45.56 / MAX: 146.45 MIN: 43.63 / MAX: 147.75 MIN: 39.51 / MAX: 151.45 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 2K 4K 6K 8K 10K SE +/- 32.34, N = 3 SE +/- 35.14, N = 3 SE +/- 27.69, N = 3 9919.18 9912.53 9909.42 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 9.70 9.67 9.67 -fno-strict-overflow -fwrapv - MIN: 4.82 / MAX: 28.19 MIN: 4.96 / MAX: 29.01 MIN: 5.1 / MAX: 28.3 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 1200 2400 3600 4800 6000 SE +/- 61.07, N = 4 SE +/- 37.89, N = 3 SE +/- 37.06, N = 3 5480.25 5230.49 5271.52 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.10, N = 4 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 8.75 9.16 9.09 -fno-strict-overflow -fwrapv - MIN: 5.37 / MAX: 26.49 MIN: 5.67 / MAX: 27.63 MIN: 5.51 / MAX: 25.76 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 500 1000 1500 2000 2500 SE +/- 24.51, N = 5 SE +/- 2.50, N = 3 SE +/- 13.82, N = 3 2372.40 2436.02 2463.91 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 9 18 27 36 45 SE +/- 0.43, N = 5 SE +/- 0.04, N = 3 SE +/- 0.22, N = 3 40.45 39.38 38.94 -fno-strict-overflow -fwrapv - MIN: 20.38 / MAX: 57.81 MIN: 24.17 / MAX: 80.88 MIN: 26.83 / MAX: 59.8 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 20K 40K 60K 80K 100K SE +/- 733.05, N = 3 SE +/- 567.66, N = 3 SE +/- 50.71, N = 3 113554.50 112991.73 113505.56 -fno-strict-overflow -fwrapv 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 0.1418 0.2836 0.4254 0.5672 0.709 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.62 0.63 0.63 -fno-strict-overflow -fwrapv - MIN: 0.2 / MAX: 20.88 MIN: 0.18 / MAX: 18.68 MIN: 0.21 / MAX: 18.55 1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.4 WAV To Opus Encode GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 5 10 15 20 25 SE +/- 0.13, N = 5 SE +/- 0.10, N = 5 SE +/- 0.07, N = 5 SE +/- 0.01, N = 5 20.03 19.13 19.29 20.15 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
PETSc PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better PETSc 3.19 Test: Streams GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 40K 80K 120K 160K 200K SE +/- 272.88, N = 3 SE +/- 138.55, N = 3 SE +/- 20.67, N = 3 183886.95 183268.53 183176.56 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -fPIC -O3 -O2 -lpthread -lpciaccess -lm
Test: Streams
GCC 14 23 Dec: The test run did not produce a result. E: /usr/bin/ld: petsc-3.19.0/arch-linux-c-opt/lib/libpetsc.so: undefined reference to `__cxa_call_terminate'
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 800K 1600K 2400K 3200K 4000K SE +/- 14867.69, N = 3 SE +/- 34138.17, N = 3 SE +/- 23217.69, N = 3 SE +/- 7434.99, N = 3 3780387 3570369 3511928 3593163 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.0641 0.1282 0.1923 0.2564 0.3205 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.000, N = 3 0.265 0.280 0.285 0.278 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 2 4 6 8 10 SE +/- 0.037, N = 3 SE +/- 0.033, N = 3 SE +/- 0.061, N = 3 6.050 5.868 5.848 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Trace Time
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./povray: 3: ./unix/povray: not found
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded GCC 13.2 Clang 17.0.2 GCC 14 23 Dec 70K 140K 210K 280K 350K SE +/- 819.02, N = 3 SE +/- 1401.25, N = 3 SE +/- 1487.52, N = 3 308069.5 329638.6 331562.3 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
Configuration: Multi-Threaded
Clang 18 23 Dec: The test quit with a non-zero exit status. E: ./quantlib: line 3: ./test-suite/quantlib-benchmark: No such file or directory
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded GCC 13.2 Clang 17.0.2 GCC 14 23 Dec 1000 2000 3000 4000 5000 SE +/- 33.97, N = 3 SE +/- 40.89, N = 3 SE +/- 30.04, N = 3 4151.0 4578.8 4504.6 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
Configuration: Single-Threaded
Clang 18 23 Dec: The test quit with a non-zero exit status. E: ./quantlib: line 3: ./test-suite/quantlib-benchmark: No such file or directory
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 500 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1000K 2000K 3000K 4000K 5000K SE +/- 47771.21, N = 4 SE +/- 37859.85, N = 15 SE +/- 27204.42, N = 3 SE +/- 33944.52, N = 15 4484373.80 4242865.92 4649256.80 4589226.30 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 500 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 700K 1400K 2100K 2800K 3500K SE +/- 53058.07, N = 15 SE +/- 37397.13, N = 15 SE +/- 21571.11, N = 3 SE +/- 22198.55, N = 3 3079293.25 2829400.83 3090972.67 3175906.70 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
SecureMark SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 90K 180K 270K 360K 450K SE +/- 25.96, N = 3 SE +/- 3387.50, N = 3 SE +/- 3191.95, N = 3 SE +/- 996.80, N = 3 392447 442902 420687 406282 1. (CC) gcc options: -pedantic -O3
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 5.45 5.25 4.87 1. (CXX) g++ options: -O3 -march=native
Throughput Test: Kostya
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./bench_ondemand: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./bench_ondemand)
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 6 9.72 9.59 9.51 1. (CXX) g++ options: -O3 -march=native
Throughput Test: TopTweet
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./bench_ondemand: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./bench_ondemand)
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.70 1.59 1.46 1. (CXX) g++ options: -O3 -march=native
Throughput Test: LargeRandom
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./bench_ondemand: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./bench_ondemand)
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.12, N = 15 SE +/- 0.11, N = 3 9.01 9.64 9.51 1. (CXX) g++ options: -O3 -march=native
Throughput Test: PartialTweets
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./bench_ondemand: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./bench_ondemand)
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID GCC 13.2 Clang 17.0.2 Clang 18 23 Dec 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.09, N = 7 SE +/- 0.08, N = 15 8.67 9.73 9.72 1. (CXX) g++ options: -O3 -march=native
Throughput Test: DistinctUserID
GCC 14 23 Dec: The test quit with a non-zero exit status. E: ./bench_ondemand: /lib/x86_64-linux-gnu/libstdc++.so.6: version `CXXABI_1.3.15' not found (required by ./bench_ondemand)
SMHasher SMHasher is a hash function tester supporting various algorithms and able to make use of AVX and other modern CPU instruction set extensions. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: SHA3-256 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 100 200 300 400 500 SE +/- 3.50, N = 4 SE +/- 4.14, N = 7 SE +/- 3.98, N = 3 SE +/- 2.29, N = 3 290.65 459.02 463.94 287.86 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
cycles/hash
OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: SHA3-256 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 300 600 900 1200 1500 SE +/- 14.31, N = 4 SE +/- 7.46, N = 7 SE +/- 8.35, N = 3 SE +/- 8.88, N = 3 1353.92 812.79 817.90 1356.53 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Result
OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: FarmHash128 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 6K 12K 18K 24K 30K SE +/- 187.58, N = 15 SE +/- 227.32, N = 3 SE +/- 100.64, N = 3 SE +/- 311.86, N = 3 29054.81 26974.45 26494.22 28278.49 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
cycles/hash
OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: FarmHash128 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 8 16 24 32 40 SE +/- 0.17, N = 15 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 35.54 25.13 25.25 35.68 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Result
OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: FarmHash32 x86_64 AVX GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 11K 22K 33K 44K 55K SE +/- 264.89, N = 3 SE +/- 439.18, N = 3 SE +/- 372.44, N = 3 SE +/- 531.14, N = 3 51654.97 40846.89 41578.48 50400.25 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
cycles/hash
OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: FarmHash32 x86_64 AVX GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 SE +/- 0.15, N = 3 20.19 22.56 22.07 19.86 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Result
OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: t1ha0_aes_avx2 x86_64 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 30K 60K 90K 120K 150K SE +/- 759.57, N = 3 SE +/- 1814.82, N = 3 SE +/- 954.34, N = 3 SE +/- 1656.41, N = 3 131074.98 132159.77 130792.83 132218.26 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
cycles/hash
OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: t1ha0_aes_avx2 x86_64 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 15.64 15.25 15.57 15.43 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Result
OpenBenchmarking.org MiB/sec, More Is Better SMHasher 2022-08-22 Hash: MeowHash x86_64 AES-NI GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 15K 30K 45K 60K 75K SE +/- 715.67, N = 3 SE +/- 966.63, N = 3 SE +/- 980.00, N = 3 SE +/- 626.56, N = 6 68597.56 67078.43 67945.38 68248.31 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
cycles/hash
OpenBenchmarking.org cycles/hash, Fewer Is Better SMHasher 2022-08-22 Hash: MeowHash x86_64 AES-NI GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 8 16 24 32 40 SE +/- 0.24, N = 3 SE +/- 0.40, N = 3 SE +/- 0.45, N = 3 SE +/- 0.24, N = 6 34.26 34.11 34.76 34.18 -flto=auto -fno-fat-lto-objects -flto=thin -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Stargate Digital Audio Workstation Stargate is an open-source, cross-platform digital audio workstation (DAW) software package with "a unique and carefully curated experience" with scalability from old systems up through modern multi-core systems. Stargate is GPLv3 licensed and makes use of Qt5 (PyQt5) for its user-interface. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 2 4 6 8 10 SE +/- 0.067600, N = 3 SE +/- 0.051637, N = 3 SE +/- 0.038112, N = 3 SE +/- 0.023956, N = 3 7.403037 8.527328 8.415251 7.530593 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 96000 - Buffer Size: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1.3326 2.6652 3.9978 5.3304 6.663 SE +/- 0.011976, N = 3 SE +/- 0.004937, N = 3 SE +/- 0.003453, N = 3 SE +/- 0.012082, N = 3 5.131378 5.922697 5.827354 5.388126 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 512 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.8413 1.6826 2.5239 3.3652 4.2065 SE +/- 0.007583, N = 3 SE +/- 0.004927, N = 3 SE +/- 0.002560, N = 3 SE +/- 0.001538, N = 3 3.258674 3.738894 3.703294 3.479722 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 1024 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3 6 9 12 15 SE +/- 0.011137, N = 3 SE +/- 0.009153, N = 3 SE +/- 0.018602, N = 3 SE +/- 0.013114, N = 3 7.987047 9.124527 9.030747 8.111928 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 96000 - Buffer Size: 1024 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 2 4 6 8 10 SE +/- 0.009825, N = 3 SE +/- 0.004464, N = 3 SE +/- 0.009480, N = 3 SE +/- 0.007988, N = 3 5.601831 6.474312 6.371475 5.951828 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 1024 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.9461 1.8922 2.8383 3.7844 4.7305 SE +/- 0.008117, N = 3 SE +/- 0.018365, N = 3 SE +/- 0.020650, N = 3 SE +/- 0.006090, N = 3 3.657915 4.204693 4.122784 3.955202 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 4 - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3 6 9 12 15 SE +/- 0.076, N = 3 SE +/- 0.063, N = 3 SE +/- 0.065, N = 3 SE +/- 0.096, N = 5 9.589 9.872 9.667 9.717 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 20 40 60 80 100 SE +/- 0.65, N = 15 SE +/- 0.81, N = 3 SE +/- 1.06, N = 4 SE +/- 1.18, N = 3 94.71 99.21 94.44 96.13 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 12 - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 50 100 150 200 250 SE +/- 0.51, N = 3 SE +/- 1.47, N = 13 SE +/- 1.52, N = 3 SE +/- 1.12, N = 3 212.00 208.92 205.93 205.10 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 13 - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 50 100 150 200 250 SE +/- 0.96, N = 3 SE +/- 0.72, N = 3 SE +/- 2.03, N = 3 SE +/- 1.82, N = 15 208.82 208.71 204.33 205.98 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 50 100 150 200 250 SE +/- 8.63, N = 12 SE +/- 0.19, N = 3 SE +/- 1.77, N = 3 SE +/- 0.67, N = 3 224.12 232.28 235.76 218.02 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 40 80 120 160 200 SE +/- 2.26, N = 15 SE +/- 1.99, N = 3 SE +/- 1.78, N = 15 SE +/- 2.39, N = 4 199.54 190.79 194.92 199.29 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 30 60 90 120 150 SE +/- 1.70, N = 15 SE +/- 0.58, N = 3 SE +/- 0.68, N = 3 SE +/- 0.79, N = 3 113.11 109.12 109.58 110.56 -mabm -mabm 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm -lreadline
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 600K 1200K 1800K 2400K 3000K SE +/- 21601.14, N = 5 SE +/- 4909.07, N = 5 SE +/- 9910.25, N = 5 SE +/- 3431.38, N = 5 2216345 2458869 2708002 2246260 1. (CC) gcc options: -O3 -march=native
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Slow GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.17, N = 3 SE +/- 0.18, N = 3 30.82 32.01 31.58 31.60 1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 8 16 24 32 40 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 34.05 35.23 34.66 34.91 1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.19, N = 3 SE +/- 0.27, N = 3 67.31 70.81 71.76 67.98 1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 16 32 48 64 80 SE +/- 0.14, N = 3 SE +/- 0.25, N = 3 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 69.54 71.84 72.62 69.71 1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 16 32 48 64 80 SE +/- 0.51, N = 3 SE +/- 0.27, N = 3 SE +/- 0.24, N = 3 SE +/- 0.15, N = 3 71.37 73.25 73.72 71.62 1. (CXX) g++ options: -O3 -march=native
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast GCC 13.2 Clang 17.0.2 GCC 14 23 Dec 3 6 9 12 15 SE +/- 0.090, N = 3 SE +/- 0.023, N = 3 SE +/- 0.019, N = 3 9.038 8.747 9.102 -flto=auto -fno-fat-lto-objects -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Video Input: Bosphorus 4K - Video Preset: Fast
Clang 18 23 Dec: The test quit with a non-zero exit status. E: ./vvenc: 2: ./vvenc-1.9.0/bin/release-static/vvencapp: not found
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster GCC 13.2 Clang 17.0.2 GCC 14 23 Dec 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 16.49 15.58 16.35 -flto=auto -fno-fat-lto-objects -flto=thin -flto=auto -fno-fat-lto-objects 1. (CXX) g++ options: -O3 -march=native
Video Input: Bosphorus 4K - Video Preset: Faster
Clang 18 23 Dec: The test quit with a non-zero exit status. E: ./vvenc: 2: ./vvenc-1.9.0/bin/release-static/vvencapp: not found
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Default GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 6 12 18 24 30 SE +/- 0.22, N = 3 SE +/- 0.06, N = 3 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 25.65 27.08 26.81 26.07 -lpng16 -ljpeg -ltiff -lpng16 -ljpeg -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 15.24 15.73 15.37 15.65 -lpng16 -ljpeg -ltiff -lpng16 -ljpeg -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.459 0.918 1.377 1.836 2.295 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 2.04 1.98 1.96 2.01 -lpng16 -ljpeg -ltiff -lpng16 -ljpeg -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 1.26 2.52 3.78 5.04 6.3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 4.51 5.60 5.39 4.48 -lpng16 -ljpeg -ltiff -lpng16 -ljpeg -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 0.1778 0.3556 0.5334 0.7112 0.889 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.79 0.79 0.79 0.78 -lpng16 -ljpeg -ltiff -lpng16 -ljpeg -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 9 18 27 36 45 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 39.38 40.43 41.05 38.70 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: KawPow - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 12K 24K 36K 48K 60K SE +/- 417.45, N = 3 SE +/- 640.77, N = 3 SE +/- 529.15, N = 3 SE +/- 668.49, N = 3 56883.5 57018.3 56651.0 57660.1 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Monero - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 12K 24K 36K 48K 60K SE +/- 228.27, N = 3 SE +/- 154.69, N = 3 SE +/- 215.49, N = 3 SE +/- 234.92, N = 3 56279.6 55888.6 55839.5 55867.9 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Wownero - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 15K 30K 45K 60K 75K SE +/- 161.43, N = 3 SE +/- 205.42, N = 3 SE +/- 267.90, N = 3 SE +/- 161.46, N = 3 71242.7 71216.1 71240.6 71393.6 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3K 6K 9K 12K 15K SE +/- 43.77, N = 3 SE +/- 36.45, N = 3 SE +/- 17.51, N = 3 SE +/- 11.65, N = 3 13592.0 13628.9 13438.4 13625.2 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: CryptoNight-Heavy - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 12K 24K 36K 48K 60K SE +/- 147.72, N = 3 SE +/- 146.78, N = 3 SE +/- 91.23, N = 3 SE +/- 185.10, N = 3 56361.0 56105.9 55924.5 56126.3 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: CryptoNight-Femto UPX2 - Hash Count: 1M GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 12K 24K 36K 48K 60K SE +/- 150.17, N = 3 SE +/- 155.80, N = 3 SE +/- 216.50, N = 3 SE +/- 136.66, N = 3 56592.8 56300.8 56023.0 56155.2 -static-libgcc -static-libstdc++ -funroll-loops -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 90 180 270 360 450 SE +/- 4.45, N = 3 SE +/- 2.30, N = 3 SE +/- 4.56, N = 3 SE +/- 3.96, N = 3 416.1 414.4 405.5 411.4 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 500 1000 1500 2000 2500 SE +/- 26.26, N = 3 SE +/- 29.73, N = 3 SE +/- 30.79, N = 3 SE +/- 10.68, N = 3 2212.9 2151.0 2173.3 2136.2 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 24.9 25.0 24.7 24.7 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 400 800 1200 1600 2000 SE +/- 2.12, N = 3 SE +/- 14.06, N = 3 SE +/- 5.93, N = 3 SE +/- 18.21, N = 3 1895.8 1856.6 1866.6 1837.3 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 11.7 11.8 11.8 11.8 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed GCC 13.2 Clang 17.0.2 Clang 18 23 Dec GCC 14 23 Dec 400 800 1200 1600 2000 SE +/- 19.09, N = 3 SE +/- 23.70, N = 3 SE +/- 21.26, N = 3 SE +/- 21.40, N = 3 1795.5 1774.3 1776.6 1773.2 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
GCC 13.2 Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 December 2023 20:26 by user pts.
Clang 17.0.2 Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: Clang 17.0.2, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 December 2023 17:43 by user pts.
Clang 18 23 Dec Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: Clang 18.0.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 December 2023 11:11 by user pts.
GCC 14 23 Dec Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 14.0.0 20231224, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105OpenCL Notes: GPU Compute Cores: 6144Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 24 December 2023 02:35 by user pts.