Genoa compiler benchmarks by Michael Larabel for a future article.
GCC 13.1.1 Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1004D BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: GCC 13.1.1 20230511, File-System: xfs, Screen Resolution: 640x480
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverDisk Notes: none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Clang 16.0.3 OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: Clang 16.0.3 + LLVM 16.0.3, File-System: xfs, Screen Resolution: 640x480
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Disk Notes: none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 Clang 16.0.3 GCC 13.1.1 60 120 180 240 300 SE +/- 7.42, N = 15 SE +/- 0.29, N = 3 171.42 276.09 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 Clang 16.0.3 GCC 13.1.1 70 140 210 280 350 SE +/- 3.21, N = 3 SE +/- 1.73, N = 3 322.90 330.05 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 GCC 13.1.1 Clang 16.0.3 60000M 120000M 180000M 240000M 300000M SE +/- 48171917.42, N = 3 SE +/- 64283484.40, N = 3 260892560200 264445169827 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 GCC 13.1.1 Clang 16.0.3 20000M 40000M 60000M 80000M 100000M SE +/- 845012614.25, N = 5 SE +/- 4419075.00, N = 3 80035854752 82701625690 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 Clang 16.0.3 GCC 13.1.1 200000M 400000M 600000M 800000M 1000000M SE +/- 51499603.07, N = 3 SE +/- 152062084.95, N = 3 645125858407 1014036223867 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM Clang 16.0.3 GCC 13.1.1 400000M 800000M 1200000M 1600000M 2000000M SE +/- 616604050.97, N = 3 SE +/- 1317927821.70, N = 3 1796351136523 1801352965597 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM Clang 16.0.3 GCC 13.1.1 300000M 600000M 900000M 1200000M 1500000M SE +/- 2176141824.81, N = 3 SE +/- 293680009.95, N = 3 1538350610540 1544053984720 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 Clang 16.0.3 GCC 13.1.1 150000M 300000M 450000M 600000M 750000M SE +/- 48134544.35, N = 3 SE +/- 116220547.85, N = 3 440446591887 708873524927 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow GCC 13.1.1 Clang 16.0.3 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 34.77 37.37 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium GCC 13.1.1 Clang 16.0.3 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.17, N = 3 35.94 38.37 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 13.1.1 Clang 16.0.3 16 32 48 64 80 SE +/- 0.74, N = 4 SE +/- 0.57, N = 3 68.57 69.85 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast Clang 16.0.3 GCC 13.1.1 16 32 48 64 80 SE +/- 0.16, N = 3 SE +/- 0.34, N = 3 68.54 70.10 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Clang 16.0.3 GCC 13.1.1 16 32 48 64 80 SE +/- 0.62, N = 3 SE +/- 0.43, N = 3 70.16 70.51 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.5 Encoder Mode: Preset 4 - Input: Bosphorus 4K GCC 13.1.1 Clang 16.0.3 1.3041 2.6082 3.9123 5.2164 6.5205 SE +/- 0.067, N = 3 SE +/- 0.025, N = 3 5.527 5.796 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.5 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 13.1.1 Clang 16.0.3 13 26 39 52 65 SE +/- 0.42, N = 3 SE +/- 0.58, N = 5 54.95 56.14 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.5 Encoder Mode: Preset 12 - Input: Bosphorus 4K Clang 16.0.3 GCC 13.1.1 40 80 120 160 200 SE +/- 1.96, N = 12 SE +/- 1.61, N = 3 170.79 176.03 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.5 Encoder Mode: Preset 13 - Input: Bosphorus 4K GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 1.27, N = 7 SE +/- 1.04, N = 12 143.43 146.22 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium GCC 13.1.1 Clang 16.0.3 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 27.43 29.65 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Medium GCC 13.1.1 Clang 16.0.3 10 20 30 40 50 SE +/- 0.09, N = 3 SE +/- 0.29, N = 3 39.08 42.49 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 13.1.1 Clang 16.0.3 11 22 33 44 55 SE +/- 0.50, N = 3 SE +/- 0.42, N = 3 46.63 47.40 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast Clang 16.0.3 GCC 13.1.1 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 46.84 46.97 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 13.1.1 Clang 16.0.3 11 22 33 44 55 SE +/- 0.37, N = 3 SE +/- 0.56, N = 3 46.72 47.56 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.64, N = 3 112.43 133.05 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Super Fast GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.55, N = 3 136.19 146.20 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 1.08, N = 11 SE +/- 2.25, N = 3 147.37 156.50 1. (CXX) g++ options: -O3 -march=native -flto
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 13.1.1 Clang 16.0.3 6 12 18 24 30 SE +/- 0.22, N = 6 SE +/- 0.34, N = 3 21.59 25.38 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya GCC 13.1.1 Clang 16.0.3 0.9968 1.9936 2.9904 3.9872 4.984 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.15 4.43 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 15 6.88 7.33 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom Clang 16.0.3 GCC 13.1.1 0.2835 0.567 0.8505 1.134 1.4175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.22 1.26 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.12, N = 12 6.56 8.22 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.07, N = 15 6.78 8.52 1. (CXX) g++ options: -O3 -march=native -flto
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 Clang 16.0.3 GCC 13.1.1 1500 3000 4500 6000 7500 SE +/- 185.48, N = 15 SE +/- 7.29, N = 3 4285.59 6902.23 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 Clang 16.0.3 GCC 13.1.1 2K 4K 6K 8K 10K SE +/- 80.21, N = 3 SE +/- 43.34, N = 3 8072.49 8251.34 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Swirl Clang 16.0.3 GCC 13.1.1 600 1200 1800 2400 3000 SE +/- 3.84, N = 3 SE +/- 15.88, N = 3 2207 2958 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Rotate Clang 16.0.3 GCC 13.1.1 140 280 420 560 700 SE +/- 0.33, N = 3 SE +/- 3.28, N = 3 659 660 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Sharpen GCC 13.1.1 Clang 16.0.3 400 800 1200 1600 2000 SE +/- 1.00, N = 3 SE +/- 12.49, N = 3 1311 1723 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Enhanced Clang 16.0.3 GCC 13.1.1 500 1000 1500 2000 2500 SE +/- 2.33, N = 3 SE +/- 4.26, N = 3 1901 2216 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Resizing GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 6.54, N = 12 SE +/- 2.70, N = 15 120 128 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: Noise-Gaussian Clang 16.0.3 GCC 13.1.1 200 400 600 800 1000 SE +/- 11.26, N = 3 SE +/- 9.66, N = 6 835 981 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.38 Operation: HWB Color Space Clang 16.0.3 GCC 13.1.1 200 400 600 800 1000 SE +/- 5.51, N = 3 SE +/- 15.72, N = 3 875 1089 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -ltiff -ljpeg -lX11 -lz -lm -lpthread
SecureMark SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 13.1.1 Clang 16.0.3 70K 140K 210K 280K 350K SE +/- 872.41, N = 3 SE +/- 1404.09, N = 3 288753 312570 1. (CC) gcc options: -pedantic -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed Clang 16.0.3 GCC 13.1.1 4K 8K 12K 16K 20K SE +/- 77.84, N = 3 SE +/- 51.33, N = 3 17350.5 17800.8 1. (CC) gcc options: -O3 -march=native -flto
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 13.1.1 Clang 16.0.3 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.54, N = 3 46.24 52.31 1. (CC) gcc options: -O3 -march=native -flto
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Clang 16.0.3 GCC 13.1.1 4K 8K 12K 16K 20K SE +/- 103.96, N = 3 SE +/- 141.14, N = 3 16246.7 16409.9 1. (CC) gcc options: -O3 -march=native -flto
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 13.1.1 Clang 16.0.3 11 22 33 44 55 SE +/- 0.49, N = 3 SE +/- 0.53, N = 3 45.84 50.85 1. (CC) gcc options: -O3 -march=native -flto
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 13.1.1 Clang 16.0.3 4K 8K 12K 16K 20K SE +/- 80.69, N = 3 SE +/- 50.74, N = 3 16305.1 16340.0 1. (CC) gcc options: -O3 -march=native -flto
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed Clang 16.0.3 GCC 13.1.1 70 140 210 280 350 SE +/- 3.40, N = 3 SE +/- 4.38, N = 3 309.8 312.6 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed Clang 16.0.3 GCC 13.1.1 400 800 1200 1600 2000 SE +/- 6.60, N = 3 SE +/- 3.39, N = 3 1632.7 1662.6 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed GCC 13.1.1 Clang 16.0.3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 16.8 16.8 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed Clang 16.0.3 GCC 13.1.1 300 600 900 1200 1500 SE +/- 1.96, N = 3 SE +/- 9.49, N = 3 1387.5 1413.0 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed Clang 16.0.3 GCC 13.1.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 8.26 8.31 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed GCC 13.1.1 Clang 16.0.3 300 600 900 1200 1500 SE +/- 4.13, N = 3 SE +/- 4.51, N = 3 1341.4 1343.5 -Qunused-arguments 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Clang 16.0.3 GCC 13.1.1 50 100 150 200 250 SE +/- 0.20, N = 3 SE +/- 0.85, N = 3 217.47 219.39 1. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.30 GCC 13.1.1 Clang 16.0.3 700 1400 2100 2800 3500 SE +/- 2.59, N = 3 SE +/- 7.81, N = 3 3238.8 3391.1 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms Clang 16.0.3 GCC 13.1.1 90 180 270 360 450 SE +/- 0.05, N = 3 SE +/- 0.35, N = 3 396.46 435.81 1. (CXX) g++ options: -O3 -march=native -flto -fPIC -pthread -pipe
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating GCC 13.1.1 150K 300K 450K 600K 750K SE +/- 7447.92, N = 3 691246 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Test: Compression Rating
Clang 16.0.3: The test quit with a non-zero exit status. E: compress-7zip: line 2: ./CPP/7zip/Bundles/Alone2/_o/7zz: No such file or directory
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating GCC 13.1.1 200K 400K 600K 800K 1000K SE +/- 4772.36, N = 3 1155277 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Test: Decompression Rating
Clang 16.0.3: The test quit with a non-zero exit status. E: compress-7zip: line 2: ./CPP/7zip/Bundles/Alone2/_o/7zz: No such file or directory
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 GCC 13.1.1 Clang 16.0.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 11.54 11.75 -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless GCC 13.1.1 Clang 16.0.3 0.342 0.684 1.026 1.368 1.71 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.35 1.52 -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression GCC 13.1.1 Clang 16.0.3 0.936 1.872 2.808 3.744 4.68 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.21 4.16 -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression GCC 13.1.1 Clang 16.0.3 0.1418 0.2836 0.4254 0.5672 0.709 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.55 0.63 -ltiff 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Fast GCC 13.1.1 Clang 16.0.3 200 400 600 800 1000 SE +/- 3.96, N = 3 SE +/- 5.60, N = 3 718.81 780.95 1. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium GCC 13.1.1 Clang 16.0.3 120 240 360 480 600 SE +/- 2.85, N = 3 SE +/- 2.38, N = 3 552.03 570.62 1. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 117.03 117.75 1. (CXX) g++ options: -O3 -march=native -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive Clang 16.0.3 GCC 13.1.1 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 12.77 12.84 1. (CXX) g++ options: -O3 -march=native -flto -pthread
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 13.1.1 Clang 16.0.3 400K 800K 1200K 1600K 2000K SE +/- 2555.19, N = 5 SE +/- 2743.38, N = 5 1661988 1758567 1. (CC) gcc options: -O3 -march=native -flto
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time Clang 16.0.3 GCC 13.1.1 110M 220M 330M 440M 550M SE +/- 8586129.93, N = 15 SE +/- 3102500.18, N = 3 517643140 532213592 -fprofile-use -lgcov -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mbmi2 -fno-peel-loops -fno-tracer -flto=jobserver 1. (CXX) g++ options: -m64 -lpthread -O3 -march=native -flto -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:5 GCC 13.1.1 400K 800K 1200K 1600K 2000K SE +/- 6222.06, N = 3 2033827.48 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Set To Get Ratio: 1:5
Clang 16.0.3: The test run did not produce a result. E: memcached: line 7: ./memtier_benchmark: No such file or directory
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 GCC 13.1.1 700K 1400K 2100K 2800K 3500K SE +/- 12793.89, N = 3 3302722.11 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Set To Get Ratio: 1:10
Clang 16.0.3: The test run did not produce a result. E: memcached: line 7: ./memtier_benchmark: No such file or directory
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 GCC 13.1.1 900K 1800K 2700K 3600K 4500K SE +/- 15774.50, N = 3 4105286.36 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Set To Get Ratio: 1:100
Clang 16.0.3: The test run did not produce a result. E: memcached: line 7: ./memtier_benchmark: No such file or directory
MariaDB This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 2048 Clang 16.0.3 GCC 13.1.1 80 160 240 320 400 SE +/- 2.37, N = 3 SE +/- 0.65, N = 3 373 385 -lcrypt -lssl -lcrypto -fPIC -pie -shared -lrt -lstdc++ 1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lz -lm -lpthread -ldl
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 4096 Clang 16.0.3 GCC 13.1.1 90 180 270 360 450 SE +/- 3.39, N = 3 SE +/- 2.65, N = 3 379 394 -lcrypt -lssl -lcrypto -fPIC -pie -shared -lrt -lstdc++ 1. (CXX) g++ options: -O3 -march=native -flto -fstack-protector -lz -lm -lpthread -ldl
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK GCC 13.1.1 300K 600K 900K 1200K 1500K SE +/- 4630.81, N = 3 1261333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
Test: WPA PSK
Clang 16.0.3: The test quit with a non-zero exit status. E: john-the-ripper: line 3: ./john: No such file or directory
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish GCC 13.1.1 70K 140K 210K 280K 350K SE +/- 401.92, N = 3 323256 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
Test: Blowfish
Clang 16.0.3: The test quit with a non-zero exit status. E: john-the-ripper: line 3: ./john: No such file or directory
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 GCC 13.1.1 60M 120M 180M 240M 300M SE +/- 2757367.12, N = 3 286625667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
Test: HMAC-SHA512
Clang 16.0.3: The test quit with a non-zero exit status. E: john-the-ripper: line 3: ./john: No such file or directory
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 GCC 13.1.1 6M 12M 18M 24M 30M SE +/- 29475.04, N = 3 28819667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
Test: MD5
Clang 16.0.3: The test quit with a non-zero exit status. E: john-the-ripper: line 3: ./john: No such file or directory
Stargate Digital Audio Workstation Stargate is an open-source, cross-platform digital audio workstation (DAW) software package with "a unique and carefully curated experience" with scalability from old systems up through modern multi-core systems. Stargate is GPLv3 licensed and makes use of Qt5 (PyQt5) for its user-interface. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 512 GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.001560, N = 3 SE +/- 0.001401, N = 3 5.837183 6.691175 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 512 GCC 13.1.1 Clang 16.0.3 0.6481 1.2962 1.9443 2.5924 3.2405 SE +/- 0.007283, N = 3 SE +/- 0.007221, N = 3 2.516856 2.880358 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 1024 GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.000734, N = 3 SE +/- 0.005456, N = 3 6.270559 7.161300 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 1024 GCC 13.1.1 Clang 16.0.3 0.7329 1.4658 2.1987 2.9316 3.6645 SE +/- 0.006270, N = 3 SE +/- 0.007486, N = 3 2.855186 3.257372 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 GCC 13.1.1 Clang 16.0.3 1000M 2000M 3000M 4000M 5000M SE +/- 8346922.25, N = 3 SE +/- 1808314.13, N = 3 4641933333 4680200000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 GCC 13.1.1 Clang 16.0.3 2000M 4000M 6000M 8000M 10000M SE +/- 13312441.97, N = 3 SE +/- 913479.31, N = 3 8490666667 8665233333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 256 - Buffer Length: 256 - Filter Length: 57 GCC 13.1.1 Clang 16.0.3 3000M 6000M 9000M 12000M 15000M SE +/- 4096068.58, N = 3 SE +/- 2603416.56, N = 3 11285333333 11762333333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 384 - Buffer Length: 256 - Filter Length: 57 GCC 13.1.1 Clang 16.0.3 3000M 6000M 9000M 12000M 15000M SE +/- 7264831.57, N = 3 SE +/- 9527737.29, N = 3 11647666667 12418333333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 Clang 16.0.3 GCC 13.1.1 15K 30K 45K 60K 75K SE +/- 10.37, N = 3 SE +/- 24.90, N = 3 71901.7 71935.9 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Clang 16.0.3 GCC 13.1.1 70M 140M 210M 280M 350M SE +/- 4076394.65, N = 15 SE +/- 6307414.85, N = 15 217323727 324078880 -fopenmp=libomp -fopenmp 1. (CXX) g++ options: -O3 -march=native -flto
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only GCC 13.1.1 Clang 16.0.3 700K 1400K 2100K 2800K 3500K SE +/- 57281.48, N = 12 SE +/- 34767.93, N = 12 3362193 3457173 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 GCC 13.1.1 Clang 16.0.3 600K 1200K 1800K 2400K 3000K SE +/- 105.74, N = 3 SE +/- 437.47, N = 3 2921019.6 2931038.9 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU GCC 13.1.1 Clang 16.0.3 500 1000 1500 2000 2500 SE +/- 42.69, N = 15 SE +/- 8.52, N = 15 2191.22 468.27 -fopenmp - MIN: 1875.06 -fopenmp=libomp - MIN: 441.14 1. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU GCC 13.1.1 Clang 16.0.3 500 1000 1500 2000 2500 SE +/- 26.76, N = 15 SE +/- 2.20, N = 3 2406.70 885.03 -fopenmp - MIN: 2191.9 -fopenmp=libomp - MIN: 871.44 1. (CXX) g++ options: -O3 -march=native -flto -msse4.1 -fPIC -pie -ldl -lpthread
OpenJPEG OpenJPEG is an open-source JPEG 2000 codec written in the C programming language. The default input for this test profile is the NASA/JPL-Caltech/MSSS Curiosity panorama 717MB TIFF image file converting to JPEG2000 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenJPEG 2.4 Encode: NASA Curiosity Panorama M34 GCC 13.1.1 Clang 16.0.3 40K 80K 120K 160K 200K SE +/- 812.57, N = 3 SE +/- 571.61, N = 3 179824 162086 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency GCC 13.1.1 Clang 16.0.3 0.0538 0.1076 0.1614 0.2152 0.269 SE +/- 0.004, N = 12 SE +/- 0.002, N = 12 0.239 0.232 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lm
Google Draco Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Lion GCC 13.1.1 Clang 16.0.3 1100 2200 3300 4400 5500 SE +/- 29.18, N = 3 SE +/- 44.34, N = 3 5243 5106 1. (CXX) g++ options: -O3 -march=native -flto
OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Church Facade GCC 13.1.1 Clang 16.0.3 1400 2800 4200 5600 7000 SE +/- 13.35, N = 3 SE +/- 4.84, N = 3 6750 6559 1. (CXX) g++ options: -O3 -march=native -flto
SQLite This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 1 Clang 16.0.3 GCC 13.1.1 0.4601 0.9202 1.3803 1.8404 2.3005 SE +/- 0.015, N = 3 SE +/- 0.002, N = 3 2.045 2.028 1. (CC) gcc options: -O3 -march=native -flto -lz -lm
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 2 GCC 13.1.1 Clang 16.0.3 0.5616 1.1232 1.6848 2.2464 2.808 SE +/- 0.027, N = 3 SE +/- 0.032, N = 3 2.496 2.442 1. (CC) gcc options: -O3 -march=native -flto -lz -lm
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.66, N = 3 113.34 112.11 -mabm 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -flto -lm
Z3 Theorem Prover The Z3 Theorem Prover / SMT solver is developed by Microsoft Research under the MIT license. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Z3 Theorem Prover 4.12.1 SMT File: 1.smt2 GCC 13.1.1 6 12 18 24 30 SE +/- 0.19, N = 3 25.90 1. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC
SMT File: 1.smt2
Clang 16.0.3: The test quit with a non-zero exit status. E: z3: line 2: ./z3-z3-4.12.1/build/z3: No such file or directory
OpenBenchmarking.org Seconds, Fewer Is Better Z3 Theorem Prover 4.12.1 SMT File: 2.smt2 GCC 13.1.1 16 32 48 64 80 SE +/- 0.06, N = 3 72.23 1. (CXX) g++ options: -lpthread -O3 -march=native -flto -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -fPIC
SMT File: 2.smt2
Clang 16.0.3: The test quit with a non-zero exit status. E: z3: line 2: ./z3-z3-4.12.1/build/z3: No such file or directory
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.11 Encoder Speed: 2 GCC 13.1.1 Clang 16.0.3 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.29, N = 3 33.81 32.66 1. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.11 Encoder Speed: 6 GCC 13.1.1 Clang 16.0.3 0.5675 1.135 1.7025 2.27 2.8375 SE +/- 0.025, N = 5 SE +/- 0.020, N = 3 2.522 2.496 1. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.11 Encoder Speed: 6, Lossless GCC 13.1.1 Clang 16.0.3 1.0458 2.0916 3.1374 4.1832 5.229 SE +/- 0.059, N = 3 SE +/- 0.035, N = 3 4.648 4.220 1. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.11 Encoder Speed: 10, Lossless GCC 13.1.1 Clang 16.0.3 0.8343 1.6686 2.5029 3.3372 4.1715 SE +/- 0.023, N = 3 SE +/- 0.020, N = 3 3.708 3.527 1. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 16.0.3 GCC 13.1.1 0.7207 1.4414 2.1621 2.8828 3.6035 SE +/- 0.042, N = 3 SE +/- 0.058, N = 15 3.203 3.063 1. (CC) gcc options: -lm -lpthread -O3 -march=native -flto
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 13.1.1 Clang 16.0.3 2 4 6 8 10 SE +/- 0.053, N = 8 SE +/- 0.053, N = 7 6.123 5.654 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
eSpeak-NG Speech Engine This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 1.51 Text-To-Speech Synthesis Clang 16.0.3 GCC 13.1.1 7 14 21 28 35 SE +/- 0.15, N = 4 SE +/- 0.02, N = 4 28.27 27.49 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 22.1 Input: Carbon Nanotube GCC 13.1.1 Clang 16.0.3 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.24, N = 3 140.86 140.28 1. (CC) gcc options: -shared -O3 -march=native -flto -lxc -lblas -lmpi
Faiss Faiss is developed by Meta/Facebook. Faiss is a library for efficient similarity search and clustering of dense vectors. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Faiss 1.7.4 Test: demo_sift1M GCC 13.1.1 13 26 39 52 65 SE +/- 0.19, N = 2 59.40 1. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc
Test: demo_sift1M
Clang 16.0.3: The test quit with a non-zero exit status.
GCC 13.1.1 Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1004D BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: GCC 13.1.1 20230511, File-System: xfs, Screen Resolution: 640x480
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverDisk Notes: none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 May 2023 15:05 by user .
Clang 16.0.3 Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1004D BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: Clang 16.0.3 + LLVM 16.0.3, File-System: xfs, Screen Resolution: 640x480
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Disk Notes: none / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa101111Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 May 2023 10:51 by user .