Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.
A Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Java Notes: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10)Python Notes: Python 3.8.10Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
B C D Processor: Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads), Motherboard: Supermicro X11SPL-F v1.02 (3.1 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 192GB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel I210
OS: Ubuntu 20.10, Kernel: 5.11.0-rc4-max-boost-inv-patch (x86_64) 20210121, Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Compiler: GCC 10.3.0, File-System: ext4, Screen Resolution: 1920x1080
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K D C B A 0.7538 1.5076 2.2614 3.0152 3.769 3.32 3.34 3.35 3.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K D C B A 3 6 9 12 15 10.15 10.80 10.28 11.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K D C B A 1.2218 2.4436 3.6654 4.8872 6.109 5.40 5.43 5.40 5.42 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K D C B A 4 8 12 16 20 18.01 17.95 17.93 18.05 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K D C B A 7 14 21 28 35 28.75 28.52 28.66 28.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K D C B A 7 14 21 28 35 28.44 28.43 28.38 28.35 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p D C B A 0.0788 0.1576 0.2364 0.3152 0.394 0.35 0.35 0.34 0.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p D C B A 1.1925 2.385 3.5775 4.77 5.9625 5.17 5.26 5.25 5.30 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p D C B A 5 10 15 20 25 18.34 16.20 15.69 15.81 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p D C B A 3 6 9 12 15 11.36 11.38 11.47 11.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p D C B A 11 22 33 44 55 48.92 43.61 44.72 46.57 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p D C B A 14 28 42 56 70 58.10 54.58 61.81 56.66 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.4 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p D C B A 15 30 45 60 75 64.46 65.51 62.03 64.95 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported and HIP for AMD Radeon GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: BMW27 - Compute: CPU-Only D B A 20 40 60 80 100 107.51 107.67 107.50
Chia Blockchain VDF Chia is a blockchain and smart transaction platform based on proofs of space and time rather than proofs of work with other cryptocurrencies. This test profile is benchmarking the CPU performance for Chia VDF performance using the Chia VDF benchmark. The Chia VDF is for the Chia Verifiable Delay Function (Proof of Time). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.7 Test: Square Plain C++ D C B A 30K 60K 90K 120K 150K 118700 118800 117200 118800 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all queries performed. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 100M Rows Web Analytics Dataset, First Run / Cold Cache D B A 40 80 120 160 200 178.95 172.52 177.76 MIN: 22.51 / MAX: 8571.43 MIN: 21.87 / MAX: 2142.86 MIN: 21.98 / MAX: 2222.22 1. ClickHouse server version 22.7.1.687 (official build).
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 100M Rows Web Analytics Dataset, Second Run D B A 40 80 120 160 200 190.36 180.53 188.33 MIN: 22.43 / MAX: 7500 MIN: 23.03 / MAX: 7500 MIN: 23.09 / MAX: 7500 1. ClickHouse server version 22.7.1.687 (official build).
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 100M Rows Web Analytics Dataset, Third Run D B A 40 80 120 160 200 179.81 185.00 184.16 MIN: 22.6 / MAX: 5454.55 MIN: 22.69 / MAX: 7500 MIN: 22.88 / MAX: 4000 1. ClickHouse server version 22.7.1.687 (official build).
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. The test profile uses a 8K x 8K game texture as a sample input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Multi-Threaded - Configuration: ETC2 D C B A 600 1200 1800 2400 3000 2921.72 2920.82 2916.51 2940.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Single-Threaded - Configuration: ETC2 D C B A 40 80 120 160 200 191.73 188.56 188.50 191.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Update Random D B A 140K 280K 420K 560K 700K 633869 633293 633380 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read While Writing D B A 600K 1200K 1800K 2400K 3000K 2995064 3015141 3010198 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read Random Write Random D B A 500K 1000K 1500K 2000K 2500K 2147824 2143910 2157960 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Glibc Benchmarks The GNU C Library project provides the core libraries for the GNU system and GNU/Linux systems, as well as many other systems that use Linux as the kernel. These libraries provide critical APIs including ISO C11, POSIX.1-2008, BSD, OS-specific APIs and more. This test profile makes use of Glibc's "benchtests" integrated benchmark suite. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: cos D C B A 20 40 60 80 100 75.37 74.92 74.98 74.71 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: ffs D C B A 1.1895 2.379 3.5685 4.758 5.9475 5.25955 5.27749 5.28646 5.27522 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: ffsll D C B A 1.1458 2.2916 3.4374 4.5832 5.729 5.00725 5.02977 5.03218 5.09231 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: pthread_once D C B A 1.191 2.382 3.573 4.764 5.955 5.24905 5.26201 5.26195 5.29319 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 D B A 70M 140M 210M 280M 350M 333423000 332860000 334362000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -pthread -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 D B A 70M 140M 210M 280M 350M 336656000 335925000 337688000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -pthread -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 D B A 20M 40M 60M 80M 100M 95831400 96562300 98000500 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -pthread -lmpi
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 D B A 30M 60M 90M 120M 150M 128219000 129201000 129189000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -pthread -lmpi
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare D B A 0.4199 0.8398 1.2597 1.6796 2.0995 1.859 1.866 1.852 1. (CXX) g++ options: -O3 -pthread
InfluxDB This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 D B A 110K 220K 330K 440K 550K 486128.9 484406.6 491686.5
OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 D B A 200K 400K 600K 800K 1000K 792169.6 782762.9 800956.6
Nettle GNU Nettle is a low-level cryptographic library used by GnuTLS and other software. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: aes256 D C B A 1300 2600 3900 5200 6500 6125.51 6113.96 6099.23 6108.01 MIN: 4141.36 / MAX: 10112.82 MIN: 4137.66 / MAX: 10107.56 MIN: 4118.79 / MAX: 10081.9 MIN: 4146.84 / MAX: 10065.96 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: chacha D C B A 300 600 900 1200 1500 1197.94 1189.61 1193.85 1196.84 MIN: 543.43 / MAX: 3622.61 MIN: 542.95 / MAX: 3584.05 MIN: 542.9 / MAX: 3611.06 MIN: 542.93 / MAX: 3617.62 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: sha512 D C B A 110 220 330 440 550 485.65 481.46 484.98 481.72 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: poly1305-aes D C B A 800 1600 2400 3200 4000 3331.02 3444.22 3625.46 3603.86 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU D B A 0.5174 1.0348 1.5522 2.0696 2.587 2.29593 2.29954 2.29507 MIN: 2.23 MIN: 2.23 MIN: 2.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU D B A 0.6964 1.3928 2.0892 2.7856 3.482 3.06766 3.08617 3.09520 MIN: 3.04 MIN: 3.06 MIN: 3.07 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU D B A 0.1124 0.2248 0.3372 0.4496 0.562 0.499623 0.497500 0.498823 MIN: 0.48 MIN: 0.48 MIN: 0.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU D B A 0.2814 0.5628 0.8442 1.1256 1.407 1.24158 1.25075 1.24593 MIN: 1.21 MIN: 1.22 MIN: 1.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU D B A 1.2575 2.515 3.7725 5.03 6.2875 5.58271 5.58907 5.58490 MIN: 5.54 MIN: 5.54 MIN: 5.56 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU D B A 0.5847 1.1694 1.7541 2.3388 2.9235 2.57547 2.59850 2.59510 MIN: 2.54 MIN: 2.56 MIN: 2.57 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU D B A 0.9756 1.9512 2.9268 3.9024 4.878 4.31526 4.33581 4.27636 MIN: 4.28 MIN: 4.3 MIN: 4.24 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU D B A 2 4 6 8 10 5.71241 7.82105 5.64541 MIN: 4.95 MIN: 5.17 MIN: 5.14 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU D B A 0.7512 1.5024 2.2536 3.0048 3.756 3.33852 3.33286 3.33540 MIN: 3.31 MIN: 3.3 MIN: 3.3 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU D B A 0.9468 1.8936 2.8404 3.7872 4.734 4.14962 4.20788 4.19879 MIN: 4.08 MIN: 4.15 MIN: 4.15 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU D B A 0.122 0.244 0.366 0.488 0.61 0.539423 0.541061 0.542180 MIN: 0.53 MIN: 0.53 MIN: 0.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU D B A 0.194 0.388 0.582 0.776 0.97 0.862283 0.843142 0.857716 MIN: 0.85 MIN: 0.83 MIN: 0.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU D B A 300 600 900 1200 1500 1614.10 1610.87 1613.62 MIN: 1612.13 MIN: 1608.57 MIN: 1610.95 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU D B A 400 800 1200 1600 2000 908.73 1742.77 908.62 MIN: 905.49 MIN: 901.64 MIN: 906.39 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU D B A 300 600 900 1200 1500 1612.51 1608.45 1613.54 MIN: 1610.59 MIN: 1606.49 MIN: 1610.95 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU D B A 3 6 9 12 15 9.40951 9.40000 9.41055 MIN: 9.1 MIN: 9.05 MIN: 9.06 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU D B A 9 18 27 36 45 13.71 37.31 13.72 MIN: 13.6 MIN: 13.61 MIN: 13.6 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU D B A 3 6 9 12 15 12.68 12.69 12.64 MIN: 12.49 MIN: 12.49 MIN: 12.47 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU D B A 200 400 600 800 1000 915.54 911.40 907.92 MIN: 907.78 MIN: 908.73 MIN: 905.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU D B A 0.2186 0.4372 0.6558 0.8744 1.093 0.965417 0.971579 0.965690 MIN: 0.94 MIN: 0.94 MIN: 0.94 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU D B A 300 600 900 1200 1500 1607.97 1610.84 1620.75 MIN: 1605.89 MIN: 1608.65 MIN: 1616.96 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU D B A 200 400 600 800 1000 906.38 910.95 906.69 MIN: 904.3 MIN: 909.05 MIN: 904.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU D B A 0.0696 0.1392 0.2088 0.2784 0.348 0.306607 0.308236 0.309251 MIN: 0.29 MIN: 0.29 MIN: 0.29 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU D B A 0.4741 0.9482 1.4223 1.8964 2.3705 2.10205 2.10124 2.10714 MIN: 2 MIN: 2 MIN: 2.01 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/ao/real_time D C B A 1.1585 2.317 3.4755 4.634 5.7925 4.90723 5.14891 4.95079 5.09652
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/scivis/real_time D C B A 1.1465 2.293 3.4395 4.586 5.7325 5.03878 4.93475 5.09572 5.05624
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/ao/real_time D B A 0.9707 1.9414 2.9121 3.8828 4.8535 4.25724 4.31440 4.22476
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time D B A 0.9443 1.8886 2.8329 3.7772 4.7215 4.19684 4.18303 4.09826
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time D B A 1.1855 2.371 3.5565 4.742 5.9275 5.26908 5.24165 5.26585
OSPray Studio Intel OSPray Studio is an open-source, interactive visualization and ray-tracing software package. OSPray Studio makes use of Intel OSPray, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer D B A 400 800 1200 1600 2000 1764 1764 1765 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer D B A 400 800 1200 1600 2000 1810 1807 1813 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer D B A 500 1000 1500 2000 2500 2158 2158 2154 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer D B A 6K 12K 18K 24K 30K 28119 28170 28164 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer D B A 15K 30K 45K 60K 75K 71554 71515 71609 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer D B A 6K 12K 18K 24K 30K 28965 28975 28943 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer D B A 16K 32K 48K 64K 80K 73139 73143 73188 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer D B A 7K 14K 21K 28K 35K 34554 34458 34458 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer D B A 20K 40K 60K 80K 100K 84177 84292 84266 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
perf-bench This test profile is used for running Linux perf-bench, the benchmark support within the Linux kernel's perf tool. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait D C B A 8K 16K 24K 32K 40K 37961 37937 37525 32940 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash D C B A 900K 1800K 2700K 3600K 4500K 4284624 4279202 4275533 4286945 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB D C B A 4 8 12 16 20 16.69 17.32 16.13 16.31 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB D C B A 13 26 39 52 65 58.13 57.25 53.06 54.75 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe D C B A 14K 28K 42K 56K 70K 65731 64802 64584 66578 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi D C B A 60 120 180 240 300 259 255 254 254 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic D C B A 3M 6M 9M 12M 15M 15715420 15762401 15705712 15798883 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Random Forest D C B A 300 600 900 1200 1500 1270.3 1294.0 1306.5 1257.1 MIN: 1095.14 / MAX: 1587.52 MIN: 1103.42 / MAX: 1555.51 MIN: 1145.21 / MAX: 1634.75 MIN: 1100.14 / MAX: 1505.25
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: ALS Movie Lens D C B A 7K 14K 21K 28K 35K 32248.8 32444.9 32723.9 32565.2 MIN: 32248.75 / MAX: 36198.44 MIN: 32444.88 / MAX: 36308.62 MAX: 36687.86 MAX: 36709.57
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark ALS D C B A 10K 20K 30K 40K 50K 46755.2 47049.0 47294.8 46525.4 MIN: 46459.2 / MAX: 47228.88 MIN: 46726.96 / MAX: 47445.93 MIN: 47025.31 / MAX: 47506.01 MIN: 46083.17 / MAX: 46911.57
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark Bayes D C B A 200 400 600 800 1000 1025.3 1050.0 1007.4 1050.9 MIN: 707.77 / MAX: 1237.47 MIN: 727.77 / MAX: 1083.23 MIN: 711.34 / MAX: 1137.21 MIN: 746.8 / MAX: 1223.38
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Savina Reactors.IO D C B A 3K 6K 9K 12K 15K 13865.9 14494.3 13958.6 13462.3 MIN: 13865.86 / MAX: 21129.28 MAX: 22331.73 MAX: 22309.66 MIN: 13462.26 / MAX: 22411.09
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark PageRank D C B A 900 1800 2700 3600 4500 4284.6 4299.5 4209.3 4192.6 MIN: 3918.99 MIN: 3931 / MAX: 4358.46 MIN: 3824.15 / MAX: 4209.33 MIN: 3823.47 / MAX: 4330.98
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Finagle HTTP Requests D C B A 1000 2000 3000 4000 5000 4452.2 4736.2 4582.9 4750.2 MIN: 4141.91 / MAX: 4605.95 MIN: 4436.85 / MAX: 5025.71 MIN: 4288.71 / MAX: 4709.87 MIN: 4482.76 / MAX: 4931.6
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: In-Memory Database Shootout D C B A 1500 3000 4500 6000 7500 6859.4 7005.7 6911.4 7055.2 MIN: 6493.39 / MAX: 7631.94 MIN: 6609.72 / MAX: 7596.98 MIN: 6607.3 / MAX: 7477.88 MIN: 6764.78 / MAX: 7663.6
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Akka Unbalanced Cobwebbed Tree D C B A 3K 6K 9K 12K 15K 13955.3 14243.8 14086.8 14036.1 MIN: 10954.7 / MAX: 13955.34 MIN: 11103.53 MIN: 11040.72 MIN: 11070.54 / MAX: 14036.14
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Genetic Algorithm Using Jenetics + Futures D C B A 1500 3000 4500 6000 7500 6898.9 6929.0 6984.7 6811.4 MIN: 6738.34 / MAX: 7028.82 MIN: 6701.66 / MAX: 7014.21 MIN: 6844.51 / MAX: 7102.31 MIN: 6586.47 / MAX: 6889.5
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya D C B A 0.5108 1.0216 1.5324 2.0432 2.554 2.22 2.22 2.25 2.27 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom D C B A 0.1958 0.3916 0.5874 0.7832 0.979 0.86 0.87 0.86 0.86 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID D C B A 0.8978 1.7956 2.6934 3.5912 4.489 3.98 3.97 3.99 3.99 1. (CXX) g++ options: -O3 -pthread
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time D B A 8M 16M 24M 32M 40M 38758154 37498341 34892064 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: NUMA D B A 70 140 210 280 350 303.27 303.60 304.15 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Futex D B A 300K 600K 900K 1200K 1500K 1353834.89 1426739.55 1268283.31 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: MEMFD D B A 200 400 600 800 1000 1003.05 997.77 996.47 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Atomic D B A 50K 100K 150K 200K 250K 221704.24 218089.40 230497.09 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Crypto D B A 4K 8K 12K 16K 20K 18828.45 18769.73 18827.88 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Malloc D B A 5M 10M 15M 20M 25M 23681966.64 23706403.33 23719707.91 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Forking D B A 11K 22K 33K 44K 55K 53447.82 53675.15 53558.81 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: IO_uring D B A 400K 800K 1200K 1600K 2000K 2041307.09 2029594.83 2029946.20 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: SENDFILE D B A 60K 120K 180K 240K 300K 291150.87 291265.36 291326.94 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: CPU Cache D B A 20 40 60 80 100 99.32 95.52 100.26 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: CPU Stress D B A 7K 14K 21K 28K 35K 33056.59 33094.73 33015.04 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Semaphores D B A 500K 1000K 1500K 2000K 2500K 2396852.36 2396549.19 2396119.52 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Matrix Math D B A 16K 32K 48K 64K 80K 74961.31 74563.38 74771.24 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Vector Math D B A 12K 24K 36K 48K 60K 55809.19 55713.96 55452.72 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: x86_64 RdRand D B A 50K 100K 150K 200K 250K 252382.87 252371.71 252385.40 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Memory Copying D B A 1200 2400 3600 4800 6000 5277.74 5775.62 5550.34 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Socket Activity D B A 2K 4K 6K 8K 10K 10146.64 10443.02 10009.90 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Context Switching D B A 2M 4M 6M 8M 10M 11341067.61 8099474.46 8352208.62 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Glibc C String Functions D B A 200K 400K 600K 800K 1000K 918756.99 929453.95 927638.28 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Glibc Qsort Data Sorting D B A 40 80 120 160 200 196.90 196.57 197.03 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: System V Message Passing D B A 700K 1400K 2100K 2800K 3500K 3444591.20 3397385.88 3422497.38 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lbsd -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K D C B A 0.4628 0.9256 1.3884 1.8512 2.314 2.057 2.038 2.050 2.050 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K D C B A 7 14 21 28 35 30.58 31.17 30.64 30.97 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 4K D C B A 15 30 45 60 75 67.20 67.16 65.05 66.94 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K D C B A 20 40 60 80 100 84.48 83.69 84.49 86.13 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 1080p D C B A 1.0341 2.0682 3.1023 4.1364 5.1705 4.501 4.596 4.482 4.498 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 1080p D C B A 20 40 60 80 100 81.46 81.20 82.82 83.47 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 1080p D C B A 30 60 90 120 150 148.74 144.18 147.48 150.16 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 1080p D C B A 40 80 120 160 200 173.05 174.95 176.73 176.10 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K D C B A 0.5918 1.1836 1.7754 2.3672 2.959 2.63 2.63 2.63 2.63 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K D C B A 12 24 36 48 60 53.58 53.41 53.15 53.34 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K D C B A 20 40 60 80 100 102.69 102.51 102.69 101.61 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K D C B A 20 40 60 80 100 86.36 85.79 85.00 85.59 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K D C B A 20 40 60 80 100 90.02 89.35 89.50 89.67 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K D C B A 15 30 45 60 75 66.62 65.65 65.75 66.48 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Tachyon This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99.2 Total Time D B A 20 40 60 80 100 109.92 108.35 108.56 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20220422 Encode Settings: Default D B A 0.7268 1.4536 2.1804 2.9072 3.634 3.202 3.230 3.180 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p D C B A 20 40 60 80 100 103.04 107.40 105.41 105.62 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -flto
A Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Java Notes: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10)Python Notes: Python 3.8.10Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 26 June 2022 07:25 by user pts.
B Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Java Notes: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10)Python Notes: Python 3.8.10Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 26 June 2022 13:06 by user pts.
C Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Java Notes: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10)Python Notes: Python 3.8.10Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 26 June 2022 18:48 by user pts.
D Processor: Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads), Motherboard: Supermicro X11SPL-F v1.02 (3.1 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 192GB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel I210
OS: Ubuntu 20.10, Kernel: 5.11.0-rc4-max-boost-inv-patch (x86_64) 20210121, Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Compiler: GCC 10.3.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102Java Notes: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10)Python Notes: Python 3.8.10Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 26 June 2022 20:51 by user pts.