Intel Core i7-6800K testing with a MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) and Zotac NVIDIA NV137 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
Run 1 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA GeForce GTX 1050, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Run 2 Run 3 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA NV137 2GB , Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, OpenGL: 4.3 Mesa 20.2.1, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p Run 1 Run 2 Run 3 80 160 240 320 400 SE +/- 0.56, N = 3 SE +/- 1.27, N = 3 SE +/- 1.19, N = 3 361.59 362.52 361.92 MIN: 267 / MAX: 561.79 MIN: 267.49 / MAX: 564.49 MIN: 266.52 / MAX: 567.25 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 102.45 102.74 102.34 MIN: 96.59 / MAX: 115.32 MIN: 96.51 / MAX: 116.33 MIN: 96.39 / MAX: 115.07 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p Run 1 Run 2 Run 3 70 140 210 280 350 SE +/- 0.61, N = 3 SE +/- 0.18, N = 3 SE +/- 0.82, N = 3 322.35 323.32 323.64 MIN: 273.49 / MAX: 351.84 MIN: 283.06 / MAX: 351.54 MIN: 272 / MAX: 354.89 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit Run 1 Run 2 Run 3 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 67.76 67.71 67.60 MIN: 44.17 / MAX: 170.52 MIN: 44.16 / MAX: 166.83 MIN: 44.14 / MAX: 165.37 1. (CC) gcc options: -pthread
yquake2 This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 1920 x 1080 Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.78, N = 3 SE +/- 0.93, N = 3 SE +/- 0.26, N = 3 81.7 81.2 81.3 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.0045, N = 3 SE +/- 0.0251, N = 3 SE +/- 0.0111, N = 3 7.3664 7.3146 7.3115 MIN: 7.29 / MAX: 7.5 MIN: 7.22 / MAX: 7.48 MIN: 7.24 / MAX: 7.45
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.0019, N = 3 SE +/- 0.0742, N = 3 SE +/- 0.0113, N = 3 7.5212 7.6137 7.5330 MIN: 7.48 / MAX: 7.63 MIN: 7.43 / MAX: 7.81 MIN: 7.49 / MAX: 7.62
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.0541, N = 3 SE +/- 0.0329, N = 3 SE +/- 0.0422, N = 3 9.2012 9.1127 9.1881 MIN: 9.04 / MAX: 9.42 MIN: 9.01 / MAX: 9.31 MIN: 9.09 / MAX: 9.41
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow Run 1 Run 2 Run 3 0.6053 1.2106 1.8159 2.4212 3.0265 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.69 2.69 2.69 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Run 1 Run 2 Run 3 0.621 1.242 1.863 2.484 3.105 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.75 2.76 2.75 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 11.00 11.04 11.01 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.33 11.35 11.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.71 7.73 7.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.04 14.04 14.02 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Run 1 Run 2 Run 3 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 29.38 29.35 29.36 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Run 1 Run 2 Run 3 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 52.74 52.92 52.78 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 Run 1 Run 2 Run 3 0.1865 0.373 0.5595 0.746 0.9325 SE +/- 0.011, N = 3 SE +/- 0.006, N = 3 SE +/- 0.003, N = 3 0.828 0.818 0.829
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 Run 1 Run 2 Run 3 0.2457 0.4914 0.7371 0.9828 1.2285 SE +/- 0.002, N = 3 SE +/- 0.010, N = 3 SE +/- 0.007, N = 3 1.091 1.085 1.092
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 Run 1 Run 2 Run 3 0.5335 1.067 1.6005 2.134 2.6675 SE +/- 0.013, N = 3 SE +/- 0.025, N = 3 SE +/- 0.015, N = 3 2.371 2.369 2.354
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding Run 1 Run 2 Run 3 12 24 36 48 60 SE +/- 0.41, N = 9 SE +/- 0.43, N = 8 SE +/- 0.42, N = 9 52.05 51.97 52.15 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 8.64 8.60 8.74 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.11, N = 3 35.22 34.75 35.10 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
HPC Challenge HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans Run 1 Run 2 Run 3 0.5654 1.1308 1.6962 2.2616 2.827 SE +/- 0.00379, N = 3 SE +/- 0.00766, N = 3 SE +/- 0.00646, N = 3 2.50276 2.51283 2.50070 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad Run 1 Run 2 Run 3 1.2475 2.495 3.7425 4.99 6.2375 SE +/- 0.07070, N = 3 SE +/- 0.09991, N = 3 SE +/- 0.01514, N = 3 5.34605 5.13392 5.54450 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth Run 1 Run 2 Run 3 0.565 1.13 1.695 2.26 2.825 SE +/- 0.09513, N = 3 SE +/- 0.01108, N = 3 SE +/- 0.10589, N = 3 2.51129 2.36057 2.44927 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya Run 1 Run 2 Run 3 0.1305 0.261 0.3915 0.522 0.6525 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.58 0.58 0.58 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom Run 1 Run 2 Run 3 0.0878 0.1756 0.2634 0.3512 0.439 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.39 0.39 0.39 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets Run 1 Run 2 Run 3 0.1553 0.3106 0.4659 0.6212 0.7765 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.69 0.69 0.69 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID Run 1 Run 2 Run 3 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.71 0.71 1. (CXX) g++ options: -O3 -pthread
HPC Challenge HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.94, N = 9 SE +/- 1.20, N = 3 SE +/- 0.89, N = 9 86.19 86.51 86.66 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte Run 1 Run 2 Run 3 0.4692 0.9384 1.4076 1.8768 2.346 SE +/- 0.13748, N = 3 SE +/- 0.05507, N = 3 SE +/- 0.05394, N = 3 2.07387 2.08554 2.07874 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM Run 1 Run 2 Run 3 9 18 27 36 45 SE +/- 0.60, N = 3 SE +/- 0.40, N = 3 SE +/- 0.49, N = 3 37.23 37.38 36.87 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access Run 1 Run 2 Run 3 0.0055 0.011 0.0165 0.022 0.0275 SE +/- 0.00028, N = 3 SE +/- 0.00072, N = 3 SE +/- 0.00023, N = 3 0.02410 0.02397 0.02441 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Run 1 Run 2 Run 3 0.5119 1.0238 1.5357 2.0476 2.5595 SE +/- 0.002, N = 3 SE +/- 0.010, N = 3 SE +/- 0.000, N = 3 2.275 2.254 2.274
HPC Challenge HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 41.78, N = 3 SE +/- 19.51, N = 3 SE +/- 46.85, N = 3 12775.55 12753.06 12872.70 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed Run 1 Run 2 Run 3 1400 2800 4200 5600 7000 SE +/- 1.39, N = 3 SE +/- 3.86, N = 3 SE +/- 0.96, N = 3 6392.0 6384.4 6385.2 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Run 1 Run 2 Run 3 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 43.21 43.21 43.24 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Run 1 Run 2 Run 3 1300 2600 3900 5200 6500 SE +/- 1.24, N = 3 SE +/- 2.97, N = 3 SE +/- 3.87, N = 3 6291.6 6285.0 6291.5 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Run 1 Run 2 Run 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 42.29 42.30 42.28 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Run 1 Run 2 Run 3 1300 2600 3900 5200 6500 SE +/- 2.81, N = 3 SE +/- 4.84, N = 3 SE +/- 3.26, N = 3 6298.2 6286.9 6295.4 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 19 Run 1 Run 2 Run 3 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 31.3 31.2 31.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time Run 1 Run 2 Run 3 2M 4M 6M 8M 10M SE +/- 125432.49, N = 3 SE +/- 55842.23, N = 3 SE +/- 41101.79, N = 3 10414229 10336902 10430427 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark Run 1 Run 2 Run 3 0.1278 0.2556 0.3834 0.5112 0.639 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 0.568 0.568 0.564 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD Run 1 Run 2 Run 3 400K 800K 1200K 1600K 2000K SE +/- 8709.31, N = 3 SE +/- 5919.78, N = 3 SE +/- 19777.69, N = 4 1786977.08 1750815.54 1746188.19 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH Run 1 Run 2 Run 3 300K 600K 900K 1200K 1500K SE +/- 9622.95, N = 11 SE +/- 15165.81, N = 3 SE +/- 13263.10, N = 3 1314301.94 1311290.87 1323367.41 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET Run 1 Run 2 Run 3 400K 800K 1200K 1600K 2000K SE +/- 12019.14, N = 3 SE +/- 19126.07, N = 3 SE +/- 21525.04, N = 3 2022289.54 1965096.13 1960201.46 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET Run 1 Run 2 Run 3 300K 600K 900K 1200K 1500K SE +/- 8648.91, N = 3 SE +/- 16434.34, N = 3 SE +/- 14536.11, N = 7 1511980.13 1574421.25 1558674.52 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Node.js V8 Web Tooling Benchmark Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 8.90 8.79 8.89 1. Nodejs
v12.18.2
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Run 1 Run 2 130K 260K 390K 520K 650K SE +/- 357.11, N = 3 SE +/- 1039.69, N = 3 622110 619528
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup Run 1 Run 2 Run 3 0.27 0.54 0.81 1.08 1.35 SE +/- 0.01, N = 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 15 1.2 1.2 1.2 1. (CC) gcc options: -fopenmp -O3 -lm
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric Run 1 Run 2 15K 30K 45K 60K 75K 70291 70116 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 13.03 12.66 12.71 MIN: 6.76 MIN: 6.05 MIN: 5.94 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.09931, N = 3 SE +/- 0.03591, N = 3 SE +/- 0.04430, N = 3 10.05002 9.80732 10.09580 MIN: 8.56 MIN: 8.52 MIN: 8.61 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.06357, N = 3 SE +/- 0.03003, N = 3 SE +/- 0.04781, N = 3 9.32830 9.16745 9.21052 MIN: 3.98 MIN: 3.97 MIN: 3.97 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 0.8581 1.7162 2.5743 3.4324 4.2905 SE +/- 0.00333, N = 3 SE +/- 0.00200, N = 3 SE +/- 0.01990, N = 3 3.81378 3.77356 3.73038 MIN: 3.12 MIN: 3.18 MIN: 3.21 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 16.95 17.00 16.94 MIN: 14.04 MIN: 13.62 MIN: 13.98 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 19.09 19.06 19.14 MIN: 8.36 MIN: 8.04 MIN: 8.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 16.00 15.83 15.75 MIN: 12.18 MIN: 11.82 MIN: 11.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 17.17 17.21 17.24 MIN: 12.71 MIN: 13.27 MIN: 12.66 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.18, N = 3 25.93 25.71 25.78 MIN: 9.72 MIN: 9.67 MIN: 9.78 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 12.76 12.67 12.62 MIN: 8.27 MIN: 8.03 MIN: 7.63 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 2K 4K 6K 8K 10K SE +/- 10.33, N = 3 SE +/- 4.29, N = 3 SE +/- 7.07, N = 3 11058.9 11071.2 11111.7 MIN: 10774.1 MIN: 10804.1 MIN: 10816.9 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 1600 3200 4800 6400 8000 SE +/- 20.69, N = 3 SE +/- 11.50, N = 3 SE +/- 6.04, N = 3 7394.07 7385.86 7418.02 MIN: 7102.56 MIN: 7090.33 MIN: 7150.1 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 2K 4K 6K 8K 10K SE +/- 8.16, N = 3 SE +/- 10.87, N = 3 SE +/- 11.56, N = 3 11071.4 11074.2 11063.8 MIN: 10803.3 MIN: 10796.5 MIN: 10764.6 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 1600 3200 4800 6400 8000 SE +/- 12.71, N = 3 SE +/- 0.26, N = 3 SE +/- 4.87, N = 3 7400.87 7404.31 7415.85 MIN: 7138.98 MIN: 7147.81 MIN: 7149.4 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.48864, N = 15 SE +/- 0.02084, N = 3 SE +/- 0.02417, N = 3 8.20994 7.18295 7.21289 MIN: 3.29 MIN: 3.37 MIN: 3.38 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Run 1 Run 2 Run 3 2K 4K 6K 8K 10K SE +/- 10.72, N = 3 SE +/- 26.91, N = 3 SE +/- 17.48, N = 3 11020.0 11062.6 11119.0 MIN: 10766 MIN: 10773.5 MIN: 10848.6 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Run 1 Run 2 Run 3 1600 3200 4800 6400 8000 SE +/- 7.28, N = 3 SE +/- 42.16, N = 3 SE +/- 14.92, N = 3 7391.52 7430.60 7414.46 MIN: 7132.96 MIN: 7131.37 MIN: 7129.34 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.03038, N = 3 SE +/- 0.01988, N = 3 SE +/- 0.03653, N = 3 9.47929 9.51403 9.63747 MIN: 4.26 MIN: 4.29 MIN: 4.31 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.53, N = 3 SE +/- 0.88, N = 4 75.07 74.53 75.19 MIN: 52.17 / MAX: 259.67 MIN: 53.79 / MAX: 180.72 MIN: 52.93 / MAX: 204.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 Run 1 Run 2 Run 3 6 12 18 24 30 SE +/- 0.47, N = 3 SE +/- 0.64, N = 3 SE +/- 0.60, N = 4 27.04 27.22 27.56 MIN: 15.02 / MAX: 98.6 MIN: 6.36 / MAX: 114.32 MIN: 7.54 / MAX: 160.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.20, N = 3 SE +/- 0.10, N = 4 17.22 17.06 17.04 MIN: 9.81 / MAX: 65.85 MIN: 9.88 / MAX: 85.95 MIN: 9.51 / MAX: 72.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 1.24, N = 3 SE +/- 0.25, N = 4 35.12 35.79 35.26 MIN: 20.41 / MAX: 103.78 MIN: 20.38 / MAX: 148.98 MIN: 17.61 / MAX: 123.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.75, N = 3 SE +/- 0.24, N = 4 17.03 18.14 17.23 MIN: 13.56 / MAX: 48.96 MIN: 5.59 / MAX: 157.6 MIN: 13.21 / MAX: 64.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.37, N = 3 SE +/- 0.38, N = 3 SE +/- 0.39, N = 4 35.75 35.16 35.71 MIN: 19.97 / MAX: 132.42 MIN: 21.2 / MAX: 117.05 MIN: 21.19 / MAX: 116.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.48, N = 3 SE +/- 0.72, N = 3 SE +/- 0.31, N = 4 9.11 9.01 8.70 MIN: 2.68 / MAX: 132.97 MIN: 2.68 / MAX: 92.41 MIN: 2.68 / MAX: 149.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet Run 1 Run 2 Run 3 16 32 48 64 80 SE +/- 0.42, N = 3 SE +/- 0.96, N = 3 SE +/- 0.44, N = 4 69.77 70.17 69.99 MIN: 40.57 / MAX: 187.95 MIN: 40.33 / MAX: 223.67 MIN: 40.48 / MAX: 183.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 Run 1 Run 2 Run 3 30 60 90 120 150 SE +/- 0.58, N = 3 SE +/- 0.47, N = 3 SE +/- 0.72, N = 4 131.95 131.74 131.81 MIN: 105.72 / MAX: 192.42 MIN: 104.27 / MAX: 191.62 MIN: 98.07 / MAX: 183.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 Run 1 Run 2 Run 3 13 26 39 52 65 SE +/- 0.53, N = 3 SE +/- 0.38, N = 3 SE +/- 0.93, N = 4 55.65 56.42 57.44 MIN: 29.24 / MAX: 174.71 MIN: 29.6 / MAX: 196.71 MIN: 32.72 / MAX: 234.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.39, N = 3 SE +/- 0.17, N = 3 SE +/- 0.33, N = 4 35.40 35.20 35.43 MIN: 24.51 / MAX: 94.32 MIN: 25.77 / MAX: 83.7 MIN: 24.78 / MAX: 113.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 Run 1 Run 2 Run 3 40 80 120 160 200 SE +/- 0.53, N = 3 SE +/- 2.32, N = 3 SE +/- 1.71, N = 4 173.24 169.88 171.13 MIN: 100.86 / MAX: 302.11 MIN: 98.02 / MAX: 304.64 MIN: 94.76 / MAX: 337.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.35, N = 3 SE +/- 0.30, N = 3 SE +/- 0.35, N = 4 79.51 79.19 78.92 MIN: 57.15 / MAX: 128.6 MIN: 59.35 / MAX: 182.68 MIN: 59.82 / MAX: 157.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd Run 1 Run 2 Run 3 13 26 39 52 65 SE +/- 0.45, N = 3 SE +/- 0.40, N = 3 SE +/- 0.35, N = 4 59.71 60.05 59.78 MIN: 43.82 / MAX: 136.53 MIN: 43.01 / MAX: 117.51 MIN: 44.03 / MAX: 108.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m Run 1 Run 2 Run 3 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.25, N = 3 SE +/- 0.54, N = 4 58.07 56.97 57.70 MIN: 51.31 / MAX: 195.89 MIN: 51.81 / MAX: 108.77 MIN: 51.85 / MAX: 196.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Build2 This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Run 1 Run 2 Run 3 60 120 180 240 300 SE +/- 0.63, N = 3 SE +/- 0.57, N = 3 SE +/- 0.98, N = 3 264.65 265.50 265.69
YafaRay YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Run 1 Run 2 Run 3 60 120 180 240 300 SE +/- 0.45, N = 3 SE +/- 0.37, N = 3 SE +/- 0.28, N = 3 272.34 271.98 272.40 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
DeepSpeech Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.34, N = 3 SE +/- 0.33, N = 3 SE +/- 0.23, N = 3 109.38 109.66 110.49
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.024, N = 5 SE +/- 0.038, N = 5 SE +/- 0.058, N = 25 9.462 9.511 9.585 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
eSpeak-NG Speech Engine This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Run 1 Run 2 Run 3 9 18 27 36 45 SE +/- 0.53, N = 20 SE +/- 0.92, N = 20 SE +/- 0.84, N = 20 39.67 40.57 40.87 1. (CC) gcc options: -O2 -std=c99
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Run 1 Run 2 Run 3 6 12 18 24 30 SE +/- 0.21, N = 8 SE +/- 0.10, N = 3 SE +/- 0.28, N = 5 25.15 24.96 25.24 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.54 7.54 7.54 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 8.83 8.20 8.31 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough Run 1 Run 2 Run 3 12 24 36 48 60 SE +/- 0.19, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 55.32 52.22 52.17 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive Run 1 Run 2 Run 3 90 180 270 360 450 SE +/- 4.31, N = 6 SE +/- 0.16, N = 3 SE +/- 0.03, N = 3 427.41 422.82 422.91 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Basis Universal Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S Run 1 Run 2 Run 3 15 30 45 60 75 SE +/- 0.29, N = 3 SE +/- 0.38, N = 3 SE +/- 0.50, N = 3 68.93 68.68 69.24 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.11, N = 15 10.85 10.87 11.05 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 Run 1 Run 2 Run 3 13 26 39 52 65 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 58.98 58.95 58.99 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 Run 1 Run 2 Run 3 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 113.01 112.93 113.14 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 + RDO Post-Processing Run 1 Run 2 Run 3 200 400 600 800 1000 SE +/- 4.07, N = 3 SE +/- 1.57, N = 3 SE +/- 11.07, N = 3 957.17 966.40 962.78 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
HPC Challenge HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency Run 1 Run 2 Run 3 0.101 0.202 0.303 0.404 0.505 SE +/- 0.01254, N = 3 SE +/- 0.00111, N = 3 SE +/- 0.01324, N = 3 0.43412 0.44873 0.43635 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
Run 1 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA GeForce GTX 1050, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 3 January 2021 21:03 by user phoronix.
Run 2 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 4 January 2021 09:41 by user phoronix.
Run 3 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA NV137 2GB, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, OpenGL: 4.3 Mesa 20.2.1, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 4 January 2021 20:03 by user phoronix.