Intel Core i7-6800K testing with a MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) and Zotac NVIDIA NV137 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
2 3 4 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA NV137 2GB, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: nouveau, OpenGL: 4.3 Mesa 20.2.1, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 3 4 2 60M 120M 180M 240M 300M SE +/- 1777861.64, N = 3 SE +/- 258877.10, N = 3 SE +/- 197532.95, N = 3 SE +/- 435487.00, N = 3 270577233 274249833 274539633 274610300 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 2 1 4 80 160 240 320 400 SE +/- 0.96, N = 3 SE +/- 0.96, N = 3 SE +/- 0.26, N = 3 374.53 376.11 377.25 MIN: 277.99 / MAX: 569.54 MIN: 279 / MAX: 582.19 MIN: 279.46 / MAX: 578.3 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 4 2 1 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 109.01 109.16 109.20 MIN: 103.01 / MAX: 122.86 MIN: 103.11 / MAX: 122.63 MIN: 103.03 / MAX: 122.93 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 4 70 140 210 280 350 SE +/- 0.56, N = 3 SE +/- 1.35, N = 3 SE +/- 0.77, N = 3 334.36 336.60 336.75 MIN: 282.32 / MAX: 365.25 MIN: 292.11 / MAX: 366.71 MIN: 293.79 / MAX: 367.96 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 2 1 4 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 68.49 68.72 68.73 MIN: 44.63 / MAX: 171.9 MIN: 44.63 / MAX: 173.21 MIN: 44.64 / MAX: 172.53 1. (CC) gcc options: -pthread
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 4 0.2147 0.4294 0.6441 0.8588 1.0735 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 SE +/- 0.003, N = 3 0.946 0.948 0.954
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 4 2 0.281 0.562 0.843 1.124 1.405 SE +/- 0.010, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.237 1.245 1.249
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 4 2 1 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.024, N = 3 SE +/- 0.006, N = 3 SE +/- 0.028, N = 3 2.677 2.683 2.686
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 4 60 120 180 240 300 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 277 277 277 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 4 1 2 110 220 330 440 550 SE +/- 0.33, N = 3 SE +/- 0.44, N = 3 SE +/- 0.29, N = 3 487 488 488 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 4 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 45 45 45 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 4 2K 4K 6K 8K 10K SE +/- 13.87, N = 3 SE +/- 8.91, N = 3 SE +/- 17.32, N = 3 10369 10372 10382 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 4 2 700 1400 2100 2800 3500 SE +/- 8.93, N = 3 SE +/- 5.22, N = 3 SE +/- 1.32, N = 3 3483 3488 3489 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 4 1 2 40 80 120 160 200 SE +/- 0.23, N = 3 SE +/- 0.14, N = 3 SE +/- 0.36, N = 3 201.21 201.88 202.70 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 20 40 60 80 100 SE +/- 0.33, N = 3 105 105 105 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 100 200 300 400 500 SE +/- 1.20, N = 3 SE +/- 1.00, N = 3 441 444 445 445 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 4 3 1 2 300 600 900 1200 1500 SE +/- 13.53, N = 3 SE +/- 1.15, N = 3 SE +/- 3.21, N = 3 SE +/- 1.76, N = 3 1538 1553 1555 1558 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 20 40 60 80 100 75 75 75 75 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 4 2 3 1 300 600 900 1200 1500 SE +/- 2.85, N = 3 SE +/- 0.88, N = 3 SE +/- 2.85, N = 3 1606 1607 1607 1610 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 4 1 2 3 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 90 91 91 91 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 100 200 300 400 500 483 483 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 4 2 3 90 180 270 360 450 412 413 414 414 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 2 4 3 1 120 240 360 480 600 SE +/- 4.58, N = 3 SE +/- 1.86, N = 3 SE +/- 0.67, N = 3 572 575 576 577 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 160 160 160 160 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 140 280 420 560 700 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 668 669 669 669 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 40 80 120 160 200 199 199 199 199 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 4 3 2 1 400 800 1200 1600 2000 SE +/- 18.39, N = 7 SE +/- 13.38, N = 13 SE +/- 14.42, N = 12 SE +/- 4.27, N = 3 2031.9 2037.4 2040.1 2054.6 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 4 1 2 300 600 900 1200 1500 SE +/- 0.88, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 1263.04 1264.54 1265.38 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 4 1 2 400 800 1200 1600 2000 SE +/- 1.00, N = 3 SE +/- 1.21, N = 3 SE +/- 0.96, N = 3 1627.93 1629.03 1630.14 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 2 4 1 300 600 900 1200 1500 SE +/- 9.44, N = 3 SE +/- 15.91, N = 4 SE +/- 4.00, N = 3 1313.90 1348.70 1358.47 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 1 4 400 800 1200 1600 2000 SE +/- 4.84, N = 3 SE +/- 5.06, N = 3 SE +/- 0.00, N = 4 1967.43 2012.03 2017.09 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 4 1 2 200 400 600 800 1000 SE +/- 17.94, N = 15 SE +/- 25.87, N = 15 SE +/- 31.02, N = 12 1102.36 1108.00 1128.17 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 4 2 1 300 600 900 1200 1500 SE +/- 40.34, N = 15 SE +/- 47.82, N = 12 SE +/- 45.47, N = 15 1388.83 1417.70 1424.62 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 4 3 2 200 400 600 800 1000 SE +/- 1.85, N = 3 SE +/- 2.77, N = 3 SE +/- 0.94, N = 3 SE +/- 2.95, N = 3 1118.28 1122.17 1123.19 1123.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 4 2 60 120 180 240 300 SE +/- 0.29, N = 3 SE +/- 0.20, N = 3 SE +/- 0.32, N = 3 SE +/- 0.14, N = 3 267.13 267.30 268.42 268.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 3 2 4 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.17, N = 3 SE +/- 0.35, N = 3 SE +/- 0.11, N = 3 150.91 150.95 150.98 151.62 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 4 2 1 3 60 120 180 240 300 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 250.29 250.38 251.45 251.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 4 2 1 500K 1000K 1500K 2000K 2500K SE +/- 7518.77, N = 3 SE +/- 2253.77, N = 3 SE +/- 5765.15, N = 3 1439625.16 1476177.92 2360946.67 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 4 1 2 400K 800K 1200K 1600K 2000K SE +/- 10271.92, N = 3 SE +/- 22404.68, N = 4 SE +/- 15518.13, N = 3 1831314.21 1858095.32 1875200.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 4 2 1 300K 600K 900K 1200K 1500K SE +/- 5529.55, N = 3 SE +/- 14301.64, N = 3 SE +/- 6897.86, N = 3 1374636.71 1375021.75 1391544.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 2 4 1 500K 1000K 1500K 2000K 2500K SE +/- 25808.83, N = 3 SE +/- 17441.42, N = 3 SE +/- 33099.62, N = 12 1902703.00 1940969.00 2169857.05 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 4 2 1 300K 600K 900K 1200K 1500K SE +/- 5141.37, N = 3 SE +/- 5724.28, N = 3 SE +/- 20237.27, N = 3 1418826.16 1419988.83 1622230.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 4 1 2 8M 16M 24M 32M 40M SE +/- 223603.64, N = 3 SE +/- 109535.42, N = 3 SE +/- 50361.23, N = 3 37965963 37977043 38369580 1. (CXX) g++ options: -O3 -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 3 2 4 140 280 420 560 700 SE +/- 13.91, N = 15 SE +/- 10.09, N = 15 SE +/- 10.35, N = 15 SE +/- 8.17, N = 15 617.96 619.24 627.12 636.84 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 3 4 1 2 150 300 450 600 750 SE +/- 5.42, N = 9 SE +/- 0.75, N = 3 SE +/- 3.26, N = 3 SE +/- 4.51, N = 3 671.75 672.86 679.72 680.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 4 1 2 4K 8K 12K 16K 20K SE +/- 99.96, N = 3 SE +/- 71.12, N = 3 SE +/- 88.88, N = 3 SE +/- 223.23, N = 3 17412.50 17463.90 17528.76 17588.79 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 2 1 4 130 260 390 520 650 SE +/- 0.24, N = 3 SE +/- 0.38, N = 3 SE +/- 0.82, N = 3 614.04 614.33 615.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 4 1 3 20 40 60 80 100 SE +/- 0.93, N = 3 SE +/- 0.75, N = 10 SE +/- 0.04, N = 3 SE +/- 1.16, N = 3 97.78 97.29 96.92 93.68 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 4 1 2 14K 28K 42K 56K 70K SE +/- 656.55, N = 3 SE +/- 484.36, N = 10 SE +/- 833.85, N = 3 63499.07 63321.80 62223.02 1. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 4 1 20K 40K 60K 80K 100K SE +/- 681.69, N = 3 SE +/- 836.51, N = 6 SE +/- 782.31, N = 8 90365.16 90116.08 89486.11 1. (CXX) g++ options: -O3 -march=native -fopenmp
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 4 1 2 2 4 6 8 10 SE +/- 0.090, N = 3 SE +/- 0.074, N = 3 SE +/- 0.072, N = 3 7.266 7.245 7.226 MIN: 6.78 / MAX: 26.54 MIN: 6.76 / MAX: 25.21 MIN: 6.79 / MAX: 26.4 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 4 2 11 22 33 44 55 SE +/- 0.48, N = 3 SE +/- 1.13, N = 3 SE +/- 0.24, N = 3 47.85 47.60 46.60 MIN: 37.96 / MAX: 79.71 MIN: 37.37 / MAX: 79.49 MIN: 36.23 / MAX: 79.17 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 4 0.9527 1.9054 2.8581 3.8108 4.7635 SE +/- 0.162, N = 3 SE +/- 0.183, N = 3 SE +/- 0.075, N = 3 4.234 4.166 4.030 MIN: 3.64 / MAX: 22.78 MIN: 3.64 / MAX: 25 MIN: 3.63 / MAX: 18.94 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 2 4 1 1.2447 2.4894 3.7341 4.9788 6.2235 SE +/- 0.446, N = 3 SE +/- 0.059, N = 3 SE +/- 0.027, N = 3 5.532 4.914 4.912 MIN: 4.23 / MAX: 16.41 MIN: 4.26 / MAX: 23.95 MIN: 4.54 / MAX: 22.9 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 2 4 1 12 24 36 48 60 SE +/- 0.99, N = 3 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 55.07 53.76 52.51 MIN: 46.94 / MAX: 90.47 MIN: 49.79 / MAX: 90.86 MIN: 49.16 / MAX: 82.48 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 4 2 1 70 140 210 280 350 SE +/- 0.88, N = 3 SE +/- 0.46, N = 3 SE +/- 0.21, N = 3 302.31 301.04 300.94 MIN: 299.23 / MAX: 325.81 MIN: 298.88 / MAX: 315.68 MIN: 298.58 / MAX: 308.32 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 4 1 2 60 120 180 240 300 SE +/- 0.42, N = 3 SE +/- 0.11, N = 3 SE +/- 0.60, N = 3 294.82 294.64 294.40 MIN: 292.2 / MAX: 297.29 MIN: 292.53 / MAX: 302.03 MIN: 291.65 / MAX: 305.43 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 3 4 2 30 60 90 120 150 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.13, N = 3 135.83 133.01 132.99 132.89 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 1 3 2 4 70 140 210 280 350 SE +/- 0.57, N = 3 SE +/- 0.48, N = 3 SE +/- 0.52, N = 3 SE +/- 0.32, N = 3 305.71 302.33 301.03 300.80 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M 4 2 1 3 300 600 900 1200 1500 SE +/- 1.99, N = 3 SE +/- 0.58, N = 3 SE +/- 2.04, N = 3 SE +/- 2.24, N = 3 1182.70 1180.39 1180.20 1178.12 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
Quantum ESPRESSO Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 3 2 1 4 500 1000 1500 2000 2500 SE +/- 23.10, N = 5 SE +/- 18.79, N = 3 SE +/- 15.22, N = 3 SE +/- 25.11, N = 4 2244.69 2237.08 2234.35 2231.47 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
Gcrypt Library Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 4 1 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.47, N = 3 242.66 242.26 242.02 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 4 2 4 6 8 10 SE +/- 0.010, N = 3 SE +/- 0.044, N = 3 SE +/- 0.052, N = 3 7.442 7.380 7.352 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 2 1 4 90 180 270 360 450 SE +/- 1.25, N = 3 SE +/- 0.32, N = 3 SE +/- 0.64, N = 3 423.76 420.87 420.40 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 4 1 2 160 320 480 640 800 SE +/- 1.31, N = 3 SE +/- 0.90, N = 3 SE +/- 1.11, N = 3 763.75 763.42 761.96 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 4 2 1 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 21.08 21.07 21.06 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 4 300 600 900 1200 1500 SE +/- 0.15, N = 3 SE +/- 0.70, N = 3 SE +/- 0.37, N = 3 1366.81 1366.26 1364.60 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 2 3 1 4 11 22 33 44 55 SE +/- 0.29, N = 3 SE +/- 0.63, N = 3 SE +/- 0.12, N = 3 SE +/- 0.52, N = 15 50.05 48.76 48.63 47.81 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 5 February 2021 06:42 by user phoronix.
2 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 5 February 2021 20:46 by user phoronix.
3 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 6 February 2021 11:00 by user phoronix.
4 Processor: Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 120GB TOSHIBA TR150, Graphics: Zotac NVIDIA NV137 2GB, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-LM + Intel I210
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: nouveau, OpenGL: 4.3 Mesa 20.2.1, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 6 February 2021 20:02 by user phoronix.