OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack O3: LTO GRAPHITE 1.0305 2.061 3.0915 4.122 5.1525 SE +/- 0.01, N = 3 4.58 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex O3: LTO GRAPHITE 0.873 1.746 2.619 3.492 4.365 SE +/- 0.01, N = 3 3.88 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls O3: LTO GRAPHITE 0.5445 1.089 1.6335 2.178 2.7225 SE +/- 0.00, N = 3 2.42 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh O3: LTO GRAPHITE 0.1868 0.3736 0.5604 0.7472 0.934 SE +/- 0.00, N = 3 0.83 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh O3: LTO GRAPHITE 0.2205 0.441 0.6615 0.882 1.1025 SE +/- 0.00, N = 3 0.98 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lglut -lGL -lGLU
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance O3: LTO GRAPHITE 300K 600K 900K 1200K 1500K SE +/- 753.00, N = 5 1441074 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite O3: LTO GRAPHITE 500 1000 1500 2000 2500 SE +/- 1.01, N = 3 2455.22 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo O3: LTO GRAPHITE 500 1000 1500 2000 2500 SE +/- 0.03, N = 3 2119.91 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform O3: LTO GRAPHITE 130 260 390 520 650 SE +/- 4.94, N = 3 589.86 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply O3: LTO GRAPHITE 600 1200 1800 2400 3000 SE +/- 1.68, N = 3 2699.53 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization O3: LTO GRAPHITE 1000 2000 3000 4000 5000 SE +/- 5.77, N = 3 4644.94 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation O3: LTO GRAPHITE 500 1000 1500 2000 2500 SE +/- 0.69, N = 3 2221.87 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup O3: LTO GRAPHITE 0.378 0.756 1.134 1.512 1.89 SE +/- 0.02, N = 5 1.68 1. (CC) gcc options: --openmp -O3 -lm
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 O3: LTO GRAPHITE 4 8 12 16 20 SE +/- 0.05, N = 3 15.94 1. (CC) gcc options: -pthread -fvisibility=hidden -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype O3: LTO GRAPHITE 8 16 24 32 40 SE +/- 0.00, N = 3 34.28 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library O3: LTO GRAPHITE 60 120 180 240 300 SE +/- 0.11, N = 3 278.14 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 0.06, N = 3 982.96 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.01, N = 3 83.15 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.00, N = 3 13.32 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -std=c++11
Compile Bench Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile O3: LTO GRAPHITE 500 1000 1500 2000 2500 SE +/- 2.79, N = 3 2411.28
Compile Bench Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Read Compiled Tree O3: LTO GRAPHITE 600 1200 1800 2400 3000 SE +/- 8.94, N = 3 2664.85
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 512 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 1.87, N = 3 8794.03 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 1.32, N = 3 8724.87 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 512 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 5.31, N = 3 8741.78 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 2.68, N = 3 8752.95 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 0.72, N = 3 8877.69 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 1.41, N = 3 8834.70 1. (CC) gcc options: -O3 -march=native
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 O3: LTO GRAPHITE 4 8 12 16 20 SE +/- 0.22, N = 3 16.23 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 O3: LTO GRAPHITE 1.2623 2.5246 3.7869 5.0492 6.3115 SE +/- 0.01, N = 3 5.61 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 2 - Type: Thread O3: LTO GRAPHITE 0.8978 1.7956 2.6934 3.5912 4.489 SE +/- 0.01, N = 3 3.99 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 4 - Type: Thread O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.06, N = 3 6.89 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Thread O3: LTO GRAPHITE 4 8 12 16 20 SE +/- 0.07, N = 3 15.61 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 1 - Type: Process O3: LTO GRAPHITE 0.5513 1.1026 1.6539 2.2052 2.7565 SE +/- 0.04, N = 3 2.45 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Thread O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.24, N = 3 23.66 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 2 - Type: Process O3: LTO GRAPHITE 0.8595 1.719 2.5785 3.438 4.2975 SE +/- 0.01, N = 3 3.82 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 4 - Type: Process O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.03, N = 3 6.55 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Process O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.08, N = 3 12.29 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Process O3: LTO GRAPHITE 5 10 15 20 25 SE +/- 0.03, N = 3 22.63 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 32 - Type: Process O3: LTO GRAPHITE 10 20 30 40 50 SE +/- 0.15, N = 3 43.38 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Bsearch O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 17.65, N = 3 14660.17 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Forking O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 390.13, N = 3 107783.35 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Hsearch O3: LTO GRAPHITE 30K 60K 90K 120K 150K SE +/- 290.11, N = 3 133631.06 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Tsearch O3: LTO GRAPHITE 90 180 270 360 450 SE +/- 0.36, N = 3 409.02 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Semaphores O3: LTO GRAPHITE 700K 1400K 2100K 2800K 3500K SE +/- 8727.38, N = 3 3416101.66 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Matrix Math O3: LTO GRAPHITE 40K 80K 120K 160K 200K SE +/- 108.18, N = 3 168551.81 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Vector Math O3: LTO GRAPHITE 12K 24K 36K 48K 60K SE +/- 5.15, N = 3 54561.84 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Memory Copying O3: LTO GRAPHITE 1100 2200 3300 4400 5500 SE +/- 9.53, N = 3 4993.79 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Socket Activity O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 29.37, N = 3 14552.99 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Context Switching O3: LTO GRAPHITE 4M 8M 12M 16M 20M SE +/- 109767.98, N = 3 18295536.34 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Glibc C String Functions O3: LTO GRAPHITE 160K 320K 480K 640K 800K SE +/- 15117.55, N = 3 751978.84 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Glibc Qsort Data Sorting O3: LTO GRAPHITE 80 160 240 320 400 SE +/- 0.12, N = 3 369.10 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: System V Message Passing O3: LTO GRAPHITE 2M 4M 6M 8M 10M SE +/- 6432.69, N = 3 8574802.25 1. (CC) gcc options: -O2 -std=gnu99 -lm -lbsd -lz -lcrypt -lrt -lpthread -laio -lc
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong O3: LTO GRAPHITE 0.7425 1.485 2.2275 2.97 3.7125 SE +/- 0.01, N = 5 3.30 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load O3: LTO GRAPHITE 7 14 21 28 35 SE +/- 0.30, N = 5 27.72 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Minion Minion is an open-source constraint solver that is designed to be very scalable. This test profile uses Minion's integrated benchmarking problems to solve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Minion 1.8 Benchmark: Solitaire O3: LTO GRAPHITE 14 28 42 56 70 SE +/- 0.80, N = 3 63.60 1. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Minion 1.8 Benchmark: Quasigroup O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.07, N = 3 100.35 1. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Decompression O3: LTO GRAPHITE 130 260 390 520 650 SE +/- 0.67, N = 3 614 1. (CXX) g++ options: -lrt -static -lpthread
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression O3: LTO GRAPHITE 50 100 150 200 250 SE +/- 0.33, N = 3 247 1. (CXX) g++ options: -lrt -static -lpthread
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 0.67, N = 3 1153 1. (CXX) g++ options: -lrt -static -lpthread
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Decrypt O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.03, N = 3 95.40 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Encrypt O3: LTO GRAPHITE 800 1600 2400 3200 4000 SE +/- 2.10, N = 3 3848.99 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Decrypt O3: LTO GRAPHITE 800 1600 2400 3200 4000 SE +/- 4.73, N = 3 3856.35 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Encrypt O3: LTO GRAPHITE 90 180 270 360 450 SE +/- 0.12, N = 3 401.22 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Decrypt O3: LTO GRAPHITE 90 180 270 360 450 SE +/- 0.03, N = 3 397.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Encrypt O3: LTO GRAPHITE 60 120 180 240 300 SE +/- 0.02, N = 3 296.03 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Decrypt O3: LTO GRAPHITE 60 120 180 240 300 SE +/- 0.04, N = 3 294.64 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Encrypt O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 0.03, N = 3 145.76 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Decrypt O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 0.02, N = 3 145.71 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Java SciMark This test runs the Java version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This benchmark is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Composite O3: LTO GRAPHITE 500 1000 1500 2000 2500 SE +/- 35.78, N = 4 2273.87
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write O3: LTO GRAPHITE 11K 22K 33K 44K 55K SE +/- 12.98, N = 3 50799.87 1. (CC) gcc options: -lrt
glibc bench The GNU C Library project provides the core libraries for the GNU system and GNU/Linux systems, as well as many other systems that use Linux as the kernel. These libraries provide critical APIs including ISO C11, POSIX.1-2008, BSD, OS-specific APIs and more.
Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: cos O3: LTO GRAPHITE 10 20 30 40 50 SE +/- 0.01, N = 3 45.21
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.28.0 VGR Performance Metric O3: LTO GRAPHITE 50K 100K 150K 200K 250K 220740 1. (CXX) g++ options: -std=c++98 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus and then to decode the generated Opus file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.0.1 WAV, Opus Encode, Opus Decode O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.00, N = 5 8.45 1. (CC) gcc options: -O3 -ffast-math -fPIE -pie -logg -lopus -lm
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Sparse Matrix Multiply O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 0.18, N = 3 1410.39 1. (CC) gcc options: -lm -ldl
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Dense LU Matrix Factorization O3: LTO GRAPHITE 800 1600 2400 3200 4000 SE +/- 10.95, N = 3 3824.34 1. (CC) gcc options: -lm -ldl
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Jacobi Successive Over-Relaxation O3: LTO GRAPHITE 400 800 1200 1600 2000 SE +/- 0.25, N = 3 1654.38 1. (CC) gcc options: -lm -ldl
OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 6 O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 4.61, N = 3 4430.72 1. (CC) gcc options: -lpopt -O2
OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 12 O3: LTO GRAPHITE 1500 3000 4500 6000 7500 SE +/- 7.60, N = 3 6845.49 1. (CC) gcc options: -lpopt -O2
OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 48 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 6.64, N = 3 11148.73 1. (CC) gcc options: -lpopt -O2
OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 128 O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 8.88, N = 3 12287.30 1. (CC) gcc options: -lpopt -O2
OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 256 O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 25.18, N = 3 12130.43 1. (CC) gcc options: -lpopt -O2
OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 5000 Files, 1MB Size, 4 Threads O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 1.04, N = 3 1583.07 1. (CC) gcc options: -static
OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 4000 Files, 32 Sub Dirs, 1MB Size O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 0.54, N = 3 1017.03 1. (CC) gcc options: -static
OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 1000 Files, 1MB Size, No Sync/FSync O3: LTO GRAPHITE 600 1200 1800 2400 3000 SE +/- 2.77, N = 3 2597.30 1. (CC) gcc options: -static
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 4GB - Disk Test: Read Performance O3: LTO GRAPHITE 1500 3000 4500 6000 7500 SE +/- 7.81, N = 3 6927.44 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 8GB - Disk Test: Read Performance O3: LTO GRAPHITE 1500 3000 4500 6000 7500 SE +/- 111.61, N = 3 7181.03 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 2GB - Disk Test: Read Performance O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 64.80, N = 5 4367.85 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 4GB - Disk Test: Read Performance O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 43.06, N = 15 4419.03 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 8GB - Disk Test: Read Performance O3: LTO GRAPHITE 1000 2000 3000 4000 5000 SE +/- 67.79, N = 5 4492.24 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 2GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 1.38, N = 3 1271.13 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 4GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 0.93, N = 3 1273.29 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 8GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 3.01, N = 3 1278.63 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 2GB - Disk Test: Write Performance O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 5.90, N = 3 1123.79 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 4GB - Disk Test: Write Performance O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 1.01, N = 3 1138.48 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 8GB - Disk Test: Write Performance O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 4.28, N = 3 1121.56 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 2GB - Disk Test: Read Performance O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 34.13, N = 3 7387.96 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 4GB - Disk Test: Read Performance O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 26.45, N = 3 7475.49 1. (CC) gcc options: -O3
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 O3: LTO GRAPHITE 11 22 33 44 55 SE +/- 0.04, N = 3 48.45 MIN: 47.21 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8u8s32 O3: LTO GRAPHITE 0.1868 0.3736 0.5604 0.7472 0.934 SE +/- 0.00, N = 3 0.83 MIN: 0.81 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.05, N = 3 101.86 MIN: 100.76 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8u8s32 O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 16.91, N = 3 7338.52 MIN: 7300.56 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 2GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 1.92, N = 3 1252.12 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 4GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 1.54, N = 3 1256.26 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 8GB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 0.73, N = 3 1255.99 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 1MB - File Size: 512MB - Disk Test: Write Performance O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 6.40, N = 3 1234.07 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 4Kb - File Size: 512MB - Disk Test: Write Performance O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 14.46, N = 3 1086.15 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better IOzone 3.465 Record Size: 64Kb - File Size: 512MB - Disk Test: Read Performance O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 28.59, N = 3 7478.54 1. (CC) gcc options: -O3
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8u8s32 O3: LTO GRAPHITE 800 1600 2400 3200 4000 SE +/- 5.33, N = 3 3870.08 MIN: 3852.84 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.13, N = 3 13.17 MIN: 6.88 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 O3: LTO GRAPHITE 0.324 0.648 0.972 1.296 1.62 SE +/- 0.01, N = 3 1.44 MIN: 1.39 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.26, N = 3 80.23 MIN: 43.93 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 0.14, N = 3 851.79 MIN: 840.37 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.01, N = 3 9.09 MIN: 8.8 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8f32s32 O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.32, N = 3 25.90 MIN: 15.14 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8u8s32 O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.07, N = 3 26.17 MIN: 15.27 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 187.56, N = 3 10948 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 15.97, N = 3 1178.70 MIN: 1102.64 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8u8s32 O3: LTO GRAPHITE 0.5265 1.053 1.5795 2.106 2.6325 SE +/- 0.02, N = 3 2.34 MIN: 1.23 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8f32s32 O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 15.80, N = 3 7267.40 MIN: 7236.2 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8f32s32 O3: LTO GRAPHITE 0.5378 1.0756 1.6134 2.1512 2.689 SE +/- 0.03, N = 6 2.39 MIN: 1.22 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8f32s32 O3: LTO GRAPHITE 800 1600 2400 3200 4000 SE +/- 0.40, N = 3 3914.75 MIN: 3906.92 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8u8s32 O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 5.66, N = 3 4220.55 MIN: 4212.16 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 202.61, N = 15 9919.43 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8u8s32 O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.01, N = 3 27.21 MIN: 26.71 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: u8s8u8s32 O3: LTO GRAPHITE 1500 3000 4500 6000 7500 SE +/- 4.27, N = 3 7189.83 MIN: 7136.31 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32s32 O3: LTO GRAPHITE 15 30 45 60 75 SE +/- 0.04, N = 3 68.88 MIN: 68.06 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32s32 O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 3.55, N = 3 4251.93 MIN: 4245.69 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32s32 O3: LTO GRAPHITE 0.1868 0.3736 0.5604 0.7472 0.934 SE +/- 0.00, N = 3 0.83 MIN: 0.81 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8u8s32 O3: LTO GRAPHITE 15 30 45 60 75 SE +/- 0.00, N = 3 67.35 MIN: 66.57 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32s32 O3: LTO GRAPHITE 7 14 21 28 35 SE +/- 0.00, N = 3 28.40 MIN: 27.81 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 55.49, N = 3 11189 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 35.22, N = 3 9606.77 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 45.35, N = 3 11360 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 127.43, N = 15 10681.77 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 154.32, N = 4 9159.88 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 131.32, N = 3 10688 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 71.54, N = 3 9825.87 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 71.91, N = 3 9794.07 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 248.36, N = 15 10554.23 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 360.26, N = 12 9346.09 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 262.73, N = 15 10180.37 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 94.98, N = 3 9865.07 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 9.47, N = 3 8036.30 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 O3: LTO GRAPHITE 1500 3000 4500 6000 7500 SE +/- 97.25, N = 3 6963.97 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 314.44, N = 3 18591 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 40.70, N = 3 18142 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 O3: LTO GRAPHITE 9K 18K 27K 36K 45K SE +/- 755.86, N = 15 40435 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 O3: LTO GRAPHITE 8K 16K 24K 32K 40K SE +/- 788.49, N = 15 38435 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 117.74, N = 3 24074 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 O3: LTO GRAPHITE 8K 16K 24K 32K 40K SE +/- 775.41, N = 15 35379 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 O3: LTO GRAPHITE 10K 20K 30K 40K 50K SE +/- 450.46, N = 3 48892 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 O3: LTO GRAPHITE 7K 14K 21K 28K 35K SE +/- 42.06, N = 3 30799 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 297.45, N = 3 25201 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 397.83, N = 4 24882 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 O3: LTO GRAPHITE 13K 26K 39K 52K 65K SE +/- 716.55, N = 3 62087 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 O3: LTO GRAPHITE 13K 26K 39K 52K 65K SE +/- 431.37, N = 3 60578 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 O3: LTO GRAPHITE 12K 24K 36K 48K 60K SE +/- 513.69, N = 3 55810 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
Mencoder This test uses mplayer's mencoder utility and the libavcodec family for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Mencoder 1.3.0 AVI To LAVC O3: LTO GRAPHITE 4 8 12 16 20 SE +/- 0.01, N = 3 15.75 1. (CC) gcc options: -ffast-math -fpie -pie -lncurses -lrt -lgnutls -lpng -lz -ljpeg -lasound -ldl -lpthread -lfribidi -lbz2 -llzo2 -lvorbisenc -lvorbis -logg -lmpg123 -lfaad -lopus -lvpx -rdynamic -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 164.02, N = 3 24463 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 184.48, N = 3 20604 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 120.02, N = 3 18438 1. (CC) gcc options: -pthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search O3: LTO GRAPHITE 0.7695 1.539 2.3085 3.078 3.8475 SE +/- 0.02, N = 3 3.42 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lhmmer -lsquid -lm
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics O3: LTO GRAPHITE 0.3713 0.7426 1.1139 1.4852 1.8565 SE +/- 0.02, N = 3 1.65 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL BFS O3: LTO GRAPHITE 0.2003 0.4006 0.6009 0.8012 1.0015 SE +/- 0.01, N = 14 0.89 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL LBM O3: LTO GRAPHITE 5 10 15 20 25 SE +/- 0.03, N = 3 21.91 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM O3: LTO GRAPHITE 11 22 33 44 55 SE +/- 0.06, N = 3 47.75 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL TPACF O3: LTO GRAPHITE 0.333 0.666 0.999 1.332 1.665 SE +/- 0.02, N = 15 1.48 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP O3: LTO GRAPHITE 0.4343 0.8686 1.3029 1.7372 2.1715 SE +/- 0.01, N = 3 1.93 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil O3: LTO GRAPHITE 1.2848 2.5696 3.8544 5.1392 6.424 SE +/- 0.02, N = 3 5.71 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 1.94, N = 3 106.52 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.06, N = 3 13.41 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte O3: LTO GRAPHITE 11 22 33 44 55 SE +/- 0.14, N = 3 46.89 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall O3: LTO GRAPHITE 0.5828 1.1656 1.7484 2.3312 2.914 SE +/- 0.04, N = 14 2.59 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.04, N = 3 10.52 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.18, N = 15 11.25 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Particle Filter O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.06, N = 3 12.31 1. (CXX) g++ options: -O2 -lOpenCL
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13b1 ATPase Simulation - 327,506 Atoms O3: LTO GRAPHITE 0.2018 0.4036 0.6054 0.8072 1.009 SE +/- 0.00063, N = 3 0.89674
FFTE FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better FFTE 6.0 Test: N=256, 1D Complex FFT Routine O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 1.88, N = 3 9086.92 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OSBench OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files O3: LTO GRAPHITE 4 8 12 16 20 SE +/- 0.05, N = 3 16.87 1. (CC) gcc options: -lm
OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.14, N = 3 11.18 1. (CC) gcc options: -lm
OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.09, N = 3 26.47 1. (CC) gcc options: -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 O3: LTO GRAPHITE 0.477 0.954 1.431 1.908 2.385 SE +/- 0.02, N = 3 2.12 MIN: 2.06 1. (CXX) g++ options: -march=native -O3 -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OSBench OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes O3: LTO GRAPHITE 5 10 15 20 25 SE +/- 0.22, N = 3 22.16 1. (CC) gcc options: -lm
OpenBenchmarking.org Ns Per Event, Fewer Is Better OSBench Test: Memory Allocations O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 0.07, N = 3 78.69 1. (CC) gcc options: -lm
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 1.09, N = 15 121 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 1.68, N = 15 127 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 2.10, N = 15 130 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 2 O3: LTO GRAPHITE 30 60 90 120 150 112 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 2.33, N = 3 123 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 1.33, N = 3 121 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 2.19, N = 3 124 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 2 O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 1.86, N = 3 106 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 1.24, N = 10 115 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 0.67, N = 3 120 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 101.75, N = 3 12709 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 2 O3: LTO GRAPHITE 20 40 60 80 100 SE +/- 1.15, N = 3 109 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 2.08, N = 3 112 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 163.96, N = 15 11237 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 310.22, N = 15 14067 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 2 O3: LTO GRAPHITE 20 40 60 80 100 104 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 2K 4K 6K 8K 10K SE +/- 32.00, N = 3 10928 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 396.36, N = 15 23315 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 9K 18K 27K 36K 45K SE +/- 298.67, N = 3 40299 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 2.19, N = 3 151 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 176.24, N = 3 13200 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 2 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 5K 10K 15K 20K 25K SE +/- 743.32, N = 15 22854 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 2 O3: LTO GRAPHITE 1600 3200 4800 6400 8000 SE +/- 333.24, N = 15 7418 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 4 O3: LTO GRAPHITE 8K 16K 24K 32K 40K SE +/- 213.33, N = 3 37483 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 6 O3: LTO GRAPHITE 11K 22K 33K 44K 55K SE +/- 195.52, N = 3 51264 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 8 O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 1634.75, N = 3 89557 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 305.90, N = 13 18720 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 7K 14K 21K 28K 35K SE +/- 159.29, N = 3 32661 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 4 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 10K 20K 30K 40K 50K SE +/- 153.84, N = 3 48661 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 7K 14K 21K 28K 35K SE +/- 497.45, N = 5 32019 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 12K 24K 36K 48K 60K SE +/- 333.24, N = 3 55317 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 6 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 16K 32K 48K 64K 80K SE +/- 597.33, N = 3 74709 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 10K 20K 30K 40K 50K SE +/- 597.33, N = 3 45419 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 16K 32K 48K 64K 80K SE +/- 532.91, N = 3 74880 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 8 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 1119.14, N = 3 100139 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 923.02, N = 3 94848 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 30K 60K 90K 120K 150K SE +/- 1065.81, N = 3 134912 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 16 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 40K 80K 120K 160K 200K SE +/- 1458.18, N = 3 172117 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 16 O3: LTO GRAPHITE 40K 80K 120K 160K 200K SE +/- 451.54, N = 3 169899 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 24 O3: LTO GRAPHITE 50K 100K 150K 200K 250K SE +/- 3256.11, N = 3 218539 1. (CC) gcc options: -O2 -lpthread
OpenBenchmarking.org usec, 99.9th Latency Percentile, Fewer Is Better Schbench Message Threads: 32 - Workers Per Message Thread: 32 O3: LTO GRAPHITE 60K 120K 180K 240K 300K SE +/- 1365.33, N = 3 278357 1. (CC) gcc options: -O2 -lpthread
PostMark This is a test of NetApp's PostMark benchmark designed to simulate small-file testing similar to the tasks endured by web and mail servers. This test profile will set PostMark to perform 25,000 transactions with 500 files simultaneously with the file sizes ranging between 5 and 512 kilobytes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostMark 1.51 Disk Transaction Performance O3: LTO GRAPHITE 1300 2600 3900 5200 6500 SE +/- 51.00, N = 3 6199 1. (CC) gcc options: -O3
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation O3: LTO GRAPHITE 3 6 9 12 15 SE +/- 0.03, N = 3 10.01 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lpthread
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time O3: LTO GRAPHITE 12M 24M 36M 48M 60M SE +/- 658261.42, N = 3 57890802 1. (CXX) g++ options: -m64 -lpthread -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.01, N = 3 8.59 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lz -llzma
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: deep O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 54.57, N = 3 19867 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -lcurl -lz -ljansson -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: lbry O3: LTO GRAPHITE 15K 30K 45K 60K 75K SE +/- 638.86, N = 3 68673 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -lcurl -lz -ljansson -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: skein O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 490.00, N = 3 79520 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -lcurl -lz -ljansson -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: myr-gr O3: LTO GRAPHITE 4K 8K 12K 16K 20K SE +/- 169.74, N = 3 17033 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -lcurl -lz -ljansson -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: sha256t O3: LTO GRAPHITE 30K 60K 90K 120K 150K SE +/- 15.28, N = 3 159440 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -lcurl -lz -ljansson -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 2018-07-28 Test: CPU O3: LTO GRAPHITE 10K 20K 30K 40K 50K SE +/- 0.41, N = 3 45516.79 1. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.17, N = 3 27.54 1. (CC) gcc options: -lm -O3 -march=native -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl O3: LTO GRAPHITE 60 120 180 240 300 SE +/- 0.33, N = 3 280 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate O3: LTO GRAPHITE 60 120 180 240 300 281 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen O3: LTO GRAPHITE 50 100 150 200 250 SE +/- 0.67, N = 3 232 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced O3: LTO GRAPHITE 50 100 150 200 250 249 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing O3: LTO GRAPHITE 60 120 180 240 300 SE +/- 0.33, N = 3 297 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian O3: LTO GRAPHITE 50 100 150 200 250 SE +/- 0.67, N = 3 241 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space O3: LTO GRAPHITE 70 140 210 280 350 SE +/- 0.33, N = 3 308 1. (CC) gcc options: -fopenmp -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -pthread -lwebp -lwebpmux -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lgomp -lpthread
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode O3: LTO GRAPHITE 80 160 240 320 400 SE +/- 6.34, N = 3 379.88 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.5 1080p 8-bit YUV To AV1 Video Encode O3: LTO GRAPHITE 12 24 36 48 60 SE +/- 0.29, N = 3 55.21 1. (CXX) g++ options: -O3 -pie -lpthread -lm
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding O3: LTO GRAPHITE 13 26 39 52 65 SE +/- 0.04, N = 3 56.65 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -rdynamic -lpthread -lrt -ldl -lnuma
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel O3: LTO GRAPHITE 6 12 18 24 30 SE +/- 0.00, N = 3 25.72 1. (CC) gcc options: -lm -lpthread -O3 -march=native -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping O3: LTO GRAPHITE 200 400 600 800 1000 SE +/- 2.58, N = 3 791.41 1. (CXX) g++ options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lpthread -fopenmp -fwhole-program -lstdc++
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode O3: LTO GRAPHITE 80 160 240 320 400 SE +/- 3.48, N = 3 356.08 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -fPIE -fPIC -O2 -flto -fvisibility=hidden -pie -rdynamic -lpthread -lrt
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV O3: LTO GRAPHITE 1.1295 2.259 3.3885 4.518 5.6475 SE +/- 0.02, N = 3 5.02 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shm -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -pthread -ldl -lbz2 -llzma -lvdpau
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples O3: LTO GRAPHITE 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.02, N = 3 4.49 1. (CXX) g++ options: -fopenmp -O3 -march=native -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin
NAMD CUDA NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.13 ATPase Simulation - 327,506 Atoms O3: LTO GRAPHITE 0.075 0.15 0.225 0.3 0.375 SE +/- 0.00082, N = 3 0.33339
OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 5.21, N = 3 1208.96 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float O3: LTO GRAPHITE 900 1800 2700 3600 4500 SE +/- 47.07, N = 15 4058.09 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 0.08, N = 3 143.84 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth O3: LTO GRAPHITE 30 60 90 120 150 SE +/- 0.04, N = 3 144.59 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.00, N = 3 6.41 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer O3: LTO GRAPHITE 2 4 6 8 10 SE +/- 0.00, N = 3 6.10 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
BlogBench BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read O3: LTO GRAPHITE 800K 1600K 2400K 3200K 4000K SE +/- 31735.66, N = 3 3759392 1. (CC) gcc options: -O2 -pthread
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 443.70, N = 3 113448.67 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set O3: LTO GRAPHITE 16K 32K 48K 64K 80K SE +/- 122.11, N = 3 75642.77 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 167.80, N = 3 78062.17 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Delete O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 198.88, N = 3 114866.63 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 952.49, N = 3 79024.60 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace O3: LTO GRAPHITE 20K 40K 60K 80K 100K SE +/- 162.38, N = 3 79504.53 1. (CC) gcc options: -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lm -rdynamic
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving O3: LTO GRAPHITE 8K 16K 24K 32K 40K SE +/- 262.62, N = 3 37969.52 1. (CC) gcc options: -shared -fPIC -O2 -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance O3: LTO GRAPHITE 1300 2600 3900 5200 6500 SE +/- 3.89, N = 3 6074.87 1. (CC) gcc options: -pthread -m64 -march=native -O3 -pipe -fgraphite-identity -floop-nest-optimize -flto=36 -fuse-linker-plugin -lssl -lcrypto -ldl
OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 10 O3: LTO GRAPHITE 11K 22K 33K 44K 55K SE +/- 479.91, N = 12 49681.40 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 50 O3: LTO GRAPHITE 7K 14K 21K 28K 35K SE +/- 352.90, N = 3 34633.16 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 100 O3: LTO GRAPHITE 8K 16K 24K 32K 40K SE +/- 119.97, N = 3 39391.49 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 250 O3: LTO GRAPHITE 10K 20K 30K 40K 50K SE +/- 1935.27, N = 15 46017.75 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. The number of iterations used is 1,000,000. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite O3: LTO GRAPHITE 160K 320K 480K 640K 800K SE +/- 218.22, N = 3 760424
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD O3: LTO GRAPHITE 400K 800K 1200K 1600K 2000K SE +/- 27487.97, N = 15 1780026.28 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH O3: LTO GRAPHITE 300K 600K 900K 1200K 1500K SE +/- 28235.66, N = 15 1627936.02 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET O3: LTO GRAPHITE 500K 1000K 1500K 2000K 2500K SE +/- 35845.80, N = 3 2145587.21 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET O3: LTO GRAPHITE 400K 800K 1200K 1600K 2000K SE +/- 39433.50, N = 15 1662194.62 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Fhourstones This integer benchmark solves positions in the game of Connect-4, as played on a vertical 7x6 board. By default, it uses a 64Mb transposition table with the twobig replacement strategy. Positions are represented as 64-bit bitboards, and the hash function is computed using a single 64-bit modulo operation, giving 64-bit machines a slight edge. The alpha-beta searcher sorts moves dynamically based on the history heuristic. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving O3: LTO GRAPHITE 3K 6K 9K 12K 15K SE +/- 12.43, N = 3 15221.10 1. (CC) gcc options: -O3
Git This test measures the time needed to carry out some sample Git operations on an example, static repository that happens to be a copy of the GNOME GTK tool-kit repository. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Git Time To Complete Common Git Commands O3: LTO GRAPHITE 1.0508 2.1016 3.1524 4.2032 5.254 SE +/- 0.00, N = 3 4.67 1. git version 2.21.0
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times O3: LTO GRAPHITE 300 600 900 1200 1500 SE +/- 1.67, N = 3 1212
O3: LTO GRAPHITE Processor: Intel Core i9-7980XE @ 4.40GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Taichi XE (P1.60 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 129024MB, Disk: 960GB INTEL SSDPED1D960GAY + 2000GB Western Digital WD20EARS-00S + 2000GB Seagate ST2000DM001-1CH1 + 3 x 2000GB Seagate ST2000DM001-1ER1 + 2000GB Western Digital WD20EARS-00M + 250GB HITACHI HTS54322, Graphics: eVGA NVIDIA GeForce GTX 970 4GB, Audio: Realtek ALC1220, Network: Intel I219-V + Intel I211 + Intel Dual Band-AC 3168NGW
OS: Gentoo/Linux, Kernel: 5.0.17-gentoo (x86_64), Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Vulkan: 1.1.99, Compiler: GCC 9.1.0 + Clang 8.0.0 + LLVM 8.0.0 + CUDA 10.1, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: FFLAGS=-march=native-O3-pipe-fgraphite-identity-floop-nest-optimize-flto=36 CXXFLAGS=-march=native-O3-pipe-flto=36 LDFLAGS=-march=native-O3-pipe-flto=36-Wl-O1-Wl--as-needed FCFLAGS=-march=native-O3-pipe-fgraphite-identity-floop-nest-optimize-flto=36 CFLAGS=-march=native-O3-pipe-fgraphite-identity-floop-nest-optimize-flto=36Compiler Notes: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/9.1.0 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/9.1.0 --disable-altivec --disable-esp --disable-fixed-point --disable-isl-version-check --disable-libmudflap --disable-libssp --disable-systemtap --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,d,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --enable-vtable-verify --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/9.1.0/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/9.1.0/man --with-build-config=bootstrap-lto --with-isl --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/9.1.0/pythonDisk Notes: NONE / noatime,rwProcessor Notes: Scaling Governor: intel_pstate performanceJava Notes: OpenJDK Runtime Environment (IcedTea 3.12.0) (Gentoo icedtea-3.12.0)Python Notes: Python 3.6.6Security Notes: l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
Testing initiated at 30 June 2019 15:50 by user wolfwood.