Intel Core i7-6800K GCC7 compiler benchmarks. Tests by Michael Larabel for a future article on phoronix.
GCC 7.0.0 20170108 Processor: Intel Core i7-6800K @ 3.80GHz (12 Cores), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0, Chipset: Intel Xeon E7 v4/Xeon, Memory: 16384MB, Disk: 120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150, Graphics: NVIDIA GeForce GTX TITAN X 12288MB, Audio: Realtek ALC1150, Network: Intel Connection
OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 7.0.0 20170108, File-System: ext4, Screen Resolution: 2560x1440
Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Notes: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: intel_pstate powersaveOpenCL Notes: GPU Compute Cores: 3072System Notes: Python 2.7.12. GPU Compute Cores: 3072.
GCC 6.3.0 OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 6.3.0, File-System: ext4, Screen Resolution: 2560x1440
GCC 5.4.0 OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 5.4.0, File-System: ext4, Screen Resolution: 2560x1440
GCC 4.9.4 OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 4.9.4, File-System: ext4, Screen Resolution: 2560x1440
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 6K 12K 18K 24K 30K SE +/- 29.98, N = 3 SE +/- 131.18, N = 3 SE +/- 74.95, N = 3 SE +/- 0.87, N = 3 24914.48 24744.45 25731.70 13528.14 1. (CC) gcc options: -lrt
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 11 22 33 44 55 SE +/- 0.25, N = 3 SE +/- 0.20, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 48.93 46.23 34.94 36.40 1. (CXX) g++ options: -O2 -lOpenCL
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 700 1400 2100 2800 3500 SE +/- 6.68, N = 4 SE +/- 6.90, N = 4 SE +/- 0.34, N = 4 SE +/- 12.21, N = 4 2791.88 2747.54 2726.62 3349.49 1. (CXX) g++ options: -O3 -march=native
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 600 1200 1800 2400 3000 SE +/- 2.79, N = 4 SE +/- 6.54, N = 4 SE +/- 4.07, N = 4 SE +/- 8.80, N = 4 2565.53 2172.90 2567.75 2587.40 1. (CXX) g++ options: -O3 -march=native
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 133.70 133.74 114.22 121.89 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 15 30 45 60 75 SE +/- 0.12, N = 3 SE +/- 0.18, N = 3 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 66.28 58.88 57.15 56.64 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 300 600 900 1200 1500 SE +/- 5.44, N = 4 SE +/- 1.39, N = 4 SE +/- 0.76, N = 4 SE +/- 5.61, N = 4 1504.18 1422.54 1497.82 1627.73 1. (CXX) g++ options: -O3 -march=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.37 12.28 12.23 13.94 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2016-12-29 Build: CPU AlexNet GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 90K 180K 270K 360K 450K SE +/- 268.06, N = 3 SE +/- 302.65, N = 3 SE +/- 66.71, N = 3 397203 360816 363428 1. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Blur GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 1.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 132 139 140 144 -std=gnu99 -ldl -ldl -ldl 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljasper -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
Timed Linux Kernel Compilation OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.9 Time To Compile GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 20 40 60 80 100 SE +/- 1.00, N = 3 SE +/- 0.86, N = 3 SE +/- 0.87, N = 3 SE +/- 0.51, N = 3 87.56 87.46 93.28 95.37
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 10.19 9.71 9.35 9.35 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 6 12 18 24 30 SE +/- 0.49, N = 3 SE +/- 0.39, N = 5 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 24.35 25.38 23.89 24.66 1. (CC) gcc options: -pthread -O3 -march=native
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 70 140 210 280 350 SE +/- 0.56, N = 3 SE +/- 0.53, N = 3 SE +/- 0.22, N = 3 SE +/- 0.91, N = 3 293.67 299.83 303.71 311.66 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.4 Build: Float + SSE - Size: 2D FFT Size 2048 GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 3K 6K 9K 12K 15K SE +/- 43.00, N = 5 SE +/- 47.65, N = 5 SE +/- 56.20, N = 5 SE +/- 102.73, N = 5 13368 13835 14111 14021 -std=gnu99 1. (CC) gcc options: -O3 -march=native -lm
Timed MAFFT Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 0.8348 1.6696 2.5044 3.3392 4.174 SE +/- 0.02, N = 3 SE +/- 0.06, N = 6 SE +/- 0.09, N = 6 SE +/- 0.06, N = 5 3.52 3.59 3.65 3.71 1. (CC) gcc options: -O3 -lm -lpthread
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: HWB Color Space GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 40 80 120 160 200 SE +/- 0.88, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 194 200 204 204 -std=gnu99 -ldl -ldl -ldl 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljasper -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
FFmpeg OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 2.8.1 H.264 HD To NTSC DV GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 17.62 17.58 17.66 16.98 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lxcb -lxcb-shm -lxcb-xfixes -lxcb-render -lxcb-shape -lasound -lSDL -lm -llzma -lbz2 -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize
LAME MP3 Encoding OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.12, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 10.29 10.68 10.45 10.60 -fomit-frame-pointer -funroll-loops -funroll-loops -funroll-loops 1. (CC) gcc options: -O3 -ffast-math -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 300K 600K 900K 1200K 1500K SE +/- 682.67, N = 5 SE +/- 550.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 4 1239375 1231615 1269073 1226677 1. (CC) gcc options: -O3 -march=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Resizing GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 180 183 184 186 -std=gnu99 -ldl -ldl -ldl 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljasper -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
Timed PHP Compilation OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 19.46 18.96 19.53 19.57 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
Crafty OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 16 32 48 64 80 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 70.12 70.20 68.99 68.14 1. (CC) gcc options: -lstdc++ -lm
Compile Bench Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 160 320 480 640 800 SE +/- 7.38, N = 3 SE +/- 5.84, N = 3 SE +/- 4.90, N = 3 SE +/- 10.53, N = 6 735.72 730.55 738.39 716.88
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Sharpen GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 134 135 138 137 -std=gnu99 -ldl -ldl -ldl 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljasper -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
Multichase Pointer Chaser OpenBenchmarking.org ns, Fewer Is Better Multichase Pointer Chaser Test: 256MB Array, 256 Byte Stride GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 13 26 39 52 65 SE +/- 0.83, N = 3 SE +/- 0.93, N = 3 SE +/- 1.35, N = 6 SE +/- 0.98, N = 3 56.82 58.42 57.38 58.49 1. (CC) gcc options: -O2 -static -pthread -lrt
HPC Challenge OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: G-Ffte GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 2 4 6 8 10 SE +/- 0.03120, N = 3 SE +/- 0.05490, N = 3 SE +/- 0.01518, N = 3 SE +/- 0.07723, N = 3 6.38503 6.44751 6.56845 6.39207 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.10.2
Minion Minion is an open-source constraint solver that is designed to be very scalable. This test profile uses Minion's integrated benchmarking problems to solve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Minion 1.8 Benchmark: Quasigroup GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 1.92, N = 3 136.80 139.77 138.36 140.72 1. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic
Stockfish OpenBenchmarking.org ms, Fewer Is Better Stockfish 2014-11-26 Total Time GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 700 1400 2100 2800 3500 SE +/- 9.24, N = 3 SE +/- 24.50, N = 3 SE +/- 0.88, N = 3 SE +/- 19.50, N = 3 3386 3479 3462 3419 1. (CXX) g++ options: -lpthread -O3 -march=native -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt -flto
Minion Minion is an open-source constraint solver that is designed to be very scalable. This test profile uses Minion's integrated benchmarking problems to solve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Minion 1.8 Benchmark: Solitaire GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.28, N = 3 SE +/- 0.22, N = 3 SE +/- 0.20, N = 3 85.22 84.99 83.48 85.76 1. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2016-12-29 Build: CPU Googlenet GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 200K 400K 600K 800K 1000K SE +/- 606.01, N = 3 SE +/- 248.34, N = 3 SE +/- 634.40, N = 3 764929 780157 784263 1. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.25, N = 3 SE +/- 0.11, N = 3 SE +/- 0.13, N = 3 15.29 15.35 15.10 14.99 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.34, N = 6 18.65 18.79 18.52 18.95 1. (CXX) g++ options: -O2 -lOpenCL
Multichase Pointer Chaser OpenBenchmarking.org ns, Fewer Is Better Multichase Pointer Chaser Test: 1GB Array, 256 Byte Stride, 4 Threads GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 15 30 45 60 75 SE +/- 0.95, N = 3 SE +/- 0.17, N = 3 SE +/- 1.11, N = 4 SE +/- 0.45, N = 3 65.35 66.14 64.79 65.63 1. (CC) gcc options: -O2 -static -pthread -lrt
HPC Challenge OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.4.3 Test / Class: G-Ptrans GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 0.938 1.876 2.814 3.752 4.69 SE +/- 0.03617, N = 3 SE +/- 0.02941, N = 3 SE +/- 0.03550, N = 3 SE +/- 0.02840, N = 3 4.15159 4.08727 4.16873 4.15386 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.10.2
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 10K 20K 30K 40K 50K SE +/- 153.19, N = 3 SE +/- 161.65, N = 3 SE +/- 77.65, N = 3 SE +/- 31.18, N = 3 45057.24 44610.32 44730.44 45489.99 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Minion Minion is an open-source constraint solver that is designed to be very scalable. This test profile uses Minion's integrated benchmarking problems to solve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Minion 1.8 Benchmark: Graceful GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 15 30 45 60 75 SE +/- 0.24, N = 3 SE +/- 0.56, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 64.66 64.30 65.24 65.51 1. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic
SQLite This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.8.10.2 Test Target: Default Test Directory GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.28, N = 3 SE +/- 0.11, N = 3 26.18 26.17 25.79 25.77 1. (CC) gcc options: -O3 -march=native -ldl -lpthread
libjpeg-turbo tjbench OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.1 Test: Decompression Throughput GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 SE +/- 0.39, N = 3 SE +/- 1.01, N = 3 156.87 156.14 157.87 158.08 1. (CC) gcc options: -O3 -march=native -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.19 Operation: Local Adaptive Thresholding GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 93 93 94 94 -std=gnu99 -ldl -ldl -ldl 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljasper -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
Timed HMMer Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 7.24 7.24 7.24 7.17 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
HPC Challenge OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: G-HPL GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 16 32 48 64 80 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.26, N = 3 SE +/- 0.14, N = 3 73.20 73.70 73.88 73.72 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.10.2
OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: EP-DGEMM GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 2 4 6 8 10 SE +/- 0.05234, N = 3 SE +/- 0.01720, N = 3 SE +/- 0.00120, N = 3 SE +/- 0.00215, N = 3 6.73993 6.78137 6.80147 6.79701 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.10.2
GMPbench OpenBenchmarking.org GMPbench Score, More Is Better GMPbench 0.2 Total Time GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 1000 2000 3000 4000 5000 4494.90 4530.60 4496.10 4501.60 1. (CC) gcc options: -O3 -march=native -lm
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 70 140 210 280 350 SE +/- 0.48, N = 4 SE +/- 0.51, N = 4 SE +/- 0.15, N = 4 SE +/- 0.83, N = 4 339.43 340.07 340.01 341.03 1. (CXX) g++ options: -O3 -march=native
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 129.07 128.68 128.96 128.57 1. (CXX) g++ options: -O2 -lOpenCL
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1g RSA 4096-bit Performance GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 200 400 600 800 1000 SE +/- 0.75, N = 3 SE +/- 1.26, N = 3 SE +/- 0.55, N = 3 SE +/- 1.10, N = 3 981.57 984.53 983.13 982.70 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
PostgreSQL pgbench OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 9.4.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 2K 4K 6K 8K 10K SE +/- 28.13, N = 3 SE +/- 15.65, N = 3 SE +/- 27.48, N = 3 SE +/- 23.18, N = 3 7986.34 7963.88 7986.01 7969.85 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 300 600 900 1200 1500 SE +/- 0.30, N = 4 SE +/- 0.24, N = 4 SE +/- 0.21, N = 4 SE +/- 3.46, N = 4 1210.68 1210.65 1210.76 1213.54 1. (CXX) g++ options: -O3 -march=native
Multichase Pointer Chaser OpenBenchmarking.org ns, Fewer Is Better Multichase Pointer Chaser Test: 4MB Array, 64 Byte Stride GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.98 6.97 6.98 6.97 1. (CC) gcc options: -O2 -static -pthread -lrt
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 700 1400 2100 2800 3500 SE +/- 0.04, N = 3 SE +/- 0.38, N = 3 SE +/- 0.12, N = 3 SE +/- 0.21, N = 3 3053.78 3052.98 3053.62 3052.37 1. (CC) gcc options: -lrt
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: SET GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 400K 800K 1200K 1600K 2000K SE +/- 4150.84, N = 3 SE +/- 7318.36, N = 3 SE +/- 62672.40, N = 6 SE +/- 6181.11, N = 3 1650185.92 1661193.92 1528154.69 1684492.46 -std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl
OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: GET GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 500K 1000K 1500K 2000K 2500K SE +/- 93997.54, N = 6 SE +/- 2889.57, N = 3 SE +/- 17316.52, N = 3 SE +/- 62256.75, N = 6 2054039.04 2237143.92 2198076.58 2158090.65 -std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 Phoronix Test Suite v7.0.0m1 GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 40K 80K 120K 160K 200K SE +/- 5743.30, N = 6 SE +/- 3471.19, N = 3 SE +/- 3146.69, N = 3 SE +/- 4871.23, N = 6 194802 186104 195641 184635 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 140 280 420 560 700 SE +/- 33.75, N = 4 SE +/- 2.50, N = 4 SE +/- 0.03, N = 4 SE +/- 3.03, N = 4 613.36 641.57 643.99 647.15 1. (CXX) g++ options: -O3 -march=native
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 1.287 2.574 3.861 5.148 6.435 SE +/- 0.10, N = 10 SE +/- 0.15, N = 10 SE +/- 0.08, N = 10 SE +/- 0.05, N = 5 5.69 5.42 5.72 5.66 1. (CC) gcc options: --openmp -O3 -lm
HPC Challenge OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.4.3 Test / Class: EP-STREAM Triad GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 0.7457 1.4914 2.2371 2.9828 3.7285 SE +/- 0.13310, N = 3 SE +/- 0.16749, N = 3 SE +/- 0.10335, N = 3 SE +/- 0.03993, N = 3 3.31431 2.91637 2.75787 2.69651 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.10.2
Compile Bench Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Read Compiled Tree GCC 4.9.4 GCC 5.4.0 GCC 6.3.0 GCC 7.0.0 20170108 600 1200 1800 2400 3000 SE +/- 50.34, N = 3 SE +/- 70.72, N = 3 SE +/- 12.95, N = 3 SE +/- 248.53, N = 3 2535.10 2536.63 2643.76 1820.84
GCC 7.0.0 20170108 Processor: Intel Core i7-6800K @ 3.80GHz (12 Cores), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0, Chipset: Intel Xeon E7 v4/Xeon, Memory: 16384MB, Disk: 120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150, Graphics: NVIDIA GeForce GTX TITAN X 12288MB, Audio: Realtek ALC1150, Network: Intel Connection
OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 7.0.0 20170108, File-System: ext4, Screen Resolution: 2560x1440
Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Notes: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: intel_pstate powersaveOpenCL Notes: GPU Compute Cores: 3072System Notes: Python 2.7.12. GPU Compute Cores: 3072.
Testing initiated at 15 January 2017 16:46 by user pts.
GCC 6.3.0 Processor: Intel Core i7-6800K @ 3.80GHz (12 Cores), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0, Chipset: Intel Xeon E7 v4/Xeon, Memory: 16384MB, Disk: 120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150, Graphics: NVIDIA GeForce GTX TITAN X 12288MB, Audio: Realtek ALC1150, Network: Intel Connection
OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 6.3.0, File-System: ext4, Screen Resolution: 2560x1440
Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Notes: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: intel_pstate powersaveOpenCL Notes: GPU Compute Cores: 3072System Notes: Python 2.7.12. GPU Compute Cores: 3072.
Testing initiated at 15 January 2017 21:15 by user pts.
GCC 5.4.0 Processor: Intel Core i7-6800K @ 3.80GHz (12 Cores), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0, Chipset: Intel Xeon E7 v4/Xeon, Memory: 16384MB, Disk: 120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150, Graphics: NVIDIA GeForce GTX TITAN X 12288MB, Audio: Realtek ALC1150, Network: Intel Connection
OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 5.4.0, File-System: ext4, Screen Resolution: 2560x1440
Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Notes: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: intel_pstate powersaveOpenCL Notes: GPU Compute Cores: 3072System Notes: Python 2.7.12. GPU Compute Cores: 3072.
Testing initiated at 16 January 2017 06:22 by user pts.
GCC 4.9.4 Processor: Intel Core i7-6800K @ 3.80GHz (12 Cores), Motherboard: MSI X99A WORKSTATION (MS-7A54) v1.0, Chipset: Intel Xeon E7 v4/Xeon, Memory: 16384MB, Disk: 120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150, Graphics: NVIDIA GeForce GTX TITAN X 12288MB, Audio: Realtek ALC1150, Network: Intel Connection
OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.27.03, OpenGL: 4.5.0, Vulkan: 1.0.8, Compiler: GCC 4.9.4, File-System: ext4, Screen Resolution: 2560x1440
Compiler Notes: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Notes: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Notes: Scaling Governor: intel_pstate powersaveOpenCL Notes: GPU Compute Cores: 3072System Notes: Python 2.7.12. GPU Compute Cores: 3072.
Testing initiated at 16 January 2017 10:48 by user pts.