Intel Core i9-7980XE testing with GCC 9 compiler by Michael Larabel.
-O3 -march=skylake Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=128 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=256 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=512 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (1704 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16384MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: NVIDIA NV120 12GB, Audio: Realtek ALC1220, Monitor: ASUS PB278, Network: Intel I219-V
OS: Clear Linux OS 29350, Kernel: 5.0.14-753.native (x86_64), Desktop: GNOME Shell 3.32.1, Display Server: X Server 1.20.4, Display Driver: nouveau 1.0.16, OpenGL: 4.3 Mesa 19.1.0-devel, Compiler: GCC 9.1.1 20190509 gcc-9-branch@271025 + clang (Clear Linux OS for Intel Architecture) 9.1.1 20190509 gcc-9-branch@271025 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 2560x1440
Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 200 400 600 800 1000 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 SE +/- 0.00, N = 3 SE +/- 0.10, N = 3 846 821 857 846 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2K 4K 6K 8K 10K SE +/- 15.77, N = 3 SE +/- 3.81, N = 3 SE +/- 5.96, N = 3 SE +/- 6.33, N = 3 10446 10397 10405 10424 -march=skylake - MIN: 10397.8 MIN: 10360.8 MIN: 10370.6 MIN: 10383.1 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 SE +/- 1.04, N = 3 SE +/- 0.10, N = 3 260 262 264 263 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4K 8K 12K 16K 20K SE +/- 39.05, N = 3 SE +/- 111.64, N = 3 SE +/- 90.51, N = 3 SE +/- 71.84, N = 3 17183 17522 17326 17334 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 1500 3000 4500 6000 7500 SE +/- 55.64, N = 3 SE +/- 28.38, N = 3 SE +/- 19.27, N = 3 SE +/- 31.31, N = 3 6327 6863 6615 5673 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 160 320 480 640 800 SE +/- 0.92, N = 3 SE +/- 1.26, N = 3 SE +/- 0.50, N = 3 SE +/- 0.64, N = 3 752 749 751 751 -march=skylake - MIN: 744 MIN: 741.69 MIN: 744.39 MIN: 745 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30 60 90 120 150 SE +/- 0.79, N = 3 SE +/- 0.73, N = 3 SE +/- 0.33, N = 3 SE +/- 0.63, N = 3 132 135 134 132 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NGINX Benchmark This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 11K 22K 33K 44K 55K SE +/- 643.64, N = 15 SE +/- 834.73, N = 15 SE +/- 314.76, N = 3 SE +/- 88.86, N = 3 47570 46938 49221 44072 -march=skylake 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 82.98 82.88 83.16 98.92 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 110K 220K 330K 440K 550K SE +/- 1607.28, N = 3 SE +/- 772.32, N = 3 SE +/- 2392.99, N = 3 SE +/- 1960.25, N = 3 520100 519930 521901 498539 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3K 6K 9K 12K 15K SE +/- 128.04, N = 3 SE +/- 44.82, N = 3 SE +/- 45.37, N = 3 SE +/- 82.67, N = 3 13311 13495 13576 13424 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.18 Time To Compile -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 9 18 27 36 45 SE +/- 0.38, N = 8 SE +/- 0.38, N = 8 SE +/- 0.53, N = 4 SE +/- 0.53, N = 4 36.34 36.33 36.93 37.36
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 68.54 68.35 68.26 68.18 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 10M 20M 30M 40M 50M SE +/- 407607.78, N = 3 SE +/- 146682.07, N = 3 SE +/- 342455.58, N = 3 SE +/- 311424.12, N = 3 47112136 46810779 47046623 46997822 -march=skylake 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 15 30 45 60 75 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 65.67 63.06 63.20 65.66 -march=skylake 1. (CC) gcc options: -march=native -O3 -msse -mfpmath=sse -lm -pthread -lmpi
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 7K 14K 21K 28K 35K SE +/- 62.55, N = 3 SE +/- 79.15, N = 3 SE +/- 34.14, N = 3 SE +/- 86.87, N = 3 33669 34962 35007 33676 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 204 207 206 186 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 40 80 120 160 200 SE +/- 0.67, N = 3 198 197 198 195 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 SE +/- 1.33, N = 3 252 251 251 228 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 50 100 150 200 250 SE +/- 0.33, N = 3 227 225 224 196 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 1.00, N = 3 291 290 290 250 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 288 289 288 253 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.58, N = 3 277 277 275 243 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30K 60K 90K 120K 150K SE +/- 1600.60, N = 15 SE +/- 876.94, N = 3 SE +/- 1645.69, N = 15 SE +/- 1768.29, N = 15 117212 107497 117621 112844 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 700 1400 2100 2800 3500 SE +/- 5.71, N = 3 SE +/- 2.31, N = 3 SE +/- 4.67, N = 3 SE +/- 9.47, N = 3 3302 3310 3292 2814 -march=skylake 1. (CC) gcc options: -O3 -mavx2
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 57.08 57.07 57.00 57.06 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4 8 12 16 20 SE +/- 0.13, N = 15 SE +/- 0.11, N = 15 SE +/- 0.13, N = 15 SE +/- 0.20, N = 4 13.63 13.72 13.88 14.12 1. (CXX) g++ options: -O2 -lOpenCL
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 400K 800K 1200K 1600K 2000K SE +/- 34258.37, N = 15 SE +/- 36055.36, N = 15 SE +/- 32556.93, N = 15 SE +/- 36075.54, N = 15 2059237 1962269 2013442 2026656 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 600K 1200K 1800K 2400K 3000K SE +/- 45086.74, N = 15 SE +/- 54716.38, N = 15 SE +/- 56848.49, N = 15 SE +/- 58238.45, N = 15 2958563 2992962 2944968 3002750 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Timed PHP Compilation This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 12 24 36 48 60 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 51.80 51.84 51.90 52.19 -march=skylake 1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 700K 1400K 2100K 2800K 3500K SE +/- 55290.27, N = 15 SE +/- 71217.78, N = 12 SE +/- 61579.63, N = 15 SE +/- 59550.07, N = 15 3140283 3095145 3139485 3059844 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
m-queens A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 48.42 48.42 48.40 48.41 -march=skylake 1. (CXX) g++ options: -fopenmp -O3 -O2 -march=native
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Delete -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30K 60K 90K 120K 150K SE +/- 1837.94, N = 3 SE +/- 1623.43, N = 4 SE +/- 1539.98, N = 15 SE +/- 1672.82, N = 15 120897 122131 114714 115719 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
7-Zip Compression This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 201.98, N = 3 SE +/- 264.40, N = 3 SE +/- 414.93, N = 3 SE +/- 174.10, N = 3 100157 99546 99960 100416 1. (CXX) g++ options: -pipe -lpthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 500K 1000K 1500K 2000K 2500K SE +/- 42954.48, N = 15 SE +/- 38284.96, N = 15 SE +/- 25293.75, N = 3 SE +/- 44027.91, N = 15 2163293 2189741 1971745 2135179 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 600K 1200K 1800K 2400K 3000K SE +/- 33796.90, N = 15 SE +/- 20507.83, N = 3 SE +/- 56601.20, N = 15 SE +/- 50359.83, N = 15 2339849 2624995 2449355 2383273 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: m7m -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 110 220 330 440 550 SE +/- 1.19, N = 3 SE +/- 1.28, N = 3 SE +/- 0.55, N = 3 SE +/- 1.44, N = 3 499 503 506 479 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: sha256t -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30K 60K 90K 120K 150K SE +/- 3.33, N = 3 SE +/- 1469.65, N = 6 SE +/- 240.62, N = 3 SE +/- 661.72, N = 3 125653 124672 124030 124567 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 108 107 107 108 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 9 18 27 36 45 39 39 39 39 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 1000 2000 3000 4000 5000 SE +/- 10.11, N = 3 SE +/- 7.81, N = 3 SE +/- 9.00, N = 3 SE +/- 4.76, N = 3 4391 4773 4784 4391 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: deep -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3K 6K 9K 12K 15K SE +/- 21.86, N = 3 SE +/- 31.80, N = 3 SE +/- 15.28, N = 3 SE +/- 65.06, N = 3 15323 15497 15490 14490 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 130 260 390 520 650 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 620 620 620 621 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 90 180 270 360 450 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 1.67, N = 3 430 429 429 428 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 1137 1138 1137 1138 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 100 200 300 400 500 SE +/- 0.58, N = 3 451 451 452 452 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 1189 1190 1191 1191 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 50 100 150 200 250 SE +/- 0.33, N = 3 248 249 249 249 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 30.87 33.34 33.47 35.86 -march=skylake 1. (CC) gcc options: -lm -lpthread -O3
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 32.49 32.44 32.44 37.27 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: myr-gr -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4K 8K 12K 16K 20K SE +/- 76.38, N = 3 SE +/- 92.92, N = 3 SE +/- 219.87, N = 4 SE +/- 32.15, N = 3 16360 15560 15695 15310 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: lbry -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 11K 22K 33K 44K 55K SE +/- 3.33, N = 3 SE +/- 41.77, N = 3 SE +/- 283.57, N = 3 SE +/- 706.79, N = 4 48547 52793 53473 53620 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 160 320 480 640 800 SE +/- 1.31, N = 3 SE +/- 1.58, N = 3 SE +/- 7.91, N = 7 SE +/- 0.52, N = 3 710 738 729 669 -march=skylake 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lpthread -fopenmp -fwhole-program -lstdc++
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 300 600 900 1200 1500 SE +/- 2.67, N = 3 SE +/- 0.83, N = 3 SE +/- 5.01, N = 3 SE +/- 1.35, N = 3 1606 1605 1607 1608 -march=skylake 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 31.17 31.15 31.11 31.29 -march=skylake 1. (CC) gcc options: -lm -O3
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: skein -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 13K 26K 39K 52K 65K SE +/- 3.33, N = 3 SE +/- 678.39, N = 3 SE +/- 955.00, N = 3 SE +/- 1026.67, N = 3 56227 62813 62560 62027 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 29.69 30.57 30.89 30.56 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 8K 16K 24K 32K 40K SE +/- 52.04, N = 3 SE +/- 144.00, N = 3 SE +/- 63.22, N = 3 SE +/- 36.99, N = 3 36040 35982 35946 33240 -march=skylake 1. (CC) gcc options: -shared -fPIC -pthread -O3
Crafty This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2M 4M 6M 8M 10M SE +/- 4114.60, N = 3 SE +/- 5800.58, N = 3 SE +/- 7027.53, N = 3 SE +/- 7719.64, N = 3 8852846 8840395 8845689 8840945 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 900 1800 2700 3600 4500 SE +/- 0.57, N = 3 SE +/- 2.47, N = 3 SE +/- 1.90, N = 3 SE +/- 1.97, N = 3 4212 3840 3855 3839 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 900 1800 2700 3600 4500 SE +/- 0.41, N = 3 SE +/- 0.98, N = 3 SE +/- 1.21, N = 3 SE +/- 1.37, N = 3 4220 3847 3833 3835 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 23.03 23.01 22.93 23.07 -march=skylake - MIN: 22.69 MIN: 22.68 MIN: 22.61 MIN: 22.7 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 400 800 1200 1600 2000 SE +/- 0.89, N = 3 SE +/- 0.22, N = 3 SE +/- 0.28, N = 3 SE +/- 0.68, N = 3 1822 1820 1819 1819 -march=skylake - MIN: 1818.58 MIN: 1817.59 MIN: 1816.37 MIN: 1815.94 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 900 1800 2700 3600 4500 SE +/- 41.78, N = 3 SE +/- 7.44, N = 3 SE +/- 38.60, N = 3 SE +/- 17.99, N = 3 4062 4073 4185 4080 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 90 180 270 360 450 SE +/- 1.58, N = 3 SE +/- 0.13, N = 3 SE +/- 0.21, N = 3 SE +/- 0.07, N = 3 393 395 395 395 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 90 180 270 360 450 SE +/- 1.99, N = 3 SE +/- 0.12, N = 3 SE +/- 0.34, N = 3 SE +/- 0.17, N = 3 400 401 401 401 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.34, N = 3 SE +/- 0.51, N = 3 SE +/- 0.18, N = 3 SE +/- 0.03, N = 3 293 294 293 294 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 60 120 180 240 300 SE +/- 0.09, N = 3 SE +/- 0.22, N = 3 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 295 295 295 295 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 145 146 146 146 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.06, N = 3 145 145 145 145 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 95.01 95.26 95.17 95.30 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 96.02 96.22 96.21 96.25 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 16K 32K 48K 64K 80K SE +/- 335.26, N = 3 SE +/- 429.23, N = 3 SE +/- 384.96, N = 3 SE +/- 141.11, N = 3 72585 73045 72688 72940 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 16K 32K 48K 64K 80K SE +/- 582.90, N = 3 SE +/- 259.01, N = 3 SE +/- 163.23, N = 3 SE +/- 622.40, N = 3 72123 73307 72947 73346 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 4K -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 19.15 19.47 19.34 19.67 -march=skylake 1. (CC) gcc options: -O3 -pthread
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 495.69, N = 3 SE +/- 510.45, N = 3 SE +/- 182.38, N = 3 SE +/- 290.53, N = 3 78294 78021 77703 77932 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 240.24, N = 3 SE +/- 549.37, N = 3 SE +/- 642.32, N = 3 SE +/- 216.99, N = 3 78460 78432 77373 78798 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 755.85, N = 3 SE +/- 109.89, N = 3 SE +/- 428.91, N = 3 SE +/- 121.74, N = 3 77484 77547 79076 79234 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.96 16.99 16.95 16.98 1. (CXX) g++ options: -O2 -lOpenCL
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 15.88 15.84 15.80 15.99 -march=skylake 1. (CC) gcc options: -pthread -fvisibility=hidden -O3
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 20.79 20.84 20.74 20.66 -march=skylake - MIN: 19.84 MIN: 19.89 MIN: 19.8 MIN: 19.73 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 8.77 8.76 8.66 9.14 -march=skylake 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
Xsbench XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 5774.93, N = 3 SE +/- 3451.22, N = 3 SE +/- 24818.37, N = 3 SE +/- 5240.08, N = 3 5164164 5171354 5140078 5166043 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 13.62 13.28 13.70 13.39 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 12.79 12.80 12.78 12.96 -march=skylake 1. (CXX) g++ options: -O3 -lpthread
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 11.52 11.47 11.31 11.71 1. (CXX) g++ options: -O2 -lOpenCL
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 12 24 36 48 60 SE +/- 0.25, N = 3 SE +/- 0.21, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 53.71 53.34 53.97 50.05 -march=skylake 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2K 4K 6K 8K 10K SE +/- 11.85, N = 3 SE +/- 63.29, N = 3 SE +/- 39.71, N = 3 SE +/- 26.14, N = 3 9376 9337 9331 9387 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.477 0.954 1.431 1.908 2.385 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 2.12 2.11 2.12 2.12 1. (CC) gcc options: --openmp -O3 -lm
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-03-07 1080p 8-bit YUV To AV1 Video Encode -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 6 12 18 24 30 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 24.44 24.37 24.34 24.43 -march=skylake 1. (CXX) g++ options: -O3 -pie -lpthread -lm
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.71 9.59 9.69 9.76 -march=skylake 1. (CC) gcc options: -O3 -pthread -lz
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 9.32 9.05 9.30 10.19 -march=skylake 1. (CC) gcc options: -O3 -lncurses -lm
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 7.74 7.98 7.77 7.80 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 500 1000 1500 2000 2500 SE +/- 71.82, N = 12 SE +/- 9.18, N = 3 SE +/- 6.91, N = 3 SE +/- 14.49, N = 3 2038 2090 2109 2113 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 1080p -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 6.93 6.97 6.95 7.05 -march=skylake 1. (CC) gcc options: -O3 -pthread
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 40 80 120 160 200 SE +/- 0.06, N = 3 SE +/- 0.64, N = 3 SE +/- 0.33, N = 3 SE +/- 0.42, N = 3 196 196 194 197 -march=skylake 1. (CC) gcc options: -O3 -lm
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 1.3343 2.6686 4.0029 5.3372 6.6715 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 5.93 5.81 5.82 5.82 -march=skylake 1. (CC) gcc options: -pthread -O3
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 1.2375 2.475 3.7125 4.95 6.1875 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 5.45 5.50 5.47 5.45 -march=skylake 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shm -lxcb-shape -lxcb-xfixes -pthread -lSDL2 -lbz2 -O3 -std=c11 -fomit-frame-pointer -fPIC -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 30 60 90 120 150 SE +/- 1.19, N = 3 SE +/- 1.54, N = 3 SE +/- 1.37, N = 3 SE +/- 1.04, N = 3 139 140 140 140 -march=skylake 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.873 1.746 2.619 3.492 4.365 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 3.49 3.40 3.50 3.88 -march=skylake 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 9K 18K 27K 36K 45K SE +/- 40.23, N = 3 SE +/- 36.60, N = 3 SE +/- 73.39, N = 3 SE +/- 98.50, N = 3 37394 39791 39343 37305 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 80 160 240 320 400 SE +/- 4.85, N = 4 SE +/- 4.39, N = 4 SE +/- 5.15, N = 4 SE +/- 4.48, N = 5 342 346 347 337 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
PolyBench-C PolyBench-C is a C-language polyhedral benchmark suite made at the Ohio State University. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.594 1.188 1.782 2.376 2.97 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.64 2.61 2.63 2.63 -march=skylake 1. (CC) gcc options: -O3 -march=native
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 2.38 2.38 2.38 2.41 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.522 1.044 1.566 2.088 2.61 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.32 2.29 2.29 2.30 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 70 140 210 280 350 SE +/- 3.61, N = 3 SE +/- 2.96, N = 3 SE +/- 3.64, N = 3 SE +/- 3.22, N = 3 304 339 337 304 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 300K 600K 900K 1200K 1500K SE +/- 1357.03, N = 5 SE +/- 1307.98, N = 5 SE +/- 1890.20, N = 5 SE +/- 1083.24, N = 5 1368057 1388054 1380406 1373514 -march=skylake 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 2K 4K 6K 8K 10K SE +/- 18.45, N = 3 SE +/- 15.81, N = 3 SE +/- 16.12, N = 3 SE +/- 32.16, N = 3 8581 8581 8544 8543 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
BLAKE2 This is a benchmark of BLAKE2 using the blake2s binary. BLAKE2 is a high-performance crypto alternative to MD5 and SHA-2/3. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20170307 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 0.9045 1.809 2.7135 3.618 4.5225 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.02 4.02 4.02 4.02 1. (CC) gcc options: -O3 -march=native -lcrypto -lz
-O3 -march=skylake Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 10 May 2019 15:30 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=128 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 15:58 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=256 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 21:26 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=512 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (1704 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16384MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: NVIDIA NV120 12GB, Audio: Realtek ALC1220, Monitor: ASUS PB278, Network: Intel I219-V
OS: Clear Linux OS 29350, Kernel: 5.0.14-753.native (x86_64), Desktop: GNOME Shell 3.32.1, Display Server: X Server 1.20.4, Display Driver: nouveau 1.0.16, OpenGL: 4.3 Mesa 19.1.0-devel, Compiler: GCC 9.1.1 20190509 gcc-9-branch@271025 + clang (Clear Linux OS for Intel Architecture) 9.1.1 20190509 gcc-9-branch@271025 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 2560x1440
Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 10:39 by user phoronix.