Intel Core i9-7980XE testing with GCC 9 compiler by Michael Larabel.
-O3 -march=skylake Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=128 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=256 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
-O3 -march=skylake-avx512 -mprefer-vector-width=512 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (1704 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16384MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: NVIDIA NV120 12GB, Audio: Realtek ALC1220, Monitor: ASUS PB278, Network: Intel I219-V
OS: Clear Linux OS 29350, Kernel: 5.0.14-753.native (x86_64), Desktop: GNOME Shell 3.32.1, Display Server: X Server 1.20.4, Display Driver: nouveau 1.0.16, OpenGL: 4.3 Mesa 19.1.0-devel, Compiler: GCC 9.1.1 20190509 gcc-9-branch@271025 + clang (Clear Linux OS for Intel Architecture) 9.1.1 20190509 gcc-9-branch@271025 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 2560x1440
Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 1500 3000 4500 6000 7500 SE +/- 31.31, N = 3 SE +/- 19.27, N = 3 SE +/- 28.38, N = 3 SE +/- 55.64, N = 3 5673 6615 6863 6327 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 98.92 83.16 82.88 82.98 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 700 1400 2100 2800 3500 SE +/- 9.47, N = 3 SE +/- 4.67, N = 3 SE +/- 2.31, N = 3 SE +/- 5.71, N = 3 2814 3292 3310 3302 -march=skylake 1. (CC) gcc options: -O3 -mavx2
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 1.00, N = 3 250 290 290 291 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 35.86 33.47 33.34 30.87 -march=skylake 1. (CC) gcc options: -lm -lpthread -O3
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 50 100 150 200 250 SE +/- 0.33, N = 3 196 224 225 227 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 37.27 32.44 32.44 32.49 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 253 288 289 288 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.873 1.746 2.619 3.492 4.365 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 3.88 3.50 3.40 3.49 -march=skylake 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 0.58, N = 3 243 275 277 277 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.19 9.30 9.05 9.32 -march=skylake 1. (CC) gcc options: -O3 -lncurses -lm
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: skein -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 13K 26K 39K 52K 65K SE +/- 1026.67, N = 3 SE +/- 955.00, N = 3 SE +/- 678.39, N = 3 SE +/- 3.33, N = 3 62027 62560 62813 56227 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 70 140 210 280 350 SE +/- 3.22, N = 3 SE +/- 3.64, N = 3 SE +/- 2.96, N = 3 SE +/- 3.61, N = 3 304 337 339 304 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 186 206 207 204 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 1.00, N = 3 SE +/- 0.33, N = 3 228 251 251 252 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: lbry -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 11K 22K 33K 44K 55K SE +/- 706.79, N = 4 SE +/- 283.57, N = 3 SE +/- 41.77, N = 3 SE +/- 3.33, N = 3 53620 53473 52793 48547 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 160 320 480 640 800 SE +/- 0.52, N = 3 SE +/- 7.91, N = 7 SE +/- 1.58, N = 3 SE +/- 1.31, N = 3 669 729 738 710 -march=skylake 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lpthread -fopenmp -fwhole-program -lstdc++
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 900 1800 2700 3600 4500 SE +/- 1.37, N = 3 SE +/- 1.21, N = 3 SE +/- 0.98, N = 3 SE +/- 0.41, N = 3 3835 3833 3847 4220 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 900 1800 2700 3600 4500 SE +/- 1.97, N = 3 SE +/- 1.90, N = 3 SE +/- 2.47, N = 3 SE +/- 0.57, N = 3 3839 3855 3840 4212 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 1000 2000 3000 4000 5000 SE +/- 4.76, N = 3 SE +/- 9.00, N = 3 SE +/- 7.81, N = 3 SE +/- 10.11, N = 3 4391 4784 4773 4391 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 8K 16K 24K 32K 40K SE +/- 36.99, N = 3 SE +/- 63.22, N = 3 SE +/- 144.00, N = 3 SE +/- 52.04, N = 3 33240 35946 35982 36040 -march=skylake 1. (CC) gcc options: -shared -fPIC -pthread -O3
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.25, N = 3 50.05 53.97 53.34 53.71 -march=skylake 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: deep -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3K 6K 9K 12K 15K SE +/- 65.06, N = 3 SE +/- 15.28, N = 3 SE +/- 31.80, N = 3 SE +/- 21.86, N = 3 14490 15490 15497 15323 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: myr-gr -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4K 8K 12K 16K 20K SE +/- 32.15, N = 3 SE +/- 219.87, N = 4 SE +/- 92.92, N = 3 SE +/- 76.38, N = 3 15310 15695 15560 16360 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 9K 18K 27K 36K 45K SE +/- 98.50, N = 3 SE +/- 73.39, N = 3 SE +/- 36.60, N = 3 SE +/- 40.23, N = 3 37305 39343 39791 37394 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Delete -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30K 60K 90K 120K 150K SE +/- 1672.82, N = 15 SE +/- 1539.98, N = 15 SE +/- 1623.43, N = 4 SE +/- 1837.94, N = 3 115719 114714 122131 120897 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: m7m -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 110 220 330 440 550 SE +/- 1.44, N = 3 SE +/- 0.55, N = 3 SE +/- 1.28, N = 3 SE +/- 1.19, N = 3 479 506 503 499 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 9.14 8.66 8.76 8.77 -march=skylake 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 110K 220K 330K 440K 550K SE +/- 1960.25, N = 3 SE +/- 2392.99, N = 3 SE +/- 772.32, N = 3 SE +/- 1607.28, N = 3 498539 521901 519930 520100 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 200 400 600 800 1000 SE +/- 0.10, N = 3 SE +/- 0.00, N = 3 SE +/- 0.19, N = 3 SE +/- 0.29, N = 3 846 857 821 846 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 15 30 45 60 75 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 65.66 63.20 63.06 65.67 -march=skylake 1. (CC) gcc options: -march=native -O3 -msse -mfpmath=sse -lm -pthread -lmpi
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 30.56 30.89 30.57 29.69 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 7K 14K 21K 28K 35K SE +/- 86.87, N = 3 SE +/- 34.14, N = 3 SE +/- 79.15, N = 3 SE +/- 62.55, N = 3 33676 35007 34962 33669 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4 8 12 16 20 SE +/- 0.20, N = 4 SE +/- 0.13, N = 15 SE +/- 0.11, N = 15 SE +/- 0.13, N = 15 14.12 13.88 13.72 13.63 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 11.71 11.31 11.47 11.52 1. (CXX) g++ options: -O2 -lOpenCL
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 13.39 13.70 13.28 13.62 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 7.80 7.77 7.98 7.74 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 900 1800 2700 3600 4500 SE +/- 17.99, N = 3 SE +/- 38.60, N = 3 SE +/- 7.44, N = 3 SE +/- 41.78, N = 3 4080 4185 4073 4062 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 80 160 240 320 400 SE +/- 4.48, N = 5 SE +/- 5.15, N = 4 SE +/- 4.39, N = 4 SE +/- 4.85, N = 4 337 347 346 342 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.18 Time To Compile -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 9 18 27 36 45 SE +/- 0.53, N = 4 SE +/- 0.53, N = 4 SE +/- 0.38, N = 8 SE +/- 0.38, N = 8 37.36 36.93 36.33 36.34
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 4K -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 19.67 19.34 19.47 19.15 -march=skylake 1. (CC) gcc options: -O3 -pthread
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30 60 90 120 150 SE +/- 0.63, N = 3 SE +/- 0.33, N = 3 SE +/- 0.73, N = 3 SE +/- 0.79, N = 3 132 134 135 132 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20K 40K 60K 80K 100K SE +/- 121.74, N = 3 SE +/- 428.91, N = 3 SE +/- 109.89, N = 3 SE +/- 755.85, N = 3 79234 79076 77547 77484 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 1.3343 2.6686 4.0029 5.3372 6.6715 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 5.82 5.82 5.81 5.93 -march=skylake 1. (CC) gcc options: -pthread -O3
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3K 6K 9K 12K 15K SE +/- 82.67, N = 3 SE +/- 45.37, N = 3 SE +/- 44.82, N = 3 SE +/- 128.04, N = 3 13424 13576 13495 13311 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4K 8K 12K 16K 20K SE +/- 71.84, N = 3 SE +/- 90.51, N = 3 SE +/- 111.64, N = 3 SE +/- 39.05, N = 3 17334 17326 17522 17183 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20K 40K 60K 80K 100K SE +/- 216.99, N = 3 SE +/- 642.32, N = 3 SE +/- 549.37, N = 3 SE +/- 240.24, N = 3 78798 77373 78432 78460 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.76 9.69 9.59 9.71 -march=skylake 1. (CC) gcc options: -O3 -pthread -lz
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 1080p -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 7.05 6.95 6.97 6.93 -march=skylake 1. (CC) gcc options: -O3 -pthread
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 16K 32K 48K 64K 80K SE +/- 622.40, N = 3 SE +/- 163.23, N = 3 SE +/- 259.01, N = 3 SE +/- 582.90, N = 3 73346 72947 73307 72123 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 40 80 120 160 200 SE +/- 0.42, N = 3 SE +/- 0.33, N = 3 SE +/- 0.64, N = 3 SE +/- 0.06, N = 3 197 194 196 196 -march=skylake 1. (CC) gcc options: -O3 -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 0.10, N = 3 SE +/- 1.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 263 264 262 260 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 40 80 120 160 200 SE +/- 0.67, N = 3 195 198 197 198 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 300K 600K 900K 1200K 1500K SE +/- 1083.24, N = 5 SE +/- 1890.20, N = 5 SE +/- 1307.98, N = 5 SE +/- 1357.03, N = 5 1373514 1380406 1388054 1368057 -march=skylake 1. (CC) gcc options: -O3 -march=native
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 12.96 12.78 12.80 12.79 -march=skylake 1. (CXX) g++ options: -O3 -lpthread
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.522 1.044 1.566 2.088 2.61 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 2.30 2.29 2.29 2.32 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Cpuminer-Opt Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: sha256t -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30K 60K 90K 120K 150K SE +/- 661.72, N = 3 SE +/- 240.62, N = 3 SE +/- 1469.65, N = 6 SE +/- 3.33, N = 3 124567 124030 124672 125653 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.41 2.38 2.38 2.38 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 15.99 15.80 15.84 15.88 -march=skylake 1. (CC) gcc options: -pthread -fvisibility=hidden -O3
PolyBench-C PolyBench-C is a C-language polyhedral benchmark suite made at the Ohio State University. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.594 1.188 1.782 2.376 2.97 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.63 2.63 2.61 2.64 -march=skylake 1. (CC) gcc options: -O3 -march=native
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Decompression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20 40 60 80 100 108 107 107 108 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 1.2375 2.475 3.7125 4.95 6.1875 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.45 5.47 5.50 5.45 -march=skylake 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shm -lxcb-shape -lxcb-xfixes -pthread -lSDL2 -lbz2 -O3 -std=c11 -fomit-frame-pointer -fPIC -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
7-Zip Compression This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20K 40K 60K 80K 100K SE +/- 174.10, N = 3 SE +/- 414.93, N = 3 SE +/- 264.40, N = 3 SE +/- 201.98, N = 3 100416 99960 99546 100157 1. (CXX) g++ options: -pipe -lpthread
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 20.66 20.74 20.84 20.79 MIN: 19.73 MIN: 19.8 MIN: 19.89 -march=skylake - MIN: 19.84 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20K 40K 60K 80K 100K SE +/- 290.53, N = 3 SE +/- 182.38, N = 3 SE +/- 510.45, N = 3 SE +/- 495.69, N = 3 77932 77703 78021 78294 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Timed PHP Compilation This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 52.19 51.90 51.84 51.80 -march=skylake 1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30 60 90 120 150 SE +/- 1.04, N = 3 SE +/- 1.37, N = 3 SE +/- 1.54, N = 3 SE +/- 1.19, N = 3 140 140 140 139 -march=skylake 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 146 146 146 145 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 10M 20M 30M 40M 50M SE +/- 311424.12, N = 3 SE +/- 342455.58, N = 3 SE +/- 146682.07, N = 3 SE +/- 407607.78, N = 3 46997822 47046623 46810779 47112136 -march=skylake 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 16K 32K 48K 64K 80K SE +/- 141.11, N = 3 SE +/- 384.96, N = 3 SE +/- 429.23, N = 3 SE +/- 335.26, N = 3 72940 72688 73045 72585 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 6 12 18 24 30 SE +/- 0.22, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 23.07 22.93 23.01 23.03 MIN: 22.7 MIN: 22.61 MIN: 22.68 -march=skylake - MIN: 22.69 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
Xsbench XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 5240.08, N = 3 SE +/- 24818.37, N = 3 SE +/- 3451.22, N = 3 SE +/- 5774.93, N = 3 5166043 5140078 5171354 5164164 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2K 4K 6K 8K 10K SE +/- 26.14, N = 3 SE +/- 39.71, N = 3 SE +/- 63.29, N = 3 SE +/- 11.85, N = 3 9387 9331 9337 9376 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 31.29 31.11 31.15 31.17 -march=skylake 1. (CC) gcc options: -lm -O3
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 68.18 68.26 68.35 68.54 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 90 180 270 360 450 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.13, N = 3 SE +/- 1.58, N = 3 395 395 395 393 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.477 0.954 1.431 1.908 2.385 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 2.12 2.12 2.11 2.12 1. (CC) gcc options: --openmp -O3 -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2K 4K 6K 8K 10K SE +/- 6.33, N = 3 SE +/- 5.96, N = 3 SE +/- 3.81, N = 3 SE +/- 15.77, N = 3 10424 10405 10397 10446 MIN: 10383.1 MIN: 10370.6 MIN: 10360.8 -march=skylake - MIN: 10397.8 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 90 180 270 360 450 SE +/- 1.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 428 429 429 430 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2K 4K 6K 8K 10K SE +/- 32.16, N = 3 SE +/- 16.12, N = 3 SE +/- 15.81, N = 3 SE +/- 18.45, N = 3 8543 8544 8581 8581 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-03-07 1080p 8-bit YUV To AV1 Video Encode -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 24.43 24.34 24.37 24.44 -march=skylake 1. (CXX) g++ options: -O3 -pie -lpthread -lm
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 50 100 150 200 250 SE +/- 0.33, N = 3 249 249 249 248 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 160 320 480 640 800 SE +/- 0.64, N = 3 SE +/- 0.50, N = 3 SE +/- 1.26, N = 3 SE +/- 0.92, N = 3 751 751 749 752 MIN: 745 MIN: 744.39 MIN: 741.69 -march=skylake - MIN: 744 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 0.03, N = 3 SE +/- 0.18, N = 3 SE +/- 0.51, N = 3 SE +/- 0.34, N = 3 294 293 294 293 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 95.30 95.17 95.26 95.01 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 90 180 270 360 450 SE +/- 0.17, N = 3 SE +/- 0.34, N = 3 SE +/- 0.12, N = 3 SE +/- 1.99, N = 3 401 401 401 400 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 96.25 96.21 96.22 96.02 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 16.98 16.95 16.99 16.96 1. (CXX) g++ options: -O2 -lOpenCL
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 100 200 300 400 500 SE +/- 0.58, N = 3 452 452 451 451 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 300 600 900 1200 1500 SE +/- 1.35, N = 3 SE +/- 5.01, N = 3 SE +/- 0.83, N = 3 SE +/- 2.67, N = 3 1608 1607 1605 1606 -march=skylake 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 1191 1191 1190 1189 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 400 800 1200 1600 2000 SE +/- 0.68, N = 3 SE +/- 0.28, N = 3 SE +/- 0.22, N = 3 SE +/- 0.89, N = 3 1819 1819 1820 1822 MIN: 1815.94 MIN: 1816.37 MIN: 1817.59 -march=skylake - MIN: 1818.58 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Decompression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 130 260 390 520 650 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 621 620 620 620 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Crafty This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 2M 4M 6M 8M 10M SE +/- 7719.64, N = 3 SE +/- 7027.53, N = 3 SE +/- 5800.58, N = 3 SE +/- 4114.60, N = 3 8840945 8845689 8840395 8852846 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 57.06 57.00 57.07 57.08 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 1138 1137 1138 1137 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
m-queens A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 48.41 48.40 48.42 48.42 -march=skylake 1. (CXX) g++ options: -fopenmp -O3 -O2 -march=native
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 145 145 145 145 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 295 295 295 295 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
BLAKE2 This is a benchmark of BLAKE2 using the blake2s binary. BLAKE2 is a high-performance crypto alternative to MD5 and SHA-2/3. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20170307 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.9045 1.809 2.7135 3.618 4.5225 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.02 4.02 4.02 4.02 1. (CC) gcc options: -O3 -march=native -lcrypto -lz
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 9 18 27 36 45 39 39 39 39 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NGINX Benchmark This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 11K 22K 33K 44K 55K SE +/- 88.86, N = 3 SE +/- 314.76, N = 3 SE +/- 834.73, N = 15 SE +/- 643.64, N = 15 44072 49221 46938 47570 -march=skylake 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30K 60K 90K 120K 150K SE +/- 1768.29, N = 15 SE +/- 1645.69, N = 15 SE +/- 876.94, N = 3 SE +/- 1600.60, N = 15 112844 117621 107497 117212 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 500K 1000K 1500K 2000K 2500K SE +/- 44027.91, N = 15 SE +/- 25293.75, N = 3 SE +/- 38284.96, N = 15 SE +/- 42954.48, N = 15 2135179 1971745 2189741 2163293 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 600K 1200K 1800K 2400K 3000K SE +/- 58238.45, N = 15 SE +/- 56848.49, N = 15 SE +/- 54716.38, N = 15 SE +/- 45086.74, N = 15 3002750 2944968 2992962 2958563 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 400K 800K 1200K 1600K 2000K SE +/- 36075.54, N = 15 SE +/- 32556.93, N = 15 SE +/- 36055.36, N = 15 SE +/- 34258.37, N = 15 2026656 2013442 1962269 2059237 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 600K 1200K 1800K 2400K 3000K SE +/- 50359.83, N = 15 SE +/- 56601.20, N = 15 SE +/- 20507.83, N = 3 SE +/- 33796.90, N = 15 2383273 2449355 2624995 2339849 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 700K 1400K 2100K 2800K 3500K SE +/- 59550.07, N = 15 SE +/- 61579.63, N = 15 SE +/- 71217.78, N = 12 SE +/- 55290.27, N = 15 3059844 3139485 3095145 3140283 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 500 1000 1500 2000 2500 SE +/- 14.49, N = 3 SE +/- 6.91, N = 3 SE +/- 9.18, N = 3 SE +/- 71.82, N = 12 2113 2109 2090 2038 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
-O3 -march=skylake Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 10 May 2019 15:30 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=128 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 15:58 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=256 Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 21:26 by user phoronix.
-O3 -march=skylake-avx512 -mprefer-vector-width=512 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (1704 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16384MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: NVIDIA NV120 12GB, Audio: Realtek ALC1220, Monitor: ASUS PB278, Network: Intel I219-V
OS: Clear Linux OS 29350, Kernel: 5.0.14-753.native (x86_64), Desktop: GNOME Shell 3.32.1, Display Server: X Server 1.20.4, Display Driver: nouveau 1.0.16, OpenGL: 4.3 Mesa 19.1.0-devel, Compiler: GCC 9.1.1 20190509 gcc-9-branch@271025 + clang (Clear Linux OS for Intel Architecture) 9.1.1 20190509 gcc-9-branch@271025 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 2560x1440
Environment Notes: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx"Compiler Notes: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswellProcessor Notes: Scaling Governor: intel_pstate performancePython Notes: Python 3.7.3Security Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Testing initiated at 11 May 2019 10:39 by user phoronix.