Intel Core i7 GCC Icelake Compiler Testing

Intel Core i7-1065G7 GCC compiler tuning benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1910237-HU-ICELAKECO20
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
-O3 -march=skylake
October 22 2019
  2 Hours, 12 Minutes
-O3 -march=skylake-avx512
October 23 2019
  2 Hours, 15 Minutes
-O3 -march=icelake-client
October 22 2019
  2 Hours, 14 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 14 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Core i7 GCC Icelake Compiler TestingOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-1065G7 @ 3.90GHz (4 Cores / 8 Threads)Dell 06CDVY (1.0.9 BIOS)Intel Device 34ef16384MBKBG40ZPZ512G NVMe TOSHIBA 512GBIntel Iris Plus 3GB (1100MHz)Realtek ALC289Intel Device 34f0Clear Linux OS 313405.3.6-850.native (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.6 Mesa 19.3.0-devel1.1.102GCC 9.2.1 20191017 gcc-9-branch@277087 + Clang 9.0.0 + LLVM 9.0.0ext41920x1200ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionIntel Core I7 GCC Icelake Compiler Testing BenchmarksSystem Logs- -O3 -march=skylake: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=skylake" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=skylake" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - -O3 -march=skylake-avx512: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=skylake-avx512" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=skylake-avx512" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - -O3 -march=icelake-client: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=icelake-client" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=icelake-client" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell - Scaling Governor: intel_pstate performance- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling

-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-clientResult OverviewPhoronix Test Suite100%103%105%108%110%SmallptSVT-HEVCTimed HMMer SearchGraphicsMagickC-Raydav1dTimed MAFFT AlignmentZstd CompressionOpenSSLSVT-VP9AOBenchTimed MrBayes AnalysisQMCPACKFFTWACES DGEMMRedisSciMarkHimeno BenchmarkSQLite SpeedtestASKAPminiFE

Intel Core i7 GCC Icelake Compiler Testingfftw: Float + SSE - 2D FFT Size 4096mt-dgemm: Sustained Floating-Point Ratec-ray: Total Time - 4K, 16 Rays Per Pixeldav1d: Chimera 1080pqmcpack: mrbayes: Primate Phylogeny Analysisaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingscimark2: Compositeopenssl: RSA 4096-bit Performancedav1d: Summer Nature 1080pminife: Smallgraphics-magick: Swirlgraphics-magick: Sharpengraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Rotategraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solversqlite-speedtest: Timed Time - Size 1,000svt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19smallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timeredis: GEThmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingfftw: Float + SSE - 1D FFT Size 4096redis: SETfftw: Float + SSE - 1D FFT Size 32scimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client192971.03188.30272.24488.62138.31970.381165.702636.43842.50256.983961.7518256837330710826194037.9756.5153.7214.4573.3473.3537.2632.1227.723062487.885.924.371570.371196.17664422111347.87188501984.937064.022804.38418.95909.89185151.01203.08255.22482.29141.75968.081156.482702.45816.19243.833973.0917349746628210426034013.2356.4552.8713.1671.0570.3738.4535.3326.983125565.176.504.621589.141179.93664972077912.88201161983.557464.722746.41420.70924.95190771.01201.58256.32476.93141.39971.541168.612661.03823.83248.913963.1617849746628110456003999.9356.1653.5013.2772.3071.4838.4635.1027.083136507.886.414.461605.951169.76659422127678.92204311985.647398.032760.02416.91944.22OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client4K8K12K16K20KSE +/- 40.07, N = 3SE +/- 75.86, N = 3SE +/- 258.01, N = 3192971851519077-march=skylake1. (CC) gcc options: -pthread -O3 -lm

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client0.23180.46360.69540.92721.159SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.031.011.01-march=skylake1. (CC) gcc options: -O3 -march=native -fopenmp

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client4080120160200SE +/- 0.33, N = 3SE +/- 0.50, N = 3SE +/- 0.43, N = 3188.30203.08201.58-march=skylake1. (CC) gcc options: -lm -lpthread -O3

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client60120180240300SE +/- 4.08, N = 14SE +/- 2.57, N = 8SE +/- 2.35, N = 10272.24255.22256.32-march=skylake - MIN: 171.75 / MAX: 530.02MIN: 160.66 / MAX: 513.75MIN: 161.83 / MAX: 514.821. (CC) gcc options: -O3 -pthread

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client110220330440550488.62482.29476.93-march=skylake1. (CXX) g++ options: -O3 -fopenmp -fomit-frame-pointer -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -lm

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client306090120150SE +/- 0.47, N = 3SE +/- 0.44, N = 3SE +/- 0.63, N = 3138.31141.75141.39-march=skylake1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Degridding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client2004006008001000SE +/- 0.51, N = 13SE +/- 0.41, N = 14SE +/- 0.74, N = 13970.38968.08971.541. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Gridding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client30060090012001500SE +/- 14.23, N = 13SE +/- 13.86, N = 14SE +/- 14.13, N = 131165.701156.481168.611. (CXX) g++ options: -lpthread

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client6001200180024003000SE +/- 14.06, N = 3SE +/- 22.76, N = 15SE +/- 22.77, N = 152636.432702.452661.03-march=skylake1. (CC) gcc options: -O3 -lm

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client2004006008001000SE +/- 8.22, N = 13SE +/- 8.03, N = 13SE +/- 7.87, N = 14842.50816.19823.83-march=skylake1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client60120180240300SE +/- 2.55, N = 15SE +/- 2.30, N = 15SE +/- 2.53, N = 13256.98243.83248.91-march=skylake - MIN: 201.76 / MAX: 366.26MIN: 192.08 / MAX: 352.38MIN: 195.33 / MAX: 354.421. (CC) gcc options: -O3 -pthread

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Small-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client9001800270036004500SE +/- 1.29, N = 3SE +/- 9.63, N = 3SE +/- 13.06, N = 33961.753973.093963.161. (CXX) g++ options: -march=native -O3 -fopenmp -pthread -lmpi

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client4080120160200SE +/- 2.67, N = 3SE +/- 2.29, N = 4SE +/- 2.19, N = 3182173178-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1326395265SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3564949-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client20406080100SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3837474-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1632486480SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3736666-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client70140210280350SE +/- 2.19, N = 3SE +/- 2.33, N = 3SE +/- 2.85, N = 3307282281-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client2004006008001000SE +/- 6.39, N = 3SE +/- 3.84, N = 3SE +/- 3.21, N = 3108210421045-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client130260390520650SE +/- 5.17, N = 3SE +/- 5.03, N = 3SE +/- 4.48, N = 3619603600-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client9001800270036004500SE +/- 7.49, N = 3SE +/- 8.62, N = 3SE +/- 26.07, N = 34037.974013.233999.93-march=skylake1. (CC) gcc options: -O3 -mavx2

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1326395265SE +/- 0.24, N = 3SE +/- 0.16, N = 3SE +/- 0.21, N = 356.5156.4556.16-march=skylake1. (CC) gcc options: -O3 -ldl -lz -lpthread

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1224364860SE +/- 0.58, N = 13SE +/- 0.57, N = 13SE +/- 0.50, N = 1453.7252.8753.50-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.4.11080p 8-bit YUV To HEVC Video Encode-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client48121620SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 314.4513.1613.27-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1632486480SE +/- 1.29, N = 13SE +/- 1.36, N = 15SE +/- 1.52, N = 1373.3471.0572.30-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1632486480SE +/- 1.34, N = 12SE +/- 1.23, N = 15SE +/- 1.21, N = 1373.3570.3771.48-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client918273645SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.44, N = 337.2638.4538.46-march=skylake1. (CC) gcc options: -O3 -pthread -lz

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client816243240SE +/- 0.42, N = 3SE +/- 0.53, N = 3SE +/- 0.53, N = 332.1235.3335.10-march=skylake1. (CXX) g++ options: -fopenmp -O3

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client714212835SE +/- 0.40, N = 3SE +/- 0.13, N = 3SE +/- 0.32, N = 327.7226.9827.08-march=skylake1. (CC) gcc options: -lm -O3

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client700K1400K2100K2800K3500KSE +/- 39424.15, N = 15SE +/- 29603.21, N = 3SE +/- 41757.69, N = 43062487.883125565.173136507.88-march=skylake1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client246810SE +/- 0.12, N = 12SE +/- 0.13, N = 12SE +/- 0.14, N = 125.926.506.41-march=skylake1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client1.03952.0793.11854.1585.1975SE +/- 0.11, N = 15SE +/- 0.08, N = 15SE +/- 0.09, N = 154.374.624.461. (CC) gcc options: -std=c99 -O3 -lm -lpthread

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Degridding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client30060090012001500SE +/- 99.01, N = 3SE +/- 74.96, N = 3SE +/- 58.16, N = 41570.371589.141605.951. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Gridding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client30060090012001500SE +/- 15.55, N = 3SE +/- 6.26, N = 3SE +/- 16.55, N = 41196.171179.931169.761. (CXX) g++ options: -lpthread

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client14K28K42K56K70KSE +/- 243.86, N = 3SE +/- 91.35, N = 3SE +/- 609.76, N = 3664426649765942-march=skylake1. (CC) gcc options: -pthread -O3 -lm

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client500K1000K1500K2000K2500KSE +/- 12926.88, N = 3SE +/- 19132.53, N = 3SE +/- 4536.67, N = 32111347.872077912.882127678.92-march=skylake1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client4K8K12K16K20KSE +/- 288.49, N = 3SE +/- 214.23, N = 15SE +/- 246.13, N = 3188502011620431-march=skylake1. (CC) gcc options: -pthread -O3 -lm

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client400800120016002000SE +/- 0.66, N = 3SE +/- 1.67, N = 3SE +/- 0.30, N = 31984.931983.551985.64-march=skylake1. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client16003200480064008000SE +/- 43.22, N = 3SE +/- 369.43, N = 3SE +/- 448.41, N = 37064.027464.727398.03-march=skylake1. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client6001200180024003000SE +/- 29.57, N = 3SE +/- 22.31, N = 3SE +/- 12.05, N = 32804.382746.412760.02-march=skylake1. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client90180270360450SE +/- 0.96, N = 3SE +/- 1.43, N = 3SE +/- 2.63, N = 3418.95420.70416.91-march=skylake1. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client2004006008001000SE +/- 8.67, N = 3SE +/- 15.75, N = 3SE +/- 0.62, N = 3909.89924.95944.22-march=skylake1. (CC) gcc options: -O3 -lm