Intel Core i7 GCC Icelake Compiler Testing

Intel Core i7-1065G7 GCC compiler tuning benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1910237-HU-ICELAKECO20&grs&sor.

Intel Core i7 GCC Icelake Compiler TestingProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-clientIntel Core i7-1065G7 @ 3.90GHz (4 Cores / 8 Threads)Dell 06CDVY (1.0.9 BIOS)Intel Device 34ef16384MBKBG40ZPZ512G NVMe TOSHIBA 512GBIntel Iris Plus 3GB (1100MHz)Realtek ALC289Intel Device 34f0Clear Linux OS 313405.3.6-850.native (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.6 Mesa 19.3.0-devel1.1.102GCC 9.2.1 20191017 gcc-9-branch@277087 + Clang 9.0.0 + LLVM 9.0.0ext41920x1200OpenBenchmarking.orgEnvironment Details- -O3 -march=skylake: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=skylake" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=skylake" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - -O3 -march=skylake-avx512: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=skylake-avx512" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=skylake-avx512" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - -O3 -march=icelake-client: CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-O3 -march=icelake-client" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-O3 -march=icelake-client" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" Compiler Details- --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details- Scaling Governor: intel_pstate performanceSecurity Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling

Intel Core i7 GCC Icelake Compiler Testinggraphics-magick: Sharpengraphics-magick: Noise-Gaussiangraphics-magick: Enhancedsmallpt: Global Illumination Renderer; 128 Samplessvt-hevc: 1080p 8-bit YUV To HEVC Video Encodegraphics-magick: Resizingfftw: Float + SSE - 1D FFT Size 32c-ray: Total Time - 4K, 16 Rays Per Pixeldav1d: Chimera 1080pdav1d: Summer Nature 1080pgraphics-magick: Swirlfftw: Float + SSE - 2D FFT Size 4096graphics-magick: Rotatescimark2: Monte Carloopenssl: RSA 4096-bit Performancecompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19graphics-magick: HWB Color Spaceaobench: 2048 x 2048 - Total Timescimark2: Compositemrbayes: Primate Phylogeny Analysisqmcpack: redis: GETredis: SETaskap: tConvolve OpenMP - Griddingscimark2: Sparse Matrix Multiplymt-dgemm: Sustained Floating-Point Ratesvt-vp9: Visual Quality Optimized - Bosphorus 1080paskap: tConvolve MT - Griddinghimeno: Poisson Pressure Solverscimark2: Fast Fourier Transformfftw: Float + SSE - 1D FFT Size 4096sqlite-speedtest: Timed Time - Size 1,000askap: tConvolve MT - Degriddingminife: Smallscimark2: Jacobi Successive Over-Relaxationaskap: tConvolve OpenMP - Degriddingsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080pscimark2: Dense LU Matrix Factorizationmafft: Multiple Sequence Alignmenthmmer: Pfam Database Search-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client56837332.1214.4530718850188.30272.24256.98182192971082909.89842.5037.2661927.722636.43138.31488.623062487.882111347.871196.172804.381.0353.721165.704037.97418.956644256.51970.383961.751984.931570.3773.3573.347064.024.375.9249746635.3313.1628220116203.08255.22243.83173185151042924.95816.1938.4560326.982702.45141.75482.293125565.172077912.881179.932746.411.0152.871156.484013.23420.706649756.45968.083973.091983.551589.1470.3771.057464.724.626.5049746635.1013.2728120431201.58256.32248.91178190771045944.22823.8338.4660027.082661.03141.39476.933136507.882127678.921169.762760.021.0153.501168.613999.93416.916594256.16971.543963.161985.641605.9571.4872.307398.034.466.41OpenBenchmarking.org

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121326395265SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3564949-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx51220406080100SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3837474-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121632486480SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3736666-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx512816243240SE +/- 0.42, N = 3SE +/- 0.53, N = 3SE +/- 0.53, N = 332.1235.1035.33-march=skylake1. (CXX) g++ options: -fopenmp -O3

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.4.11080p 8-bit YUV To HEVC Video Encode-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx51248121620SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 314.4513.2713.16-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client70140210280350SE +/- 2.19, N = 3SE +/- 2.33, N = 3SE +/- 2.85, N = 3307282281-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake4K8K12K16K20KSE +/- 246.13, N = 3SE +/- 214.23, N = 15SE +/- 288.49, N = 3204312011618850-march=skylake1. (CC) gcc options: -pthread -O3 -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5124080120160200SE +/- 0.33, N = 3SE +/- 0.43, N = 3SE +/- 0.50, N = 3188.30201.58203.08-march=skylake1. (CC) gcc options: -lm -lpthread -O3

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx51260120180240300SE +/- 4.08, N = 14SE +/- 2.35, N = 10SE +/- 2.57, N = 8272.24256.32255.22-march=skylake - MIN: 171.75 / MAX: 530.02MIN: 161.83 / MAX: 514.82MIN: 160.66 / MAX: 513.751. (CC) gcc options: -O3 -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080p-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx51260120180240300SE +/- 2.55, N = 15SE +/- 2.53, N = 13SE +/- 2.30, N = 15256.98248.91243.83-march=skylake - MIN: 201.76 / MAX: 366.26MIN: 195.33 / MAX: 354.42MIN: 192.08 / MAX: 352.381. (CC) gcc options: -O3 -pthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5124080120160200SE +/- 2.67, N = 3SE +/- 2.19, N = 3SE +/- 2.29, N = 4182178173-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5124K8K12K16K20KSE +/- 40.07, N = 3SE +/- 258.01, N = 3SE +/- 75.86, N = 3192971907718515-march=skylake1. (CC) gcc options: -pthread -O3 -lm

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5122004006008001000SE +/- 6.39, N = 3SE +/- 3.21, N = 3SE +/- 3.84, N = 3108210451042-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake2004006008001000SE +/- 0.62, N = 3SE +/- 15.75, N = 3SE +/- 8.67, N = 3944.22924.95909.89-march=skylake1. (CC) gcc options: -O3 -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5122004006008001000SE +/- 8.22, N = 13SE +/- 7.87, N = 14SE +/- 8.03, N = 13842.50823.83816.19-march=skylake1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client918273645SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.44, N = 337.2638.4538.46-march=skylake1. (CC) gcc options: -O3 -pthread -lz

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client130260390520650SE +/- 5.17, N = 3SE +/- 5.03, N = 3SE +/- 4.48, N = 3619603600-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=skylake-avx512-O3 -march=icelake-client-O3 -march=skylake714212835SE +/- 0.13, N = 3SE +/- 0.32, N = 3SE +/- 0.40, N = 326.9827.0827.72-march=skylake1. (CC) gcc options: -lm -O3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=skylake-avx512-O3 -march=icelake-client-O3 -march=skylake6001200180024003000SE +/- 22.76, N = 15SE +/- 22.77, N = 15SE +/- 14.06, N = 32702.452661.032636.43-march=skylake1. (CC) gcc options: -O3 -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx512306090120150SE +/- 0.47, N = 3SE +/- 0.63, N = 3SE +/- 0.44, N = 3138.31141.39141.75-march=skylake1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

QMCPACK

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake110220330440550476.93482.29488.62-march=skylake1. (CXX) g++ options: -O3 -fopenmp -fomit-frame-pointer -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake700K1400K2100K2800K3500KSE +/- 41757.69, N = 4SE +/- 29603.21, N = 3SE +/- 39424.15, N = 153136507.883125565.173062487.88-march=skylake1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET-O3 -march=icelake-client-O3 -march=skylake-O3 -march=skylake-avx512500K1000K1500K2000K2500KSE +/- 4536.67, N = 3SE +/- 12926.88, N = 3SE +/- 19132.53, N = 32127678.922111347.872077912.88-march=skylake1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Gridding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client30060090012001500SE +/- 15.55, N = 3SE +/- 6.26, N = 3SE +/- 16.55, N = 41196.171179.931169.761. (CXX) g++ options: -lpthread

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5126001200180024003000SE +/- 29.57, N = 3SE +/- 12.05, N = 3SE +/- 22.31, N = 32804.382760.022746.41-march=skylake1. (CC) gcc options: -O3 -lm

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5120.23180.46360.69540.92721.159SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.031.011.01-march=skylake1. (CC) gcc options: -O3 -march=native -fopenmp

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121224364860SE +/- 0.58, N = 13SE +/- 0.50, N = 14SE +/- 0.57, N = 1353.7253.5052.87-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Gridding-O3 -march=icelake-client-O3 -march=skylake-O3 -march=skylake-avx51230060090012001500SE +/- 14.13, N = 13SE +/- 14.23, N = 13SE +/- 13.86, N = 141168.611165.701156.481. (CXX) g++ options: -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=icelake-client9001800270036004500SE +/- 7.49, N = 3SE +/- 8.62, N = 3SE +/- 26.07, N = 34037.974013.233999.93-march=skylake1. (CC) gcc options: -O3 -mavx2

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=icelake-client90180270360450SE +/- 1.43, N = 3SE +/- 0.96, N = 3SE +/- 2.63, N = 3420.70418.95416.91-march=skylake1. (CC) gcc options: -O3 -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=icelake-client14K28K42K56K70KSE +/- 91.35, N = 3SE +/- 243.86, N = 3SE +/- 609.76, N = 3664976644265942-march=skylake1. (CC) gcc options: -pthread -O3 -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake1326395265SE +/- 0.21, N = 3SE +/- 0.16, N = 3SE +/- 0.24, N = 356.1656.4556.51-march=skylake1. (CC) gcc options: -O3 -ldl -lz -lpthread

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Degridding-O3 -march=icelake-client-O3 -march=skylake-O3 -march=skylake-avx5122004006008001000SE +/- 0.74, N = 13SE +/- 0.51, N = 13SE +/- 0.41, N = 14971.54970.38968.081. (CXX) g++ options: -lpthread

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Small-O3 -march=skylake-avx512-O3 -march=icelake-client-O3 -march=skylake9001800270036004500SE +/- 9.63, N = 3SE +/- 13.06, N = 3SE +/- 1.29, N = 33973.093963.163961.751. (CXX) g++ options: -march=native -O3 -fopenmp -pthread -lmpi

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=icelake-client-O3 -march=skylake-O3 -march=skylake-avx512400800120016002000SE +/- 0.30, N = 3SE +/- 0.66, N = 3SE +/- 1.67, N = 31985.641984.931983.55-march=skylake1. (CC) gcc options: -O3 -lm

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Degridding-O3 -march=icelake-client-O3 -march=skylake-avx512-O3 -march=skylake30060090012001500SE +/- 58.16, N = 4SE +/- 74.96, N = 3SE +/- 99.01, N = 31605.951589.141570.371. (CXX) g++ options: -lpthread

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121632486480SE +/- 1.34, N = 12SE +/- 1.21, N = 13SE +/- 1.23, N = 1573.3571.4870.37-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121632486480SE +/- 1.29, N = 13SE +/- 1.52, N = 13SE +/- 1.36, N = 1573.3472.3071.05-march=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=skylake-avx512-O3 -march=icelake-client-O3 -march=skylake16003200480064008000SE +/- 369.43, N = 3SE +/- 448.41, N = 3SE +/- 43.22, N = 37464.727398.037064.02-march=skylake1. (CC) gcc options: -O3 -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx5121.03952.0793.11854.1585.1975SE +/- 0.11, N = 15SE +/- 0.09, N = 15SE +/- 0.08, N = 154.374.464.621. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O3 -march=skylake-O3 -march=icelake-client-O3 -march=skylake-avx512246810SE +/- 0.12, N = 12SE +/- 0.14, N = 12SE +/- 0.13, N = 125.926.416.50-march=skylake1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm


Phoronix Test Suite v10.8.5