GCC 9 Skylake-AVX512 Tests

Intel Core i9-7980XE testing with a ASUS PRIME X299-A (1602 BIOS) and NVIDIA NV120 12GB on Ubuntu 18.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1902216-PTS-GCC9SKYL42&gru&sor.

GCC 9 Skylake-AVX512 TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (1602 BIOS)Intel Sky Lake-E DMI3 Registers16384MBSamsung SSD 970 EVO 500GBNVIDIA NV120 12GBRealtek ALC1220ASUS PB278Intel I219-VUbuntu 18.104.18.0-15-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1modesetting 1.20.14.3 Mesa 18.2.2GCC 9.0.1 20190217ext42560x1440OpenBenchmarking.orgEnvironment Details- -O0: CXXFLAGS=-O0 CFLAGS=-O0- -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64- -O3 -march=skylake: CXXFLAGS=-O3-march=skylake CFLAGS=-O3-march=skylake- -O3 -march=skylake-avx512: CXXFLAGS=-O3-march=skylake-avx512 CFLAGS=-O3-march=skylake-avx512Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersavePython Details- Python 2.7.15+ + Python 3.6.7Security Details- KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW STIBP + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable

GCC 9 Skylake-AVX512 Testssysbench: Memorysysbench: CPUaom-av1: AV1 Video Encodingsvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodevpxenc: vpxenc VP9 1080p Video Encodex264: H.264 Video Encodingx265: H.265 1080p Video Encodinggraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacexsbench: byte: Dhrystone 2byte: Integer Arithmeticbyte: Register Arithmeticbyte: Floating-Point Arithmetictjbench: Decompression Throughputfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationhimeno: Poisson Pressure Solvercompress-7zip: Compress Speed Teststockfish: Total Timehint: FLOAThint: DOUBLEjohn-the-ripper: Blowfishjohn-the-ripper: Traditional DESjohn-the-ripper: MD5ebizzy: redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETnginx: Static Web Page Servingapache: Static Web Page Servingopenssl: RSA 4096-bit Performancenpb: BT.Anpb: EP.Cnpb: FT.Anpb: FT.Bnpb: LU.Anpb: LU.Cnpb: SP.Apgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writeapache-siege: 250ctx-clock: Context Switch Timemultichase: 4MB Array, 64 Byte Stridemultichase: 1GB Array, 256 Byte Stridemultichase: 256MB Array, 256 Byte Stridemultichase: 1GB Array, 256 Byte Stride, 2 Threadsmultichase: 1GB Array, 256 Byte Stride, 4 Threadst-test1: 1t-test1: 2parboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingcloverleaf: Lagrangian-Eulerian Hydrodynamicsrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterhmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentbuild-apache: Time To Compilebuild-gcc: Time To Compilebuild-imagemagick: Time To Compilebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelcompress-pbzip2: 256MB File Compressionaobench: 2048 x 2048 - Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19dav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdcraw: RAW To PPM Image Conversionencode-flac: WAV To FLACencode-mp3: WAV To MP3m-queens: Time To Solvemencoder: AVI To LAVCn-queens: Elapsed Time-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx5127152403253340.228.5330730622.9787.3438.8113015180101141103152450904739431843111151301434243438333521713421325031463640322670315333779798412434689377045894946436365279981051255563999444000893520569980219668716295671417088207079215046613331122853457846541178633370423606833583408930105212257687414008.2763.9059.9766.7670.5452.4818.1173.012.406.931562.4126.1912.3214.099.302.4713.248866.6716.65127.422.4072.372.334.124.544.022.570.891.05119.1219.5849.4723.7239.0458.0927.4548.3832.248.4119443592350110.228.5231130622.9912239.10220252176201245164260450879039354548111185107019034112777431583325169477733416577916623084935924622016243647735198186143128966100829667902217565019231364416260541470671202901115343243438123270469346361153645870513598433535413045795512198425824038.3464.4559.7967.1970.5550.0917.1772.982.396.911552.4226.3112.3814.1614.482.4824.0596027.1163.2942.432.3937.682.344.104.534.012.560.891.0585.4210.3926.9513.8911.5610.068.8248.4016.298.4019327199325960.248.3830931523.1412339.30229256189217254171266450311138166612111194135741231813854109176475167732008138683378211913227529707913495628422193315934084629901443644611898499671228950100709667901612589046221841516450821427923209280315144253444423505469546411192638870493607733551411046006112143378453998.3264.1660.0066.3170.3649.8317.0973.272.386.971572.4125.7412.3314.0714.162.3823.7495227.2263.2631.232.3634.922.273.844.073.822.320.841.0084.9510.6626.3913.6510.839.109.3648.3816.028.4019353234338690.258.4830931325.6012339.29226257189211252170265450027538455237111197152281405715119107866744182421985646170424671944027519717833478630122203316935044631847843720277898170059929031100746333885449567481223297916403181399734206842415113403397224212469346471198650970373619033175414745937012200419733978.3364.1159.5767.0670.4749.9617.0773.002.416.941542.4225.9412.3114.1114.602.4223.9796127.2763.5733.942.4134.922.613.824.063.852.310.841.0083.9510.4926.7513.8210.799.029.4148.3716.238.40OpenBenchmarking.org

Sysbench

Test: Memory

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O04M8M12M16M20MSE +/- 25107.25, N = 3SE +/- 65309.84, N = 3SE +/- 50006.95, N = 3SE +/- 11654.91, N = 31944359219353234193271997152403-march=x86-64-march=skylake-O0 -march=core21. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O07K14K21K28K35KSE +/- 6.41, N = 3SE +/- 6.42, N = 3SE +/- 1.99, N = 3SE +/- 6.46, N = 335011338693259625334-march=x86-64-march=skylake-O0 -march=core21. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -rdynamic -ldl -laio -lm

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video Encoding-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.05630.11260.16890.22520.2815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.250.240.220.22-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-151080p 8-bit YUV To AV1 Video Encode-O0-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake246810SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.538.528.488.381. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video Encode-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O070140210280350SE +/- 4.38, N = 3SE +/- 2.15, N = 3SE +/- 4.28, N = 3SE +/- 3.49, N = 3311309309307-O3 -march=x86-64-O3-O3 -march=skylake-O01. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O070140210280350SE +/- 5.38, N = 3SE +/- 5.81, N = 3SE +/- 5.43, N = 3SE +/- 5.82, N = 3315313306306-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O0612182430SE +/- 0.38, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 325.6023.1422.9922.97-march=skylake-march=x86-64-O01. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O0306090120150SE +/- 1.22, N = 12SE +/- 1.68, N = 12SE +/- 1.88, N = 5SE +/- 1.68, N = 3123.00123.00122.0087.34-march=skylake-march=x86-64-O01. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O0918273645SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 339.3039.2939.1038.81-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O050100150200250SE +/- 0.33, N = 3229226220130-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O060120180240300SE +/- 1.53, N = 3SE +/- 0.33, N = 3257256252151-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O04080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 318918917680-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O050100150200250217211201101-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O060120180240300SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 3254252245141-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O04080120160200SE +/- 1.20, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3171170164103-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O060120180240300SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3266265260152-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx5121000K2000K3000K4000K5000KSE +/- 440.03, N = 3SE +/- 834.04, N = 3SE +/- 3905.30, N = 3SE +/- 8949.05, N = 345090474508790450311145002751. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2-O0-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake8M16M24M32M40MSE +/- 122855.06, N = 3SE +/- 212845.04, N = 3SE +/- 35381.50, N = 3SE +/- 379820.56, N = 339431843393545483845523738166612-O0-O3 -march=x86-64-O3-O3 -march=skylake1. (CC) gcc options:

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.2250.450.6750.91.1251111-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options:

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.2250.450.6750.91.1251111-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options:

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.2250.450.6750.91.1251111-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options:

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression Throughput-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O04080120160200SE +/- 0.56, N = 3SE +/- 0.57, N = 3SE +/- 0.68, N = 3SE +/- 0.53, N = 3197194185151-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O03K6K9K12K15KSE +/- 37.99, N = 3SE +/- 135.22, N = 3SE +/- 3.84, N = 3SE +/- 91.03, N = 121522813574107013014-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O03K6K9K12K15KSE +/- 66.43, N = 3SE +/- 19.35, N = 3SE +/- 19.17, N = 3SE +/- 0.84, N = 3140571231890343424-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O03K6K9K12K15KSE +/- 15.21, N = 3SE +/- 43.44, N = 3SE +/- 93.95, N = 3SE +/- 19.32, N = 31511913854112773438-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O02K4K6K8K10KSE +/- 15.98, N = 3SE +/- 334.88, N = 12SE +/- 64.31, N = 3SE +/- 5.98, N = 3109171078674313335-O3 -march=skylake-O3-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O014002800420056007000SE +/- 6.12, N = 3SE +/- 35.50, N = 3SE +/- 43.25, N = 3SE +/- 3.87, N = 36744647558332171-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32-O3 -march=skylake-avx512-O3 -march=skylake-O04K8K12K16K20KSE +/- 304.84, N = 3SE +/- 246.89, N = 5SE +/- 6.16, N = 318242167733421-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64-O3 -march=skylake-O3 -march=skylake-avx512-O04K8K12K16K20KSE +/- 471.72, N = 12SE +/- 164.46, N = 3SE +/- 33.40, N = 320081198563250-O3 -march=skylake-O3-O01. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32-O3 -march=skylake-avx512-O3 -march=skylake-O010K20K30K40K50KSE +/- 873.80, N = 3SE +/- 1140.38, N = 12SE +/- 34.99, N = 346170386833146-O3-O3 -march=skylake1. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64-O3 -march=skylake-avx512-O3 -march=skylake-O09K18K27K36K45KSE +/- 576.52, N = 3SE +/- 537.92, N = 5SE +/- 21.24, N = 342467378213640-O3-O3 -march=skylake1. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=skylake-avx512-O3 -march=skylake-O04K8K12K16K20KSE +/- 79.22, N = 3SE +/- 196.68, N = 3SE +/- 23.64, N = 319440191323226-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O06001200180024003000SE +/- 2.76, N = 3SE +/- 1.00, N = 3SE +/- 0.86, N = 3SE +/- 0.71, N = 3275227512516703-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O02004006008001000SE +/- 0.55, N = 3SE +/- 0.57, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3971970947153-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O02004006008001000SE +/- 1.91, N = 3SE +/- 0.54, N = 3SE +/- 3.92, N = 3SE +/- 0.99, N = 3791783773337-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O07001400210028003500SE +/- 6.16, N = 3SE +/- 5.26, N = 3SE +/- 4.59, N = 3SE +/- 0.24, N = 3349534783416797-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O014002800420056007000SE +/- 0.29, N = 3SE +/- 10.08, N = 3SE +/- 1.44, N = 3SE +/- 0.24, N = 3630162845779984-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O05001000150020002500SE +/- 0.23, N = 3SE +/- 2.53, N = 3SE +/- 0.37, N = 3SE +/- 2.24, N = 32220221916621243-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O07001400210028003500SE +/- 4.94, N = 3SE +/- 3.42, N = 3SE +/- 4.79, N = 3SE +/- 0.30, N = 3331633153084468-march=skylake-march=x86-64-O01. (CC) gcc options: -O3 -mavx2

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O0-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake20K40K60K80K100KSE +/- 176.14, N = 3SE +/- 332.73, N = 3SE +/- 164.59, N = 3SE +/- 295.13, N = 3937709359293504934081. (CXX) g++ options: -pipe -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O010M20M30M40M50MSE +/- 382200.06, N = 3SE +/- 393717.83, N = 3SE +/- 180038.36, N = 3SE +/- 181348.45, N = 346318478462990144622016245894946-march=skylake-march=x86-64-O01. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O3 -march=skylake-avx512-O3 -march=x86-64-O3 -march=skylake-O090M180M270M360M450MSE +/- 402091.62, N = 3SE +/- 856766.90, N = 3SE +/- 134069.19, N = 3SE +/- 615391.75, N = 3437202778436477351436446118436365279-march=x86-64-march=skylake-O01. (CC) gcc options: -O3 -march=native -lm

Hierarchical INTegration

Test: DOUBLE

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLE-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx512-O0200M400M600M800M1000MSE +/- 788605.63, N = 3SE +/- 286663.14, N = 3SE +/- 2720569.85, N = 3SE +/- 1623168.86, N = 3984996712981861431981700599981051255-march=skylake-march=x86-64-O01. (CC) gcc options: -O3 -march=native -lm

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish-O3 -march=skylake-avx512-O3 -march=x86-64-O3 -march=skylake-O06K12K18K24K30KSE +/- 41.46, N = 3SE +/- 8.67, N = 3SE +/- 9.82, N = 3SE +/- 1.45, N = 329031289662895056391. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Traditional DES-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O020M40M60M80M100MSE +/- 134375.51, N = 3SE +/- 30123.82, N = 3SE +/- 73121.66, N = 3SE +/- 237363.29, N = 3100829667100746333100709667994440001. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: MD5-O3 -march=x86-64-O3 -march=skylake-O0-O3 -march=skylake-avx512200K400K600K800K1000KSE +/- 1490.19, N = 3SE +/- 1557.07, N = 3SE +/- 1807.15, N = 3SE +/- 15148.30, N = 49022179016128935208854491. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-O3 -march=skylake-O0-O3 -march=skylake-avx512-O3 -march=x86-64130K260K390K520K650KSE +/- 10152.88, N = 12SE +/- 11752.07, N = 12SE +/- 12545.19, N = 12SE +/- 6642.32, N = 8589046569980567481565019-march=skylake-O0-march=x86-641. (CC) gcc options: -pthread -lpthread -O3 -march=native

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O0500K1000K1500K2000K2500KSE +/- 33940.57, N = 4SE +/- 25522.81, N = 12SE +/- 29907.21, N = 12SE +/- 23378.66, N = 323136442232979221841521966871. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 16784.08, N = 3SE +/- 7995.97, N = 3SE +/- 3861.43, N = 3SE +/- 5516.38, N = 316450821640318162956716260541. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3 -march=x86-64-O3 -march=skylake-O0-O3 -march=skylake-avx512300K600K900K1200K1500KSE +/- 7767.29, N = 3SE +/- 4741.67, N = 3SE +/- 21522.01, N = 3SE +/- 27880.00, N = 314706711427923141708813997341. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=skylake-O0-O3 -march=skylake-avx512-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 27984.39, N = 3SE +/- 20382.13, N = 3SE +/- 22834.56, N = 10SE +/- 25102.63, N = 320928032070792206842420290111. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512-O0300K600K900K1200K1500KSE +/- 21146.25, N = 3SE +/- 5359.30, N = 3SE +/- 23960.63, N = 3SE +/- 10490.67, N = 315343241514425151134015046611. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx512-O07K14K21K28K35KSE +/- 157.08, N = 3SE +/- 350.27, N = 3SE +/- 323.44, N = 3SE +/- 166.92, N = 334444343813397233311-march=skylake-march=x86-64-O01. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Serving-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O05K10K15K20K25KSE +/- 78.51, N = 3SE +/- 158.92, N = 3SE +/- 154.99, N = 3SE +/- 86.03, N = 324212235052327022853-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -shared -fPIC -pthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O010002000300040005000SE +/- 0.99, N = 3SE +/- 4.89, N = 3SE +/- 3.42, N = 3SE +/- 6.21, N = 34695469346934578-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -pthread -m64 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.A-O0-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-6410002000300040005000SE +/- 8.67, N = 3SE +/- 19.42, N = 3SE +/- 24.06, N = 3SE +/- 7.25, N = 34654464746414636-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.C-O3 -march=skylake-avx512-O3 -march=skylake-O0-O3 -march=x86-6430060090012001500SE +/- 3.88, N = 3SE +/- 16.10, N = 3SE +/- 23.08, N = 3SE +/- 16.44, N = 61198119211781153-march=skylake-O0-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: FT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.A-O3 -march=skylake-avx512-O3 -march=x86-64-O3 -march=skylake-O014002800420056007000SE +/- 7.63, N = 3SE +/- 14.83, N = 3SE +/- 67.92, N = 12SE +/- 137.64, N = 116509645863886333-march=x86-64-march=skylake-O01. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.B-O3 -march=x86-64-O3 -march=skylake-O0-O3 -march=skylake-avx51215003000450060007500SE +/- 10.07, N = 3SE +/- 6.32, N = 3SE +/- 1.50, N = 3SE +/- 9.38, N = 37051704970427037-march=x86-64-march=skylake-O01. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.A-O3 -march=skylake-avx512-O3 -march=skylake-O0-O3 -march=x86-648K16K24K32K40KSE +/- 50.05, N = 3SE +/- 31.75, N = 3SE +/- 9.20, N = 3SE +/- 49.38, N = 336190360773606835984-march=skylake-O0-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.C-O0-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx5127K14K21K28K35KSE +/- 81.83, N = 3SE +/- 71.59, N = 3SE +/- 29.53, N = 3SE +/- 58.23, N = 333583335513353533175-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.A-O3 -march=skylake-avx512-O3 -march=x86-64-O3 -march=skylake-O09001800270036004500SE +/- 3.26, N = 3SE +/- 10.43, N = 3SE +/- 30.65, N = 3SE +/- 26.23, N = 34147413041104089-march=x86-64-march=skylake-O01. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O0100K200K300K400K500KSE +/- 528.95, N = 3SE +/- 671.98, N = 3SE +/- 1546.17, N = 3SE +/- 294.25, N = 3460061459370457955301052-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O0-O3 -march=skylake-avx512-O3 -march=x86-64-O3 -march=skylake3K6K9K12K15KSE +/- 142.41, N = 3SE +/- 71.51, N = 3SE +/- 45.11, N = 3SE +/- 103.82, N = 312257122001219812143-O0-O3-O3 -march=x86-64-O3 -march=skylake1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250-O0-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake15K30K45K60K75KSE +/- 1555.32, N = 12SE +/- 710.52, N = 3SE +/- 3682.76, N = 9SE +/- 523.36, N = 368741425824197337845-O0-O3 -march=x86-64-O3-O3 -march=skylake1. (CC) gcc options: -lpthread -ldl -lssl -lcrypto

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O3 -march=skylake-avx512-O3 -march=skylake-O0-O3 -march=x86-6490180270360450SE +/- 1.33, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 3397399400403-O3-O3 -march=skylake-O0-O3 -march=x86-641. (CC) gcc options:

Multichase Pointer Chaser

Test: 4MB Array, 64 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 4MB Array, 64 Byte Stride-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64246810SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 38.278.328.338.341. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride-O0-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-641428425670SE +/- 0.23, N = 3SE +/- 0.15, N = 3SE +/- 0.41, N = 3SE +/- 0.14, N = 363.9064.1164.1664.451. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 256MB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 256MB Array, 256 Byte Stride-O3 -march=skylake-avx512-O3 -march=x86-64-O0-O3 -march=skylake1326395265SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 359.5759.7959.9760.001. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 2 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 2 Threads-O3 -march=skylake-O0-O3 -march=skylake-avx512-O3 -march=x86-641530456075SE +/- 0.38, N = 3SE +/- 0.22, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 366.3166.7667.0667.191. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 4 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 4 Threads-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-641632486480SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 370.3670.4770.5470.551. (CC) gcc options: -O2 -static -pthread -lrt

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O01224364860SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 349.8349.9650.0952.48-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -pthread

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 317.0717.0917.1718.11-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -pthread

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM-O3 -march=x86-64-O3 -march=skylake-avx512-O0-O3 -march=skylake1632486480SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.19, N = 372.9873.0073.0173.271. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP-O3 -march=skylake-O3 -march=x86-64-O0-O3 -march=skylake-avx5120.54231.08461.62692.16922.7115SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.382.392.402.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil-O3 -march=x86-64-O0-O3 -march=skylake-avx512-O3 -march=skylake246810SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.916.936.946.971. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI Gridding-O3 -march=skylake-avx512-O3 -march=x86-64-O0-O3 -march=skylake306090120150SE +/- 0.18, N = 3SE +/- 1.03, N = 3SE +/- 0.41, N = 3SE +/- 1.16, N = 31541551561571. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics-O0-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx5120.54451.0891.63352.1782.7225SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.412.412.422.421. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-64612182430SE +/- 0.03, N = 3SE +/- 0.30, N = 3SE +/- 0.22, N = 3SE +/- 0.28, N = 325.7425.9426.1926.311. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver-O3 -march=skylake-avx512-O0-O3 -march=skylake-O3 -march=x86-643691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 312.3112.3212.3312.381. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP Streamcluster-O3 -march=skylake-O0-O3 -march=skylake-avx512-O3 -march=x86-6448121620SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 314.0714.0914.1114.161. (CXX) g++ options: -O2 -lOpenCL

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O0-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx51248121620SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 39.3014.1614.4814.60-O0-O3 -march=skylake-O3 -march=x86-64-O31. (CC) gcc options: -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-640.5581.1161.6742.2322.79SE +/- 0.03, N = 12SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 32.382.422.472.481. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 313.2423.7423.9724.05

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 8.2Time To Compile-O0-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx5122004006008001000886952960961

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512612182430SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 36.6727.1127.2227.27

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O0-O3 -march=skylake-O3 -march=x86-64-O3 -march=skylake-avx5121428425670SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 316.6563.2663.2963.57-O0-O3 -march=skylake-O3 -march=x86-64-O31. (CC) gcc options: -pedantic -ldl -lz -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O0306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 331.2333.9442.43127.42-march=skylake-march=x86-64-O01. (CC) gcc options: -lm -lpthread -O3

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File Compression-O3 -march=skylake-O3 -march=x86-64-O0-O3 -march=skylake-avx5120.54231.08461.62692.16922.7115SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 32.362.392.402.411. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O01632486480SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 334.9234.9237.6872.37-march=skylake-march=x86-64-O01. (CC) gcc options: -lm -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=skylake-O0-O3 -march=x86-64-O3 -march=skylake-avx5120.58731.17461.76192.34922.9365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.272.332.342.61-march=skylake-O0-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.9271.8542.7813.7084.635SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.823.844.104.12-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O01.02152.0433.06454.0865.1075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.064.074.534.54-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O00.90451.8092.71353.6184.5225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.823.854.014.02-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O00.57831.15661.73492.31322.8915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 32.312.322.562.57-march=skylake-march=x86-64-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-640.20030.40060.60090.80121.0015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.840.840.890.89-march=skylake-O0-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=skylake-O3 -march=skylake-avx512-O0-O3 -march=x86-640.23630.47260.70890.94521.1815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.001.001.051.05-march=skylake-O0-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O0306090120150SE +/- 0.29, N = 3SE +/- 0.23, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 383.9584.9585.42119.12-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -pthread -fvisibility=hidden

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=x86-64-O3 -march=skylake-avx512-O3 -march=skylake-O0510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.00, N = 310.3910.4910.6619.58-O3 -march=x86-64-O3-O3 -march=skylake-O01. (CC) gcc options: -pthread -lz -llzma

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 4K-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O01122334455SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 326.3926.7526.9549.47-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 1080p-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O0612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 313.6513.8213.8923.72-O3 -march=skylake-O3-O3 -march=x86-64-O01. (CC) gcc options: -pthread

dcraw

RAW To PPM Image Conversion

OpenBenchmarking.orgSeconds, Fewer Is BetterdcrawRAW To PPM Image Conversion-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O0918273645SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 310.7910.8311.5639.04-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CC) gcc options: -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=skylake-avx512-O3 -march=skylake-O3 -march=x86-64-O01326395265SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.03, N = 59.029.1010.0658.09-O3-O3 -march=skylake-O3 -march=x86-64-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512-O0612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.829.369.4127.45-O3 -march=x86-64-O3 -march=skylake-O3-O01. (CC) gcc options: -lm

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve-O3 -march=skylake-avx512-O0-O3 -march=skylake-O3 -march=x86-641122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 348.3748.3848.3848.40-O3-O0-O3 -march=skylake-O3 -march=x86-641. (CXX) g++ options: -fopenmp -O2 -march=native

Mencoder

AVI To LAVC

OpenBenchmarking.orgSeconds, Fewer Is BetterMencoder 1.3.0AVI To LAVC-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64-O0714212835SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 316.0216.2316.2932.241. (CC) gcc options: -fpie -pie -lrt -lpng -lz -ljpeg -lasound -ldl -lpthread -lmad -lvorbisenc -lvorbis -logg -rdynamic -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Time-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512-O0246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38.408.408.408.41-march=x86-64-march=skylake-O01. (CC) gcc options: -static -fopenmp -O3 -march=native


Phoronix Test Suite v10.8.5