GCC 9 Skylake-AVX512 Tests

Intel Core i9-7980XE testing with a ASUS PRIME X299-A (1602 BIOS) and NVIDIA NV120 12GB on Ubuntu 18.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1902216-PTS-GCC9SKYL42&grs&sro.

GCC 9 Skylake-AVX512 TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx512Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (1602 BIOS)Intel Sky Lake-E DMI3 Registers16384MBSamsung SSD 970 EVO 500GBNVIDIA NV120 12GBRealtek ALC1220ASUS PB278Intel I219-VUbuntu 18.104.18.0-15-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1modesetting 1.20.14.3 Mesa 18.2.2GCC 9.0.1 20190217ext42560x1440OpenBenchmarking.orgEnvironment Details- -O0: CXXFLAGS=-O0 CFLAGS=-O0- -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64- -O3 -march=skylake: CXXFLAGS=-O3-march=skylake CFLAGS=-O3-march=skylake- -O3 -march=skylake-avx512: CXXFLAGS=-O3-march=skylake-avx512 CFLAGS=-O3-march=skylake-avx512Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersavePython Details- Python 2.7.15+ + Python 3.6.7Security Details- KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW STIBP + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable

GCC 9 Skylake-AVX512 Testshimeno: Poisson Pressure Solverencode-flac: WAV To FLACscimark2: Dense LU Matrix Factorizationscimark2: Monte Carlofftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32john-the-ripper: Blowfishfftw: Stock - 2D FFT Size 32scimark2: Sparse Matrix Multiplyfftw: Stock - 1D FFT Size 64build-imagemagick: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelscimark2: Compositebuild-php: Time To Compiledcraw: RAW To PPM Image Conversionencode-mp3: WAV To MP3fftw: Stock - 2D FFT Size 4096sysbench: Memorygraphics-magick: Sharpenscimark2: Fast Fourier Transformgraphics-magick: Enhancedaobench: 2048 x 2048 - Total Timemencoder: AVI To LAVCfftw: Float + SSE - 2D FFT Size 64compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19dav1d: Summer Nature 4Kbuild-apache: Time To Compilegraphics-magick: Resizingscimark2: Jacobi Successive Over-Relaxationgraphics-magick: Swirlgraphics-magick: HWB Color Spacedav1d: Summer Nature 1080pgraphics-magick: Rotategraphics-magick: Noise-Gaussianhmmer: Pfam Database Searchpgbench: Buffer Test - Normal Load - Read Onlycompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9x264: H.264 Video Encodingsysbench: CPUtjbench: Decompression Throughputbullet: Raytestsaom-av1: AV1 Video Encodingbullet: 1000 Stackvpxenc: vpxenc VP9 1080p Video Encodebullet: 136 Ragdollsbuild-gcc: Time To Compilebullet: 3000 Fallt-test1: 2bullet: Prim Trimeshapache: Static Web Page Servingredis: LPOPt-test1: 1bullet: 1000 Convexredis: LPUSHbullet: Convex Trimeshmafft: Multiple Sequence Alignmentnpb: EP.Cnginx: Static Web Page Servingbyte: Dhrystone 2redis: GETsvt-vp9: 1080p 8-bit YUV To VP9 Video Encodeopenssl: RSA 4096-bit Performancerodinia: OpenMP LavaMDcompress-pbzip2: 256MB File Compressionredis: SETparboil: OpenMP MRI Griddingjohn-the-ripper: MD5svt-av1: 1080p 8-bit YUV To AV1 Video Encodectx-clock: Context Switch Timenpb: SP.Ajohn-the-ripper: Traditional DESmultichase: 1GB Array, 256 Byte Stride, 2 Threadssvt-hevc: 1080p 8-bit YUV To HEVC Video Encodex265: H.265 1080p Video Encodingparboil: OpenMP CUTCPnpb: LU.Credis: SADDpgbench: Buffer Test - Normal Load - Read Writestockfish: Total Timeparboil: OpenMP Stencilmultichase: 1GB Array, 256 Byte Stridemultichase: 4MB Array, 64 Byte Stridemultichase: 256MB Array, 256 Byte Striderodinia: OpenMP Streamclusternpb: LU.Arodinia: OpenMP CFD Solvercloverleaf: Lagrangian-Eulerian Hydrodynamicshint: DOUBLEparboil: OpenMP LBMnpb: BT.Acompress-7zip: Compress Speed Testmultichase: 1GB Array, 256 Byte Stride, 4 Threadsnpb: FT.Bxsbench: hint: FLOATn-queens: Elapsed Timem-queens: Time To Solvebyte: Floating-Point Arithmeticbyte: Register Arithmeticbyte: Integer Arithmeticapache-siege: 250ebizzy: fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 32npb: FT.A-O0-O3 -march=x86-64-O3 -march=skylake-O3 -march=skylake-avx51246858.09984153322634215639343879734246.67127.4270316.6539.0427.45217171524038033710172.3732.24364019.5849.4713.24141124313015223.721511039.30301052119.1287.34253341512.330.224.5422.972.578864.1218.110.8922853219668752.484.0214170881.052.47117833311394318432070792306457826.192.4015046611568935208.5340040899944400066.7630738.812.4033583162956712257458949466.9363.908.2759.9714.093606812.322.4198105125573.0146549377070.54704245090474363652798.4148.381116874156998031463250333530146333308410.06577994728966112773416903427.1142.43251663.2911.568.8258331944359217677320137.6816.2910.3926.9524.05245166222026013.8925216414.4845795585.42122350111852.340.224.5322.992.569604.1017.170.8923270231364450.094.0114706711.052.48115334381393545482029011306469326.312.3915343241559022178.52403413010082966767.1931139.102.3933535162605412198462201626.9164.458.3459.7914.163598412.382.4298186143172.9846369359270.55705145087904364773518.4048.4011142582565019743110701645833159.1062849701913216773289501385434951231827.2231.23275263.2610.839.3664751932719918979121734.9216.023782110.6626.3923.74254221922926613.6525617114.1646006184.95123325961942.270.244.0723.142.329523.8417.090.8423505221841549.833.8214279231.002.38119234444381666122092803315469525.742.3615144251579016128.38399411010070966766.3130939.302.3833551164508212143462990146.9764.168.3260.0014.073607712.332.4198499671273.2746419340870.36704945031114364461188.4048.381113784558904638683200811091713574638833169.0263019711944018242290311511934781405727.2733.94275163.5710.799.4167441935323418978321134.9216.234246710.4926.7523.97252222022626513.8225717014.6045937083.95123338691972.610.254.0625.602.319613.8217.070.8424212223297949.963.8513997341.002.42119833972384552372068424313469325.942.4115113401548854498.48397414710074633367.0630939.292.4133175164031812200463184786.9464.118.3359.5714.113619012.312.4298170059973.0046479350470.47703745002754372027788.4048.3711141973567481461701985610786152286509OpenBenchmarking.org

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-647001400210028003500SE +/- 0.30, N = 3SE +/- 3.42, N = 3SE +/- 4.94, N = 3SE +/- 4.79, N = 3468331533163084-O0-march=skylake-march=x86-641. (CC) gcc options: -O3 -mavx2

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641326395265SE +/- 0.03, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 558.099.109.0210.06-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CXX) g++ options: -fvisibility=hidden -logg -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6414002800420056007000SE +/- 0.24, N = 3SE +/- 10.08, N = 3SE +/- 0.29, N = 3SE +/- 1.44, N = 3984628463015779-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-642004006008001000SE +/- 0.13, N = 3SE +/- 0.57, N = 3SE +/- 0.55, N = 3SE +/- 0.23, N = 3153970971947-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O0-O3 -march=skylake-O3 -march=skylake-avx5124K8K12K16K20KSE +/- 23.64, N = 3SE +/- 196.68, N = 3SE +/- 79.22, N = 332261913219440-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32-O0-O3 -march=skylake-O3 -march=skylake-avx5124K8K12K16K20KSE +/- 6.16, N = 3SE +/- 246.89, N = 5SE +/- 304.84, N = 334211677318242-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-646K12K18K24K30KSE +/- 1.45, N = 3SE +/- 9.82, N = 3SE +/- 41.46, N = 3SE +/- 8.67, N = 356392895029031289661. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-643K6K9K12K15KSE +/- 19.32, N = 3SE +/- 43.44, N = 3SE +/- 15.21, N = 3SE +/- 93.95, N = 33438138541511911277-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-647001400210028003500SE +/- 0.24, N = 3SE +/- 6.16, N = 3SE +/- 5.26, N = 3SE +/- 4.59, N = 3797349534783416-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-643K6K9K12K15KSE +/- 0.84, N = 3SE +/- 19.35, N = 3SE +/- 66.43, N = 3SE +/- 19.17, N = 3342412318140579034-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 36.6727.2227.2727.11

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64306090120150SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3127.4231.2333.9442.43-O0-march=skylake-march=x86-641. (CC) gcc options: -lm -lpthread -O3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-646001200180024003000SE +/- 0.71, N = 3SE +/- 2.76, N = 3SE +/- 1.00, N = 3SE +/- 0.86, N = 3703275227512516-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641428425670SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 316.6563.2663.5763.29-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pedantic -ldl -lz -lm

dcraw

RAW To PPM Image Conversion

OpenBenchmarking.orgSeconds, Fewer Is BetterdcrawRAW To PPM Image Conversion-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64918273645SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 339.0410.8310.7911.56-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 327.459.369.418.82-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6414002800420056007000SE +/- 3.87, N = 3SE +/- 35.50, N = 3SE +/- 6.12, N = 3SE +/- 43.25, N = 32171647567445833-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

Sysbench

Test: Memory

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-644M8M12M16M20MSE +/- 11654.91, N = 3SE +/- 50006.95, N = 3SE +/- 65309.84, N = 3SE +/- 25107.25, N = 37152403193271991935323419443592-O0 -march=core2-march=skylake-march=x86-641. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -rdynamic -ldl -laio -lm

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-644080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 380189189176-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-642004006008001000SE +/- 0.99, N = 3SE +/- 1.91, N = 3SE +/- 0.54, N = 3SE +/- 3.92, N = 3337791783773-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6450100150200250101217211201-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641632486480SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 372.3734.9234.9237.68-O0-march=skylake-march=x86-641. (CC) gcc options: -lm -O3

Mencoder

AVI To LAVC

OpenBenchmarking.orgSeconds, Fewer Is BetterMencoder 1.3.0AVI To LAVC-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64714212835SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 332.2416.0216.2316.291. (CC) gcc options: -fpie -pie -lrt -lpng -lz -ljpeg -lasound -ldl -lpthread -lmad -lvorbisenc -lvorbis -logg -rdynamic -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64-O0-O3 -march=skylake-O3 -march=skylake-avx5129K18K27K36K45KSE +/- 21.24, N = 3SE +/- 537.92, N = 5SE +/- 576.52, N = 336403782142467-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64510152025SE +/- 0.00, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 319.5810.6610.4910.39-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread -lz -llzma

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 4K-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641122334455SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 349.4726.3926.7526.95-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 313.2423.7423.9724.05

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6460120180240300SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 3141254252245-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-645001000150020002500SE +/- 2.24, N = 3SE +/- 2.53, N = 3SE +/- 0.23, N = 3SE +/- 0.37, N = 31243221922201662-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6450100150200250SE +/- 0.33, N = 3130229226220-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6460120180240300SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3152266265260-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 1080p-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 323.7213.6513.8213.89-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6460120180240300SE +/- 1.53, N = 3SE +/- 0.33, N = 3151256257252-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-644080120160200SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 1.00, N = 3103171170164-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6448121620SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 39.3014.1614.6014.48-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread -lhmmer -lsquid -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64100K200K300K400K500KSE +/- 294.25, N = 3SE +/- 528.95, N = 3SE +/- 671.98, N = 3SE +/- 1546.17, N = 3301052460061459370457955-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64306090120150SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.29, N = 3SE +/- 0.21, N = 3119.1284.9583.9585.42-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread -fvisibility=hidden

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64306090120150SE +/- 1.68, N = 3SE +/- 1.68, N = 12SE +/- 1.22, N = 12SE +/- 1.88, N = 587.34123.00123.00122.00-O0-march=skylake-march=x86-641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-647K14K21K28K35KSE +/- 6.46, N = 3SE +/- 1.99, N = 3SE +/- 6.42, N = 3SE +/- 6.41, N = 325334325963386935011-O0 -march=core2-march=skylake-march=x86-641. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -rdynamic -ldl -laio -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression Throughput-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-644080120160200SE +/- 0.53, N = 3SE +/- 0.57, N = 3SE +/- 0.56, N = 3SE +/- 0.68, N = 3151194197185-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lm

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.58731.17461.76192.34922.9365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.332.272.612.34-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video Encoding-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.05630.11260.16890.22520.2815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.220.240.250.22-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641.02152.0433.06454.0865.1075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.544.074.064.53-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.38, N = 3SE +/- 0.03, N = 322.9723.1425.6022.99-O0-march=skylake-march=x86-641. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.57831.15661.73492.31322.8915SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 32.572.322.312.56-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 8.2Time To Compile-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-642004006008001000886952961960

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.9271.8542.7813.7084.635SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 34.123.843.824.10-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6448121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 318.1117.0917.0717.17-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.20030.40060.60090.80121.0015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.890.840.840.89-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Serving-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-645K10K15K20K25KSE +/- 86.03, N = 3SE +/- 158.92, N = 3SE +/- 78.51, N = 3SE +/- 154.99, N = 322853235052421223270-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -shared -fPIC -pthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64500K1000K1500K2000K2500KSE +/- 23378.66, N = 3SE +/- 29907.21, N = 12SE +/- 25522.81, N = 12SE +/- 33940.57, N = 421966872218415223297923136441. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641224364860SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 352.4849.8349.9650.09-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.90451.8092.71353.6184.5225SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.023.823.854.01-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64300K600K900K1200K1500KSE +/- 21522.01, N = 3SE +/- 4741.67, N = 3SE +/- 27880.00, N = 3SE +/- 7767.29, N = 314170881427923139973414706711. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.23630.47260.70890.94521.1815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.051.001.001.05-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.5581.1161.6742.2322.79SE +/- 0.01, N = 3SE +/- 0.03, N = 12SE +/- 0.04, N = 12SE +/- 0.01, N = 32.472.382.422.481. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.C-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6430060090012001500SE +/- 23.08, N = 3SE +/- 16.10, N = 3SE +/- 3.88, N = 3SE +/- 16.44, N = 61178119211981153-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-647K14K21K28K35KSE +/- 166.92, N = 3SE +/- 157.08, N = 3SE +/- 323.44, N = 3SE +/- 350.27, N = 333311344443397234381-O0-march=skylake-march=x86-641. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-648M16M24M32M40MSE +/- 122855.06, N = 3SE +/- 379820.56, N = 3SE +/- 35381.50, N = 3SE +/- 212845.04, N = 339431843381666123845523739354548-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options:

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 20382.13, N = 3SE +/- 27984.39, N = 3SE +/- 22834.56, N = 10SE +/- 25102.63, N = 320707922092803206842420290111. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6470140210280350SE +/- 5.82, N = 3SE +/- 5.38, N = 3SE +/- 5.81, N = 3SE +/- 5.43, N = 3306315313306-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6410002000300040005000SE +/- 6.21, N = 3SE +/- 0.99, N = 3SE +/- 4.89, N = 3SE +/- 3.42, N = 34578469546934693-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -pthread -m64 -lssl -lcrypto -ldl

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64612182430SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.30, N = 3SE +/- 0.28, N = 326.1925.7425.9426.311. (CXX) g++ options: -O2 -lOpenCL

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File Compression-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.54231.08461.62692.16922.7115SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 32.402.362.412.391. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64300K600K900K1200K1500KSE +/- 10490.67, N = 3SE +/- 5359.30, N = 3SE +/- 23960.63, N = 3SE +/- 21146.25, N = 315046611514425151134015343241. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI Gridding-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64306090120150SE +/- 0.41, N = 3SE +/- 1.16, N = 3SE +/- 0.18, N = 3SE +/- 1.03, N = 31561571541551. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: MD5-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64200K400K600K800K1000KSE +/- 1807.15, N = 3SE +/- 1557.07, N = 3SE +/- 15148.30, N = 4SE +/- 1490.19, N = 38935209016128854499022171. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-151080p 8-bit YUV To AV1 Video Encode-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 38.538.388.488.521. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6490180270360450SE +/- 1.33, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 3400399397403-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options:

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.A-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-649001800270036004500SE +/- 26.23, N = 3SE +/- 30.65, N = 3SE +/- 3.26, N = 3SE +/- 10.43, N = 34089411041474130-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Traditional DES-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6420M40M60M80M100MSE +/- 237363.29, N = 3SE +/- 73121.66, N = 3SE +/- 30123.82, N = 3SE +/- 134375.51, N = 3994440001007096671007463331008296671. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 2 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 2 Threads-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641530456075SE +/- 0.22, N = 3SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 366.7666.3167.0667.191. (CC) gcc options: -O2 -static -pthread -lrt

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video Encode-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6470140210280350SE +/- 3.49, N = 3SE +/- 4.28, N = 3SE +/- 2.15, N = 3SE +/- 4.38, N = 3307309309311-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64918273645SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 338.8139.3039.2939.10-O0-march=skylake-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.54231.08461.62692.16922.7115SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.402.382.412.391. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.C-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-647K14K21K28K35KSE +/- 81.83, N = 3SE +/- 71.59, N = 3SE +/- 58.23, N = 3SE +/- 29.53, N = 333583335513317533535-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 3861.43, N = 3SE +/- 16784.08, N = 3SE +/- 7995.97, N = 3SE +/- 5516.38, N = 316295671645082164031816260541. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-643K6K9K12K15KSE +/- 142.41, N = 3SE +/- 103.82, N = 3SE +/- 71.51, N = 3SE +/- 45.11, N = 312257121431220012198-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6410M20M30M40M50MSE +/- 181348.45, N = 3SE +/- 393717.83, N = 3SE +/- 382200.06, N = 3SE +/- 180038.36, N = 345894946462990144631847846220162-O0-march=skylake-march=x86-641. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.936.976.946.911. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641428425670SE +/- 0.23, N = 3SE +/- 0.41, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 363.9064.1664.1164.451. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 4MB Array, 64 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 4MB Array, 64 Byte Stride-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64246810SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 38.278.328.338.341. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 256MB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 256MB Array, 256 Byte Stride-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641326395265SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 359.9760.0059.5759.791. (CC) gcc options: -O2 -static -pthread -lrt

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP Streamcluster-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6448121620SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 314.0914.0714.1114.161. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.A-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-648K16K24K32K40KSE +/- 9.20, N = 3SE +/- 31.75, N = 3SE +/- 50.05, N = 3SE +/- 49.38, N = 336068360773619035984-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-643691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.3212.3312.3112.381. (CXX) g++ options: -O2 -lOpenCL

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.54451.0891.63352.1782.7225SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.412.412.422.421. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Hierarchical INTegration

Test: DOUBLE

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLE-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64200M400M600M800M1000MSE +/- 1623168.86, N = 3SE +/- 788605.63, N = 3SE +/- 2720569.85, N = 3SE +/- 286663.14, N = 3981051255984996712981700599981861431-O0-march=skylake-march=x86-641. (CC) gcc options: -O3 -march=native -lm

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641632486480SE +/- 0.03, N = 3SE +/- 0.19, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 373.0173.2773.0072.981. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.A-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6410002000300040005000SE +/- 8.67, N = 3SE +/- 24.06, N = 3SE +/- 19.42, N = 3SE +/- 7.25, N = 34654464146474636-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6420K40K60K80K100KSE +/- 176.14, N = 3SE +/- 295.13, N = 3SE +/- 164.59, N = 3SE +/- 332.73, N = 3937709340893504935921. (CXX) g++ options: -pipe -lpthread

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 4 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 4 Threads-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641632486480SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.18, N = 370.5470.3670.4770.551. (CC) gcc options: -O2 -static -pthread -lrt

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.B-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6415003000450060007500SE +/- 1.50, N = 3SE +/- 6.32, N = 3SE +/- 9.38, N = 3SE +/- 10.07, N = 37042704970377051-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641000K2000K3000K4000K5000KSE +/- 440.03, N = 3SE +/- 3905.30, N = 3SE +/- 8949.05, N = 3SE +/- 834.04, N = 345090474503111450027545087901. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6490M180M270M360M450MSE +/- 615391.75, N = 3SE +/- 134069.19, N = 3SE +/- 402091.62, N = 3SE +/- 856766.90, N = 3436365279436446118437202778436477351-O0-march=skylake-march=x86-641. (CC) gcc options: -O3 -march=native -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Time-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38.418.408.408.40-O0-march=skylake-march=x86-641. (CC) gcc options: -static -fopenmp -O3 -march=native

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-641122334455SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 348.3848.3848.3748.40-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CXX) g++ options: -fopenmp -O2 -march=native

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.2250.450.6750.91.1251111-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options:

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.2250.450.6750.91.1251111-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options:

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-640.2250.450.6750.91.1251111-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options:

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6415K30K45K60K75KSE +/- 1555.32, N = 12SE +/- 523.36, N = 3SE +/- 3682.76, N = 9SE +/- 710.52, N = 368741378454197342582-O0-O3 -march=skylake-O3-O3 -march=x86-641. (CC) gcc options: -lpthread -ldl -lssl -lcrypto

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-64130K260K390K520K650KSE +/- 11752.07, N = 12SE +/- 10152.88, N = 12SE +/- 12545.19, N = 12SE +/- 6642.32, N = 8569980589046567481565019-O0-march=skylake-march=x86-641. (CC) gcc options: -pthread -lpthread -O3 -march=native

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32-O0-O3 -march=skylake-O3 -march=skylake-avx51210K20K30K40K50KSE +/- 34.99, N = 3SE +/- 1140.38, N = 12SE +/- 873.80, N = 331463868346170-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64-O0-O3 -march=skylake-O3 -march=skylake-avx5124K8K12K16K20KSE +/- 33.40, N = 3SE +/- 471.72, N = 12SE +/- 164.46, N = 332502008119856-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-642K4K6K8K10KSE +/- 5.98, N = 3SE +/- 15.98, N = 3SE +/- 334.88, N = 12SE +/- 64.31, N = 3333510917107867431-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-643K6K9K12K15KSE +/- 91.03, N = 12SE +/- 135.22, N = 3SE +/- 37.99, N = 3SE +/- 3.84, N = 33014135741522810701-O0-O3 -march=skylake-O31. (CC) gcc options: -pthread -lm

NAS Parallel Benchmarks

Test / Class: FT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.A-O0-O3 -march=skylake-O3 -march=skylake-avx512-O3 -march=x86-6414002800420056007000SE +/- 137.64, N = 11SE +/- 67.92, N = 12SE +/- 7.63, N = 3SE +/- 14.83, N = 36333638865096458-O0-march=skylake-march=x86-641. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi


Phoronix Test Suite v10.8.5