GCC 9 Compiler Benchmarking vs. LLVM Clang

Intel Core i9-7980XE benchmarks of GCC 8 / GCC 9 versus LLVM Clang 7 and LLVM Clang 8 compilers on Ubuntu Linux. Benchmarks by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1811133-SK-GCC9COMPI38&grr&rdt.

GCC 9 Compiler Benchmarking vs. LLVM ClangProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (1503 BIOS)Intel Sky Lake-E DMI3 Registers16384MB240GB Force MP510NV120 12GBRealtek ALC1220ASUS PB278Intel ConnectionUbuntu 18.104.18.0-10-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1modesetting 1.20.14.3 Mesa 18.2.2GCC 9.0.0 20181112ext42560x1440GCC 8.2.0Clang 7.0.0-3Clang 8.0.0-svn346617-1~exp1+0~20181111195013.162~1.gbp8d271fOpenBenchmarking.orgEnvironment Details- CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- GCC 9.0.0 20181112, GCC 8.2.0: --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersavePython Details- Python 2.7.15+ + Python 3.6.7Security Details- KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable

GCC 9 Compiler Benchmarking vs. LLVM Clangpgbench: Buffer Test - Normal Load - Read Writeparboil: OpenMP MRI Griddingbuild-linux-kernel: Time To Compileparboil: OpenMP LBMpgbench: Buffer Test - Normal Load - Read Onlystockfish: Total Timebuild-llvm: Time To Compilejohn-the-ripper: Traditional DESfftw: Float + SSE - 2D FFT Size 2048hpcg: c-ray: Total Time - 4K, 16 Rays Per Pixelcompress-7zip: Compress Speed Testm-queens: Time To Solvehimeno: Poisson Pressure Solvernpb: BT.Aebizzy: aobench: 2048 x 2048 - Total Timescimark2: Compositenpb: SP.Acrafty: Elapsed Timebuild-apache: Time To Compilejohn-the-ripper: MD5john-the-ripper: Blowfishopenssl: RSA 4096-bit Performancemcperf: Setxsbench: mcperf: Getnpb: FT.Bencode-mp3: WAV To MP3compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19hmmer: Pfam Database Searchparboil: OpenMP Stencilnpb: EP.Cfftw: Float + SSE - 1D FFT Size 2048x264: H.264 Video Encodingtjbench: Decompression Throughputcloverleaf: Lagrangian-Eulerian Hydrodynamicsparboil: OpenMP CUTCPnpb: FT.Ablake2: scimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte CarloGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811112177614744.1881.525001544690833922074356333204681.2233.759343748.723093455762024531.3226133865842339722.2667599322173467372449466480911049571749.7410.6310.427.191250592991281843.202.4467183.512069606332957329062383014644.2679.364967304782781122372707667200141.2533.709357648.733089483460731931.3526153542856064122.05670435222544658717764662395108730722010.1310.6110.267.13925570191281832.982.4667563.66207460713295727906171865047254617970721698537500202581.3366.7650.19246359499732.30258421.1379920927611454773064515610911586812.1610.278.59562741252043.021662647633007637171001250563145725081189100083000198711.3467.0650.41245558826632.19257719.7081897625614454473009502210711524711.8010.348.52544331251983.10166164493288768717OpenBenchmarking.org

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811115K10K15K20K25KSE +/- 1475.07, N = 9SE +/- 402.82, N = 4SE +/- 1724.11, N = 9SE +/- 421.62, N = 12217762383017186100121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingGCC 9.0.0 20181112GCC 8.2.0306090120150SE +/- 0.40, N = 3SE +/- 0.83, N = 31471461. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileGCC 9.0.0 20181112GCC 8.2.01020304050SE +/- 0.61, N = 6SE +/- 0.50, N = 944.1844.26

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMGCC 9.0.0 20181112GCC 8.2.020406080100SE +/- 0.27, N = 3SE +/- 0.33, N = 381.5279.361. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111110K220K330K440K550KSE +/- 274.70, N = 3SE +/- 210.08, N = 3SE +/- 3717.80, N = 3SE +/- 3943.25, N = 35001544967305047255056311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111110M20M30M40M50MSE +/- 469942.62, N = 3SE +/- 290155.29, N = 3SE +/- 358695.82, N = 3SE +/- 77465.59, N = 3469083394782781146179707457250811. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111150100150200250220223216189

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Traditional DESGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111120M40M60M80M100MSE +/- 132290.51, N = 3SE +/- 1115841.74, N = 3SE +/- 1095165.68, N = 12SE +/- 185734.58, N = 37435633372707667985375001000830001. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811114K8K12K16K20KSE +/- 88.29, N = 3SE +/- 25.39, N = 3SE +/- 260.52, N = 3SE +/- 95.21, N = 3204682001420258198711. (CC) gcc options: -pthread -O3 -march=native -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811110.30150.6030.90451.2061.5075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.221.251.331.34

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811111530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 333.7533.7066.7667.061. (CC) gcc options: -lm -lpthread -O3 -march=native

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 9.0.0 20181112GCC 8.2.020K40K60K80K100KSE +/- 384.58, N = 3SE +/- 184.01, N = 393437935761. (CXX) g++ options: -pipe -lpthread

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811111122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 348.7248.7350.1950.411. (CXX) g++ options: -fopenmp -O3 -march=native -O2

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811117001400210028003500SE +/- 3.44, N = 3SE +/- 4.75, N = 3SE +/- 2.40, N = 3SE +/- 4.58, N = 330933089246324551. (CC) gcc options: -O3 -march=native -mavx2

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.AGCC 9.0.0 20181112GCC 8.2.010002000300040005000SE +/- 35.44, N = 3SE +/- 31.51, N = 3455748341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111130K260K390K520K650KSE +/- 7775.77, N = 3SE +/- 5181.95, N = 3SE +/- 8494.47, N = 12SE +/- 10943.27, N = 36202456073195949975882661. (CC) gcc options: -pthread -lpthread -O3 -march=native

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111816243240SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 331.3231.3532.3032.191. (CC) gcc options: -lm -O3 -march=native

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811116001200180024003000SE +/- 2.04, N = 3SE +/- 1.63, N = 3SE +/- 1.31, N = 3SE +/- 5.03, N = 326132615258425771. (CC) gcc options: -O3 -march=native -lm

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.AGCC 9.0.0 20181112GCC 8.2.08001600240032004000SE +/- 28.57, N = 3SE +/- 45.01, N = 3386535421. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 9.0.0 20181112GCC 8.2.02M4M6M8M10MSE +/- 15686.24, N = 3SE +/- 8409.72, N = 3842339785606411. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111510152025SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 322.2622.0521.1319.70

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: MD5GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111200K400K600K800K1000KSE +/- 1299.84, N = 3SE +/- 925.69, N = 3SE +/- 11085.39, N = 3SE +/- 768.00, N = 36759936704357992098189761. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: BlowfishGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811116K12K18K24K30KSE +/- 65.83, N = 3SE +/- 70.42, N = 3SE +/- 11.67, N = 3SE +/- 9.29, N = 3221732225427611256141. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111110002000300040005000SE +/- 0.31, N = 3SE +/- 5.28, N = 3SE +/- 2.68, N = 3SE +/- 8.23, N = 34673465845474544-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111116K32K48K64K80KSE +/- 305.67, N = 3SE +/- 259.45, N = 3SE +/- 499.66, N = 3SE +/- 408.85, N = 3724497177673064730091. (CC) gcc options: -O3 -march=native -lm -rdynamic

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811111.1M2.2M3.3M4.4M5.5MSE +/- 1108.05, N = 3SE +/- 806.44, N = 3SE +/- 843.35, N = 3SE +/- 1402.44, N = 346648094662395515610950221071. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111120K40K60K80K100KSE +/- 666.93, N = 3SE +/- 767.29, N = 3SE +/- 605.02, N = 3SE +/- 1208.65, N = 31104951087301158681152471. (CC) gcc options: -O3 -march=native -lm -rdynamic

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BGCC 9.0.0 20181112GCC 8.2.015003000450060007500SE +/- 2.08, N = 3SE +/- 8.95, N = 3717472201. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811113691215SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.7410.1312.1611.80-pipe -lncurses-pipe -lncurses1. (CC) gcc options: -O3 -march=native -lm

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811113691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 310.6310.6110.2710.341. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811113691215SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 310.4210.268.598.521. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilGCC 9.0.0 20181112GCC 8.2.0246810SE +/- 0.06, N = 3SE +/- 0.05, N = 37.197.131. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.CGCC 9.0.0 20181112GCC 8.2.030060090012001500SE +/- 12.17, N = 3SE +/- 4.53, N = 312509251. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111113K26K39K52K65KSE +/- 196.81, N = 3SE +/- 1025.80, N = 3SE +/- 463.90, N = 3SE +/- 809.18, N = 5592995701956274544331. (CC) gcc options: -pthread -O3 -march=native -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111306090120150SE +/- 1.58, N = 7SE +/- 1.97, N = 3SE +/- 2.08, N = 4SE +/- 1.11, N = 3128128125125-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression ThroughputGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811114080120160200SE +/- 0.54, N = 3SE +/- 0.06, N = 3SE +/- 0.36, N = 3SE +/- 1.17, N = 31841832041981. (CC) gcc options: -O3 -march=native -lm

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsGCC 9.0.0 20181112GCC 8.2.00.721.442.162.883.6SE +/- 0.01, N = 3SE +/- 0.03, N = 33.202.981. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPGCC 9.0.0 20181112GCC 8.2.00.55351.1071.66052.2142.7675SE +/- 0.02, N = 3SE +/- 0.04, N = 32.442.461. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: FT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.AGCC 9.0.0 20181112GCC 8.2.014002800420056007000SE +/- 17.12, N = 3SE +/- 12.06, N = 3671867561. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

BLAKE2

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20170307GCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811110.82351.6472.47053.2944.1175SE +/- 0.11, N = 12SE +/- 0.09, N = 12SE +/- 0.06, N = 3SE +/- 0.01, N = 33.513.663.023.101. (CC) gcc options: -O3 -march=native -lcrypto -lz

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111400800120016002000SE +/- 3.14, N = 3SE +/- 2.18, N = 3SE +/- 1.35, N = 3SE +/- 1.35, N = 320692074166216611. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 2018111114002800420056007000SE +/- 7.72, N = 3SE +/- 10.59, N = 3SE +/- 0.69, N = 3SE +/- 18.31, N = 360636071647664491. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811117001400210028003500SE +/- 3.31, N = 3SE +/- 1.55, N = 3SE +/- 3.75, N = 3SE +/- 5.51, N = 332953295330032881. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 20181111170340510680850SE +/- 1.85, N = 3SE +/- 0.86, N = 3SE +/- 3.38, N = 3SE +/- 1.98, N = 37327277637681. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 9.0.0 20181112GCC 8.2.0Clang 7.0.0Clang 8.0.0 201811112004006008001000SE +/- 0.99, N = 3SE +/- 1.15, N = 3SE +/- 0.32, N = 3SE +/- 0.29, N = 39069067177171. (CC) gcc options: -O3 -march=native -lm


Phoronix Test Suite v10.8.4