PGI Compiler 18.10 Benchmarks vs. GCC vs. LLVM Clang

PGI compiler benchmarks for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1812180-SK-PGICOMPIL33&grr&rdt.

PGI Compiler 18.10 Benchmarks vs. GCC vs. LLVM ClangProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (1602 BIOS)Intel Sky Lake-E DMI3 Registers16384MB240GB Force MP510NVIDIA NV120 12GBRealtek ALC1220ASUS PB278Intel ConnectionUbuntu 18.104.20.0-999-generic (x86_64) 20181206GNOME Shell 3.30.1X Server 1.20.1modesetting 1.20.14.3 Mesa 18.2.2PGI Compiler 18.10-1ext42560x1440GCC 8.2.0OpenBenchmarking.orgEnvironment Details- CXXFLAGS=-O3 CXXFLAGS_OVERRIDE=-O3 CFLAGS=-O3 CFLAGS_OVERRIDE=-O3Disk Details- NONE / errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: intel_pstate powersavePython Details- PGI Compiler 18.10: Python 2.7.15+ + Python 3.6.7Security Details- __user pointer sanitization + disabled STIBP: disabled + PTE Inversion; VMX: vulnerable Compiler Details- GCC 8.2.0: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v

PGI Compiler 18.10 Benchmarks vs. GCC vs. LLVM Clangblogbench: Readc-ray: Total Time - 4K, 16 Rays Per Pixelhpcg: aobench: 2048 x 2048 - Total Timescimark2: Compositepolybench-c: Covariance Computationhmmer: Pfam Database Searchpolybench-c: Correlation Computationpolybench-c: 3 Matrix Multiplicationstscp: AI Chess Performancescimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte CarloPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.075231039.241.3424.1519327.238.514.813.351199430143846222243768591709382103.101.3432.1825184.869.994.862.86144032116655796340577395096263152.791.3333.8124784.878.284.882.831629136166260693164778717OpenBenchmarking.org

BlogBench

Test: Read

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: ReadPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0200K400K600K800K1000KSE +/- 18545.35, N = 9SE +/- 12212.43, N = 3SE +/- 75775.18, N = 9752310709382962631-pthread-pthread1. (CC) gcc options: -O3

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.020406080100SE +/- 0.02, N = 3SE +/- 0.66, N = 3SE +/- 0.00, N = 339.24103.1052.791. (CC) gcc options: -lm -lpthread -O3

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0PGI Compiler 18.10GCC 8.2.0LLVM Clang 7.00.30150.6030.90451.2061.5075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.341.341.33

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimePGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0816243240SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 324.1532.1833.811. (CC) gcc options: -lm -O3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositePGI Compiler 18.10GCC 8.2.0LLVM Clang 7.05001000150020002500SE +/- 0.39, N = 3SE +/- 0.69, N = 3SE +/- 1.38, N = 31932251824781. (CC) gcc options: -O3 -lm

PolyBench-C

Test: Covariance Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Covariance ComputationPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0246810SE +/- 0.74, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 37.234.864.871. (CC) gcc options: -O3

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.03691215SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 38.519.998.28-pthread-pthread1. (CC) gcc options: -O3 -lhmmer -lsquid -lm

PolyBench-C

Test: Correlation Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Correlation ComputationPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.01.0982.1963.2944.3925.49SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.814.864.881. (CC) gcc options: -O3

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: 3 Matrix MultiplicationsPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.00.75381.50762.26143.01523.769SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 33.352.862.831. (CC) gcc options: -O3

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformancePGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0300K600K900K1200K1500KSE +/- 521.80, N = 5SE +/- 922.23, N = 5SE +/- 15151.69, N = 51199430144032116291361. (CC) gcc options: -O3

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.0400800120016002000SE +/- 0.21, N = 3SE +/- 0.37, N = 3SE +/- 0.69, N = 31438166516621. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.013002600390052006500SE +/- 0.84, N = 3SE +/- 0.98, N = 3SE +/- 4.29, N = 34622579660691. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.07001400210028003500SE +/- 0.36, N = 3SE +/- 2.59, N = 3SE +/- 1.62, N = 32243340531641. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.02004006008001000SE +/- 2.80, N = 3SE +/- 1.66, N = 3SE +/- 1.63, N = 37687737781. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloPGI Compiler 18.10GCC 8.2.0LLVM Clang 7.02004006008001000SE +/- 0.53, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 35919507171. (CC) gcc options: -O3 -lm


Phoronix Test Suite v10.8.5