POWER8 Compiler Benchmarks

Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1602146-GA-POWER8COM58&sor&grr.

POWER8 Compiler BenchmarksProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionGCC 5.3.1Clang 3.6.2POWER8 @ 3.86GHz (64 Cores)PowerNV 8335-GCA131072MB2000GB HDS722020ALA330ASPEED ASPEED FamilyBroadcom NetXtreme BCM5719 Gigabit PCIeDebian testing4.3.0-1-powerpc64le (ppc64le)modesetting 1.18.0GCC 5.3.1 20160121 + Clang 3.6.2-3ext41024x768Clang 3.6.2-3OpenBenchmarking.orgCompiler Details- GCC 5.3.1: --build=powerpc64le-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-multilib --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-secureplt --enable-shared --enable-targets=powerpcle-linux --enable-threads=posix --host=powerpc64le-linux-gnu --target=powerpc64le-linux-gnu --with-arch-directory=ppc64le --with-cpu=power8 --with-default-libstdcxx-abi=new --with-long-double-128 -v Processor Details- Scaling Governor: powernv-cpufreq ondemand

POWER8 Compiler Benchmarksapache: Static Web Page Servingpgbench: Buffer Test - Normal Load - Read Writeopenssl: RSA 4096-bit Performanceencode-flac: WAV To FLACstockfish: Total Timesmallpt: Global Illumination Renderer; 100 Samplesc-ray: Total Timebuild-apache: Time To Compilecompress-7zip: Compress Speed Testhimeno: Poisson Pressure Solverscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloscimark2: Compositerodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverrodinia: OpenMP LavaMDGCC 5.3.1Clang 3.6.216037.3394.63261.6030.3357004610.2932.7247630347.43465.121354.681116.53189.64146.92654.5841.0657.8269.0116057.1888.54243.1744.495545119824.3127.6848557265.91963.501493.681241.75198.26155.51810.54OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingClang 3.6.2GCC 5.3.13K6K9K12K15KSE +/- 76.28, N = 3SE +/- 160.69, N = 316057.1816037.331. (CC) gcc options: -shared -fPIC -O2 -pthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 5.3.1Clang 3.6.220406080100SE +/- 6.85, N = 6SE +/- 5.39, N = 694.6388.54-pthreads1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceGCC 5.3.1Clang 3.6.260120180240300SE +/- 0.17, N = 3SE +/- 0.15, N = 3261.60243.171. (CC) gcc options: -O3 -fomit-frame-pointer -lssl -lcrypto -ldl

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACGCC 5.3.1Clang 3.6.21020304050SE +/- 0.02, N = 5SE +/- 0.01, N = 530.3344.49-fvisibility=hidden1. (CXX) g++ options: -O2 -lm

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeClang 3.6.2GCC 5.3.112002400360048006000SE +/- 1.15, N = 3SE +/- 1.86, N = 355455700-flto1. (CXX) g++ options: -lpthread -fno-exceptions -fno-rtti -ansi -pedantic -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 5.3.1Clang 3.6.230060090012001500SE +/- 0.33, N = 3SE +/- 0.00, N = 34611981. (CXX) g++ options: -fopenmp

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 5.3.1Clang 3.6.2612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 310.2924.311. (CC) gcc options: -lm -lpthread -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileClang 3.6.2GCC 5.3.1816243240SE +/- 0.07, N = 3SE +/- 0.04, N = 327.6832.72

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestClang 3.6.2GCC 5.3.110K20K30K40K50KSE +/- 60.76, N = 3SE +/- 11.89, N = 348557476301. (CXX) g++ options: -pipe -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 5.3.1Clang 3.6.280160240320400SE +/- 1.77, N = 3SE +/- 0.70, N = 3347.43265.911. (CC) gcc options: -O3

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.6.2GCC 5.3.12004006008001000SE +/- 0.02, N = 4SE +/- 0.01, N = 4963.50465.12

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 3.6.2GCC 5.3.130060090012001500SE +/- 2.85, N = 4SE +/- 0.96, N = 41493.681354.68

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 3.6.2GCC 5.3.130060090012001500SE +/- 2.23, N = 4SE +/- 1.13, N = 41241.751116.53

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 3.6.2GCC 5.3.14080120160200SE +/- 3.21, N = 4SE +/- 0.23, N = 4198.26189.64

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 3.6.2GCC 5.3.1306090120150SE +/- 0.24, N = 4SE +/- 0.07, N = 4155.51146.92

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 3.6.2GCC 5.3.12004006008001000SE +/- 1.29, N = 4SE +/- 0.33, N = 4810.54654.58

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterGCC 5.3.1918273645SE +/- 0.10, N = 341.061. (CXX) g++ options: -O3 -fopenmp

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverGCC 5.3.11326395265SE +/- 0.09, N = 357.821. (CXX) g++ options: -O3 -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDGCC 5.3.11530456075SE +/- 0.05, N = 369.011. (CXX) g++ options: -O3 -fopenmp


Phoronix Test Suite v10.8.5