Compiler Intel Broadwell Linux Tests

Compiler Broadwell tests by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1502176-LI-1501249DE02&grr&sro.

Compiler Intel Broadwell Linux TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 4.9.2LLVM Clang 3.5Core 2 QuadIntel Core i7-5600U @ 3.20GHz (4 Cores)LENOVO 20BSCTO1WWIntel Broadwell-U-OPI8192MB128GB SAMSUNG MZNTE128Intel Broadwell-U (950MHz)Intel Broadwell-U AudioIntel Connection + Intel Wireless 7265Fedora 213.17.8-300.fc21.x86_64 (x86_64)GNOME Shell 3.14.3X Server 1.16.2.901 (1.16.3 RC 1)intel 2.99.9163.3 Mesa 10.4.1GCC 4.9.2 20141101ext41920x1080Clang 3.5.0 + LLVM 3.5.0Intel Core 2 Quad Q9400 @ 2.67GHz (4 Cores)Gigabyte EP45-DS3LRIntel 4 DRAM + ICH10R4096MB1000GB Seagate ST1000DX001-1CM1MSI NVIDIA GeForce GTX 580 3072MB (50/135MHz)Intel 82801JILG E2260Realtek RTL8111/8168/8411Arch Linux3.18.6-1-ARCH (x86_64)Cinnamon 2.4.6NVIDIA 346.354.4.0GCC 4.9.2 20150204 + Clang 3.5.1 + LLVM 3.5.1 + CUDA 6.5OpenBenchmarking.orgCompiler Details- GCC 4.9.2: --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,fortran,ada,go,lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-linker-hash-style=gnu --with-tune=generic - LLVM Clang 3.5: Optimized build; Built Dec 25 2014 (21:22:22); Default target: x86_64-redhat-linux-gnu; Host CPU: x86-64- Core 2 Quad: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-cloog-backend=isl --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-linker-hash-style=gnu Processor Details- GCC 4.9.2: Scaling Governor: intel_pstate powersave- LLVM Clang 3.5: Scaling Governor: intel_pstate powersave- Core 2 Quad: Scaling Governor: acpi-cpufreq ondemandSystem Details- GCC 4.9.2, LLVM Clang 3.5: SELinux: Enabled.

Compiler Intel Broadwell Linux Testsfftw: Stock - 2D FFT Size 4096fftw: Stock - 2D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 2048apache: Static Web Page Servinghint: FLOATencode-mp3: WAV To MP3encode-flac: WAV To FLACbullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallstockfish: Total Timesmallpt: Global Illumination Renderer; 100 Samplesc-ray: Total Timebuild-php: Time To Compilebuild-apache: Time To Compileebizzy: Phoronix Test Suite v5.6.0m1himeno: Poisson Pressure Solverjohn-the-ripper: Traditional DESjohn-the-ripper: Blowfishscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloscimark2: Compositemrbayes: Primate Phylogeny Analysismafft: Multiple Sequence Alignmenthmmer: Pfam Database SearchGCC 4.9.2LLVM Clang 3.5Core 2 Quad4091.204504.066053.006283.3215429.86206580965.6912.826.871.461.233.795.766.655.7539336448.4060.9455.65181921618.11429633323821065.172046.081957.09263.97564.441179.3525.4411.3921.774070.664260.745447.485443.1215499.88237608504.1812.889.121.571.313.896.597.456.08410814873.9344.1438.61178791459.1249253339261579.542953.002070.94279.26603.621497.2626.2812.8822.432146.432100.332936.833097.3412980.64198825231.0623.9522.7721846.9868.8363.3515444878.0194453332611777.04659.65571.9167.70335.53473.3646.4413.5126.22OpenBenchmarking.org

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 4096Core 2 QuadGCC 4.9.2LLVM Clang 3.59001800270036004500SE +/- 36.00, N = 10SE +/- 73.64, N = 5SE +/- 18.53, N = 52146.434091.204070.66-std=gnu99 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math-std=gnu99 -march=native-march=native1. (CC) gcc options: -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048Core 2 QuadGCC 4.9.2LLVM Clang 3.510002000300040005000SE +/- 42.89, N = 10SE +/- 67.63, N = 5SE +/- 14.70, N = 52100.334504.064260.74-std=gnu99 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math-std=gnu99 -march=native-march=native1. (CC) gcc options: -O3 -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 1D FFT Size 4096Core 2 QuadGCC 4.9.2LLVM Clang 3.513002600390052006500SE +/- 134.03, N = 10SE +/- 14.77, N = 5SE +/- 49.20, N = 52936.836053.005447.48-std=gnu99 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math-std=gnu99 -march=native-march=native1. (CC) gcc options: -O3 -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 1D FFT Size 2048Core 2 QuadGCC 4.9.2LLVM Clang 3.513002600390052006500SE +/- 114.75, N = 10SE +/- 8.17, N = 5SE +/- 17.21, N = 53097.346283.325443.12-std=gnu99 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math-std=gnu99 -march=native-march=native1. (CC) gcc options: -O3 -lm

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingCore 2 QuadGCC 4.9.2LLVM Clang 3.53K6K9K12K15KSE +/- 6.92, N = 3SE +/- 175.48, N = 3SE +/- 191.64, N = 312980.6415429.8615499.88-O2-O3 -march=native-O3 -march=native1. (CC) gcc options: -shared -fPIC -pthread

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATCore 2 QuadGCC 4.9.2LLVM Clang 3.550M100M150M200M250MSE +/- 1986722.47, N = 3SE +/- 8609472.39, N = 6SE +/- 245808.00, N = 3198825231.06206580965.69237608504.181. (CC) gcc options: -O3 -march=native -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Core 2 QuadGCC 4.9.2LLVM Clang 3.5612182430SE +/- 0.07, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 523.9512.8212.88-fomit-frame-pointer -ffast-math -lncurses-march=native-march=native1. (CC) gcc options: -O3 -pipe -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACCore 2 QuadGCC 4.9.2LLVM Clang 3.5510152025SE +/- 0.58, N = 10SE +/- 0.05, N = 5SE +/- 0.01, N = 522.776.879.12-O2 -fvisibility=hidden -logg-O3 -march=native -fvisibility=hidden-O3 -march=native1. (CXX) g++ options: -lm

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 4.9.2LLVM Clang 3.50.35330.70661.05991.41321.7665SE +/- 0.00, N = 3SE +/- 0.05, N = 31.461.571. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 4.9.2LLVM Clang 3.50.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.04, N = 31.231.311. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 4.9.2LLVM Clang 3.50.87531.75062.62593.50124.3765SE +/- 0.01, N = 3SE +/- 0.01, N = 33.793.891. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 4.9.2LLVM Clang 3.5246810SE +/- 0.01, N = 3SE +/- 0.01, N = 35.766.591. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 4.9.2LLVM Clang 3.5246810SE +/- 0.11, N = 3SE +/- 0.07, N = 36.657.451. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 4.9.2LLVM Clang 3.5246810SE +/- 0.08, N = 3SE +/- 0.04, N = 35.756.081. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeGCC 4.9.2LLVM Clang 3.59001800270036004500SE +/- 23.88, N = 3SE +/- 10.27, N = 339334108-flto1. (CXX) g++ options: -lpthread -O3 -march=native -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesCore 2 QuadGCC 4.9.2LLVM Clang 3.550100150200250SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 321864148-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fopenmp

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCore 2 QuadGCC 4.9.2LLVM Clang 3.51632486480SE +/- 0.01, N = 3SE +/- 0.29, N = 3SE +/- 0.43, N = 346.9848.4073.93-march=native-march=native1. (CC) gcc options: -lm -lpthread -O3

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileCore 2 QuadGCC 4.9.2LLVM Clang 3.51530456075SE +/- 0.37, N = 3SE +/- 0.24, N = 3SE +/- 0.27, N = 368.8360.9444.14-O2-O3 -march=native-O3 -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileCore 2 QuadGCC 4.9.2LLVM Clang 3.51428425670SE +/- 0.39, N = 3SE +/- 0.31, N = 3SE +/- 0.25, N = 363.3555.6538.61

ebizzy

Phoronix Test Suite v5.6.0m1

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3Phoronix Test Suite v5.6.0m1Core 2 QuadGCC 4.9.2LLVM Clang 3.54K8K12K16K20KSE +/- 276.70, N = 6SE +/- 310.69, N = 4SE +/- 256.96, N = 5154441819217879-march=native-march=native1. (CC) gcc options: -pthread -lpthread -O3

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCore 2 QuadGCC 4.9.2LLVM Clang 3.530060090012001500SE +/- 24.89, N = 6SE +/- 0.81, N = 3SE +/- 43.19, N = 6878.011618.111459.12-march=native-march=native1. (CC) gcc options: -O3

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: Traditional DESCore 2 QuadGCC 4.9.2LLVM Clang 3.52M4M6M8M10MSE +/- 371714.02, N = 6SE +/- 5897.27, N = 3SE +/- 22040.37, N = 39445333429633349253331. (CC) gcc options: -fopenmp -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishCore 2 QuadGCC 4.9.2LLVM Clang 3.56001200180024003000SE +/- 5.49, N = 3SE +/- 1.67, N = 3SE +/- 0.00, N = 3261123829261. (CC) gcc options: -fopenmp -lcrypt

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationCore 2 QuadGCC 4.9.2LLVM Clang 3.530060090012001500SE +/- 1.00, N = 4SE +/- 1.02, N = 4SE +/- 1.47, N = 4777.041065.171579.54-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationCore 2 QuadGCC 4.9.2LLVM Clang 3.56001200180024003000SE +/- 3.16, N = 3SE +/- 8.95, N = 4SE +/- 20.80, N = 4659.652046.082953.00-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyCore 2 QuadGCC 4.9.2LLVM Clang 3.5400800120016002000SE +/- 31.07, N = 4SE +/- 2.10, N = 4SE +/- 50.78, N = 4571.911957.092070.94-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformCore 2 QuadGCC 4.9.2LLVM Clang 3.560120180240300SE +/- 0.72, N = 4SE +/- 3.04, N = 4SE +/- 0.45, N = 367.70263.97279.26-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloCore 2 QuadGCC 4.9.2LLVM Clang 3.5130260390520650SE +/- 12.62, N = 4SE +/- 6.45, N = 4SE +/- 24.59, N = 4335.53564.44603.62-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeCore 2 QuadGCC 4.9.2LLVM Clang 3.530060090012001500SE +/- 8.53, N = 8SE +/- 1.92, N = 4SE +/- 13.74, N = 4473.361179.351497.26-O3 -march=native-O3 -march=native1. (CXX) g++ options:

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.1.2Primate Phylogeny AnalysisCore 2 QuadGCC 4.9.2LLVM Clang 3.51122334455SE +/- 0.77, N = 6SE +/- 0.27, N = 3SE +/- 0.25, N = 346.4425.4426.28

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentCore 2 QuadGCC 4.9.2LLVM Clang 3.53691215SE +/- 0.25, N = 3SE +/- 0.33, N = 6SE +/- 0.26, N = 613.5111.3912.881. (CC) gcc options: -O3 -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchCore 2 QuadGCC 4.9.2LLVM Clang 3.5612182430SE +/- 0.17, N = 3SE +/- 0.76, N = 6SE +/- 0.66, N = 626.2221.7722.43-O2-O3 -march=native-O3 -march=native1. (CC) gcc options: -pthread -lhmmer -lsquid -lm


Phoronix Test Suite v10.8.4