LLVM Clang 3.8 Compiler Tuning

Intel Xeon E5-2687W v3 testing with a MSI X99S SLI PLUS (MS-7885) v1.0 and AMD FirePro V7900 2048MB on Ubuntu 16.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1602049-GA-LLVMCLANG61&grr.

LLVM Clang 3.8 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O0-O1-O2-Oz-O3-O3 -march=nativeIntel Xeon E5-2687W v3 @ 3.50GHz (20 Cores)MSI X99S SLI PLUS (MS-7885) v1.0Intel Xeon E7 v3/Xeon16384MBPNY CS1211 120GB + 80GB INTEL SSDSCKGW08AMD FirePro V7900 2048MBRealtek ALC892ASUS PB278Intel ConnectionUbuntu 16.044.5.0-040500rc1-generic (x86_64) 20160124Unity 7.4.0X Server 1.17.3radeon 7.6.13.3 Mesa 11.0.8 Gallium 0.4Clang 3.8.0 (SVN 259676) + LLVM 3.8.0ext42560x1440OpenBenchmarking.orgCompiler Details- Optimized build; Built Feb 3 2016 (13:57:10); Default target: x86_64-unknown-linux-gnu; Host CPU: haswellProcessor Details- Scaling Governor: intel_pstate powersave

LLVM Clang 3.8 Compiler Tuningapache: Static Web Page Servinghint: FLOATredis: SETredis: GETredis: LPUSHredis: SADDredis: LPOPpgbench: Buffer Test - Heavy Contention - Read Writepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Writeencode-mp3: WAV To MP3encode-flac: WAV To FLACsmallpt: Global Illumination Renderer; 100 Samplesc-ray: Total Timebuild-php: Time To Compilebuild-apache: Time To Compilehimeno: Poisson Pressure Solvergraphics-magick: Local Adaptive Thresholdinggraphics-magick: HWB Color Spacegraphics-magick: Sharpengraphics-magick: Blurscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloscimark2: Compositehmmer: Pfam Database Search-O0-O1-O2-Oz-O3-O3 -march=native23021.42112554681.82468101.81527734.53456864.10481943.16541334.665176.52284.515007.7537.1057.4427.584.249.25284.58188361671477.574133.952807.96358.28238.141803.1812.1823380.23266823791.59575054.87625334.40585266.59587106.65642965.385209.08348.865049.3415.689.1216.308.9317.831334.47701501111161476.744187.612776.50357.19237.831807.1815.9923360.85322378022.36579653.25624757.29570314.27599041.21633016.315050.83379.004968.1013.958.651219.8111.8221.271359.01811501131191478.304137.162798.01363.59234.761802.3614.0223342.16314508487.41585438.60632178.77589078.56581864.31631316.904911.10357.324736.9116.8812.271319.709.5519.221002.24711441081131477.784181.892806.62357.50237.111812.1815.0923283.07321885788.84582801.37649056.62587392.23592835.98642800.774630.05351.754597.6514.388.661213.2115.7921.671354.29801501131191478.194127.102766.53358.05237.631793.5015.1323355.46268576816.54580208.84642443.67575188.23587747.73624901.335342.15362.585092.2015.227.111212.7815.9321.871342.94841501071171478.724901.042613.77362.39235.581918.3014.80OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page Serving-O0-O1-O2-Oz-O3-O3 -march=native5K10K15K20K25KSE +/- 50.43, N = 3SE +/- 55.39, N = 3SE +/- 64.64, N = 3SE +/- 21.97, N = 3SE +/- 43.31, N = 3SE +/- 103.00, N = 323021.4223380.2323360.8523342.1623283.0723355.46-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -shared -fPIC -pthread

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O0-O1-O2-Oz-O3-O3 -march=native70M140M210M280M350MSE +/- 392885.93, N = 3SE +/- 452959.10, N = 3SE +/- 1309178.89, N = 3SE +/- 1422860.99, N = 3SE +/- 328417.31, N = 3SE +/- 865143.17, N = 3112554681.82266823791.59322378022.36314508487.41321885788.84268576816.54-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SET-O0-O1-O2-Oz-O3-O3 -march=native130K260K390K520K650KSE +/- 8333.77, N = 3SE +/- 9986.25, N = 3SE +/- 8932.93, N = 3SE +/- 4629.62, N = 3SE +/- 8917.10, N = 3SE +/- 9778.12, N = 4468101.81575054.87579653.25585438.60582801.37580208.841. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GET-O0-O1-O2-Oz-O3-O3 -march=native140K280K420K560K700KSE +/- 2815.08, N = 3SE +/- 9198.37, N = 6SE +/- 6780.00, N = 3SE +/- 12112.33, N = 3SE +/- 11447.92, N = 3SE +/- 10103.06, N = 3527734.53625334.40624757.29632178.77649056.62642443.671. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: LPUSH-O0-O1-O2-Oz-O3-O3 -march=native130K260K390K520K650KSE +/- 7778.94, N = 4SE +/- 8816.15, N = 5SE +/- 7327.04, N = 3SE +/- 10683.69, N = 3SE +/- 9549.58, N = 3SE +/- 8620.97, N = 3456864.10585266.59570314.27589078.56587392.23575188.231. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SADD-O0-O1-O2-Oz-O3-O3 -march=native130K260K390K520K650KSE +/- 4708.54, N = 3SE +/- 6376.53, N = 3SE +/- 6002.71, N = 3SE +/- 8371.51, N = 3SE +/- 4485.63, N = 3SE +/- 7782.07, N = 3481943.16587106.65599041.21581864.31592835.98587747.731. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: LPOP-O0-O1-O2-Oz-O3-O3 -march=native140K280K420K560K700KSE +/- 5474.55, N = 3SE +/- 2297.03, N = 3SE +/- 8627.19, N = 3SE +/- 6559.78, N = 3SE +/- 6373.30, N = 3SE +/- 7141.78, N = 3541334.66642965.38633016.31631316.90642800.77624901.331. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write-O0-O1-O2-Oz-O3-O3 -march=native11002200330044005500SE +/- 54.54, N = 3SE +/- 22.85, N = 3SE +/- 5.93, N = 3SE +/- 79.23, N = 4SE +/- 68.69, N = 3SE +/- 97.00, N = 65176.525209.085050.834911.104630.055342.15-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -pthread -pthreads -mthreads -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-O0-O1-O2-Oz-O3-O3 -march=native80160240320400SE +/- 4.11, N = 3SE +/- 6.43, N = 6SE +/- 3.72, N = 3SE +/- 5.12, N = 5SE +/- 6.37, N = 3SE +/- 4.92, N = 5284.51348.86379.00357.32351.75362.58-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -pthread -pthreads -mthreads -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O0-O1-O2-Oz-O3-O3 -march=native11002200330044005500SE +/- 68.30, N = 3SE +/- 39.17, N = 3SE +/- 90.64, N = 3SE +/- 34.48, N = 3SE +/- 93.61, N = 6SE +/- 139.97, N = 65007.755049.344968.104736.914597.655092.20-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -pthread -pthreads -mthreads -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-O0-O1-O2-Oz-O3-O3 -march=native918273645SE +/- 0.11, N = 5SE +/- 0.19, N = 5SE +/- 0.12, N = 5SE +/- 0.23, N = 5SE +/- 0.21, N = 5SE +/- 0.07, N = 537.1015.6813.9516.8814.3815.22-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -pipe -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O0-O1-O2-Oz-O3-O3 -march=native1326395265SE +/- 0.15, N = 5SE +/- 0.10, N = 5SE +/- 0.10, N = 5SE +/- 0.14, N = 5SE +/- 0.11, N = 5SE +/- 0.03, N = 557.449.128.6512.278.667.11-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options: -logg -lm

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samples-O2-Oz-O3-O3 -march=native3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312131212-O2-Oz-O3-O3 -march=native1. (CXX) g++ options: -fopenmp

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-O0-O1-O2-Oz-O3-O3 -march=native612182430SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 327.5816.3019.8119.7013.2112.78-O0-O1-O2-Oz-march=native1. (CC) gcc options: -lm -lpthread -O3

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compile-O0-O1-O2-Oz-O3-O3 -march=native48121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 34.248.9311.829.5515.7915.93-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O0-O1-O2-Oz-O3-O3 -march=native510152025SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.16, N = 39.2517.8321.2719.2221.6721.87

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O0-O1-O2-Oz-O3-O3 -march=native30060090012001500SE +/- 1.80, N = 3SE +/- 8.91, N = 3SE +/- 6.85, N = 3SE +/- 2.77, N = 3SE +/- 1.53, N = 3SE +/- 0.76, N = 3284.581334.471359.011002.241354.291342.94-O0-O1-O2-Oz-march=native1. (CC) gcc options: -O3 -mavx2

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-O0-O1-O2-Oz-O3-O3 -march=native20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3187081718084-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-O0-O1-O2-Oz-O3-O3 -march=native306090120150SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 383150150144150150-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-O0-O1-O2-Oz-O3-O3 -march=native306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 361111113108113107-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O0-O1-O2-Oz-O3-O3 -march=native306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.88, N = 367116119113119117-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lgomp -lpthread

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O0-O1-O2-Oz-O3-O3 -march=native30060090012001500SE +/- 31.80, N = 4SE +/- 33.02, N = 4SE +/- 32.10, N = 4SE +/- 31.89, N = 4SE +/- 32.07, N = 4SE +/- 32.04, N = 41477.571476.741478.301477.781478.191478.72-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O0-O1-O2-Oz-O3-O3 -march=native11002200330044005500SE +/- 72.62, N = 4SE +/- 50.41, N = 4SE +/- 52.48, N = 4SE +/- 52.55, N = 4SE +/- 42.70, N = 4SE +/- 23.42, N = 44133.954187.614137.164181.894127.104901.04-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O0-O1-O2-Oz-O3-O3 -march=native6001200180024003000SE +/- 55.30, N = 4SE +/- 51.42, N = 4SE +/- 47.39, N = 4SE +/- 46.34, N = 4SE +/- 47.81, N = 4SE +/- 44.72, N = 42807.962776.502798.012806.622766.532613.77-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O0-O1-O2-Oz-O3-O3 -march=native80160240320400SE +/- 5.39, N = 4SE +/- 6.22, N = 4SE +/- 1.41, N = 4SE +/- 6.36, N = 4SE +/- 6.29, N = 4SE +/- 6.39, N = 4358.28357.19363.59357.50358.05362.39-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O0-O1-O2-Oz-O3-O3 -march=native50100150200250SE +/- 4.62, N = 4SE +/- 5.20, N = 4SE +/- 4.95, N = 4SE +/- 5.01, N = 4SE +/- 5.13, N = 4SE +/- 4.92, N = 4238.14237.83234.76237.11237.63235.58-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O0-O1-O2-Oz-O3-O3 -march=native400800120016002000SE +/- 20.39, N = 4SE +/- 14.83, N = 4SE +/- 11.55, N = 4SE +/- 12.05, N = 4SE +/- 16.52, N = 4SE +/- 9.31, N = 41803.181807.181802.361812.181793.501918.30-O0-O1-O2-Oz-O3-O3 -march=native1. (CXX) g++ options:

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O0-O1-O2-Oz-O3-O3 -march=native48121620SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.60, N = 6SE +/- 0.44, N = 6SE +/- 0.29, N = 6SE +/- 0.46, N = 612.1815.9914.0215.0915.1314.80-O0-O1-O2-Oz-O3-O3 -march=native1. (CC) gcc options: -pthread -lhmmer -lsquid -lm


Phoronix Test Suite v10.8.5