Clang 6.0 AMD EPYC Tuning Comparison

vm-other Xen 4.9.0 Hypervisor testing on Ubuntu 17.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1801029-AL-CLANG60AM37&sro&grt.

Clang 6.0 AMD EPYC Tuning ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay DriverOpenCLCompilerFile-SystemScreen ResolutionSystem Layer-march=znver1-march=haswell-march=x86-64AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores)TYAN B8026T70AE24HRAMD Device 1450126976MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.104.13.0-21-generic (x86_64)GNOME Shell 3.26.1modesetting 1.19.5OpenCL 1.2 pocl 1.0 LLVM 5.0.0Clang 6.0.0 (SVN 321623) + LLVM 6.0.0svnext41920x1080vm-other Xen 4.9.0 HypervisorOpenBenchmarking.orgCompiler Details- Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1 Disk Details- NONE / data=ordered,errors=remount-ro,relatime,rw

Clang 6.0 AMD EPYC Tuning Comparisonapache: Static Web Page Servingbullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshc-ray: Total Timeebizzy: fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 4096encode-flac: WAV To FLACgraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverencode-mp3: WAV To MP3polybench-c: 3 Matrix Multiplicationsscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsqlite: Default Test Directoryhmmer: Pfam Database Searchtscp: AI Chess Performance-march=znver1-march=haswell-march=x86-649663.933.185.346.085.313.231.091.324.4811454055031.60124816.63104136155981052.4712.8162.751699.32552.19226.682258.644034.891424.217.4811.099182699410.043.085.386.055.013.231.091.264.4911205704839.53123936.69102135155941031.6812.8362.331739.34555.76226.732207.314285.181421.727.2813.839179639531.433.225.486.305.433.281.101.334.5310766484660.83136497.94101131150971032.7111.3362.981479.53531.38179.292190.103190.431110.657.5312.85917658OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page Serving-march=haswell-march=x86-64-march=znver12K4K6K8K10KSE +/- 12.51, N = 3SE +/- 161.49, N = 3SE +/- 121.20, N = 39410.049531.439663.93-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -shared -fPIC -pthread -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-march=haswell-march=x86-64-march=znver10.72451.4492.17352.8983.6225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.083.223.18-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-march=haswell-march=x86-64-march=znver11.2332.4663.6994.9326.165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.385.485.34-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-march=haswell-march=x86-64-march=znver1246810SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 36.056.306.08-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-march=haswell-march=x86-64-march=znver11.22182.44363.66544.88726.109SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.015.435.31-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-march=haswell-march=x86-64-march=znver10.7381.4762.2142.9523.69SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.233.283.23-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-march=haswell-march=x86-64-march=znver10.24750.4950.74250.991.2375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.091.101.09-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-march=haswell-march=x86-64-march=znver10.29930.59860.89791.19721.4965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.261.331.32-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-march=haswell-march=x86-64-march=znver11.01932.03863.05794.07725.0965SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 34.494.534.48-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -lm -lpthread -O3

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-march=haswell-march=x86-64-march=znver1200K400K600K800K1000KSE +/- 17039.32, N = 5SE +/- 12350.97, N = 3SE +/- 17141.58, N = 6112057010766481145405-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -pthread -lpthread -O3 -march=native

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-march=haswell-march=x86-64-march=znver111002200330044005500SE +/- 19.26, N = 3SE +/- 57.25, N = 3SE +/- 17.93, N = 34839.534660.835031.60-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-march=haswell-march=x86-64-march=znver13K6K9K12K15KSE +/- 34.04, N = 3SE +/- 100.95, N = 3SE +/- 76.70, N = 3123931364912481-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -pthread -O3 -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-march=haswell-march=x86-64-march=znver1246810SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 56.697.946.63-march=haswell-march=x86-64-march=znver11. (CXX) g++ options: -O3 -logg -lm

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-march=haswell-march=x86-64-march=znver120406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3102101104-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-march=haswell-march=x86-64-march=znver1306090120150SE +/- 0.67, N = 3SE +/- 0.67, N = 3135131136-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-march=haswell-march=x86-64-march=znver1306090120150SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 3155150155-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-march=haswell-march=x86-64-march=znver120406080100SE +/- 0.33, N = 3949798-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-march=haswell-march=x86-64-march=znver12004006008001000SE +/- 0.02, N = 3SE +/- 1.46, N = 3SE +/- 1.50, N = 31031.681032.711052.47-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -mavx2

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.5WAV To MP3-march=haswell-march=x86-64-march=znver13691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 512.8311.3312.81-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 3.2Test: 3 Matrix Multiplications-march=haswell-march=x86-64-march=znver11428425670SE +/- 0.29, N = 3SE +/- 0.24, N = 3SE +/- 0.26, N = 362.3362.9862.75-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -march=native

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-march=haswell-march=x86-64-march=znver1400800120016002000SE +/- 19.46, N = 4SE +/- 36.18, N = 8SE +/- 25.95, N = 41739.341479.531699.32-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-march=haswell-march=x86-64-march=znver1120240360480600SE +/- 0.02, N = 4SE +/- 0.05, N = 4SE +/- 0.05, N = 4555.76531.38552.19-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-march=haswell-march=x86-64-march=znver150100150200250SE +/- 0.19, N = 4SE +/- 45.12, N = 4SE +/- 0.38, N = 4226.73179.29226.68-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-march=haswell-march=x86-64-march=znver15001000150020002500SE +/- 10.64, N = 4SE +/- 11.89, N = 4SE +/- 29.07, N = 42207.312190.102258.64-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-march=haswell-march=x86-64-march=znver19001800270036004500SE +/- 87.85, N = 4SE +/- 17.92, N = 4SE +/- 135.63, N = 44285.183190.434034.89-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-march=haswell-march=x86-64-march=znver130060090012001500SE +/- 0.59, N = 4SE +/- 310.16, N = 4SE +/- 0.49, N = 41421.721110.651424.21-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SQLite

Test Target: Default Test Directory

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.8.10.2Test Target: Default Test Directory-march=haswell-march=x86-64-march=znver1246810SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 67.287.537.48-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -ldl -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-march=haswell-march=x86-64-march=znver148121620SE +/- 1.36, N = 6SE +/- 1.28, N = 6SE +/- 0.12, N = 313.8312.8511.09-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-march=haswell-march=x86-64-march=znver1200K400K600K800K1000KSE +/- 572.37, N = 5SE +/- 306.40, N = 5917963917658918269-march=haswell-march=x86-64-march=znver11. (CC) gcc options: -O3 -march=native


Phoronix Test Suite v10.8.4