GCC AMD Ryzen Zen znver1 Compiler Optimizations

AMD Ryzen 7 1800X Eight-Core testing for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1703031-RI-GCCAMDRYZ39
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 2 Tests
Bioinformatics 2 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 12 Tests
CPU Massive 13 Tests
Creator Workloads 6 Tests
Encoding 2 Tests
HPC - High Performance Computing 3 Tests
Imaging 2 Tests
Multi-Core 7 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 2 Tests
Scientific Computing 3 Tests
Server CPU Tests 5 Tests
Single-Threaded 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
-O3
March 02 2017
 
-O3 -march=k8-sse3
March 03 2017
 
-O3 -march=bdver1
March 02 2017
 
-O3 -march=bdver4
March 02 2017
 
-O3 -march=znver1
March 02 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCC AMD Ryzen Zen znver1 Compiler OptimizationsOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 7 1800X Eight-Core @ 3.60GHz (16 Cores)MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0AMD Device 145016384MB256GB INTEL SSDPEKKW256G7Sapphire AMD Radeon R9 FURY / NANO 4096MBAMD Fiji HDMI/DPDELL P2415QIntel I211 Gigabit ConnectionUbuntu 17.044.10.0-9-generic (x86_64)Unity 7.5.0X Server 1.18.4modesetting 1.18.44.5 Mesa 17.0.0- padoka PPA Gallium 0.4 (LLVM 4.0.0)1.0.39GCC 6.3.0 20161229ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC AMD Ryzen Zen Znver1 Compiler Optimizations PerformanceSystem Logs- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - Scaling Governor: acpi-cpufreq performance

-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver1Result OverviewPhoronix Test Suite100%102%104%107%109%Timed Apache CompilationHimeno BenchmarkSciMarkJohn The RipperTimed ImageMagick Compilation

GCC AMD Ryzen Zen znver1 Compiler Optimizationsfftw: Float + SSE - 2D FFT Size 1024hmmer: Pfam Database Searchscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationjohn-the-ripper: Blowfishttsiod-renderer: Phong Rendering With Soft-Shadow Mappinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-apache: Time To Compilebuild-imagemagick: Time To Compilec-ray: Total Timestockfish: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3tjbench: Decompression Throughputhint: FLOAT-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver1210357.111585.02738.39157.642421.083436.151171.8312798342.431861872412541401195.3824.33177.318.1736155.229.00178.19333762114.34206287.201533.07727.86168.822524.933071.161172.5612829343.391731572412541401190.2324.51177.9312.3636085.719.99178.13332568933.811605.68728.45150.422518.053460.151171.3112878355.791194.4226.48177.4636231602.85727.88149.472519.533446.641170.73128871186.0226.38178.17222097.121588.29738.77148.462568.323310.831175.0812881355.261922042522611431135.4626.32177.737.6436115.708.73180.57333251163.50OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 1024-O3-O3 -march=k8-sse3-O3 -march=znver15K10K15K20K25KSE +/- 94.45, N = 5SE +/- 117.99, N = 5SE +/- 80.85, N = 5210352062822209-march=k8-sse3-march=znver11. (CC) gcc options: -O3 -lm

Timed HMMer Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O3-O3 -march=k8-sse3-O3 -march=znver1246810SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.117.207.12-march=k8-sse3-march=znver11. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm

SciMark

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver130060090012001500SE +/- 7.65, N = 4SE +/- 5.25, N = 4SE +/- 7.57, N = 4SE +/- 6.07, N = 4SE +/- 8.26, N = 41585.021533.071605.681602.851588.29-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver1160320480640800SE +/- 0.06, N = 4SE +/- 0.14, N = 4SE +/- 0.13, N = 4SE +/- 0.29, N = 4SE +/- 0.14, N = 4738.39727.86728.45727.88738.77-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver14080120160200SE +/- 0.99, N = 4SE +/- 0.44, N = 4SE +/- 0.28, N = 4SE +/- 0.41, N = 4SE +/- 0.21, N = 4157.64168.82150.42149.47148.46-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver16001200180024003000SE +/- 7.04, N = 4SE +/- 14.04, N = 4SE +/- 25.26, N = 4SE +/- 14.04, N = 4SE +/- 22.07, N = 42421.082524.932518.052519.532568.32-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver17001400210028003500SE +/- 36.57, N = 4SE +/- 20.09, N = 4SE +/- 18.20, N = 4SE +/- 21.23, N = 4SE +/- 47.79, N = 43436.153071.163460.153446.643310.83-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver130060090012001500SE +/- 1.51, N = 4SE +/- 1.23, N = 4SE +/- 1.02, N = 4SE +/- 0.71, N = 4SE +/- 0.14, N = 41171.831172.561171.311170.731175.08-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -lm

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: Blowfish-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver13K6K9K12K15KSE +/- 104.33, N = 3SE +/- 23.33, N = 3SE +/- 13.04, N = 3SE +/- 7.77, N = 3SE +/- 13.04, N = 312798128291287812887128811. (CC) gcc options: -fopenmp -lcrypt

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mapping-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=znver180160240320400SE +/- 0.71, N = 3SE +/- 0.35, N = 3SE +/- 1.48, N = 3SE +/- 0.60, N = 3342.43343.39355.79355.26-march=k8-sse3-march=bdver1-march=znver11. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O3-O3 -march=k8-sse3-O3 -march=znver14080120160200SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3186173192-march=k8-sse3-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-O3-O3 -march=k8-sse3-O3 -march=znver14080120160200187157204-march=k8-sse3-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizing-O3-O3 -march=k8-sse3-O3 -march=znver160120180240300241241252-march=k8-sse3-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-O3-O3 -march=k8-sse3-O3 -march=znver160120180240300254254261-march=k8-sse3-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-O3-O3 -march=k8-sse3-O3 -march=znver1306090120150140140143-march=k8-sse3-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver130060090012001500SE +/- 0.54, N = 3SE +/- 0.85, N = 3SE +/- 0.95, N = 3SE +/- 0.53, N = 3SE +/- 0.39, N = 31195.381190.231194.421186.021135.46-march=k8-sse3-march=bdver1-march=bdver4-march=znver11. (CC) gcc options: -O3 -mavx2

Timed Apache Compilation

This test times how long it takes to build the Apache HTTP Server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver1612182430SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 324.3324.5126.4826.3826.32

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=bdver4-O3 -march=znver14080120160200SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 0.28, N = 3SE +/- 0.93, N = 3177.31177.93177.46178.17177.73

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-O3-O3 -march=k8-sse3-O3 -march=znver13691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 38.1712.367.64-march=k8-sse3-march=znver11. (CC) gcc options: -lm -lpthread -O3

Stockfish

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total Time-O3-O3 -march=k8-sse3-O3 -march=bdver1-O3 -march=znver18001600240032004000SE +/- 0.88, N = 3SE +/- 5.46, N = 3SE +/- 1.45, N = 3SE +/- 1.00, N = 33615360836233611-march=k8-sse3-march=bdver1-march=znver11. (CXX) g++ options: -lpthread -O3 -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt -flto

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format three times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O3-O3 -march=k8-sse3-O3 -march=znver11.28482.56963.85445.13926.424SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 55.225.715.70-march=k8-sse3-march=znver11. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-O3-O3 -march=k8-sse3-O3 -march=znver13691215SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 59.009.998.73-march=k8-sse3-march=znver11. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

libjpeg-turbo tjbench

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression Throughput-O3-O3 -march=k8-sse3-O3 -march=znver14080120160200SE +/- 0.22, N = 3SE +/- 0.19, N = 3SE +/- 1.16, N = 3178.19178.13180.57-march=k8-sse3-march=znver11. (CC) gcc options: -O3 -lm

Hierarchical INTegration

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O3-O3 -march=k8-sse3-O3 -march=znver170M140M210M280M350MSE +/- 1715194.48, N = 3SE +/- 120531.27, N = 3SE +/- 3555529.86, N = 3333762114.34332568933.81333251163.50-march=k8-sse3-march=znver11. (CC) gcc options: -O3 -lm