AOCC 1.0 Compiler Tuning

AMD Ryzen 7 1700 Eight-Core testing with a MSI B350 TOMAHAWK (MS-7A34) v1.0 and HIS AMD Radeon HD 7750/8740 / R7 250E 1024MB on Ubuntu 17.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1705218-TR-AOCC10COM39&grr&sor.

AOCC 1.0 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverCompilerFile-SystemScreen Resolution-O0-O2-O3-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (16 Cores)MSI B350 TOMAHAWK (MS-7A34) v1.0AMD Device 145016384MB120GB Samsung SSD 840HIS AMD Radeon HD 7750/8740 / R7 250E 1024MBAMD Cape Verde/PitcairnDELL S2409WRealtek RTL8111/8168/8411Ubuntu 17.044.12.0-999-generic (x86_64) 20170518Unity 7.5.0modesetting 1.19.3Clang 4.0.0ext41920x1080OpenBenchmarking.orgCompiler Details- Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1Processor Details- Scaling Governor: acpi-cpufreq ondemand

AOCC 1.0 Compiler Tuningredis: SETredis: GETpgbench: Buffer Test - Heavy Contention - Read Writepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Writetjbench: Decompression Throughputopenssl: RSA 4096-bit Performanceencode-wavpack: WAV To WavPackencode-mp3: WAV To MP3encode-flac: WAV To FLACstockfish: Total Timec-ray: Total Timehimeno: Poisson Pressure Solvergraphics-magick: Local Adaptive Thresholdinggraphics-magick: HWB Color Spacegraphics-magick: Resizinggraphics-magick: Sharpengraphics-magick: Blurtscp: AI Chess Performancescimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloscimark2: Compositemafft: Multiple Sequence Alignmentfftw: Float + SSE - 2D FFT Size 1024-O0-O2-O3-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1890829.001204370.921985.80201.671865.8271.84986.707.7636.4862.12371132.30325.6428724921464757101673.775637.052648.16134.67643.452147.423.872434.701377516.081983766.971903.00225.061906.40161.68987.676.529.406.80371214.011157.381331631325710710165721675.225554.972646.12134.06642.372130.553.63201901399320.001971325.872037.20226.941942.42162.59986.936.519.426.78370314.001150.221351661285810610539191676.245859.932667.90134.23643.432196.343.77202971379585.371945705.791952.70225.611930.21168.45987.436.4310.615.64364313.491133.271351491375910210210941682.075670.422636.32131.50660.302156.123.66201711406611.752008848.231932.86226.381886.70168.66986.73364413.491133.241411571436010610210941681.875605.432616.74134.46659.882139.673.78204331386323.961953486.67165.24986.436.4610.616.26361813.411036.501351611386410010298911680.495644.152619.99135.39659.672147.943.8220676OpenBenchmarking.org

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SET-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-Ofast -march=znver1-O3 -march=znver1-O2-O0300K600K900K1200K1500KSE +/- 10024.40, N = 3SE +/- 6804.89, N = 3SE +/- 1282.46, N = 3SE +/- 13825.40, N = 3SE +/- 8504.02, N = 3SE +/- 12621.48, N = 31406611.751399320.001386323.961379585.371377516.08890829.001. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GET-O3 -march=znver1 -mllvm -enable-strided-vectorization-O2-O3-Ofast -march=znver1-O3 -march=znver1-O0400K800K1200K1600K2000KSE +/- 34111.46, N = 6SE +/- 32907.71, N = 4SE +/- 15178.56, N = 3SE +/- 18769.12, N = 3SE +/- 13206.56, N = 3SE +/- 4609.53, N = 32008848.231983766.971971325.871953486.671945705.791204370.921. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write-O3-O0-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O2400800120016002000SE +/- 8.58, N = 3SE +/- 31.25, N = 3SE +/- 32.80, N = 4SE +/- 27.83, N = 5SE +/- 24.85, N = 32037.201985.801952.701932.861903.00-O3 -lpgcommon -lpgport -lrt -lcrypt -ldl -lm-O0 -shared-O3 -march=znver1 -shared-march=znver1 -O3 -mllvm -shared-O2 -shared1. (CC) gcc options: -fno-strict-aliasing -fwrapv -fpic

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-O3-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3 -march=znver1-O2-O050100150200250SE +/- 0.29, N = 3SE +/- 0.93, N = 3SE +/- 0.74, N = 3SE +/- 0.26, N = 3SE +/- 2.36, N = 3226.94226.38225.61225.06201.67-O3 -lpgcommon -lpgport -lrt -lcrypt -ldl -lm-march=znver1 -O3 -mllvm -shared-O3 -march=znver1 -shared-O2 -shared-O0 -shared1. (CC) gcc options: -fno-strict-aliasing -fwrapv -fpic

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3-O3 -march=znver1-O2-O3 -march=znver1 -mllvm -enable-strided-vectorization-O0400800120016002000SE +/- 27.77, N = 6SE +/- 31.64, N = 6SE +/- 32.61, N = 6SE +/- 36.05, N = 6SE +/- 42.77, N = 61942.421930.211906.401886.701865.82-O3 -lpgcommon -lpgport -lrt -lcrypt -ldl -lm-O3 -march=znver1 -shared-O2 -shared-march=znver1 -O3 -mllvm -shared-O0 -shared1. (CC) gcc options: -fno-strict-aliasing -fwrapv -fpic

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression Throughput-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3 -march=znver1-Ofast -march=znver1-O3-O2-O04080120160200SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 1.75, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 3168.66168.45165.24162.59161.6871.84-march=znver1 -O3 -lm-O3 -march=znver1 -lm-Ofast -march=znver1 -lm-O3 -lm-O2-O0 -lm1. (CC) gcc options:

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit Performance-O2-O3 -march=znver1-O3-O3 -march=znver1 -mllvm -enable-strided-vectorization-O0-Ofast -march=znver12004006008001000SE +/- 0.49, N = 3SE +/- 0.57, N = 3SE +/- 0.58, N = 3SE +/- 0.85, N = 3SE +/- 0.85, N = 3SE +/- 0.77, N = 3987.67987.43986.93986.73986.70986.431. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.1WAV To WavPack-O3 -march=znver1-Ofast -march=znver1-O3-O2-O0246810SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 56.436.466.516.527.76-O3 -march=znver1-Ofast -march=znver1-O3-O2-O01. (CC) gcc options: -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-O2-O3-O3 -march=znver1-Ofast -march=znver1-O0816243240SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 59.409.4210.6110.6136.48-O2-march=znver1-Ofast -march=znver1-O01. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O3 -march=znver1-Ofast -march=znver1-O3-O2-O01428425670SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 55.646.266.786.8062.12-Ofast -march=znver1-O3-O2-O01. (CXX) g++ options: -logg -lm

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total Time-Ofast -march=znver1-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-O0-O28001600240032004000SE +/- 4.98, N = 3SE +/- 5.00, N = 3SE +/- 8.08, N = 3SE +/- 9.53, N = 3SE +/- 7.88, N = 3SE +/- 7.75, N = 3361836433644370337113712-Ofast -march=znver1-march=znver1-march=znver1 -mllvm-O0-O21. (CXX) g++ options: -lpthread -fno-exceptions -fno-rtti -ansi -pedantic -O3 -msse -msse3 -mpopcnt

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-Ofast -march=znver1-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-O2-O0816243240SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.4113.4913.4914.0014.0132.30-Ofast -march=znver1-march=znver1-march=znver1 -mllvm-O2-O01. (CC) gcc options: -lm -lpthread -O3

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O3-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O02004006008001000SE +/- 1.11, N = 3SE +/- 0.19, N = 3SE +/- 0.82, N = 3SE +/- 0.76, N = 3SE +/- 0.70, N = 3SE +/- 0.62, N = 31157.381150.221133.271133.241036.50325.64-O2-march=znver1-march=znver1 -mllvm-Ofast -march=znver1-O01. (CC) gcc options: -O3 -mavx2

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O3 -march=znver1-O3-O2-O030609012015014113513513513328-march=znver1 -O3 -mllvm -lpng16-Ofast -march=znver1-O3 -march=znver1-O3-O2-O01. (CC) gcc options: -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-O3-O2-Ofast -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3 -march=znver1-O0408012016020016616316115714972-O3-O2-Ofast -march=znver1-march=znver1 -O3 -mllvm -lpng16-O3 -march=znver1-O01. (CC) gcc options: -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizing-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O3 -march=znver1-O2-O3-O0306090120150SE +/- 0.67, N = 3SE +/- 6.17, N = 614313813713212849-march=znver1 -O3 -mllvm -lpng16-Ofast -march=znver1-O3 -march=znver1-O2-O3-O01. (CC) gcc options: -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-Ofast -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3 -march=znver1-O3-O2-O01428425670SE +/- 0.33, N = 3646059585721-Ofast -march=znver1-march=znver1 -O3 -mllvm -lpng16-O3 -march=znver1-O3-O2-O01. (CC) gcc options: -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O2-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-O3 -march=znver1-Ofast -march=znver1-O02040608010010710610610210046-O2-march=znver1 -O3 -mllvm -lpng16-O3-O3 -march=znver1-Ofast -march=znver1-O01. (CC) gcc options: -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3-Ofast -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3 -march=znver1-O2-O0200K400K600K800K1000KSE +/- 494.31, N = 5SE +/- 1971.42, N = 5SE +/- 463.44, N = 5SE +/- 463.44, N = 5SE +/- 701.56, N = 5SE +/- 82.20, N = 510539191029891102109410210941016572475710-O3-Ofast -march=znver1-march=znver1 -O3 -mllvm-O3 -march=znver1-O2-O01. (CC) gcc options:

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O3-O2-O0400800120016002000SE +/- 0.29, N = 4SE +/- 0.52, N = 4SE +/- 1.12, N = 4SE +/- 0.43, N = 4SE +/- 0.50, N = 4SE +/- 2.17, N = 41682.071681.871680.491676.241675.221673.77-O3 -march=znver1-march=znver1 -O3 -mllvm-Ofast -march=znver1-O3-O2-O01. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3-O3 -march=znver1-Ofast -march=znver1-O0-O3 -march=znver1 -mllvm -enable-strided-vectorization-O213002600390052006500SE +/- 103.73, N = 4SE +/- 16.49, N = 4SE +/- 25.89, N = 4SE +/- 21.28, N = 4SE +/- 27.73, N = 4SE +/- 40.80, N = 45859.935670.425644.155637.055605.435554.97-O3-O3 -march=znver1-Ofast -march=znver1-O0-march=znver1 -O3 -mllvm-O21. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3-O0-O2-O3 -march=znver1-Ofast -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization6001200180024003000SE +/- 39.95, N = 4SE +/- 7.78, N = 4SE +/- 4.64, N = 4SE +/- 10.92, N = 4SE +/- 5.95, N = 4SE +/- 6.22, N = 42667.902648.162646.122636.322619.992616.74-O3-O0-O2-O3 -march=znver1-Ofast -march=znver1-march=znver1 -O3 -mllvm1. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-Ofast -march=znver1-O0-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-O2-O3 -march=znver1306090120150SE +/- 0.42, N = 4SE +/- 0.39, N = 4SE +/- 0.16, N = 4SE +/- 0.25, N = 4SE +/- 0.36, N = 4SE +/- 3.77, N = 4135.39134.67134.46134.23134.06131.50-Ofast -march=znver1-O0-march=znver1 -O3 -mllvm-O3-O2-O3 -march=znver11. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O0-O3-O2140280420560700SE +/- 0.05, N = 4SE +/- 0.16, N = 4SE +/- 0.11, N = 4SE +/- 0.13, N = 4SE +/- 0.30, N = 4SE +/- 1.18, N = 4660.30659.88659.67643.45643.43642.37-O3 -march=znver1-march=znver1 -O3 -mllvm-Ofast -march=znver1-O0-O3-O21. (CC) gcc options: -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3-O3 -march=znver1-Ofast -march=znver1-O0-O3 -march=znver1 -mllvm -enable-strided-vectorization-O25001000150020002500SE +/- 28.83, N = 4SE +/- 4.05, N = 4SE +/- 5.09, N = 4SE +/- 2.89, N = 4SE +/- 5.52, N = 4SE +/- 8.24, N = 42196.342156.122147.942147.422139.672130.55-O3-O3 -march=znver1-Ofast -march=znver1-O0-march=znver1 -O3 -mllvm-O21. (CC) gcc options: -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence Alignment-O2-O3 -march=znver1-O3-O3 -march=znver1 -mllvm -enable-strided-vectorization-Ofast -march=znver1-O00.87081.74162.61243.48324.354SE +/- 0.11, N = 6SE +/- 0.08, N = 6SE +/- 0.02, N = 3SE +/- 0.10, N = 6SE +/- 0.09, N = 6SE +/- 0.06, N = 43.633.663.773.783.823.871. (CC) gcc options: -O3 -lm -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 1024-Ofast -march=znver1-O3 -march=znver1 -mllvm -enable-strided-vectorization-O3-O2-O3 -march=znver1-O04K8K12K16K20KSE +/- 51.64, N = 5SE +/- 81.73, N = 5SE +/- 89.96, N = 5SE +/- 110.89, N = 5SE +/- 65.74, N = 5SE +/- 2.81, N = 520676.0020433.0020297.0020190.0020171.002434.70-Ofast -march=znver1-march=znver1 -O3 -mllvm-O3-O2-O3 -march=znver11. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4