AMD EPYC Compiler Testing July 2018

AMD EPYC compiler benchmarks.

HTML result view exported from: https://openbenchmarking.org/result/1807210-RA-AMDEPYCCO34&gru&sor&rro.

AMD EPYC Compiler Testing July 2018ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V1.02.B10 BIOS)AMD Family 17h129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 18.104.16.0-041600-generic (x86_64)GCC 8.1.1 20180720ext41920x1080GCC 9.0.0 20180720OpenBenchmarking.orgEnvironment Details- GCC 8.2 RC1: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 8.2 RC1: -O3 -march=native - PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 8.2 RC1: -O3: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.0 SVN: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checking=releaseDisk Details- NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: acpi-cpufreq ondemandSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB Protection

AMD EPYC Compiler Testing July 2018ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingvpxenc: vpxencgraphics-magick: HWB Color Spacegraphics-magick: Blurgraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Sharpentinymembench: Standard Memcpytjbench: Decompression Throughputscimark2: Compositescimark2: Fast Fourier Transformscimark2: Jacobi Successive Over-Relaxationscimark2: Monte Carloscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationfftw: Stock - 1D FFT Size 1024fftw: Stock - 2D FFT Size 1024himeno: Poisson Pressure Solvercompress-7zip: Compress Speed Testcrafty: Elapsed Timestockfish: Total Timetscp: AI Chess Performanceebizzy: redis: SETredis: GETredis: LPUSHredis: LPOPredis: SADDapache: Static Web Page Servingopenssl: RSA 4096-bit Performancepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Single Thread - Read Onlypgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Normal Load - Read Onlypolybench-c: 3 Matrix Multiplicationspolybench-c: Correlation Computationpolybench-c: Covariance Computationsqlite: Timed SQLite Insertionsaobench: 2048 x 2048 - Total Timecompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19m-queens: Time To Solvec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesencode-mp3: WAV To MP3encode-flac: WAV To FLAChmmer: Pfam Database Searchbullet: 3000 Fallbullet: 1000 Stackbullet: 136 Ragdollsbullet: 1000 Convexbullet: Prim Trimeshbullet: Convex Trimeshbullet: Raytestssockperf: Latency Under Loadsockperf: Latency Ping PongGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core439.5412.3820915411018518812497.20142.181943.30219.381687.64556.372421.384831.758518.676557.83982.6710068460148665973029085101911470091375161.961691908.001108304.541108268.081430638.4516205.324610.632071.7814374.1336226.69408873.253.585.345.342.8950.80155.6432.022.52410.5112.637.655.025.793.095.321.131.333.1119.756.423.0446182329612463.50106.031766.38214.571676.49195.982442.874301.998289.106351.50972.079872659987591372510729507113169615210.474656.401427.389213.4528411.49242221.353.665.395.373.1357.85260.022694.47566.2253315.4012.87192.2631.296.8812.2912453.201877.54219.091676.15213.982413.244865.24852.639976260266556108086991919311124484617.233.355.335.4249.0532.442.513441.5812.3619914710817517712555.70141.741874.09221.301429.81561.852473.984683.557176.775835.90998.899791759892075983751285128311133561332541.961813896.311187656.291143259.501517826.7715891.724609.732044.3013985.7036248.65408599.133.595.405.392.8852.60166.7032.092.84411.1312.277.715.066.023.145.521.121.363.1819.676.7612.3720114910418018212377.60145.661940.65219.091688.09556.372458.234781.458506.836531.371007.749738558762975920154585657611116621284845.131586367.581133637.461152018.461423169.7915856.034616.472015.1814307.5436115.94401369.613.665.385.412.8950.72156.5831.562.52410.5012.447.635.045.783.085.291.131.333.1219.8846182330612474.20106.901869.33216.861678.45190.062391.264869.998345.206448.401006.19993625878800756563114230215244.024647.801348.778870.9428285.32238819.603.615.385.453.0757.00261.523263.94561.5056015.4912.87192.1418.066.79OpenBenchmarking.org

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3100200300400500SE +/- 1.94, N = 3SE +/- 1.61, N = 33.04439.54441.58-march=native1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

VP9 libvpx Encoding

vpxenc

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.7.0vpxencPGO-O3 -march=native-O3-O3 -march=native3691215SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.2912.3612.3812.371. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: HWB Color SpaceAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native50100150200250SE +/- 0.58, N = 34646199201209-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: BlurAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native3060901201501818147149154-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Local Adaptive ThresholdingAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native20406080100SE +/- 0.33, N = 32323104108110-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: ResizingAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native4080120160200SE +/- 1.00, N = 3SE +/- 1.00, N = 3SE +/- 1.20, N = 32930175180185-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: SharpenAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native408012016020066177182188-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Tinymembench

Standard Memcpy

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemcpyGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O33K6K9K12K15K12377.6012453.2012463.5012474.2012497.2012555.70-march=native-march=native -fprofile-correction-march=native-march=native-march=native1. (CC) gcc options: -O2 -O3 -lm

Tinymembench

Standard Memset

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemsetGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native2K4K6K8K10K9975.209982.8010015.5010034.0010037.7010063.10-march=native -fprofile-correction-march=native-march=native-march=native-march=native1. (CC) gcc options: -O2 -O3 -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression ThroughputAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=native306090120150SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3106.03106.90141.74142.18145.66-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native400800120016002000SE +/- 5.53, N = 3SE +/- 3.46, N = 3SE +/- 4.00, N = 3SE +/- 5.96, N = 31766.381869.331874.091877.541940.651943.30-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O350100150200250SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3214.57216.86219.09219.09219.38221.30-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native - PGOAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=native400800120016002000SE +/- 0.16, N = 3SE +/- 0.26, N = 3SE +/- 0.06, N = 3SE +/- 0.72, N = 31429.811676.151676.491678.451687.641688.09-march=native -fprofile-correction-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3120240360480600SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3190.06195.98213.98556.37556.37561.85-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O35001000150020002500SE +/- 10.35, N = 3SE +/- 11.03, N = 3SE +/- 13.99, N = 3SE +/- 1.01, N = 32391.262413.242421.382442.872458.232473.98-march=native-march=native -fprofile-correction-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core10002000300040005000SE +/- 27.07, N = 3SE +/- 25.52, N = 3SE +/- 20.82, N = 3SE +/- 17.02, N = 34301.994683.554781.454831.754865.244869.99-march=native-march=native-march=native-march=native -fprofile-correction-march=native1. (CC) gcc options: -O3 -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native2K4K6K8K10KSE +/- 8.90, N = 3SE +/- 1.83, N = 3SE +/- 18.71, N = 37176.778289.108345.208506.838518.67-march=native-march=native-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024GCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native14002800420056007000SE +/- 3.18, N = 3SE +/- 16.29, N = 3SE +/- 73.40, N = 35835.906351.506448.406531.376557.83-march=native-march=native-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 8.2 RC1: -O3 -march=native - PGOAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=native2004006008001000SE +/- 0.73, N = 3SE +/- 0.78, N = 3SE +/- 2.17, N = 3SE +/- 4.57, N = 3852.63972.07982.67998.891006.191007.74-march=native -fprofile-correction-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -mavx2

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=native20K40K60K80K100KSE +/- 465.88, N = 3SE +/- 586.49, N = 3SE +/- 555.15, N = 3SE +/- 715.28, N = 397385979179872699362997621006841. (CXX) g++ options: -pipe -lpthread

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO1.3M2.6M3.9M5.2M6.5MSE +/- 13470.58, N = 3SE +/- 13110.39, N = 3SE +/- 5146.22, N = 3SE +/- 11021.16, N = 35876297587880059892075998759601486660266551. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native - PGO13M26M39M52M65MSE +/- 756265.41, N = 3SE +/- 576746.80, N = 3SE +/- 551889.61, N = 3SE +/- 512120.63, N = 3137251059201545597302905983751261080869-march=native-march=native-march=native -fprofile-correction1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO200K400K600K800K1000KSE +/- 766.17, N = 5SE +/- 833.02, N = 5SE +/- 421.22, N = 5SE +/- 1018.49, N = 5729507756563851019851283856576919193-fprofile-correction1. (CC) gcc options: -O3 -march=native

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native200K400K600K800K1000KSE +/- 23788.74, N = 6SE +/- 20234.15, N = 6SE +/- 18183.13, N = 4SE +/- 28582.82, N = 6111166211124481113356113169611423021147009-fprofile-correction1. (CC) gcc options: -pthread -lpthread -O3 -march=native

Redis

Test: SET

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=native-O3300K600K900K1200K1500KSE +/- 31380.33, N = 6SE +/- 23587.86, N = 6SE +/- 13759.71, N = 31284845.131332541.961375161.961. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=native-O3400K800K1200K1600K2000KSE +/- 25099.79, N = 6SE +/- 37527.46, N = 6SE +/- 34086.99, N = 61586367.581691908.001813896.311. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPUSH

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3 -march=native-O3300K600K900K1200K1500KSE +/- 18347.41, N = 6SE +/- 2157.90, N = 3SE +/- 12639.17, N = 31108304.541187656.291133637.461. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPOP

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O3 -march=native-O3200K400K600K800K1000KSE +/- 4079.04, N = 3SE +/- 15264.52, N = 3SE +/- 21693.88, N = 31108268.081143259.501152018.461. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O3 -march=native-O3300K600K900K1200K1500KSE +/- 3767.82, N = 3SE +/- 4081.67, N = 3SE +/- 35661.81, N = 61423169.791430638.451517826.771. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native3K6K9K12K15KSE +/- 89.48, N = 3SE +/- 188.72, N = 3SE +/- 193.23, N = 315210.4715244.0215856.0315891.7216205.32-march=native-march=native-march=native-march=native1. (CC) gcc options: -shared -fPIC -pthread -O3

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit PerformanceGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core10002000300040005000SE +/- 22.71, N = 3SE +/- 21.56, N = 3SE +/- 23.13, N = 3SE +/- 23.93, N = 34609.734610.634616.474617.234647.804656.40-lssl-lssl-lssl1. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native400800120016002000SE +/- 37.36, N = 3SE +/- 30.23, N = 5SE +/- 12.52, N = 31348.771427.382015.182044.302071.78-march=native-march=native-march=native -lpq-lpq-march=native -lpq1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read OnlyGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native3K6K9K12K15KSE +/- 266.50, N = 3SE +/- 152.49, N = 3SE +/- 113.33, N = 38870.949213.4513985.7014307.5414374.13-march=native-march=native-lpq-march=native -lpq-march=native -lpq1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O38K16K24K32K40KSE +/- 66.03, N = 3SE +/- 133.09, N = 3SE +/- 81.46, N = 328285.3228411.4936115.9436226.6936248.65-march=native-march=native-march=native -lpq-march=native -lpq-lpq1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native90K180K270K360K450KSE +/- 3319.69, N = 3SE +/- 1716.32, N = 3SE +/- 1289.79, N = 3238819.60242221.35401369.61408599.13408873.25-march=native-march=native-march=native -lpq-lpq-march=native -lpq1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: 3 Matrix MultiplicationsGCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO0.82351.6472.47053.2944.1175SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 33.663.663.613.593.583.35-fprofile-correction1. (CC) gcc options: -O3 -march=native

PolyBench-C

Test: Correlation Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Correlation ComputationGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO1.2152.433.6454.866.075SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 35.405.395.385.385.345.33-fprofile-correction1. (CC) gcc options: -O3 -march=native

PolyBench-C

Test: Covariance Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Covariance ComputationGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native1.22632.45263.67894.90526.1315SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 35.455.425.415.395.375.34-fprofile-correction1. (CC) gcc options: -O3 -march=native

SQLite

Timed SQLite Insertions

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.22Timed SQLite InsertionsAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O30.70431.40862.11292.81723.5215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 33.133.072.892.892.88-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lz -ldl -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO1326395265SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.20, N = 357.8557.0052.6050.8050.7249.05-march=native-march=native-march=native-march=native-march=native -fprofile-correction1. (CC) gcc options: -lm -O3

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native60120180240300SE +/- 0.68, N = 3SE +/- 1.43, N = 3SE +/- 0.54, N = 3261.52260.02166.70156.58155.64-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -pthread -lz -llzma

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To SolveGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=native7001400210028003500SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 33263.942694.4732.4432.0932.0231.561. (CXX) g++ options: -fopenmp -O3 -march=native -O2

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO120240360480600SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3566.22561.502.842.522.522.51-march=native-march=native1. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO120240360480600SE +/- 0.21, N = 65605334443-march=native-march=native1. (CXX) g++ options: -fopenmp -O3

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=native48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 315.4915.4011.1310.5110.50-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O33691215SE +/- 0.19, N = 5SE +/- 0.01, N = 5SE +/- 0.12, N = 512.8712.8712.6312.4412.27-march=native-march=native-march=native-march=native1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=native4080120160200SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3192.26192.147.717.657.63-march=native-march=native1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm

Bullet Physics Engine

Test: 3000 Fall

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3-O3 -march=native1.13852.2773.41554.5545.6925SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 35.065.025.041. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3-O3 -march=native246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.025.795.781. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3-O3 -march=native0.70651.4132.11952.8263.5325SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 33.143.093.081. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3-O3 -march=native1.2422.4843.7264.9686.21SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.525.325.291. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=native-O30.25430.50860.76291.01721.2715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.131.131.121. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3-O3 -march=native0.3060.6120.9181.2241.53SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.361.331.331. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Raytests

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3-O3 -march=native0.71551.4312.14652.8623.5775SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.183.113.121. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Sockperf

Test: Latency Under Load

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under LoadAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core714212835SE +/- 0.24, N = 8SE +/- 0.30, N = 10SE +/- 0.31, N = 1031.2919.8819.7519.6718.06-march=native-march=native-march=native-march=native1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native246810SE +/- 0.07, N = 5SE +/- 0.11, N = 106.886.796.766.42-march=native-march=native-march=native1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread


Phoronix Test Suite v10.8.5