AMD EPYC Compiler Testing July 2018

AMD EPYC compiler benchmarks.

HTML result view exported from: https://openbenchmarking.org/result/1807210-RA-AMDEPYCCO34&grr&sor.

AMD EPYC Compiler Testing July 2018ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V1.02.B10 BIOS)AMD Family 17h129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 18.104.16.0-041600-generic (x86_64)GCC 8.1.1 20180720ext41920x1080GCC 9.0.0 20180720OpenBenchmarking.orgEnvironment Details- GCC 8.2 RC1: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 8.2 RC1: -O3 -march=native - PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 8.2 RC1: -O3: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.0 SVN: -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 9.0.0 20180720 - AMD EPYC 7601 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checking=releaseDisk Details- NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: acpi-cpufreq ondemandSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB Protection

AMD EPYC Compiler Testing July 2018vpxenc: vpxencm-queens: Time To Solvestockfish: Total Timettsiod-renderer: Phong Rendering With Soft-Shadow Mappingcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19pgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Single Thread - Read Onlypgbench: Buffer Test - Normal Load - Read Writesmallpt: Global Illumination Renderer; 100 Samplesc-ray: Total Timetinymembench: Standard Memcpycompress-7zip: Compress Speed Testhimeno: Poisson Pressure Solverapache: Static Web Page Servinggraphics-magick: Sharpengraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Blurgraphics-magick: HWB Color Spaceaobench: 2048 x 2048 - Total Timehmmer: Pfam Database Searchebizzy: crafty: Elapsed Timescimark2: Compositeredis: GETredis: SETsockperf: Latency Under Loadopenssl: RSA 4096-bit Performanceredis: LPUSHredis: SADDencode-flac: WAV To FLACsockperf: Latency Ping Pongredis: LPOPfftw: Stock - 2D FFT Size 1024encode-mp3: WAV To MP3bullet: 3000 Falltjbench: Decompression Throughputpolybench-c: Covariance Computationpolybench-c: Correlation Computationfftw: Stock - 1D FFT Size 1024polybench-c: 3 Matrix Multiplicationstscp: AI Chess Performancesqlite: Timed SQLite Insertionsbullet: Raytestsbullet: Convex Trimeshbullet: Prim Trimeshbullet: 1000 Convexbullet: 136 Ragdollsbullet: 1000 Stackscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Monte Carloscimark2: Jacobi Successive Over-Relaxationscimark2: Fast Fourier TransformGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core12.3832.0259730290439.54155.642071.78408873.2514374.1336226.6942.5212497.20100684982.6716205.3218811018515420950.807.65114700960148661943.301691908.001375161.9619.754610.631108304.541430638.4512.636.421108268.086557.8310.515.02142.185.345.348518.673.588510192.893.111.331.135.323.095.794831.752421.38556.371687.64219.382694.4713725103.04260.021427.38242221.359213.4528411.49533566.2212463.5098726972.0715210.4762329184657.85192.26113169659987591766.3831.294656.4012.876.886351.5015.40106.035.375.398289.103.667295073.134301.992442.87195.981676.49214.5712.2932.446108086932.5112453.2099762852.6349.05111244860266551877.544617.235.425.333.359191934865.242413.24213.981676.15219.0912.3632.0959837512441.58166.702044.30408599.1313985.7036248.6542.8412555.7097917998.8915891.7217710817514719952.607.71111335659892071874.091813896.311332541.9619.674609.731187656.291517826.7712.276.761143259.505835.9011.135.06141.745.395.407176.773.598512832.883.181.361.125.523.146.024683.552473.98561.851429.81221.3012.3731.5659201545156.582015.18401369.6114307.5436115.9442.5212377.60973851007.7415856.0318210418014920150.727.63111166258762971940.651586367.581284845.1319.884616.471133637.461423169.7912.441152018.466531.3710.505.04145.665.415.388506.833.668565762.893.121.331.135.293.085.784781.452458.23556.371688.09219.093263.94261.521348.77238819.608870.9428285.32560561.5012474.20993621006.1915244.0262330184657.00192.14114230258788001869.3318.064647.8012.876.796448.4015.49106.905.455.388345.203.617565633.074869.992391.26190.061678.45216.86OpenBenchmarking.org

VP9 libvpx Encoding

vpxenc

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.7.0vpxenc-O3 -march=native-O3PGO-O3 -march=native3691215SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 312.3812.3612.2912.371. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To SolveGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native - PGOAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core7001400210028003500SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 331.5632.0232.0932.442694.473263.941. (CXX) g++ options: -fopenmp -O3 -march=native -O2

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-Core13M26M39M52M65MSE +/- 512120.63, N = 3SE +/- 551889.61, N = 3SE +/- 576746.80, N = 3SE +/- 756265.41, N = 3610808695983751259730290592015451372510-march=native -fprofile-correction-march=native-march=native1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-Core100200300400500SE +/- 1.61, N = 3SE +/- 1.94, N = 3441.58439.543.04-march=native1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core60120180240300SE +/- 0.54, N = 3SE +/- 1.43, N = 3SE +/- 0.68, N = 3155.64156.58166.70260.02261.52-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -pthread -lz -llzma

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core400800120016002000SE +/- 12.52, N = 3SE +/- 30.23, N = 5SE +/- 37.36, N = 32071.782044.302015.181427.381348.77-march=native -lpq-lpq-march=native -lpq-march=native-march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core90K180K270K360K450KSE +/- 1289.79, N = 3SE +/- 1716.32, N = 3SE +/- 3319.69, N = 3408873.25408599.13401369.61242221.35238819.60-march=native -lpq-lpq-march=native -lpq-march=native-march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read OnlyGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core3K6K9K12K15KSE +/- 113.33, N = 3SE +/- 152.49, N = 3SE +/- 266.50, N = 314374.1314307.5413985.709213.458870.94-march=native -lpq-march=native -lpq-lpq-march=native-march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core8K16K24K32K40KSE +/- 81.46, N = 3SE +/- 133.09, N = 3SE +/- 66.03, N = 336248.6536226.6936115.9428411.4928285.32-lpq-march=native -lpq-march=native -lpq-march=native-march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core120240360480600SE +/- 0.21, N = 63444533560-march=native-march=native1. (CXX) g++ options: -fopenmp -O3

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core120240360480600SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.512.522.522.84561.50566.22-march=native-march=native1. (CC) gcc options: -lm -lpthread -O3

Tinymembench

Standard Memset

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemsetGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGO2K4K6K8K10K10063.1010037.7010034.0010015.509982.809975.20-march=native-march=native-march=native-march=native-march=native -fprofile-correction1. (CC) gcc options: -O2 -O3 -lm

Tinymembench

Standard Memcpy

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemcpyGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=native3K6K9K12K15K12555.7012497.2012474.2012463.5012453.2012377.60-march=native-march=native-march=native-march=native -fprofile-correction-march=native1. (CC) gcc options: -O2 -O3 -lm

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=native20K40K60K80K100KSE +/- 715.28, N = 3SE +/- 555.15, N = 3SE +/- 586.49, N = 3SE +/- 465.88, N = 310068499762993629872697917973851. (CXX) g++ options: -pipe -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGO2004006008001000SE +/- 4.57, N = 3SE +/- 2.17, N = 3SE +/- 0.78, N = 3SE +/- 0.73, N = 31007.741006.19998.89982.67972.07852.63-march=native-march=native-march=native-march=native-march=native -fprofile-correction1. (CC) gcc options: -O3 -mavx2

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core3K6K9K12K15KSE +/- 193.23, N = 3SE +/- 188.72, N = 3SE +/- 89.48, N = 316205.3215891.7215856.0315244.0215210.47-march=native-march=native-march=native-march=native1. (CC) gcc options: -shared -fPIC -pthread -O3

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: SharpenGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core408012016020018818217766-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Local Adaptive ThresholdingGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core20406080100SE +/- 0.33, N = 31101081042323-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: ResizingGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core4080120160200SE +/- 1.20, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 31851801753029-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: BlurGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core3060901201501541491471818-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: HWB Color SpaceGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core50100150200250SE +/- 0.58, N = 32092011994646-march=native-march=native-march=native-march=native1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core1326395265SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 349.0550.7250.8052.6057.0057.85-march=native -fprofile-correction-march=native-march=native-march=native-march=native1. (CC) gcc options: -lm -O3

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core4080120160200SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 37.637.657.71192.14192.26-march=native-march=native1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=native200K400K600K800K1000KSE +/- 28582.82, N = 6SE +/- 18183.13, N = 4SE +/- 20234.15, N = 6SE +/- 23788.74, N = 6114700911423021131696111335611124481111662-fprofile-correction1. (CC) gcc options: -pthread -lpthread -O3 -march=native

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=native1.3M2.6M3.9M5.2M6.5MSE +/- 11021.16, N = 3SE +/- 5146.22, N = 3SE +/- 13110.39, N = 3SE +/- 13470.58, N = 36026655601486659987595989207587880058762971. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core400800120016002000SE +/- 5.96, N = 3SE +/- 4.00, N = 3SE +/- 3.46, N = 3SE +/- 5.53, N = 31943.301940.651877.541874.091869.331766.38-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm

Redis

Test: GET

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3-O3 -march=native400K800K1200K1600K2000KSE +/- 34086.99, N = 6SE +/- 37527.46, N = 6SE +/- 25099.79, N = 61813896.311691908.001586367.581. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=native-O3300K600K900K1200K1500KSE +/- 13759.71, N = 3SE +/- 23587.86, N = 6SE +/- 31380.33, N = 61375161.961332541.961284845.131. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Sockperf

Test: Latency Under Load

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under LoadGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-Core714212835SE +/- 0.31, N = 10SE +/- 0.30, N = 10SE +/- 0.24, N = 818.0619.6719.7519.8831.29-march=native-march=native-march=native-march=native1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit PerformanceAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O310002000300040005000SE +/- 23.93, N = 3SE +/- 23.13, N = 3SE +/- 21.56, N = 3SE +/- 22.71, N = 34656.404647.804617.234616.474610.634609.73-lssl-lssl-lssl1. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl

Redis

Test: LPUSH

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3-O3 -march=native300K600K900K1200K1500KSE +/- 2157.90, N = 3SE +/- 18347.41, N = 6SE +/- 12639.17, N = 31187656.291108304.541133637.461. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O3-O3 -march=native300K600K900K1200K1500KSE +/- 35661.81, N = 6SE +/- 4081.67, N = 3SE +/- 3767.82, N = 31517826.771430638.451423169.791. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core3691215SE +/- 0.12, N = 5SE +/- 0.01, N = 5SE +/- 0.19, N = 512.2712.4412.6312.8712.87-march=native-march=native-march=native-march=native1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core246810SE +/- 0.11, N = 10SE +/- 0.07, N = 56.426.766.796.88-march=native-march=native-march=native1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Redis

Test: LPOP

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O3 -march=native-O3200K400K600K800K1000KSE +/- 21693.88, N = 3SE +/- 15264.52, N = 3SE +/- 4079.04, N = 31152018.461143259.501108268.081. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O314002800420056007000SE +/- 73.40, N = 3SE +/- 16.29, N = 3SE +/- 3.18, N = 36557.836531.376448.406351.505835.90-march=native-march=native-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.5010.5111.1315.4015.49-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

Bullet Physics Engine

Test: 3000 Fall

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=native-O31.13852.2773.41554.5545.6925SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 35.025.065.041. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression ThroughputGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core306090120150SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3145.66142.18141.74106.90106.03-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

PolyBench-C

Test: Covariance Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Covariance ComputationGCC 8.2 RC1: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core1.22632.45263.67894.90526.1315SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.345.375.395.415.425.45-fprofile-correction1. (CC) gcc options: -O3 -march=native

PolyBench-C

Test: Correlation Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Correlation ComputationGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O31.2152.433.6454.866.075SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.335.345.385.385.395.40-fprofile-correction1. (CC) gcc options: -O3 -march=native

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O32K4K6K8K10KSE +/- 18.71, N = 3SE +/- 1.83, N = 3SE +/- 8.90, N = 38518.678506.838345.208289.107176.77-march=native-march=native-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: 3 Matrix MultiplicationsGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 9.0 SVN: -O3 -march=native0.82351.6472.47053.2944.1175SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 33.353.583.593.613.663.66-fprofile-correction1. (CC) gcc options: -O3 -march=native

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core200K400K600K800K1000KSE +/- 1018.49, N = 5SE +/- 421.22, N = 5SE +/- 833.02, N = 5SE +/- 766.17, N = 5919193856576851283851019756563729507-fprofile-correction1. (CC) gcc options: -O3 -march=native

SQLite

Timed SQLite Insertions

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.22Timed SQLite InsertionsGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core0.70431.40862.11292.81723.5215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 32.882.892.893.073.13-march=native-march=native-march=native-march=native1. (CC) gcc options: -O3 -lz -ldl -lpthread

Bullet Physics Engine

Test: Raytests

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=native-O30.71551.4312.14652.8623.5775SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.113.183.121. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=native-O30.3060.6120.9181.2241.53SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.331.361.331. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3-O3 -march=native0.25430.50860.76291.01721.2715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.121.131.131. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=native-O31.2422.4843.7264.9686.21SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.295.325.521. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=native-O30.70651.4132.11952.8263.5325SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.083.093.141. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=native-O3246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.785.796.021. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3AMD EPYC 7601 32-Core10002000300040005000SE +/- 17.02, N = 3SE +/- 20.82, N = 3SE +/- 25.52, N = 3SE +/- 27.07, N = 34869.994865.244831.754781.454683.554301.99-march=native-march=native -fprofile-correction-march=native-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core5001000150020002500SE +/- 1.01, N = 3SE +/- 13.99, N = 3SE +/- 11.03, N = 3SE +/- 10.35, N = 32473.982458.232442.872421.382413.242391.26-march=native-march=native-march=native-march=native -fprofile-correction-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 8.2 RC1: -O3GCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOAMD EPYC 7601 32-CoreGCC 9.0.0 20180720 - AMD EPYC 7601 32-Core120240360480600SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.25, N = 3561.85556.37556.37213.98195.98190.06-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=nativeGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-CoreGCC 8.2 RC1: -O3 -march=native - PGOGCC 8.2 RC1: -O3400800120016002000SE +/- 0.72, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.16, N = 31688.091687.641678.451676.491676.151429.81-march=native-march=native-march=native-march=native-march=native -fprofile-correction1. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 8.2 RC1: -O3GCC 8.2 RC1: -O3 -march=nativeGCC 9.0 SVN: -O3 -march=nativeGCC 8.2 RC1: -O3 -march=native - PGOGCC 9.0.0 20180720 - AMD EPYC 7601 32-CoreAMD EPYC 7601 32-Core50100150200250SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3221.30219.38219.09219.09216.86214.57-march=native-march=native-march=native -fprofile-correction-march=native-march=native1. (CC) gcc options: -O3 -lm


Phoronix Test Suite v10.8.4