AMD EPYC Compiler Testing GCC 9.0 SVN With PGO

AMD EPYC compiler benchmarks for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1807235-RA-GCCPGOEPY35&sor&grr.

ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 8.2 RC1GCC 9.0 SVN -O2 -O3 -O3 native -O3 native + PGO -O2 -O3 -O3 native -O3 native + PGOAMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V1.02.B10 BIOS)AMD Family 17h129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 18.104.16.0-041600-generic (x86_64)GCC 8.1.1 20180720ext41920x1080GCC 9.0.0 20180720OpenBenchmarking.orgEnvironment Details- GCC 8.2 RC1: -O2: CXXFLAGS=-O2 CFLAGS=-O2- GCC 8.2 RC1: -O3: CXXFLAGS=-O3 CFLAGS=-O3- GCC 8.2 RC1: -O3 native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 8.2 RC1: -O3 native + PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 9.0 SVN: -O2: CXXFLAGS=-O2 CFLAGS=-O2- GCC 9.0 SVN: -O3: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.0 SVN: -O3 native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- GCC 9.0 SVN: -O3 native + PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checking=releaseDisk Details- NONE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: acpi-cpufreq ondemandSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB Protection

vpxenc: vpxenccompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19pgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Onlystockfish: Total Timecompress-7zip: Compress Speed Testapache: Static Web Page Servinghimeno: Poisson Pressure Solvergraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: HWB Color Spacetinymembench: Standard Memcpyaobench: 2048 x 2048 - Total Timecrafty: Elapsed Timescimark2: Compositem-queens: Time To Solveebizzy: sockperf: Latency Under Loadsockperf: Latency Ping Pongencode-flac: WAV To FLACopenssl: RSA 4096-bit Performanceredis: GETfftw: Stock - 2D FFT Size 1024ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingredis: SETredis: LPUSHredis: SADDencode-mp3: WAV To MP3redis: LPOPhmmer: Pfam Database Searchbullet: 3000 Falltjbench: Decompression Throughputsmallpt: Global Illumination Renderer; 100 Samplespolybench-c: Covariance Computationpolybench-c: Correlation Computationfftw: Stock - 1D FFT Size 1024c-ray: Total Timepolybench-c: 3 Matrix Multiplicationstscp: AI Chess Performancesqlite: Timed SQLite Insertionsbullet: Raytestsbullet: Convex Trimeshbullet: Prim Trimeshbullet: 1000 Convexbullet: 136 Ragdollsbullet: 1000 Stackscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Monte Carloscimark2: Jacobi Successive Over-Relaxationscimark2: Fast Fourier TransformGCC 8.2 RC1GCC 9.0 SVN -O2 -O3 -O3 native -O3 native + PGO -O2 -O3 -O3 native -O3 native + PGO12.43166.121991.71396925.9136523.7814037.15590073149848015972.591014.6610918114818120212359.3055.5260047001464.0632.22110639119.596.6612.154609.301612387.565796.33400.361278916.541135840.671500041.0414.051117326.837.565.06141.4345.535.557194.376.003.628525972.873.171.361.125.513.146.003462.392156.61561.79919.02220.5212.36166.702044.30408599.1336248.6513985.70598375129791715891.72998.8910817514717719912555.7052.6059892071874.0932.09111335619.676.7612.274609.731813896.315835.90441.581332541.961187656.291517826.7711.131143259.507.715.06141.7445.395.407176.772.843.598512832.883.181.361.125.523.146.024683.552473.98561.851429.81221.3012.38155.642071.78408873.2536226.6914374.135973029010068416205.32982.6711018515418820912497.2050.8060148661943.3032.02114700919.756.4212.634610.631691908.006557.83439.541375161.961108304.541430638.4510.511108268.087.655.02142.1845.345.348518.672.523.588510192.893.111.331.135.323.095.794831.752421.38556.371687.64219.3812.296108086999762852.6312453.2049.0560266551877.5432.4411124484617.2335.425.332.513.359191934865.242413.24213.981676.15219.0912.31168.141973.07397574.7436232.4414101.79583720899842315943.491014.3810217614617619712422.9055.5257808241499.9631.59112422219.756.5312.224612.201580690.665775.231319138.271130545.221542587.2514.281078502.417.405.06141.1845.505.506852.406.023.688565762.863.171.361.125.493.135.993432.882365.87561.29918.96220.8012.21168.202093.81395454.6136081.5714004.08584129759766316065.26993.221041791481792011249053.0358928251877.0331.64110914822.436.5712.134602.301682310.235601.001291788.211135535.001477048.9411.001103652.217.745.16137.8235.495.467168.632.833.638565772.853.171.361.125.533.226.014723.982458.50552.081429.31221.2812.37156.582015.18401369.6136115.9414307.54592015459738515856.031007.7410418014918220112377.6050.7258762971940.6531.56111166219.8812.444616.471586367.586531.371284845.131133637.461423169.7910.501152018.467.635.04145.6645.415.388506.832.523.668565762.893.121.331.135.293.085.784781.452458.23556.371688.09219.0912.07147.785614955096774869.0712300.6048.5158414231870.8332.3111669984623.6045.375.402.503.309161364845.492400.46213.951674.99219.24OpenBenchmarking.org

VP9 libvpx Encoding

vpxenc

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.7.0vpxenc-O2-O3 native-O3-O3 native + PGO3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 312.4312.3812.3612.2912.3712.3112.2112.071. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 native + PGO-O3 native-O2-O34080120160200SE +/- 0.24, N = 3SE +/- 1.43, N = 3SE +/- 0.46, N = 3SE +/- 0.51, N = 3SE +/- 0.54, N = 3SE +/- 0.62, N = 3SE +/- 0.68, N = 3147.78156.58168.14168.20155.64166.12166.701. (CC) gcc options: -pthread -lz -llzma

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-O3-O3 native-O2400800120016002000SE +/- 39.87, N = 3SE +/- 37.36, N = 3SE +/- 7.20, N = 3SE +/- 12.52, N = 3SE +/- 30.23, N = 5SE +/- 14.24, N = 32093.812015.181973.072071.782044.301991.711. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 native-O3-O290K180K270K360K450KSE +/- 1289.79, N = 3SE +/- 1716.32, N = 3SE +/- 2225.51, N = 3SE +/- 3319.69, N = 3SE +/- 1546.59, N = 3SE +/- 3629.20, N = 3408873.25408599.13396925.91401369.61397574.74395454.611. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O2-O3-O3 native8K16K24K32K40KSE +/- 62.00, N = 3SE +/- 81.46, N = 3SE +/- 133.09, N = 3SE +/- 96.17, N = 3SE +/- 66.03, N = 3SE +/- 193.85, N = 336523.7836248.6536226.6936232.4436115.9436081.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Only-O3 native-O2-O33K6K9K12K15KSE +/- 113.33, N = 3SE +/- 215.85, N = 3SE +/- 266.50, N = 3SE +/- 152.49, N = 3SE +/- 72.75, N = 3SE +/- 140.09, N = 314374.1314037.1513985.7014307.5414101.7914004.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

Total Time

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 native + PGO-O3-O3 native-O213M26M39M52M65MSE +/- 512120.63, N = 3SE +/- 551889.61, N = 3SE +/- 576746.80, N = 3SE +/- 219692.26, N = 3SE +/- 756265.41, N = 3SE +/- 236832.99, N = 3SE +/- 346634.55, N = 3SE +/- 419233.35, N = 361080869598375125973029059007314592015455841297558372089561495501. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

7-Zip Compression

Compress Speed Test

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O3 native-O3 native + PGO-O2-O320K40K60K80K100KSE +/- 715.28, N = 3SE +/- 555.15, N = 3SE +/- 771.81, N = 3SE +/- 586.49, N = 3SE +/- 926.79, N = 3SE +/- 935.09, N = 3SE +/- 465.88, N = 3SE +/- 1357.08, N = 3100684997629848097917984239766397385967741. (CXX) g++ options: -pipe -lpthread

Apache Benchmark

Static Web Page Serving

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page Serving-O3 native-O3-O23K6K9K12K15KSE +/- 193.23, N = 3SE +/- 132.55, N = 3SE +/- 188.72, N = 3SE +/- 74.41, N = 3SE +/- 80.25, N = 3SE +/- 89.48, N = 316205.3215972.5915891.7216065.2615943.4915856.031. (CC) gcc options: -shared -fPIC -pthread

Himeno Benchmark

Poisson Pressure Solver

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O3 native-O3-O3 native + PGO2004006008001000SE +/- 1.94, N = 3SE +/- 2.17, N = 3SE +/- 0.78, N = 3SE +/- 0.73, N = 3SE +/- 0.47, N = 3SE +/- 4.57, N = 3SE +/- 5.36, N = 3SE +/- 0.64, N = 31014.66998.89982.67852.631014.381007.74993.22869.071. (CC) gcc options: -O3 -mavx2

GraphicsMagick

Operation: Local Adaptive Thresholding

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Local Adaptive Thresholding-O3 native-O2-O320406080100SE +/- 0.33, N = 3SE +/- 1.00, N = 31101091081041041021. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Resizing-O3 native-O2-O34080120160200SE +/- 1.20, N = 3SE +/- 0.88, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 31851811751801791761. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Blur

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Blur-O3 native-O3-O2306090120150SE +/- 0.67, N = 31541481471491481461. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: Sharpen-O3 native-O2-O340801201602001881811771821791761. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.28Operation: HWB Color Space-O3 native-O2-O350100150200250SE +/- 0.58, N = 32092021992012011971. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Tinymembench

Standard Memset

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memset-O3 native-O3-O2-O3 native + PGO2K4K6K8K10K10063.1010037.709978.609975.2010039.709989.409982.909982.801. (CC) gcc options: -O2 -lm

Tinymembench

Standard Memcpy

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpy-O3-O3 native-O3 native + PGO-O23K6K9K12K15K12555.7012497.2012453.2012359.3012490.0012422.9012377.6012300.601. (CC) gcc options: -O2 -lm

AOBench

Size: 2048 x 2048 - Total Time

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 native + PGO-O3 native-O3-O21224364860SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 348.5150.7253.0355.5249.0550.8052.6055.521. (CC) gcc options: -lm -O3

Crafty

Elapsed Time

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time-O3 native + PGO-O3 native-O2-O31.3M2.6M3.9M5.2M6.5MSE +/- 11021.16, N = 3SE +/- 5146.22, N = 3SE +/- 8016.22, N = 3SE +/- 13110.39, N = 3SE +/- 2607.25, N = 3SE +/- 13470.58, N = 3SE +/- 2577.32, N = 3SE +/- 9192.37, N = 3602665560148666004700598920758928255876297584142357808241. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

SciMark

Computational Test: Composite

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 native-O3 native + PGO-O3-O2400800120016002000SE +/- 5.96, N = 3SE +/- 3.46, N = 3SE +/- 5.53, N = 3SE +/- 3.37, N = 3SE +/- 4.00, N = 3SE +/- 4.50, N = 3SE +/- 5.95, N = 3SE +/- 2.46, N = 31943.301877.541874.091464.061940.651877.031870.831499.961. (CC) gcc options: -lm

m-queens

Time To Solve

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To Solve-O3 native-O2-O3-O3 native + PGO816243240SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 331.5631.5931.6432.3132.0232.0932.2232.441. (CXX) g++ options: -fopenmp -march=native -O2

ebizzy

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-O3 native + PGO-O3 native-O2-O3200K400K600K800K1000KSE +/- 4974.04, N = 3SE +/- 17876.02, N = 3SE +/- 23788.74, N = 6SE +/- 14780.83, N = 3SE +/- 28582.82, N = 6SE +/- 18183.13, N = 4SE +/- 20234.15, N = 6SE +/- 16592.93, N = 4116699811242221111662110914811470091113356111244811063911. (CC) gcc options: -pthread -lpthread -O3 -march=native

Sockperf

Test: Latency Under Load

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under Load-O2-O3-O3 native510152025SE +/- 0.09, N = 5SE +/- 0.31, N = 10SE +/- 0.30, N = 10SE +/- 0.42, N = 10SE +/- 0.24, N = 8SE +/- 2.62, N = 1019.5919.6719.7519.7519.8822.431. (CXX) g++ options: --param -rdynamic -ldl -lpthread

Sockperf

Test: Latency Ping Pong

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping Pong-O3 native-O2-O3246810SE +/- 0.11, N = 10SE +/- 0.08, N = 10SE +/- 0.07, N = 5SE +/- 0.12, N = 10SE +/- 0.09, N = 66.426.666.766.536.571. (CXX) g++ options: --param -rdynamic -ldl -lpthread

FLAC Audio Encoding

WAV To FLAC

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3-O2-O3 native3691215SE +/- 0.02, N = 5SE +/- 0.09, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.12, N = 5SE +/- 0.19, N = 512.1312.2212.4412.1512.2712.631. (CXX) g++ options: -fvisibility=hidden -logg -lm

OpenSSL

RSA 4096-bit Performance

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit Performance-O3 native + PGO-O3 native-O2-O310002000300040005000SE +/- 19.76, N = 3SE +/- 23.13, N = 3SE +/- 22.12, N = 3SE +/- 23.64, N = 3SE +/- 23.93, N = 3SE +/- 21.56, N = 3SE +/- 22.71, N = 3SE +/- 24.30, N = 34623.604616.474612.204602.304617.234610.634609.734609.301. (CC) gcc options: -O3 -pthread -m64 -lcrypto -ldl

Redis

Test: GET

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3-O3 native-O2400K800K1200K1600K2000KSE +/- 34086.99, N = 6SE +/- 37527.46, N = 6SE +/- 40550.88, N = 6SE +/- 40793.79, N = 6SE +/- 25099.79, N = 6SE +/- 7906.34, N = 31813896.311691908.001612387.561682310.231586367.581580690.661. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

FFTW

Build: Stock - Size: 2D FFT Size 1024

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024-O3 native-O3-O214002800420056007000SE +/- 73.40, N = 3SE +/- 3.18, N = 3SE +/- 4.95, N = 3SE +/- 16.29, N = 3SE +/- 57.48, N = 3SE +/- 173.00, N = 66557.835835.905796.336531.375775.235601.001. (CC) gcc options: -pthread -lm

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mapping-O3-O3 native-O2100200300400500SE +/- 1.61, N = 3SE +/- 1.94, N = 3SE +/- 1.81, N = 3441.58439.54400.36-O3-O3 -march=native-O21. (CXX) g++ options: -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

Redis

Test: SET

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 native-O3-O2300K600K900K1200K1500KSE +/- 13759.71, N = 3SE +/- 23587.86, N = 6SE +/- 9618.90, N = 3SE +/- 22403.96, N = 6SE +/- 24024.35, N = 3SE +/- 31380.33, N = 61375161.961332541.961278916.541319138.271291788.211284845.131. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPUSH

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3-O2-O3 native300K600K900K1200K1500KSE +/- 2157.90, N = 3SE +/- 21056.82, N = 3SE +/- 18347.41, N = 6SE +/- 17323.31, N = 6SE +/- 12639.17, N = 3SE +/- 18588.25, N = 41187656.291135840.671108304.541135535.001133637.461130545.221. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O2-O3-O3 native300K600K900K1200K1500KSE +/- 11436.45, N = 3SE +/- 22572.62, N = 4SE +/- 3767.82, N = 3SE +/- 35661.81, N = 6SE +/- 32509.56, N = 6SE +/- 4081.67, N = 31542587.251477048.941423169.791517826.771500041.041430638.451. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

LAME MP3 Encoding

WAV To MP3

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 native-O3-O248121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.5011.0014.2810.5111.1314.051. (CC) gcc options: -lm

Redis

Test: LPOP

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O3 native-O3-O2200K400K600K800K1000KSE +/- 21693.88, N = 3SE +/- 19911.75, N = 3SE +/- 8709.47, N = 3SE +/- 15264.52, N = 3SE +/- 2162.33, N = 3SE +/- 4079.04, N = 31152018.461103652.211078502.411143259.501117326.831108268.081. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Timed HMMer Search

Pfam Database Search

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O2-O3 native-O3246810SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 37.407.637.747.567.657.711. (CC) gcc options: -pthread -lhmmer -lsquid -lm

Bullet Physics Engine

Test: 3000 Fall

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 native-O2-O31.1612.3223.4834.6445.805SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 65.025.065.065.045.065.161. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

libjpeg-turbo tjbench

Test: Decompression Throughput

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression Throughput-O3 native-O3-O2306090120150SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 1.71, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3145.66141.18137.82142.18141.74141.431. (CC) gcc options: -lm

Smallpt

Global Illumination Renderer; 100 Samples

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samples-O3 native + PGO-O3-O2-O3 native0.91.82.73.64.5SE +/- 0.21, N = 6344434441. (CXX) g++ options: -fopenmp

PolyBench-C

Test: Covariance Computation

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Covariance Computation-O3 native-O3 native + PGO-O3-O21.24432.48863.73294.97726.2215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 35.345.395.425.535.375.415.495.501. (CC) gcc options: -O3 -march=native

PolyBench-C

Test: Correlation Computation

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Correlation Computation-O3 native + PGO-O3 native-O3-O21.24882.49763.74644.99526.244SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 35.335.345.405.555.385.405.465.501. (CC) gcc options: -O3 -march=native

FFTW

Build: Stock - Size: 1D FFT Size 1024

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024-O3 native-O2-O32K4K6K8K10KSE +/- 18.71, N = 3SE +/- 1.72, N = 3SE +/- 8.90, N = 3SE +/- 1.83, N = 3SE +/- 10.71, N = 3SE +/- 176.56, N = 68518.677194.377176.778506.837168.636852.401. (CC) gcc options: -pthread -lm

C-Ray

Total Time

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-O3 native + PGO-O3 native-O3-O2246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.502.522.836.022.512.522.846.001. (CC) gcc options: -lm -lpthread -O3

PolyBench-C

Test: 3 Matrix Multiplications

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: 3 Matrix Multiplications-O3 native + PGO-O3 native-O3-O20.8281.6562.4843.3124.14SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 33.303.633.663.683.353.583.593.621. (CC) gcc options: -O3 -march=native

TSCP

AI Chess Performance

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 native + PGO-O3-O3 native-O2200K400K600K800K1000KSE +/- 1018.49, N = 5SE +/- 833.02, N = 5SE +/- 766.17, N = 5SE +/- 1116.56, N = 5SE +/- 595.69, N = 5SE +/- 421.22, N = 5SE +/- 421.22, N = 59191938525978512838510199161368565778565768565761. (CC) gcc options: -O3 -march=native

SQLite

Timed SQLite Insertions

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.22Timed SQLite Insertions-O3-O2-O3 native0.65031.30061.95092.60123.2515SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 32.852.862.892.872.882.891. (CC) gcc options: -lz -ldl -lpthread

Bullet Physics Engine

Test: Raytests

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 native-O2-O30.71551.4312.14652.8623.5775SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.113.173.183.123.173.171. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 native-O2-O30.3060.6120.9181.2241.53SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.331.361.361.331.361.361. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O2-O3-O3 native0.25430.50860.76291.01721.2715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.121.121.131.121.121.131. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 native-O2-O31.24432.48863.73294.97726.2215SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.295.495.535.325.515.521. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 native-O2-O30.72451.4492.17352.8983.6225SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.083.133.223.093.143.141. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 native-O2-O3246810SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.785.996.015.796.006.021. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Dense LU Matrix Factorization

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 native + PGO-O3 native-O3-O210002000300040005000SE +/- 17.02, N = 3SE +/- 20.82, N = 3SE +/- 27.07, N = 3SE +/- 14.02, N = 3SE +/- 24.64, N = 3SE +/- 25.52, N = 3SE +/- 19.27, N = 3SE +/- 3.21, N = 34865.244831.754683.553462.394845.494781.454723.983432.881. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3-O3 native-O3 native + PGO-O25001000150020002500SE +/- 1.01, N = 3SE +/- 11.03, N = 3SE +/- 10.35, N = 3SE +/- 2.57, N = 3SE +/- 11.52, N = 3SE +/- 13.99, N = 3SE +/- 7.59, N = 3SE +/- 14.14, N = 32473.982421.382413.242156.612458.502458.232400.462365.871. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3-O2-O3 native-O3 native + PGO120240360480600SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.25, N = 3SE +/- 0.53, N = 3SE +/- 0.02, N = 3SE +/- 9.75, N = 3SE +/- 0.33, N = 3561.85561.79556.37213.98561.29556.37552.08213.951. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

GCC 9.0 SVNGCC 8.2 RC1OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 native-O3 native + PGO-O3-O2400800120016002000SE +/- 0.72, N = 3SE +/- 0.23, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 31688.091674.991429.31918.961687.641676.151429.81919.021. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

GCC 8.2 RC1GCC 9.0 SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3-O2-O3 native-O3 native + PGO50100150200250SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 3221.30220.52219.38219.09221.28220.80219.24219.091. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4