GCC 7.2 vs. GCC 8 Halloween Znver1 EPYC

AMD EPYC 7601 32-Core testing with a TYAN B8026T70AE24HR and ASPEED ASPEED Family on Ubuntu 17.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1710310-AL-GCC72VSGC26&sor&grt.

GCC 7.2 vs. GCC 8 Halloween Znver1 EPYCProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopCompilerFile-SystemScreen ResolutionGCC 7.2.0GCC 8.0.0 20171030AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HRAMD Device 1450129024MB120GB Force MP500ASPEED ASPEED FamilyAcer P243WBroadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.104.13.0-16-generic (x86_64)GNOME Shell 3.26.1GCC 7.2.0ext41920x1200GCC 8.0.0 20171030OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Details- Scaling Governor: acpi-cpufreq ondemand

GCC 7.2 vs. GCC 8 Halloween Znver1 EPYCc-ray: Total Timeffmpeg: H.264 HD To NTSC DVfftw: Stock - 1D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 2D FFT Size 1024gmpbench: Total Timegraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghpcg: himeno: Poisson Pressure Solverencode-mp3: WAV To MP3tjbench: Decompression Throughputpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Single Thread - Read Onlyprimesieve: 1e12 Prime Number Generationredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsmallpt: Global Illumination Renderer; 100 Samplesstockfish: Total Timetscp: AI Chess Performancettsiod-renderer: Phong Rendering With Soft-Shadow MappingGCC 7.2.0GCC 8.0.0 201710303.0810.698539.106420.2326763198343918.801481771741981090.73936.7711.20141.55308134.5710784.9211.91202065.84189478.65193867.82198580.74192063.141925.72194.45223.692376.615150.441683.4044485861129407.272.7410.668416.236622.9026876211313926.201481791761991100.72942.4410.71143.79303543.1710709.1311.94195542.12193163.62192219.37198883.33193210.491933.09554.99223.062444.394757.531685.4844500853655407.58OpenBenchmarking.org

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 8.0.0 20171030GCC 7.2.00.6931.3862.0792.7723.465SE +/- 0.07, N = 6SE +/- 0.06, N = 32.743.081. (CC) gcc options: -lm -lpthread -O3 -march=znver1

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 3.3.3H.264 HD To NTSC DVGCC 8.0.0 20171030GCC 7.2.03691215SE +/- 0.07, N = 3SE +/- 0.04, N = 310.6610.691. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lxcb -lxcb-xfixes -lxcb-shape -lasound -lm -llzma -pthread -O3 -march=znver1 -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GCC 7.2.0GCC 8.0.0 201710302K4K6K8K10KSE +/- 4.79, N = 3SE +/- 5.10, N = 38539.108416.231. (CC) gcc options: -pthread -O3 -march=znver1 -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024GCC 8.0.0 20171030GCC 7.2.014002800420056007000SE +/- 33.11, N = 3SE +/- 104.31, N = 46622.906420.231. (CC) gcc options: -pthread -O3 -march=znver1 -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024GCC 8.0.0 20171030GCC 7.2.06K12K18K24K30KSE +/- 120.67, N = 3SE +/- 26.61, N = 326876267631. (CC) gcc options: -pthread -O3 -march=znver1 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024GCC 8.0.0 20171030GCC 7.2.05K10K15K20K25KSE +/- 57.10, N = 3SE +/- 270.69, N = 321131198341. (CC) gcc options: -pthread -O3 -march=znver1 -lm

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.1.2Total TimeGCC 8.0.0 20171030GCC 7.2.080016002400320040003926.203918.801. (CC) gcc options: -O3 -march=znver1 -lm

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: BlurGCC 8.0.0 20171030GCC 7.2.03060901201501481481. (CC) gcc options: -fopenmp -O3 -march=znver1 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: SharpenGCC 8.0.0 20171030GCC 7.2.04080120160200SE +/- 0.33, N = 31791771. (CC) gcc options: -fopenmp -O3 -march=znver1 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: ResizingGCC 8.0.0 20171030GCC 7.2.04080120160200SE +/- 0.88, N = 3SE +/- 0.67, N = 31761741. (CC) gcc options: -fopenmp -O3 -march=znver1 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color SpaceGCC 8.0.0 20171030GCC 7.2.040801201602001991981. (CC) gcc options: -fopenmp -O3 -march=znver1 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive ThresholdingGCC 8.0.0 20171030GCC 7.2.0204060801001101091. (CC) gcc options: -fopenmp -O3 -march=znver1 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lX11 -llzma -lz -lm -ldl -lpthread

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 7.2.0GCC 8.0.0 201710300.16430.32860.49290.65720.8215SE +/- 0.01, N = 3SE +/- 0.00, N = 30.730.72

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 8.0.0 20171030GCC 7.2.02004006008001000SE +/- 1.81, N = 3SE +/- 21.77, N = 6942.44936.771. (CC) gcc options: -O3 -march=znver1 -mavx2

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.5WAV To MP3GCC 8.0.0 20171030GCC 7.2.03691215SE +/- 0.00, N = 5SE +/- 0.01, N = 510.7111.201. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=znver1 -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression ThroughputGCC 8.0.0 20171030GCC 7.2.0306090120150SE +/- 0.02, N = 3SE +/- 0.03, N = 3143.79141.551. (CC) gcc options: -O3 -march=znver1 -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 7.2.0GCC 8.0.0 2017103070K140K210K280K350KSE +/- 1439.36, N = 3SE +/- 3027.80, N = 3308134.57303543.171. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=znver1 -fPIC -shared

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.0Scaling: Buffer Test - Test: Single Thread - Mode: Read OnlyGCC 7.2.0GCC 8.0.0 201710302K4K6K8K10KSE +/- 140.64, N = 3SE +/- 47.06, N = 310784.9210709.131. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=znver1 -fPIC -shared

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 6.21e12 Prime Number GenerationGCC 7.2.0GCC 8.0.0 201710303691215SE +/- 0.03, N = 3SE +/- 0.04, N = 311.9111.941. (CXX) g++ options: -O3 -march=znver1 -rdynamic -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: LPOPGCC 7.2.0GCC 8.0.0 2017103040K80K120K160K200KSE +/- 3741.75, N = 3SE +/- 3274.37, N = 6202065.84195542.121. (CC) gcc options: -ggdb -rdynamic -lm -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SADDGCC 8.0.0 20171030GCC 7.2.040K80K120K160K200KSE +/- 2485.81, N = 3SE +/- 303.66, N = 3193163.62189478.651. (CC) gcc options: -ggdb -rdynamic -lm -pthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: LPUSHGCC 7.2.0GCC 8.0.0 2017103040K80K120K160K200KSE +/- 2952.51, N = 6SE +/- 3068.09, N = 4193867.82192219.371. (CC) gcc options: -ggdb -rdynamic -lm -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETGCC 8.0.0 20171030GCC 7.2.040K80K120K160K200KSE +/- 2615.42, N = 6SE +/- 3940.07, N = 6198883.33198580.741. (CC) gcc options: -ggdb -rdynamic -lm -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETGCC 8.0.0 20171030GCC 7.2.040K80K120K160K200KSE +/- 3816.59, N = 3SE +/- 1582.98, N = 3193210.49192063.141. (CC) gcc options: -ggdb -rdynamic -lm -pthread

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 8.0.0 20171030GCC 7.2.0400800120016002000SE +/- 7.80, N = 4SE +/- 8.62, N = 41933.091925.721. (CC) gcc options: -O3 -march=znver1 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 8.0.0 20171030GCC 7.2.0120240360480600SE +/- 0.07, N = 4SE +/- 0.01, N = 4554.99194.451. (CC) gcc options: -O3 -march=znver1 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 7.2.0GCC 8.0.0 2017103050100150200250SE +/- 0.12, N = 4SE +/- 0.07, N = 4223.69223.061. (CC) gcc options: -O3 -march=znver1 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 8.0.0 20171030GCC 7.2.05001000150020002500SE +/- 9.19, N = 4SE +/- 13.62, N = 42444.392376.611. (CC) gcc options: -O3 -march=znver1 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 7.2.0GCC 8.0.0 2017103011002200330044005500SE +/- 37.84, N = 4SE +/- 31.33, N = 45150.444757.531. (CC) gcc options: -O3 -march=znver1 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 8.0.0 20171030GCC 7.2.0400800120016002000SE +/- 0.66, N = 4SE +/- 0.70, N = 41685.481683.401. (CC) gcc options: -O3 -march=znver1 -lm

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 7.2.0GCC 8.0.0 201710300.91.82.73.64.5SE +/- 0.17, N = 6SE +/- 0.17, N = 6441. (CXX) g++ options: -fopenmp -O3 -march=znver1

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeGCC 7.2.0GCC 8.0.0 2017103010002000300040005000SE +/- 56.11, N = 3SE +/- 55.17, N = 3448545001. (CXX) g++ options: -lpthread -O3 -march=znver1 -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt -flto

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 7.2.0GCC 8.0.0 20171030200K400K600K800K1000KSE +/- 329.95, N = 5SE +/- 264.40, N = 58611298536551. (CC) gcc options: -O3 -march=znver1 -march=native

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingGCC 8.0.0 20171030GCC 7.2.090180270360450SE +/- 6.79, N = 4SE +/- 6.30, N = 6407.58407.271. (CXX) g++ options: -O3 -march=znver1 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++


Phoronix Test Suite v10.8.5