Clang znver1 Scheduler Model Tests

AMD Ryzen 7 1700 Eight-Core testing with the new Zen znver1 scheduler model. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1707198-TR-CLANGZNVE46&sor.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay DriverCompilerFile-SystemScreen ResolutionClang 4.0Clang SVN -O3 -march=x86-64 -O3 -march=znver1 -O3 -march=x86-64 -O3 -march=znver1AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (16 Cores)MSI B350 TOMAHAWK (MS-7A34) v1.0AMD Device 145016384MB120GB Samsung SSD 840HIS AMD Radeon HD 7750/8740 / R7 250E 1024MBAMD Cape Verde/PitcairnRealtek RTL8111/8168/8411Ubuntu 17.044.12.0-999-generic (x86_64) 20170622Unity 7.5.0modesetting 1.19.3Clang 4.0.0-1ubuntu1ext41024x768Clang 6.0.0 (SVN 308494)OpenBenchmarking.orgProcessor Details- Scaling Governor: acpi-cpufreq ondemandCompiler Details- Clang SVN: -O3 -march=x86-64, Clang SVN: -O3 -march=znver1: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1

fftw: Float + SSE - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancex264: H.264 Video Encodinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverebizzy: c-ray: Total Timestockfish: Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-wavpack: WAV To WavPacktachyon: Total Timetjbench: Decompression Throughputpgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Heavy Contention - Read Writeredis: LPOPredis: GETredis: SEThint: FLOATapache: Static Web Page ServingClang 4.0Clang SVN -O3 -march=x86-64 -O3 -march=znver1 -O3 -march=x86-64 -O3 -march=znver1129752124.54623.66134.392654.485534.541675.631054730252.21105571321631331114.0340827814.7237042.854.625.334.892.970.991.196.779.626.505.23163.711899.66226.162021.121890860.671900142.801415121.33301996595.0927221.41130722174.86651.73135.842616.815792.781677.131019964252.73101601351481341115.3640680214.6236712.794.535.144.692.930.991.175.7210.746.445.25166.391938.55226.212061.891908429.461896426.081400655.09301686748.9226808.65136902129.48642.55133.362623.755572.721675.031103331252.331361371731331110.9240850913.6837102.864.625.274.842.961.001.206.779.756.525.29161.781865.09226.041938.931968693.381953368.381385134.04301795000.8526994.80133982152.36657.96134.242597.545692.831679.211096303252.521401391781361022.6538296013.6236702.784.535.104.642.930.991.165.6910.926.455.29165.301813.34224.891875.271960971.501973143.291413713.29300351844.0527700.93OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

Clang SVNClang 4.0OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=x86-64-O3 -march=znver13K6K9K12K15KSE +/- 12.52, N = 5SE +/- 55.71, N = 5SE +/- 75.88, N = 5SE +/- 25.06, N = 5136901339813072129751. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Composite

Clang 4.0Clang SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=znver1-O3 -march=x86-645001000150020002500SE +/- 6.84, N = 4SE +/- 10.27, N = 4SE +/- 2.54, N = 4SE +/- 2.34, N = 42174.862124.542152.362129.481. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

Clang SVNClang 4.0OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver1-O3 -march=x86-64140280420560700SE +/- 0.67, N = 4SE +/- 0.08, N = 4SE +/- 0.07, N = 4SE +/- 0.39, N = 4657.96642.55651.73623.661. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

Clang 4.0Clang SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=znver1-O3 -march=x86-64306090120150SE +/- 0.22, N = 4SE +/- 0.28, N = 4SE +/- 0.63, N = 4SE +/- 0.55, N = 4135.84134.39134.24133.361. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

Clang 4.0Clang SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=x86-64-O3 -march=znver16001200180024003000SE +/- 3.12, N = 4SE +/- 3.22, N = 4SE +/- 4.33, N = 4SE +/- 5.05, N = 42654.482616.812623.752597.541. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

Clang 4.0Clang SVNOpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=znver1-O3 -march=x86-6412002400360048006000SE +/- 36.57, N = 4SE +/- 49.61, N = 4SE +/- 16.27, N = 4SE +/- 14.40, N = 45792.785534.545692.835572.721. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

Clang SVNClang 4.0OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver1-O3 -march=x86-64400800120016002000SE +/- 0.50, N = 4SE +/- 0.35, N = 4SE +/- 0.75, N = 4SE +/- 0.35, N = 41679.211675.031677.131675.631. (CC) gcc options: -O3 -lm

TSCP

AI Chess Performance

Clang SVNClang 4.0OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=x86-64-O3 -march=znver1200K400K600K800K1000KSE +/- 1625.76, N = 5SE +/- 1629.46, N = 5SE +/- 987.82, N = 5SE +/- 1196.29, N = 511033311096303105473010199641. (CC) gcc options: -O3

x264

H.264 Video Encoding

Clang 4.0Clang SVNOpenBenchmarking.orgFrames Per Second, More Is Betterx264 2017-05-19H.264 Video Encoding-O3 -march=znver1-O3 -march=x86-6460120180240300SE +/- 1.15, N = 5SE +/- 0.72, N = 5SE +/- 0.68, N = 5SE +/- 0.34, N = 5252.73252.21252.52252.331. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

GraphicsMagick

Operation: Blur

Clang SVNClang 4.0OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O3 -march=znver1-O3 -march=x86-643060901201501401361051011. (CC) gcc options: -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

Clang SVNClang 4.0OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-O3 -march=znver1-O3 -march=x86-64306090120150SE +/- 0.33, N = 313913760571. (CC) gcc options: -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizing-O3 -march=znver1-O3 -march=x86-64306090120150SE +/- 0.33, N = 3135132-march=znver1-march=x86-641. (CC) gcc options: -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

Clang SVNClang 4.0OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-O3 -march=znver1-O3 -march=x86-644080120160200SE +/- 0.33, N = 31781731631481. (CC) gcc options: -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

Clang SVNClang 4.0OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-O3 -march=znver1-O3 -march=x86-643060901201501361331341331. (CC) gcc options: -O3 -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

Clang 4.0Clang SVNOpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=znver1-O3 -march=x86-642004006008001000SE +/- 6.87, N = 3SE +/- 0.52, N = 3SE +/- 1.19, N = 3SE +/- 5.42, N = 31115.361114.031110.921022.651. (CC) gcc options: -O3 -mavx2

ebizzy

Clang SVNClang 4.0OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-O3 -march=x86-64-O3 -march=znver190K180K270K360K450KSE +/- 4432.38, N = 3SE +/- 5536.66, N = 6SE +/- 6544.76, N = 4SE +/- 4344.35, N = 34085093829604082784068021. (CC) gcc options: -pthread -lpthread -O3

C-Ray

Total Time

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-O3 -march=znver1-O3 -march=x86-6448121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.6213.6814.6214.721. (CC) gcc options: -lm -lpthread -O3

Stockfish

Total Time

Clang SVNClang 4.0OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total Time-O3 -march=znver1-O3 -march=x86-648001600240032004000SE +/- 5.03, N = 3SE +/- 1.76, N = 3SE +/- 12.02, N = 3SE +/- 2.96, N = 336703710367137041. (CXX) g++ options: -lpthread -O3 -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt

Bullet Physics Engine

Test: Raytests

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=znver1-O3 -march=x86-640.64351.2871.93052.5743.2175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.782.862.792.851. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=znver1-O3 -march=x86-641.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.534.624.534.621. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=znver1-O3 -march=x86-641.19932.39863.59794.79725.9965SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.105.275.145.331. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=znver1-O3 -march=x86-641.10032.20063.30094.40125.5015SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.644.844.694.891. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=znver1-O3 -march=x86-640.66831.33662.00492.67323.3415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.932.972.932.961. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=x86-64-O3 -march=znver10.2250.450.6750.91.125SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.990.990.991.001. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=znver1-O3 -march=x86-640.270.540.811.081.35SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.161.201.171.191. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

FLAC Audio Encoding

WAV To FLAC

Clang SVNClang 4.0OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O3 -march=znver1-O3 -march=x86-64246810SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.06, N = 5SE +/- 0.00, N = 55.696.775.726.771. (CXX) g++ options: -O3 -logg -lm

LAME MP3 Encoding

WAV To MP3

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-O3 -march=x86-64-O3 -march=znver13691215SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 59.6210.749.7510.921. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

WavPack Audio Encoding

WAV To WavPack

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.1WAV To WavPack-O3 -march=znver1-O3 -march=x86-64246810SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 56.446.506.456.521. (CC) gcc options: -O3 -lm

Tachyon

Total Time

Clang 4.0Clang SVNOpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total Time-O3 -march=x86-64-O3 -march=znver11.19032.38063.57094.76125.9515SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.235.255.295.291. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

libjpeg-turbo tjbench

Test: Decompression Throughput

Clang 4.0Clang SVNOpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression Throughput-O3 -march=znver1-O3 -march=x86-644080120160200SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 3166.39163.71165.30161.781. (CC) gcc options: -O3 -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

Clang 4.0Clang SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=znver1-O3 -march=x86-64400800120016002000SE +/- 30.13, N = 3SE +/- 33.75, N = 6SE +/- 47.88, N = 6SE +/- 52.04, N = 61938.551899.661865.091813.341. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -fpic -shared

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

Clang 4.0Clang SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-O3 -march=znver1-O3 -march=x86-6450100150200250SE +/- 0.63, N = 3SE +/- 0.18, N = 3SE +/- 0.39, N = 3SE +/- 1.25, N = 3226.21226.16226.04224.891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -fpic -shared

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write

Clang 4.0Clang SVNOpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.6.3Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write-O3 -march=znver1-O3 -march=x86-64400800120016002000SE +/- 36.47, N = 6SE +/- 7.18, N = 3SE +/- 30.24, N = 3SE +/- 30.52, N = 42061.892021.121938.931875.271. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -fpic -shared

Redis

Test: LPOP

Clang SVNClang 4.0OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: LPOP-O3 -march=x86-64-O3 -march=znver1400K800K1200K1600K2000KSE +/- 13701.81, N = 3SE +/- 13594.16, N = 3SE +/- 5577.00, N = 3SE +/- 21807.56, N = 31968693.381960971.501908429.461890860.671. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: GET

Clang SVNClang 4.0OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GET-O3 -march=znver1-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 27406.80, N = 3SE +/- 15420.28, N = 3SE +/- 29446.39, N = 5SE +/- 9354.06, N = 31973143.291953368.381900142.801896426.081. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SET

Clang 4.0Clang SVNOpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SET-O3 -march=x86-64-O3 -march=znver1300K600K900K1200K1500KSE +/- 4364.72, N = 3SE +/- 8135.91, N = 3SE +/- 23768.12, N = 6SE +/- 8018.20, N = 31415121.331400655.091413713.291385134.041. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Hierarchical INTegration

Test: FLOAT

Clang 4.0Clang SVNOpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O3 -march=x86-64-O3 -march=znver160M120M180M240M300MSE +/- 176504.67, N = 3SE +/- 359206.87, N = 3SE +/- 120507.74, N = 3SE +/- 20484.23, N = 3301996595.09301686748.92301795000.85300351844.051. (CC) gcc options: -O3 -lm

Apache Benchmark

Static Web Page Serving

Clang SVNClang 4.0OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page Serving-O3 -march=znver1-O3 -march=x86-646K12K18K24K30KSE +/- 44.40, N = 3SE +/- 179.13, N = 3SE +/- 320.32, N = 3SE +/- 228.50, N = 327700.9326994.8027221.4126808.651. (CC) gcc options: -shared -fPIC -pthread -O3


Phoronix Test Suite v10.8.4