Ryzen 9 3900X Znver2 Compiler Tuning

AMD Ryzen 9 3900X 12-Core testing of GCC 9 and GCC 10 development with Znver2 tuning following recent cost table updates, etc. Benchmarks by Michael Larabel for a future article..

HTML result view exported from: https://openbenchmarking.org/result/1907290-HV-RYZEN939034&obr_sgm=y&sro&grt.

Ryzen 9 3900X Znver2 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver2AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 9.1.0ext43840x2160GCC 10.0.0 20190727OpenBenchmarking.orgEnvironment Details- GCC 9.1.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.1.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- GCC 10.0.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 10.0.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15+ + Python 3.6.8Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling

Ryzen 9 3900X Znver2 Compiler Tuningaobench: 2048 x 2048 - Total Timeaom-av1: AV1 Video Encodingapache: Static Web Page Servingapache-siege: 200apache-siege: 250bullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshc-ray: Total Time - 4K, 16 Rays Per Pixelcoremark: CoreMark Size 666 - Iterations Per Secondcpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractioncpuminer-opt: m7mcpuminer-opt: deepcpuminer-opt: lbrycpuminer-opt: skeincpuminer-opt: myr-grcpuminer-opt: sha256tffmpeg: H.264 HD To NTSC DVfftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32encode-flac: WAV To FLACmpcbench: Multi-Precision Benchmarkgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: Water Benchmarkhpcg: himeno: Poisson Pressure Solverhpcc: G-HPLhpcc: G-Fftehpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthjohn-the-ripper: Blowfishencode-mp3: WAV To MP3tjbench: Decompression Throughputlzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionm-queens: Time To Solvemcperf: Getmcperf: Setmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32nginx: Static Web Page Servingencode-ogg: WAV To Oggopenssl: RSA 4096-bit Performancepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writeredis: GETredis: SETscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsmallpt: Global Illumination Renderer; 128 Samplessockperf: Throughputsockperf: Latency Ping Pongstockfish: Total Timesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodebuild-llvm: Time To Compilebuild-php: Time To Compiletscp: AI Chess Performancex264: H.264 Video Encodingx265: H.265 1080p Video Encodingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver234.600.2738392.2960835.7998050.911.983.203.883.512.050.750.8943.09567987.3459.3131.52309.36751.1576.4514.4027.60593.8911190345833939714127879516.8612958119099028.107063.03452537.7095972512621812092751702870.981.091322.9070.969008.598038.5980332.830832.732551.708200.097570.325964.8916124227.253203357.25218.09391164681269494239111947.1292376.4052914.10152.511523.52117.6019613.70221.2359.002527.5051813.051147.6239734.855.133516.27300353.0929178.233297713.332122162.942768.16761.38295.183767.636891.372125.267.785145513.123927896446.45246.0189.99280.2752.711305781139.5952.9425.2333.200.3138022.7999824.4996842.132.043.223.773.572.040.770.9039.42555154.6060.3831.43302.82750.6674.0814.1528.93590.6610230.34344203979714023872386.831182814314108147920.17449517.9995772592631952212801712930.991.081378.4671.776638.605148.6051432.602632.952251.716680.097980.326984.9883223832.608202536.94225.64401164681268515257118347.2793850.5959232.07155.341599.68118.4719696.57217.0257.972520.0152238.801153.4639602.495.053481.50297539.8929149.203066070.282169531.003686.60800.23273.493580.7311370.272408.267.675170953.153956165546.39247.3396.54284.1053.911337188138.4152.5325.2535.980.2738490.9882293.1462725.242.113.444.113.732.200.810.9542.63568329.0059.9731.51307.23799.8874.2615.1028.19591.3211137352883972014130864176.8812748129029583.737071.30453617.7295802542621812082741702880.971.091385.2371.070108.817488.8174832.863432.729741.722050.097780.331864.9494723885.438204267.28220.33401134671287507250114747.1495710.6057193.25154.091582.78116.6219803.57212.8358.162507.1650039.131145.0139525.705.053492.53300244.8129372.393042507.472051361.333127.49777.17301.173856.638526.662175.857.845147483.033963199346.22248.8589.84292.5354.431366017138.7453.0025.2633.050.3238009.2583275.06102423.072.063.273.853.602.050.770.9139.36567096.6563.3431.30306.02787.7777.2214.9028.30590.8011123346303984314137864406.781411314119105317823.27463058.1193572642771962232861733020.981.081385.8871.049308.637948.6379432.843632.947301.730550.097710.325215.0460323993.043204267.45225.44371084531250499248115947.2197228.2752910.87157.721556.91118.0219694.33218.6656.872543.9350679.531145.9539346.915.363487.10298969.7529148.603031706.222084989.883553.67759.97261.103675.9410777.882293.467.535296573.043954032846.49247.9992.35300.3153.761408752139.8252.4025.39OpenBenchmarking.org

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 335.9833.0534.6033.20-march=znver2-march=znver21. (CC) gcc options: -lm -O3

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0720.1440.2160.2880.36SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.270.320.270.31-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 65.39, N = 3SE +/- 79.10, N = 3SE +/- 57.64, N = 3SE +/- 139.15, N = 338490.9838009.2538392.2938022.79-march=znver2-march=znver21. (CC) gcc options: -shared -fPIC -pthread -O3

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 3302.37, N = 12SE +/- 1288.23, N = 15SE +/- 798.56, N = 3SE +/- 3575.15, N = 1582293.1483275.0660835.7999824.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 122.71, N = 3SE +/- 1636.75, N = 12SE +/- 3755.13, N = 15SE +/- 4063.46, N = 1262725.24102423.0798050.9196842.13-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.47480.94961.42441.89922.374SE +/- 0.00, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.112.061.982.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.443.273.203.22-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.92481.84962.77443.69924.624SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 34.113.853.883.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.83931.67862.51793.35724.1965SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 33.733.603.513.57-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.4950.991.4851.982.475SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.202.052.052.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.18230.36460.54690.72920.9115SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.770.750.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.21380.42760.64140.85521.069SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.950.910.890.90-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21020304050SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 342.6339.3643.0939.42-march=znver2-march=znver21. (CC) gcc options: -lm -lpthread -O3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2120K240K360K480K600KSE +/- 1210.22, N = 3SE +/- 1036.74, N = 3SE +/- 1430.19, N = 3SE +/- 2761.64, N = 3568329.00567096.65567987.34555154.60-march=znver2-march=znver21. (CC) gcc options: -O2 -O3 -lrt" -lrt

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21428425670SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.30, N = 3SE +/- 0.53, N = 1159.9763.3459.3160.38-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.38, N = 5SE +/- 0.03, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 331.5131.3031.5231.43-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 4.29, N = 3SE +/- 2.37, N = 3SE +/- 0.26, N = 3SE +/- 3.91, N = 3307.23306.02309.36302.82-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 4.15, N = 3SE +/- 10.35, N = 5SE +/- 2.69, N = 3SE +/- 0.27, N = 3799.88787.77751.15750.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.88, N = 3SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 374.2677.2276.4574.08-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver248121620SE +/- 0.17, N = 3SE +/- 0.20, N = 4SE +/- 0.08, N = 3SE +/- 0.03, N = 315.1014.9014.4014.15-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.08, N = 3SE +/- 0.45, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 328.1928.3027.6028.93-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

Cpuminer-Opt

Algorithm: m7m

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7mGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2130260390520650SE +/- 0.27, N = 3SE +/- 0.35, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 3591.32590.80593.89590.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: deep

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deepGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 3.33, N = 3SE +/- 8.82, N = 3SE +/- 926.03, N = 1211137.0011123.0011190.0010230.34-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: lbry

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 460.86, N = 5SE +/- 20.82, N = 3SE +/- 550.28, N = 3SE +/- 5.77, N = 335288346303458334420-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: skein

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skeinGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 5.77, N = 3SE +/- 133.46, N = 3SE +/- 602.50, N = 3SE +/- 21.86, N = 339720398433939739797-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: myr-gr

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-grGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 40.00, N = 3SE +/- 6.67, N = 3SE +/- 49.78, N = 3SE +/- 26.03, N = 314130141371412714023-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: sha256t

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256tGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 116.81, N = 3SE +/- 180.83, N = 3SE +/- 990.16, N = 7SE +/- 1027.26, N = 686417864408795187238-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 36.886.786.866.83-march=znver2-march=znver21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -O3 -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 110.06, N = 3SE +/- 5.51, N = 3SE +/- 1.76, N = 3SE +/- 15.90, N = 312748141131295811828-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 2.19, N = 3SE +/- 155.95, N = 3SE +/- 141.66, N = 312902141191190914314-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.17, N = 3SE +/- 10.67, N = 3SE +/- 30.08, N = 3SE +/- 148.34, N = 49583.7310531.009028.1010814.00-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 40.30, N = 3SE +/- 95.02, N = 3SE +/- 73.85, N = 3SE +/- 67.62, N = 37071.307823.277063.037920.17-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver210K20K30K40K50KSE +/- 54.85, N = 3SE +/- 28.47, N = 3SE +/- 663.38, N = 4SE +/- 105.51, N = 345361463054525344951-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 57.728.117.707.99-march=znver2-march=znver21. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

GNU MPC

Multi-Precision Benchmark

OpenBenchmarking.orgGlobal Score, More Is BetterGNU MPC 1.1.0Multi-Precision BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 26.46, N = 3SE +/- 102.03, N = 3SE +/- 31.80, N = 3SE +/- 50.44, N = 39580935795979577-march=znver2-march=znver21. (CC) gcc options: -lm -O3 -MT -MD -MP -MF

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.88, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3254264251259-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 4.33, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3262277262263-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3181196181195-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.20, N = 3208223209221-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 2.65, N = 3SE +/- 1.53, N = 3SE +/- 2.19, N = 3SE +/- 1.15, N = 3274286275280-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3170173170171-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 2.19, N = 3SE +/- 0.33, N = 3SE +/- 2.60, N = 3SE +/- 2.19, N = 3288302287293-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.99-march=znver2-march=znver21. (CXX) g++ options: -march=core-avx2 -O3 -std=c++11 -funroll-all-loops -fopenmp -lrt -lpthread -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.24530.49060.73590.98121.2265SE +/- 0.01, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.091.081.091.08

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 2.93, N = 3SE +/- 0.48, N = 3SE +/- 11.21, N = 3SE +/- 6.19, N = 31385.231385.881322.901378.46-march=znver2-march=znver21. (CC) gcc options: -O3 -mavx2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21632486480SE +/- 0.08, N = 3SE +/- 0.22, N = 3SE +/- 0.23, N = 3SE +/- 0.37, N = 371.0771.0570.9771.78-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 0.19, N = 3SE +/- 0.42, N = 332.8632.8432.8332.60-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.66431.32861.99292.65723.3215SE +/- 0.00151, N = 3SE +/- 0.00082, N = 3SE +/- 0.00047, N = 3SE +/- 0.00095, N = 32.729742.947302.732552.95225-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.38940.77881.16821.55761.947SE +/- 0.00098, N = 3SE +/- 0.00091, N = 3SE +/- 0.00015, N = 3SE +/- 0.00081, N = 31.722051.730551.708201.71668-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0220.0440.0660.0880.11SE +/- 0.00041, N = 3SE +/- 0.00044, N = 3SE +/- 0.00036, N = 3SE +/- 0.00042, N = 30.097780.097710.097570.09798-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.07470.14940.22410.29880.3735SE +/- 0.00125, N = 3SE +/- 0.00071, N = 3SE +/- 0.00047, N = 3SE +/- 0.00042, N = 30.331860.325210.325960.32698-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.13542.27083.40624.54165.677SE +/- 0.05697, N = 3SE +/- 0.04322, N = 3SE +/- 0.02698, N = 3SE +/- 0.07571, N = 34.949475.046034.891614.98832-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25K10K15K20K25KSE +/- 62.37, N = 3SE +/- 195.64, N = 3SE +/- 159.70, N = 3SE +/- 119.42, N = 323885.4423993.0424227.2523832.61-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 63.74, N = 3SE +/- 64.22, N = 3SE +/- 64.93, N = 3SE +/- 63.01, N = 3204262042620335202531. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 37.287.457.256.94-march=znver2-march=znver21. (CC) gcc options: -O3 -lncurses -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 2.32, N = 3SE +/- 0.30, N = 3SE +/- 0.44, N = 3SE +/- 0.31, N = 3220.33225.44218.09225.64-march=znver2-march=znver21. (CC) gcc options: -O3 -rdynamic

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2918273645SE +/- 0.33, N = 3403739401. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.33, N = 31131081161161. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2100200300400500SE +/- 0.33, N = 3SE +/- 3.18, N = 3SE +/- 4.91, N = 84674534684681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 9.50, N = 3SE +/- 12.79, N = 812871250126912681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110220330440550SE +/- 0.88, N = 3SE +/- 4.47, N = 11SE +/- 0.67, N = 3SE +/- 4.10, N = 35074994945151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.67, N = 3SE +/- 1.86, N = 32502482392571. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.33, N = 3SE +/- 10.00, N = 311471159111911831. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 347.1447.2147.1247.27-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3 -O2 -march=native

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 1025.20, N = 15SE +/- 1267.59, N = 3SE +/- 1551.16, N = 3SE +/- 937.65, N = 1595710.6097228.2792376.4093850.59-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver213K26K39K52K65KSE +/- 2058.38, N = 15SE +/- 393.33, N = 3SE +/- 293.82, N = 3SE +/- 3850.96, N = 1557193.2552910.8752914.1059232.07-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 3.18, N = 12SE +/- 3.21, N = 14SE +/- 3.35, N = 15SE +/- 2.27, N = 15154.09157.72152.51155.34MIN: 129-march=znver2 - MIN: 127MIN: 111.42-march=znver2 - MIN: 127.991. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 5.99, N = 3SE +/- 25.50, N = 3SE +/- 7.48, N = 3SE +/- 17.21, N = 31582.781556.911523.521599.68MIN: 1385.56-march=znver2 - MIN: 1368.2MIN: 1357.02-march=znver2 - MIN: 1393.731. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.79, N = 3SE +/- 1.48, N = 4SE +/- 0.16, N = 3SE +/- 0.45, N = 3116.62118.02117.60118.47MIN: 102.39-march=znver2 - MIN: 102.11MIN: 103.13-march=znver2 - MIN: 103.471. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 87.03, N = 3SE +/- 41.11, N = 3SE +/- 42.61, N = 3SE +/- 22.35, N = 319803.5719694.3319613.7019696.57MIN: 19014.9-march=znver2 - MIN: 18995.6MIN: 18961.5-march=znver2 - MIN: 19033.51. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 0.29, N = 3SE +/- 1.85, N = 15SE +/- 2.00, N = 15SE +/- 1.79, N = 13212.83218.66221.23217.02MIN: 201.7-march=znver2 - MIN: 203.42MIN: 202.07-march=znver2 - MIN: 203.651. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21326395265SE +/- 0.69, N = 15SE +/- 0.58, N = 8SE +/- 0.66, N = 7SE +/- 0.49, N = 1558.1656.8759.0057.97MIN: 50.91-march=znver2 - MIN: 50.96MIN: 50.8-march=znver2 - MIN: 51.571. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 6.13, N = 3SE +/- 17.20, N = 3SE +/- 9.57, N = 3SE +/- 9.61, N = 32507.162543.932527.502520.01MIN: 2461.57-march=znver2 - MIN: 2467.76MIN: 2462.11-march=znver2 - MIN: 2467.071. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver211K22K33K44K55KSE +/- 224.88, N = 3SE +/- 390.75, N = 3SE +/- 589.20, N = 6SE +/- 668.40, N = 350039.1350679.5351813.0552238.80MIN: 46883.1-march=znver2 - MIN: 48056.6MIN: 48543.1-march=znver2 - MIN: 49224.91. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 6.39, N = 3SE +/- 5.61, N = 3SE +/- 6.51, N = 3SE +/- 6.23, N = 31145.011145.951147.621153.46MIN: 1050.58-march=znver2 - MIN: 1052.71MIN: 1052.13-march=znver2 - MIN: 1057.541. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 158.42, N = 3SE +/- 23.74, N = 3SE +/- 102.83, N = 3SE +/- 112.05, N = 339525.7039346.9139734.8539602.49-march=znver2-march=znver21. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.3WAV To OggGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.2062.4123.6184.8246.03SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 35.055.365.135.05-march=znver2-march=znver21. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -logg

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 1.89, N = 3SE +/- 0.70, N = 3SE +/- 7.07, N = 3SE +/- 1.42, N = 33492.533487.103516.273481.50-march=znver2-march=znver21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260K120K180K240K300KSE +/- 102.78, N = 3SE +/- 237.85, N = 3SE +/- 513.53, N = 3SE +/- 235.79, N = 3300244.81298969.75300353.09297539.89-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver26K12K18K24K30KSE +/- 31.16, N = 3SE +/- 124.84, N = 3SE +/- 55.36, N = 3SE +/- 40.41, N = 329372.3929148.6029178.2329149.20-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2700K1400K2100K2800K3500KSE +/- 47460.73, N = 15SE +/- 51486.64, N = 15SE +/- 40781.06, N = 3SE +/- 61029.58, N = 153042507.473031706.223297713.333066070.281. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2500K1000K1500K2000K2500KSE +/- 28123.08, N = 3SE +/- 14796.01, N = 3SE +/- 30290.32, N = 4SE +/- 19021.82, N = 32051361.332084989.882122162.942169531.001. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 5.96, N = 3SE +/- 13.97, N = 3SE +/- 25.64, N = 3SE +/- 5.91, N = 33127.493553.672768.163686.60-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 0.24, N = 3SE +/- 0.29, N = 3SE +/- 7.16, N = 3SE +/- 0.74, N = 3777.17759.97761.38800.23-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 0.54, N = 3SE +/- 0.24, N = 3SE +/- 2.85, N = 3SE +/- 0.21, N = 3301.17261.10295.18273.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 15.78, N = 3SE +/- 58.43, N = 3SE +/- 37.65, N = 3SE +/- 13.73, N = 33856.633675.943767.633580.73-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.91, N = 3SE +/- 12.24, N = 3SE +/- 60.72, N = 3SE +/- 15.98, N = 38526.6610777.886891.3711370.27-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 0.16, N = 3SE +/- 0.89, N = 3SE +/- 20.16, N = 3SE +/- 0.53, N = 32175.852293.462125.262408.26-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 37.847.537.787.67-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3

Sockperf

Test: Throughput

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110K220K330K440K550KSE +/- 5409.10, N = 5SE +/- 3715.76, N = 18SE +/- 4767.11, N = 5SE +/- 4175.03, N = 5514748529657514551517095-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.70881.41762.12642.83523.544SE +/- 0.02, N = 25SE +/- 0.02, N = 25SE +/- 0.04, N = 5SE +/- 0.04, N = 63.033.043.123.15-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28M16M24M32M40MSE +/- 237875.03, N = 3SE +/- 131167.27, N = 3SE +/- 210046.69, N = 3SE +/- 164232.11, N = 339631993395403283927896439561655-march=znver2-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 346.2246.4946.4546.39-march=znver2-march=znver21. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.73, N = 3SE +/- 1.78, N = 3SE +/- 3.72, N = 3SE +/- 0.72, N = 3248.85247.99246.01247.33-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 389.8492.3589.9996.54-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350292.53300.31280.27284.10

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.09, N = 3SE +/- 0.51, N = 3SE +/- 0.15, N = 3SE +/- 0.36, N = 354.4353.7652.7153.91-march=znver2-march=znver21. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2300K600K900K1200K1500KSE +/- 676.60, N = 5SE +/- 6261.48, N = 5SE +/- 620.00, N = 5SE +/- 10688.23, N = 51366017140875213057811337188-march=znver2-march=znver21. (CC) gcc options: -O3 -march=native

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 2.27, N = 3SE +/- 2.09, N = 4SE +/- 1.55, N = 7SE +/- 2.03, N = 3138.74139.82139.59138.41-march=znver2-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 353.0052.4052.9452.53-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2612182430SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 325.2625.3925.2325.25-march=znver2-march=znver21. (CC) gcc options: -pthread -fvisibility=hidden -O3

Geometric Mean Of All Test Results

Result Composite - Ryzen 9 3900X Znver2 Compiler Tuning

OpenBenchmarking.orgGeometric Mean, More Is BetterGeometric Mean Of All Test ResultsResult Composite - Ryzen 9 3900X Znver2 Compiler TuningGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200162.54165.41162.44166.33


Phoronix Test Suite v10.8.5