Ryzen 9 3900X Znver2 Compiler Tuning

AMD Ryzen 9 3900X 12-Core testing of GCC 9 and GCC 10 development with Znver2 tuning following recent cost table updates, etc. Benchmarks by Michael Larabel for a future article..

HTML result view exported from: https://openbenchmarking.org/result/1907290-HV-RYZEN939034&sro.

Ryzen 9 3900X Znver2 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver2AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 9.1.0ext43840x2160GCC 10.0.0 20190727OpenBenchmarking.orgEnvironment Details- GCC 9.1.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.1.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- GCC 10.0.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 10.0.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15+ + Python 3.6.8Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling

Ryzen 9 3900X Znver2 Compiler Tuningsockperf: Throughputsockperf: Latency Ping Pongmpcbench: Multi-Precision Benchmarkhpcc: G-HPLhpcc: G-Fftehpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthhpcg: lzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionfftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancejohn-the-ripper: Blowfishmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32aom-av1: AV1 Video Encodingsvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodex264: H.264 Video Encodingx265: H.265 1080p Video Encodinggraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solverstockfish: Total Timebuild-llvm: Time To Compilebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9encode-flac: WAV To FLACencode-mp3: WAV To MP3encode-ogg: WAV To Oggffmpeg: H.264 HD To NTSC DVm-queens: Time To Solveopenssl: RSA 4096-bit Performancecpuminer-opt: m7mcpuminer-opt: deepcpuminer-opt: lbrycpuminer-opt: skeincpuminer-opt: myr-grcpuminer-opt: sha256ttjbench: Decompression Throughputgromacs: Water Benchmarkpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writecpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractionredis: GETredis: SETmcperf: Getmcperf: Setnginx: Static Web Page Servingapache: Static Web Page Servingapache-siege: 200apache-siege: 250coremark: CoreMark Size 666 - Iterations Per SecondGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver25145513.12959770.969008.598038.5980332.830832.732551.708200.097570.325964.8916124227.2531.09391164681269494239111912958119099028.107063.03452532768.16761.38295.183767.636891.372125.26130578120335152.511523.52117.6019613.70221.2359.002527.5051813.051147.620.2746.45246.0189.99139.5952.942512621812092751702871322.9039278964280.2752.7143.097.7834.601.983.203.883.512.050.750.8925.237.707.255.136.8647.123516.27593.891119034583393971412787951218.090.98300353.0929178.2359.3131.52309.36751.1576.4514.4027.603297713.332122162.9492376.4052914.1039734.8538392.2960835.7998050.91567987.345170953.15957771.776638.605148.6051432.602632.952251.716680.097980.326984.9883223832.6081.0840116468126851525711831182814314108147920.17449513686.60800.23273.493580.7311370.272408.26133718820253155.341599.68118.4719696.57217.0257.972520.0152238.801153.460.3146.39247.3396.54138.4152.532592631952212801712931378.4639561655284.1053.9139.427.6733.202.043.223.773.572.040.770.9025.257.996.945.056.8347.273481.50590.6610230.3434420397971402387238225.640.99297539.8929149.2060.3831.43302.82750.6674.0814.1528.933066070.282169531.0093850.5959232.0739602.4938022.7999824.4996842.13555154.605147483.03958071.070108.817488.8174832.863432.729741.722050.097780.331864.9494723885.4381.09401134671287507250114712748129029583.737071.30453613127.49777.17301.173856.638526.662175.85136601720426154.091582.78116.6219803.57212.8358.162507.1650039.131145.010.2746.22248.8589.84138.7453.002542621812082741702881385.2339631993292.5354.4342.637.8435.982.113.444.113.732.200.810.9525.267.727.285.056.8847.143492.53591.321113735288397201413086417220.330.97300244.8129372.3959.9731.51307.23799.8874.2615.1028.193042507.472051361.3395710.6057193.2539525.7038490.9882293.1462725.24568329.005296573.04935771.049308.637948.6379432.843632.947301.730550.097710.325215.0460323993.0431.0837108453125049924811591411314119105317823.27463053553.67759.97261.103675.9410777.882293.46140875220426157.721556.91118.0219694.33218.6656.872543.9350679.531145.950.3246.49247.9992.35139.8252.402642771962232861733021385.8839540328300.3153.7639.367.5333.052.063.273.853.602.050.770.9125.398.117.455.366.7847.213487.10590.801112334630398431413786440225.440.98298969.7529148.6063.3431.30306.02787.7777.2214.9028.303031706.222084989.8897228.2752910.8739346.9138009.2583275.06102423.07567096.65OpenBenchmarking.org

Sockperf

Test: Throughput

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110K220K330K440K550KSE +/- 5409.10, N = 5SE +/- 3715.76, N = 18SE +/- 4767.11, N = 5SE +/- 4175.03, N = 5514748529657514551517095-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.70881.41762.12642.83523.544SE +/- 0.02, N = 25SE +/- 0.02, N = 25SE +/- 0.04, N = 5SE +/- 0.04, N = 63.033.043.123.15-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

GNU MPC

Multi-Precision Benchmark

OpenBenchmarking.orgGlobal Score, More Is BetterGNU MPC 1.1.0Multi-Precision BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 26.46, N = 3SE +/- 102.03, N = 3SE +/- 31.80, N = 3SE +/- 50.44, N = 39580935795979577-march=znver2-march=znver21. (CC) gcc options: -lm -O3 -MT -MD -MP -MF

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21632486480SE +/- 0.08, N = 3SE +/- 0.22, N = 3SE +/- 0.23, N = 3SE +/- 0.37, N = 371.0771.0570.9771.78-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 0.19, N = 3SE +/- 0.42, N = 332.8632.8432.8332.60-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.66431.32861.99292.65723.3215SE +/- 0.00151, N = 3SE +/- 0.00082, N = 3SE +/- 0.00047, N = 3SE +/- 0.00095, N = 32.729742.947302.732552.95225-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.38940.77881.16821.55761.947SE +/- 0.00098, N = 3SE +/- 0.00091, N = 3SE +/- 0.00015, N = 3SE +/- 0.00081, N = 31.722051.730551.708201.71668-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0220.0440.0660.0880.11SE +/- 0.00041, N = 3SE +/- 0.00044, N = 3SE +/- 0.00036, N = 3SE +/- 0.00042, N = 30.097780.097710.097570.09798-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.07470.14940.22410.29880.3735SE +/- 0.00125, N = 3SE +/- 0.00071, N = 3SE +/- 0.00047, N = 3SE +/- 0.00042, N = 30.331860.325210.325960.32698-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.13542.27083.40624.54165.677SE +/- 0.05697, N = 3SE +/- 0.04322, N = 3SE +/- 0.02698, N = 3SE +/- 0.07571, N = 34.949475.046034.891614.98832-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25K10K15K20K25KSE +/- 62.37, N = 3SE +/- 195.64, N = 3SE +/- 159.70, N = 3SE +/- 119.42, N = 323885.4423993.0424227.2523832.61-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.24530.49060.73590.98121.2265SE +/- 0.01, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.091.081.091.08

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2918273645SE +/- 0.33, N = 3403739401. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.33, N = 31131081161161. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2100200300400500SE +/- 0.33, N = 3SE +/- 3.18, N = 3SE +/- 4.91, N = 84674534684681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 9.50, N = 3SE +/- 12.79, N = 812871250126912681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110220330440550SE +/- 0.88, N = 3SE +/- 4.47, N = 11SE +/- 0.67, N = 3SE +/- 4.10, N = 35074994945151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.67, N = 3SE +/- 1.86, N = 32502482392571. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.33, N = 3SE +/- 10.00, N = 311471159111911831. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 110.06, N = 3SE +/- 5.51, N = 3SE +/- 1.76, N = 3SE +/- 15.90, N = 312748141131295811828-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 2.19, N = 3SE +/- 155.95, N = 3SE +/- 141.66, N = 312902141191190914314-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.17, N = 3SE +/- 10.67, N = 3SE +/- 30.08, N = 3SE +/- 148.34, N = 49583.7310531.009028.1010814.00-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 40.30, N = 3SE +/- 95.02, N = 3SE +/- 73.85, N = 3SE +/- 67.62, N = 37071.307823.277063.037920.17-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver210K20K30K40K50KSE +/- 54.85, N = 3SE +/- 28.47, N = 3SE +/- 663.38, N = 4SE +/- 105.51, N = 345361463054525344951-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 5.96, N = 3SE +/- 13.97, N = 3SE +/- 25.64, N = 3SE +/- 5.91, N = 33127.493553.672768.163686.60-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 0.24, N = 3SE +/- 0.29, N = 3SE +/- 7.16, N = 3SE +/- 0.74, N = 3777.17759.97761.38800.23-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 0.54, N = 3SE +/- 0.24, N = 3SE +/- 2.85, N = 3SE +/- 0.21, N = 3301.17261.10295.18273.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 15.78, N = 3SE +/- 58.43, N = 3SE +/- 37.65, N = 3SE +/- 13.73, N = 33856.633675.943767.633580.73-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.91, N = 3SE +/- 12.24, N = 3SE +/- 60.72, N = 3SE +/- 15.98, N = 38526.6610777.886891.3711370.27-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 0.16, N = 3SE +/- 0.89, N = 3SE +/- 20.16, N = 3SE +/- 0.53, N = 32175.852293.462125.262408.26-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2300K600K900K1200K1500KSE +/- 676.60, N = 5SE +/- 6261.48, N = 5SE +/- 620.00, N = 5SE +/- 10688.23, N = 51366017140875213057811337188-march=znver2-march=znver21. (CC) gcc options: -O3 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 63.74, N = 3SE +/- 64.22, N = 3SE +/- 64.93, N = 3SE +/- 63.01, N = 3204262042620335202531. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 3.18, N = 12SE +/- 3.21, N = 14SE +/- 3.35, N = 15SE +/- 2.27, N = 15154.09157.72152.51155.34MIN: 129-march=znver2 - MIN: 127MIN: 111.42-march=znver2 - MIN: 127.991. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 5.99, N = 3SE +/- 25.50, N = 3SE +/- 7.48, N = 3SE +/- 17.21, N = 31582.781556.911523.521599.68MIN: 1385.56-march=znver2 - MIN: 1368.2MIN: 1357.02-march=znver2 - MIN: 1393.731. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.79, N = 3SE +/- 1.48, N = 4SE +/- 0.16, N = 3SE +/- 0.45, N = 3116.62118.02117.60118.47MIN: 102.39-march=znver2 - MIN: 102.11MIN: 103.13-march=znver2 - MIN: 103.471. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 87.03, N = 3SE +/- 41.11, N = 3SE +/- 42.61, N = 3SE +/- 22.35, N = 319803.5719694.3319613.7019696.57MIN: 19014.9-march=znver2 - MIN: 18995.6MIN: 18961.5-march=znver2 - MIN: 19033.51. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 0.29, N = 3SE +/- 1.85, N = 15SE +/- 2.00, N = 15SE +/- 1.79, N = 13212.83218.66221.23217.02MIN: 201.7-march=znver2 - MIN: 203.42MIN: 202.07-march=znver2 - MIN: 203.651. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21326395265SE +/- 0.69, N = 15SE +/- 0.58, N = 8SE +/- 0.66, N = 7SE +/- 0.49, N = 1558.1656.8759.0057.97MIN: 50.91-march=znver2 - MIN: 50.96MIN: 50.8-march=znver2 - MIN: 51.571. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 6.13, N = 3SE +/- 17.20, N = 3SE +/- 9.57, N = 3SE +/- 9.61, N = 32507.162543.932527.502520.01MIN: 2461.57-march=znver2 - MIN: 2467.76MIN: 2462.11-march=znver2 - MIN: 2467.071. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver211K22K33K44K55KSE +/- 224.88, N = 3SE +/- 390.75, N = 3SE +/- 589.20, N = 6SE +/- 668.40, N = 350039.1350679.5351813.0552238.80MIN: 46883.1-march=znver2 - MIN: 48056.6MIN: 48543.1-march=znver2 - MIN: 49224.91. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 6.39, N = 3SE +/- 5.61, N = 3SE +/- 6.51, N = 3SE +/- 6.23, N = 31145.011145.951147.621153.46MIN: 1050.58-march=znver2 - MIN: 1052.71MIN: 1052.13-march=znver2 - MIN: 1057.541. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0720.1440.2160.2880.36SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.270.320.270.31-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 346.2246.4946.4546.39-march=znver2-march=znver21. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.73, N = 3SE +/- 1.78, N = 3SE +/- 3.72, N = 3SE +/- 0.72, N = 3248.85247.99246.01247.33-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 389.8492.3589.9996.54-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 2.27, N = 3SE +/- 2.09, N = 4SE +/- 1.55, N = 7SE +/- 2.03, N = 3138.74139.82139.59138.41-march=znver2-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 353.0052.4052.9452.53-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.88, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3254264251259-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 4.33, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3262277262263-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3181196181195-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.20, N = 3208223209221-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 2.65, N = 3SE +/- 1.53, N = 3SE +/- 2.19, N = 3SE +/- 1.15, N = 3274286275280-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3170173170171-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 2.19, N = 3SE +/- 0.33, N = 3SE +/- 2.60, N = 3SE +/- 2.19, N = 3288302287293-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 2.93, N = 3SE +/- 0.48, N = 3SE +/- 11.21, N = 3SE +/- 6.19, N = 31385.231385.881322.901378.46-march=znver2-march=znver21. (CC) gcc options: -O3 -mavx2

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28M16M24M32M40MSE +/- 237875.03, N = 3SE +/- 131167.27, N = 3SE +/- 210046.69, N = 3SE +/- 164232.11, N = 339631993395403283927896439561655-march=znver2-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350292.53300.31280.27284.10

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.09, N = 3SE +/- 0.51, N = 3SE +/- 0.15, N = 3SE +/- 0.36, N = 354.4353.7652.7153.91-march=znver2-march=znver21. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21020304050SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 342.6339.3643.0939.42-march=znver2-march=znver21. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 37.847.537.787.67-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 335.9833.0534.6033.20-march=znver2-march=znver21. (CC) gcc options: -lm -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.47480.94961.42441.89922.374SE +/- 0.00, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.112.061.982.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.443.273.203.22-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.92481.84962.77443.69924.624SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 34.113.853.883.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.83931.67862.51793.35724.1965SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 33.733.603.513.57-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.4950.991.4851.982.475SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.202.052.052.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.18230.36460.54690.72920.9115SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.770.750.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.21380.42760.64140.85521.069SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.950.910.890.90-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2612182430SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 325.2625.3925.2325.25-march=znver2-march=znver21. (CC) gcc options: -pthread -fvisibility=hidden -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 57.728.117.707.99-march=znver2-march=znver21. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 37.287.457.256.94-march=znver2-march=znver21. (CC) gcc options: -O3 -lncurses -lm

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.3WAV To OggGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.2062.4123.6184.8246.03SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 35.055.365.135.05-march=znver2-march=znver21. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -logg

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 36.886.786.866.83-march=znver2-march=znver21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -O3 -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 347.1447.2147.1247.27-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3 -O2 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 1.89, N = 3SE +/- 0.70, N = 3SE +/- 7.07, N = 3SE +/- 1.42, N = 33492.533487.103516.273481.50-march=znver2-march=znver21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Cpuminer-Opt

Algorithm: m7m

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7mGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2130260390520650SE +/- 0.27, N = 3SE +/- 0.35, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 3591.32590.80593.89590.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: deep

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deepGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 3.33, N = 3SE +/- 8.82, N = 3SE +/- 926.03, N = 1211137.0011123.0011190.0010230.34-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: lbry

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 460.86, N = 5SE +/- 20.82, N = 3SE +/- 550.28, N = 3SE +/- 5.77, N = 335288346303458334420-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: skein

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skeinGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 5.77, N = 3SE +/- 133.46, N = 3SE +/- 602.50, N = 3SE +/- 21.86, N = 339720398433939739797-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: myr-gr

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-grGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 40.00, N = 3SE +/- 6.67, N = 3SE +/- 49.78, N = 3SE +/- 26.03, N = 314130141371412714023-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: sha256t

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256tGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 116.81, N = 3SE +/- 180.83, N = 3SE +/- 990.16, N = 7SE +/- 1027.26, N = 686417864408795187238-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 2.32, N = 3SE +/- 0.30, N = 3SE +/- 0.44, N = 3SE +/- 0.31, N = 3220.33225.44218.09225.64-march=znver2-march=znver21. (CC) gcc options: -O3 -rdynamic

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.99-march=znver2-march=znver21. (CXX) g++ options: -march=core-avx2 -O3 -std=c++11 -funroll-all-loops -fopenmp -lrt -lpthread -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260K120K180K240K300KSE +/- 102.78, N = 3SE +/- 237.85, N = 3SE +/- 513.53, N = 3SE +/- 235.79, N = 3300244.81298969.75300353.09297539.89-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver26K12K18K24K30KSE +/- 31.16, N = 3SE +/- 124.84, N = 3SE +/- 55.36, N = 3SE +/- 40.41, N = 329372.3929148.6029178.2329149.20-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21428425670SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.30, N = 3SE +/- 0.53, N = 1159.9763.3459.3160.38-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.38, N = 5SE +/- 0.03, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 331.5131.3031.5231.43-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 4.29, N = 3SE +/- 2.37, N = 3SE +/- 0.26, N = 3SE +/- 3.91, N = 3307.23306.02309.36302.82-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 4.15, N = 3SE +/- 10.35, N = 5SE +/- 2.69, N = 3SE +/- 0.27, N = 3799.88787.77751.15750.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.88, N = 3SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 374.2677.2276.4574.08-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver248121620SE +/- 0.17, N = 3SE +/- 0.20, N = 4SE +/- 0.08, N = 3SE +/- 0.03, N = 315.1014.9014.4014.15-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.08, N = 3SE +/- 0.45, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 328.1928.3027.6028.93-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2700K1400K2100K2800K3500KSE +/- 47460.73, N = 15SE +/- 51486.64, N = 15SE +/- 40781.06, N = 3SE +/- 61029.58, N = 153042507.473031706.223297713.333066070.281. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2500K1000K1500K2000K2500KSE +/- 28123.08, N = 3SE +/- 14796.01, N = 3SE +/- 30290.32, N = 4SE +/- 19021.82, N = 32051361.332084989.882122162.942169531.001. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 1025.20, N = 15SE +/- 1267.59, N = 3SE +/- 1551.16, N = 3SE +/- 937.65, N = 1595710.6097228.2792376.4093850.59-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver213K26K39K52K65KSE +/- 2058.38, N = 15SE +/- 393.33, N = 3SE +/- 293.82, N = 3SE +/- 3850.96, N = 1557193.2552910.8752914.1059232.07-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 158.42, N = 3SE +/- 23.74, N = 3SE +/- 102.83, N = 3SE +/- 112.05, N = 339525.7039346.9139734.8539602.49-march=znver2-march=znver21. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 65.39, N = 3SE +/- 79.10, N = 3SE +/- 57.64, N = 3SE +/- 139.15, N = 338490.9838009.2538392.2938022.79-march=znver2-march=znver21. (CC) gcc options: -shared -fPIC -pthread -O3

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 3302.37, N = 12SE +/- 1288.23, N = 15SE +/- 798.56, N = 3SE +/- 3575.15, N = 1582293.1483275.0660835.7999824.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 122.71, N = 3SE +/- 1636.75, N = 12SE +/- 3755.13, N = 15SE +/- 4063.46, N = 1262725.24102423.0798050.9196842.13-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2120K240K360K480K600KSE +/- 1210.22, N = 3SE +/- 1036.74, N = 3SE +/- 1430.19, N = 3SE +/- 2761.64, N = 3568329.00567096.65567987.34555154.60-march=znver2-march=znver21. (CC) gcc options: -O2 -O3 -lrt" -lrt


Phoronix Test Suite v10.8.5