Ryzen 9 3900X Znver2 Compiler Tuning

AMD Ryzen 9 3900X 12-Core testing of GCC 9 and GCC 10 development with Znver2 tuning following recent cost table updates, etc. Benchmarks by Michael Larabel for a future article..

HTML result view exported from: https://openbenchmarking.org/result/1907290-HV-RYZEN939034&gru&sro.

Ryzen 9 3900X Znver2 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver2AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 9.1.0ext43840x2160GCC 10.0.0 20190727OpenBenchmarking.orgEnvironment Details- GCC 9.1.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.1.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- GCC 10.0.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 10.0.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15+ + Python 3.6.8Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling

Ryzen 9 3900X Znver2 Compiler Tuningaom-av1: AV1 Video Encodingsvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodex264: H.264 Video Encodingx265: H.265 1080p Video Encodinghpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: Rand Ring Bandwidthhpcc: G-Fftehpcg: hpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMmpcbench: Multi-Precision Benchmarkhpcc: G-Rand Accessgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondcpuminer-opt: m7mcpuminer-opt: deepcpuminer-opt: lbrycpuminer-opt: skeincpuminer-opt: myr-grcpuminer-opt: sha256thpcc: Max Ping Pong Bandwidthlzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressiontjbench: Decompression Throughputsockperf: Throughputfftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationhimeno: Poisson Pressure Solvertscp: AI Chess Performancestockfish: Total Timegromacs: Water Benchmarkmcperf: Getmcperf: Setjohn-the-ripper: Blowfishredis: GETredis: SETnginx: Static Web Page Servingapache: Static Web Page Servingopenssl: RSA 4096-bit Performancepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writeapache-siege: 200apache-siege: 250mkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32build-llvm: Time To Compilebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9encode-flac: WAV To FLACencode-mp3: WAV To MP3encode-ogg: WAV To Oggffmpeg: H.264 HD To NTSC DVm-queens: Time To Solvecpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractionsockperf: Latency Ping Ponghpcc: Rand Ring LatencyGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0GCC 10.0.0 znver20.2746.45246.0189.99139.5952.942.732551.708204.891618.598031.0970.969008.5980332.8308395970.09757251262181209275170287567987.34593.89111903458339397141278795124227.2533911646812694942391119218.0951455112958119099028.107063.03452532768.16761.38295.183767.636891.372125.261322.901305781392789640.9892376.4052914.10203353297713.332122162.9439734.8538392.293516.27300353.0929178.2360835.7998050.91152.511523.52117.6019613.70221.2359.002527.5051813.051147.62280.2752.7143.097.7834.601.983.203.883.512.050.750.8925.237.707.255.136.8647.1259.3131.52309.36751.1576.4514.4027.603.120.325960.3146.39247.3396.54138.4152.532.952251.716684.988328.605141.0871.776638.6051432.6026395770.09798259263195221280171293555154.60590.6610230.343442039797140238723823832.6084011646812685152571183225.645170951182814314108147920.17449513686.60800.23273.493580.7311370.272408.261378.461337188395616550.9993850.5959232.07202533066070.282169531.0039602.4938022.793481.50297539.8929149.2099824.4996842.13155.341599.68118.4719696.57217.0257.972520.0152238.801153.46284.1053.9139.427.6733.202.043.223.773.572.040.770.9025.257.996.945.056.8347.2760.3831.43302.82750.6674.0814.1528.933.150.326980.2746.22248.8589.84138.7453.002.729741.722054.949478.817481.0971.070108.8174832.8634395800.09778254262181208274170288568329.00591.32111373528839720141308641723885.4384011346712875072501147220.3351474812748129029583.737071.30453613127.49777.17301.173856.638526.662175.851385.231366017396319930.9795710.6057193.25204263042507.472051361.3339525.7038490.983492.53300244.8129372.3982293.1462725.24154.091582.78116.6219803.57212.8358.162507.1650039.131145.01292.5354.4342.637.8435.982.113.444.113.732.200.810.9525.267.727.285.056.8847.1459.9731.51307.23799.8874.2615.1028.193.030.331860.3246.49247.9992.35139.8252.402.947301.730555.046038.637941.0871.049308.6379432.8436393570.09771264277196223286173302567096.65590.80111233463039843141378644023993.0433710845312504992481159225.445296571411314119105317823.27463053553.67759.97261.103675.9410777.882293.461385.881408752395403280.9897228.2752910.87204263031706.222084989.8839346.9138009.253487.10298969.7529148.6083275.06102423.07157.721556.91118.0219694.33218.6656.872543.9350679.531145.95300.3153.7639.367.5333.052.063.273.853.602.050.770.9125.398.117.455.366.7847.2163.3431.30306.02787.7777.2214.9028.303.040.32521OpenBenchmarking.org

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0720.1440.2160.2880.36SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.270.320.270.31-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 346.2246.4946.4546.39-march=znver2-march=znver21. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.73, N = 3SE +/- 1.78, N = 3SE +/- 3.72, N = 3SE +/- 0.72, N = 3248.85247.99246.01247.33-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video EncodeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 389.8492.3589.9996.54-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 2.27, N = 3SE +/- 2.09, N = 4SE +/- 1.55, N = 7SE +/- 2.03, N = 3138.74139.82139.59138.41-march=znver2-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 353.0052.4052.9452.53-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.66431.32861.99292.65723.3215SE +/- 0.00151, N = 3SE +/- 0.00082, N = 3SE +/- 0.00047, N = 3SE +/- 0.00095, N = 32.729742.947302.732552.95225-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.38940.77881.16821.55761.947SE +/- 0.00098, N = 3SE +/- 0.00091, N = 3SE +/- 0.00015, N = 3SE +/- 0.00081, N = 31.722051.730551.708201.71668-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.13542.27083.40624.54165.677SE +/- 0.05697, N = 3SE +/- 0.04322, N = 3SE +/- 0.02698, N = 3SE +/- 0.07571, N = 34.949475.046034.891614.98832-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.24530.49060.73590.98121.2265SE +/- 0.01, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.091.081.091.08

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21632486480SE +/- 0.08, N = 3SE +/- 0.22, N = 3SE +/- 0.23, N = 3SE +/- 0.37, N = 371.0771.0570.9771.78-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.18198, N = 3SE +/- 0.02559, N = 3SE +/- 0.02013, N = 3SE +/- 0.06300, N = 38.817488.637948.598038.60514-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 0.19, N = 3SE +/- 0.42, N = 332.8632.8432.8332.60-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

GNU MPC

Multi-Precision Benchmark

OpenBenchmarking.orgGlobal Score, More Is BetterGNU MPC 1.1.0Multi-Precision BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 26.46, N = 3SE +/- 102.03, N = 3SE +/- 31.80, N = 3SE +/- 50.44, N = 39580935795979577-march=znver2-march=znver21. (CC) gcc options: -lm -O3 -MT -MD -MP -MF

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.0220.0440.0660.0880.11SE +/- 0.00041, N = 3SE +/- 0.00044, N = 3SE +/- 0.00036, N = 3SE +/- 0.00042, N = 30.097780.097710.097570.09798-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.88, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3254264251259-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 4.33, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3262277262263-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3181196181195-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 1.20, N = 3208223209221-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 2.65, N = 3SE +/- 1.53, N = 3SE +/- 2.19, N = 3SE +/- 1.15, N = 3274286275280-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3170173170171-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 2.19, N = 3SE +/- 0.33, N = 3SE +/- 2.60, N = 3SE +/- 2.19, N = 3288302287293-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2120K240K360K480K600KSE +/- 1210.22, N = 3SE +/- 1036.74, N = 3SE +/- 1430.19, N = 3SE +/- 2761.64, N = 3568329.00567096.65567987.34555154.60-march=znver2-march=znver21. (CC) gcc options: -O2 -O3 -lrt" -lrt

Cpuminer-Opt

Algorithm: m7m

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7mGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2130260390520650SE +/- 0.27, N = 3SE +/- 0.35, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 3591.32590.80593.89590.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: deep

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deepGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 3.33, N = 3SE +/- 8.82, N = 3SE +/- 926.03, N = 1211137.0011123.0011190.0010230.34-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: lbry

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 460.86, N = 5SE +/- 20.82, N = 3SE +/- 550.28, N = 3SE +/- 5.77, N = 335288346303458334420-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: skein

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skeinGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 5.77, N = 3SE +/- 133.46, N = 3SE +/- 602.50, N = 3SE +/- 21.86, N = 339720398433939739797-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: myr-gr

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-grGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 40.00, N = 3SE +/- 6.67, N = 3SE +/- 49.78, N = 3SE +/- 26.03, N = 314130141371412714023-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: sha256t

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256tGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 116.81, N = 3SE +/- 180.83, N = 3SE +/- 990.16, N = 7SE +/- 1027.26, N = 686417864408795187238-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25K10K15K20K25KSE +/- 62.37, N = 3SE +/- 195.64, N = 3SE +/- 159.70, N = 3SE +/- 119.42, N = 323885.4423993.0424227.2523832.61-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2918273645SE +/- 0.33, N = 3403739401. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.33, N = 31131081161161. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2100200300400500SE +/- 0.33, N = 3SE +/- 3.18, N = 3SE +/- 4.91, N = 84674534684681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 9.50, N = 3SE +/- 12.79, N = 812871250126912681. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110220330440550SE +/- 0.88, N = 3SE +/- 4.47, N = 11SE +/- 0.67, N = 3SE +/- 4.10, N = 35074994945151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260120180240300SE +/- 0.67, N = 3SE +/- 1.86, N = 32502482392571. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 0.33, N = 3SE +/- 10.00, N = 311471159111911831. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 2.32, N = 3SE +/- 0.30, N = 3SE +/- 0.44, N = 3SE +/- 0.31, N = 3220.33225.44218.09225.64-march=znver2-march=znver21. (CC) gcc options: -O3 -rdynamic

Sockperf

Test: Throughput

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: ThroughputGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110K220K330K440K550KSE +/- 5409.10, N = 5SE +/- 3715.76, N = 18SE +/- 4767.11, N = 5SE +/- 4175.03, N = 5514748529657514551517095-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 110.06, N = 3SE +/- 5.51, N = 3SE +/- 1.76, N = 3SE +/- 15.90, N = 312748141131295811828-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver23K6K9K12K15KSE +/- 2.19, N = 3SE +/- 155.95, N = 3SE +/- 141.66, N = 312902141191190914314-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.17, N = 3SE +/- 10.67, N = 3SE +/- 30.08, N = 3SE +/- 148.34, N = 49583.7310531.009028.1010814.00-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 40.30, N = 3SE +/- 95.02, N = 3SE +/- 73.85, N = 3SE +/- 67.62, N = 37071.307823.277063.037920.17-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver210K20K30K40K50KSE +/- 54.85, N = 3SE +/- 28.47, N = 3SE +/- 663.38, N = 4SE +/- 105.51, N = 345361463054525344951-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 5.96, N = 3SE +/- 13.97, N = 3SE +/- 25.64, N = 3SE +/- 5.91, N = 33127.493553.672768.163686.60-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 0.24, N = 3SE +/- 0.29, N = 3SE +/- 7.16, N = 3SE +/- 0.74, N = 3777.17759.97761.38800.23-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 0.54, N = 3SE +/- 0.24, N = 3SE +/- 2.85, N = 3SE +/- 0.21, N = 3301.17261.10295.18273.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 15.78, N = 3SE +/- 58.43, N = 3SE +/- 37.65, N = 3SE +/- 13.73, N = 33856.633675.943767.633580.73-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22K4K6K8K10KSE +/- 19.91, N = 3SE +/- 12.24, N = 3SE +/- 60.72, N = 3SE +/- 15.98, N = 38526.6610777.886891.3711370.27-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 0.16, N = 3SE +/- 0.89, N = 3SE +/- 20.16, N = 3SE +/- 0.53, N = 32175.852293.462125.262408.26-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 2.93, N = 3SE +/- 0.48, N = 3SE +/- 11.21, N = 3SE +/- 6.19, N = 31385.231385.881322.901378.46-march=znver2-march=znver21. (CC) gcc options: -O3 -mavx2

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2300K600K900K1200K1500KSE +/- 676.60, N = 5SE +/- 6261.48, N = 5SE +/- 620.00, N = 5SE +/- 10688.23, N = 51366017140875213057811337188-march=znver2-march=znver21. (CC) gcc options: -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28M16M24M32M40MSE +/- 237875.03, N = 3SE +/- 131167.27, N = 3SE +/- 210046.69, N = 3SE +/- 164232.11, N = 339631993395403283927896439561655-march=znver2-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.99-march=znver2-march=znver21. (CXX) g++ options: -march=core-avx2 -O3 -std=c++11 -funroll-all-loops -fopenmp -lrt -lpthread -lm

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 1025.20, N = 15SE +/- 1267.59, N = 3SE +/- 1551.16, N = 3SE +/- 937.65, N = 1595710.6097228.2792376.4093850.59-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver213K26K39K52K65KSE +/- 2058.38, N = 15SE +/- 393.33, N = 3SE +/- 293.82, N = 3SE +/- 3850.96, N = 1557193.2552910.8752914.1059232.07-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 63.74, N = 3SE +/- 64.22, N = 3SE +/- 64.93, N = 3SE +/- 63.01, N = 3204262042620335202531. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2700K1400K2100K2800K3500KSE +/- 47460.73, N = 15SE +/- 51486.64, N = 15SE +/- 40781.06, N = 3SE +/- 61029.58, N = 153042507.473031706.223297713.333066070.281. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2500K1000K1500K2000K2500KSE +/- 28123.08, N = 3SE +/- 14796.01, N = 3SE +/- 30290.32, N = 4SE +/- 19021.82, N = 32051361.332084989.882122162.942169531.001. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver29K18K27K36K45KSE +/- 158.42, N = 3SE +/- 23.74, N = 3SE +/- 102.83, N = 3SE +/- 112.05, N = 339525.7039346.9139734.8539602.49-march=znver2-march=znver21. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28K16K24K32K40KSE +/- 65.39, N = 3SE +/- 79.10, N = 3SE +/- 57.64, N = 3SE +/- 139.15, N = 338490.9838009.2538392.2938022.79-march=znver2-march=znver21. (CC) gcc options: -shared -fPIC -pthread -O3

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver28001600240032004000SE +/- 1.89, N = 3SE +/- 0.70, N = 3SE +/- 7.07, N = 3SE +/- 1.42, N = 33492.533487.103516.273481.50-march=znver2-march=znver21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver260K120K180K240K300KSE +/- 102.78, N = 3SE +/- 237.85, N = 3SE +/- 513.53, N = 3SE +/- 235.79, N = 3300244.81298969.75300353.09297539.89-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver26K12K18K24K30KSE +/- 31.16, N = 3SE +/- 124.84, N = 3SE +/- 55.36, N = 3SE +/- 40.41, N = 329372.3929148.6029178.2329149.20-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 3302.37, N = 12SE +/- 1288.23, N = 15SE +/- 798.56, N = 3SE +/- 3575.15, N = 1582293.1483275.0660835.7999824.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220K40K60K80K100KSE +/- 122.71, N = 3SE +/- 1636.75, N = 12SE +/- 3755.13, N = 15SE +/- 4063.46, N = 1262725.24102423.0798050.9196842.13-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 3.18, N = 12SE +/- 3.21, N = 14SE +/- 3.35, N = 15SE +/- 2.27, N = 15154.09157.72152.51155.34MIN: 129-march=znver2 - MIN: 127MIN: 111.42-march=znver2 - MIN: 127.991. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver230060090012001500SE +/- 5.99, N = 3SE +/- 25.50, N = 3SE +/- 7.48, N = 3SE +/- 17.21, N = 31582.781556.911523.521599.68MIN: 1385.56-march=znver2 - MIN: 1368.2MIN: 1357.02-march=znver2 - MIN: 1393.731. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2306090120150SE +/- 0.79, N = 3SE +/- 1.48, N = 4SE +/- 0.16, N = 3SE +/- 0.45, N = 3116.62118.02117.60118.47MIN: 102.39-march=znver2 - MIN: 102.11MIN: 103.13-march=znver2 - MIN: 103.471. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver24K8K12K16K20KSE +/- 87.03, N = 3SE +/- 41.11, N = 3SE +/- 42.61, N = 3SE +/- 22.35, N = 319803.5719694.3319613.7019696.57MIN: 19014.9-march=znver2 - MIN: 18995.6MIN: 18961.5-march=znver2 - MIN: 19033.51. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver250100150200250SE +/- 0.29, N = 3SE +/- 1.85, N = 15SE +/- 2.00, N = 15SE +/- 1.79, N = 13212.83218.66221.23217.02MIN: 201.7-march=znver2 - MIN: 203.42MIN: 202.07-march=znver2 - MIN: 203.651. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21326395265SE +/- 0.69, N = 15SE +/- 0.58, N = 8SE +/- 0.66, N = 7SE +/- 0.49, N = 1558.1656.8759.0057.97MIN: 50.91-march=znver2 - MIN: 50.96MIN: 50.8-march=znver2 - MIN: 51.571. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver25001000150020002500SE +/- 6.13, N = 3SE +/- 17.20, N = 3SE +/- 9.57, N = 3SE +/- 9.61, N = 32507.162543.932527.502520.01MIN: 2461.57-march=znver2 - MIN: 2467.76MIN: 2462.11-march=znver2 - MIN: 2467.071. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver211K22K33K44K55KSE +/- 224.88, N = 3SE +/- 390.75, N = 3SE +/- 589.20, N = 6SE +/- 668.40, N = 350039.1350679.5351813.0552238.80MIN: 46883.1-march=znver2 - MIN: 48056.6MIN: 48543.1-march=znver2 - MIN: 49224.91. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 6.39, N = 3SE +/- 5.61, N = 3SE +/- 6.51, N = 3SE +/- 6.23, N = 31145.011145.951147.621153.46MIN: 1050.58-march=znver2 - MIN: 1052.71MIN: 1052.13-march=znver2 - MIN: 1057.541. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350292.53300.31280.27284.10

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21224364860SE +/- 0.09, N = 3SE +/- 0.51, N = 3SE +/- 0.15, N = 3SE +/- 0.36, N = 354.4353.7652.7153.91-march=znver2-march=znver21. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21020304050SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 342.6339.3643.0939.42-march=znver2-march=znver21. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 37.847.537.787.67-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2816243240SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 335.9833.0534.6033.20-march=znver2-march=znver21. (CC) gcc options: -lm -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.47480.94961.42441.89922.374SE +/- 0.00, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.112.061.982.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.443.273.203.22-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.92481.84962.77443.69924.624SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 34.113.853.883.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.83931.67862.51793.35724.1965SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 33.733.603.513.57-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.4950.991.4851.982.475SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.202.052.052.04-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.18230.36460.54690.72920.9115SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.770.750.77-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.21380.42760.64140.85521.069SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.950.910.890.90-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2612182430SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 325.2625.3925.2325.25-march=znver2-march=znver21. (CC) gcc options: -pthread -fvisibility=hidden -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 57.728.117.707.99-march=znver2-march=znver21. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 37.287.457.256.94-march=znver2-march=znver21. (CC) gcc options: -O3 -lncurses -lm

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.3WAV To OggGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21.2062.4123.6184.8246.03SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 35.055.365.135.05-march=znver2-march=znver21. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -logg

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 36.886.786.866.83-march=znver2-march=znver21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -O3 -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21122334455SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 347.1447.2147.1247.27-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3 -O2 -march=native

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver21428425670SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.30, N = 3SE +/- 0.53, N = 1159.9763.3459.3160.38-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.38, N = 5SE +/- 0.03, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 331.5131.3031.5231.43-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver270140210280350SE +/- 4.29, N = 3SE +/- 2.37, N = 3SE +/- 0.26, N = 3SE +/- 3.91, N = 3307.23306.02309.36302.82-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver22004006008001000SE +/- 4.15, N = 3SE +/- 10.35, N = 5SE +/- 2.69, N = 3SE +/- 0.27, N = 3799.88787.77751.15750.66-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver220406080100SE +/- 0.88, N = 3SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 374.2677.2276.4574.08-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver248121620SE +/- 0.17, N = 3SE +/- 0.20, N = 4SE +/- 0.08, N = 3SE +/- 0.03, N = 315.1014.9014.4014.15-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2714212835SE +/- 0.08, N = 3SE +/- 0.45, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 328.1928.3027.6028.93-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.70881.41762.12642.83523.544SE +/- 0.02, N = 25SE +/- 0.02, N = 25SE +/- 0.04, N = 5SE +/- 0.04, N = 63.033.043.123.15-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver20.07470.14940.22410.29880.3735SE +/- 0.00125, N = 3SE +/- 0.00071, N = 3SE +/- 0.00047, N = 3SE +/- 0.00042, N = 30.331860.325210.325960.32698-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1


Phoronix Test Suite v10.8.5