Ryzen 9 3900X Znver2 Compiler Tuning

AMD Ryzen 9 3900X 12-Core testing of GCC 9 and GCC 10 development with Znver2 tuning following recent cost table updates, etc. Benchmarks by Michael Larabel for a future article..

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1907290-HV-RYZEN939034
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 9.1.0
July 27 2019
  5 Hours, 45 Minutes
GCC 9.1.0 znver2
July 27 2019
  5 Hours, 43 Minutes
GCC 10.0.0
July 28 2019
  5 Hours, 25 Minutes
GCC 10.0.0 znver2
July 28 2019
  5 Hours, 52 Minutes
Invert Behavior (Only Show Selected Data)
  5 Hours, 41 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ryzen 9 3900X Znver2 Compiler TuningOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 9.1.0GCC 10.0.0 20190727ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilersFile-SystemScreen ResolutionRyzen 9 3900X Znver2 Compiler Tuning BenchmarksSystem Logs- GCC 9.1.0: CXXFLAGS=-O3 CFLAGS=-O3- GCC 9.1.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- GCC 10.0.0 znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- GCC 10.0.0: CXXFLAGS=-O3 CFLAGS=-O3- --disable-multilib --enable-checking=release- Scaling Governor: acpi-cpufreq ondemand- Python 2.7.15+ + Python 3.6.8- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling

GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0Result OverviewPhoronix Test Suite100%109%118%128%137%Apache SiegeAOM AV1SciMarkFFTWC-RayAOBenchTSCPSVT-VP9LAME MP3 EncodingTimed LLVM CompilationBullet Physics EngineMemcached mcperfOgg EncodingRedisFLAC Audio EncodingGraphicsMagickHimeno BenchmarkSmallptlzbenchlibjpeg-turbo tjbenchTimed PHP CompilationSockperfGNU MPCCppPerformanceBenchmarksCoremarkGROMACSCpuminer-OptFFmpegApache BenchmarkHPC ChallengeSVT-HEVCx265x264MKL-DNNOpenSSLNGINX BenchmarkHigh Performance Conjugate GradientStockfishJohn The RipperPostgreSQL pgbenchXZ CompressionSVT-AV1m-queens

Ryzen 9 3900X Znver2 Compiler Tuningcpp-perf-bench: Rand Numbershpcc: G-HPLmkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_all - f32cpp-perf-bench: Math Librarygromacs: Water Benchmarkfftw: Stock - 2D FFT Size 4096mkl-dnn: Convolution Batch conv_googlenet_v3 - f32cpp-perf-bench: Atolbuild-llvm: Time To Compileapache-siege: 250apache-siege: 200mkl-dnn: Deconvolution Batch deconv_1d - f32pgbench: Buffer Test - Normal Load - Read Onlycpp-perf-bench: Stepanov Vectorpgbench: Buffer Test - Normal Load - Read Writemkl-dnn: IP Batch 1D - f32mcperf: Setaom-av1: AV1 Video Encodinghpcg: stockfish: Total Timegraphics-magick: Noise-Gaussiangraphics-magick: Swirlgraphics-magick: Enhancedgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Rotategraphics-magick: HWB Color Spacecpuminer-opt: deephimeno: Poisson Pressure Solverbuild-php: Time To Compilenginx: Static Web Page Servingmkl-dnn: IP Batch All - f32mpcbench: Multi-Precision Benchmarkmcperf: Getcpuminer-opt: sha256tlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionm-queens: Time To Solvemkl-dnn: Convolution Batch conv_3d - f32sockperf: Latency Ping Pongredis: GETc-ray: Total Time - 4K, 16 Rays Per Pixellzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressioncpp-perf-bench: Ctypecpuminer-opt: lbrycpuminer-opt: m7maobench: 2048 x 2048 - Total Timelzbench: Libdeflate 1 - Decompressionlzbench: Libdeflate 1 - Compressioncpuminer-opt: skeinlzbench: Brotli 0 - Compressioncpuminer-opt: myr-grjohn-the-ripper: Blowfishcoremark: CoreMark Size 666 - Iterations Per Secondcpp-perf-bench: Stepanov Abstractionscimark2: Compositeapache: Static Web Page Servingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9sockperf: Throughputmkl-dnn: Convolution Batch conv_alexnet - f32openssl: RSA 4096-bit Performancecpp-perf-bench: Function Objectsencode-flac: WAV To FLACmkl-dnn: Deconvolution Batch deconv_3d - f32redis: SETx265: H.265 1080p Video Encodingsmallpt: Global Illumination Renderer; 128 Samplesencode-mp3: WAV To MP3ffmpeg: H.264 HD To NTSC DVsvt-vp9: 1080p 8-bit YUV To VP9 Video Encodex264: H.264 Video Encodingtjbench: Decompression Throughputencode-ogg: WAV To Oggsvt-av1: 1080p 8-bit YUV To AV1 Video Encodefftw: Stock - 2D FFT Size 512bullet: Raytestsfftw: Float + SSE - 2D FFT Size 32fftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32svt-hevc: 1080p 8-bit YUV To HEVC Video Encodetscp: AI Chess Performancebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlohpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Fftehpcc: G-FfteGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0751.1570.9690051813.0519613.70309.360.987063.031147.6259.31280.2798050.9160835.79221.23300353.0976.4529178.23152.5152914.100.271.0939278964170251209181275262287111901322.9052.7139734.851523.52959792376.4087951126946847.12117.603.123297713.3343.091163931.5234583593.8934.601119239393974941412720335567987.3427.602768.1638392.2925.235145512527.503516.2714.407.7059.002122162.9452.947.787.256.8689.99139.59218.095.1346.459028.101.98452531295811909246.0113057810.890.752.053.513.883.202125.266891.373767.63295.18761.3824227.2534.891610.325960.097571.708202.7325532.830838.598038.59803750.6671.7766352238.8019696.57302.820.997920.171153.4660.38284.1096842.1399824.49217.02297539.8974.0829149.20155.3459232.070.311.083956165517125922119528026329310230.341378.4653.9139602.491599.68957793850.5987238126846847.27118.473.153066070.2839.421164031.4334420590.6633.201183257397975151402320253555154.6028.933686.6038022.7925.255170952520.013481.5014.157.9957.972169531.0052.537.676.946.8396.54138.41225.645.0546.39108142.04449511182814314247.3313371880.900.772.043.573.773.222408.2611370.273580.73273.49800.2323832.6084.988320.326980.097981.716682.9522532.602638.605148.60514787.7771.0493050679.5319694.33306.020.987823.271145.9563.34300.31102423.0783275.06218.66298969.7577.2229148.60157.7252910.870.321.0839540328173264223196286277302111231385.8853.7639346.911556.91935797228.2786440125045347.21118.023.043031706.2239.361083731.3034630590.8033.051159248398435051413720426567096.6528.303553.6738009.2525.395296572543.933487.1014.908.1156.872084989.8852.407.537.456.7892.35139.82225.445.3646.49105312.06463051411314119247.9914087520.910.772.053.603.853.272293.4610777.883675.94261.10759.9723993.0435.046030.325210.097711.730552.9473032.843638.637948.63794799.8871.0701050039.1319803.57307.230.977071.301145.0159.97292.5362725.2482293.14212.83300244.8174.2629372.39154.0957193.250.271.0939631993170254208181274262288111371385.2354.4339525.701582.78958095710.6086417128746747.14116.623.033042507.4742.631134031.5135288591.3235.981147250397205221413020426568329.0028.193127.4938490.9825.265147482507.163492.5315.107.7258.162051361.3353.007.847.286.8889.84138.74220.335.0546.229583.732.11453611274812902248.8513660170.950.812.203.734.113.442175.858526.663856.63301.17777.1723885.4384.949470.331860.097781.722052.7297432.863438.817488.81748OpenBenchmarking.org

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02004006008001000SE +/- 2.69, N = 3SE +/- 0.27, N = 3SE +/- 10.35, N = 5SE +/- 4.15, N = 3751.15750.66787.77799.88-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01632486480SE +/- 0.23, N = 3SE +/- 0.37, N = 3SE +/- 0.22, N = 3SE +/- 0.08, N = 370.9771.7871.0571.07-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.011K22K33K44K55KSE +/- 589.20, N = 6SE +/- 668.40, N = 3SE +/- 390.75, N = 3SE +/- 224.88, N = 351813.0552238.8050679.5350039.13MIN: 48543.1-march=znver2 - MIN: 49224.9-march=znver2 - MIN: 48056.6MIN: 46883.11. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.04K8K12K16K20KSE +/- 42.61, N = 3SE +/- 22.35, N = 3SE +/- 41.11, N = 3SE +/- 87.03, N = 319613.7019696.5719694.3319803.57MIN: 18961.5-march=znver2 - MIN: 19033.5-march=znver2 - MIN: 18995.6MIN: 19014.91. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.070140210280350SE +/- 0.26, N = 3SE +/- 3.91, N = 3SE +/- 2.37, N = 3SE +/- 4.29, N = 3309.36302.82306.02307.23-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.980.990.980.97-march=znver2-march=znver21. (CXX) g++ options: -march=core-avx2 -O3 -std=c++11 -funroll-all-loops -fopenmp -lrt -lpthread -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02K4K6K8K10KSE +/- 73.85, N = 3SE +/- 67.62, N = 3SE +/- 95.02, N = 3SE +/- 40.30, N = 37063.037920.177823.277071.30-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02004006008001000SE +/- 6.51, N = 3SE +/- 6.23, N = 3SE +/- 5.61, N = 3SE +/- 6.39, N = 31147.621153.461145.951145.01MIN: 1052.13-march=znver2 - MIN: 1057.54-march=znver2 - MIN: 1052.71MIN: 1050.581. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01428425670SE +/- 0.30, N = 3SE +/- 0.53, N = 11SE +/- 0.06, N = 3SE +/- 0.17, N = 359.3160.3863.3459.97-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.070140210280350280.27284.10300.31292.53

Apache Siege

This is a test of the Apache web server performance being facilitated by the Siege web serverb enchmark program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020K40K60K80K100KSE +/- 3755.13, N = 15SE +/- 4063.46, N = 12SE +/- 1636.75, N = 12SE +/- 122.71, N = 398050.9196842.13102423.0762725.24-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020K40K60K80K100KSE +/- 798.56, N = 3SE +/- 3575.15, N = 15SE +/- 1288.23, N = 15SE +/- 3302.37, N = 1260835.7999824.4983275.0682293.14-march=znver2-march=znver21. (CC) gcc options: -O3 -lpthread -ldl -lssl -lcrypto

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.050100150200250SE +/- 2.00, N = 15SE +/- 1.79, N = 13SE +/- 1.85, N = 15SE +/- 0.29, N = 3221.23217.02218.66212.83MIN: 202.07-march=znver2 - MIN: 203.65-march=znver2 - MIN: 203.42MIN: 201.71. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.060K120K180K240K300KSE +/- 513.53, N = 3SE +/- 235.79, N = 3SE +/- 237.85, N = 3SE +/- 102.78, N = 3300353.09297539.89298969.75300244.81-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020406080100SE +/- 0.35, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.88, N = 376.4574.0877.2274.26-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.06K12K18K24K30KSE +/- 55.36, N = 3SE +/- 40.41, N = 3SE +/- 124.84, N = 3SE +/- 31.16, N = 329178.2329149.2029148.6029372.39-march=znver2-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0306090120150SE +/- 3.35, N = 15SE +/- 2.27, N = 15SE +/- 3.21, N = 14SE +/- 3.18, N = 12152.51155.34157.72154.09MIN: 111.42-march=znver2 - MIN: 127.99-march=znver2 - MIN: 127MIN: 1291. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.013K26K39K52K65KSE +/- 293.82, N = 3SE +/- 3850.96, N = 15SE +/- 393.33, N = 3SE +/- 2058.38, N = 1552914.1059232.0752910.8757193.25-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video EncodingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.0720.1440.2160.2880.36SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.270.310.320.27-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.24530.49060.73590.98121.2265SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 41.091.081.081.09

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08M16M24M32M40MSE +/- 210046.69, N = 3SE +/- 164232.11, N = 3SE +/- 131167.27, N = 3SE +/- 237875.03, N = 339278964395616553954032839631993-march=znver2-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.04080120160200SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3170171173170-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.060120180240300SE +/- 1.86, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 3251259264254-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.050100150200250SE +/- 1.20, N = 3209221223208-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.04080120160200SE +/- 0.33, N = 3181195196181-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.060120180240300SE +/- 2.19, N = 3SE +/- 1.15, N = 3SE +/- 1.53, N = 3SE +/- 2.65, N = 3275280286274-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.060120180240300SE +/- 1.20, N = 3SE +/- 1.86, N = 3SE +/- 4.33, N = 3262263277262-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.070140210280350SE +/- 2.60, N = 3SE +/- 2.19, N = 3SE +/- 0.33, N = 3SE +/- 2.19, N = 3287293302288-march=znver2-march=znver21. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deepGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02K4K6K8K10KSE +/- 926.03, N = 12SE +/- 8.82, N = 3SE +/- 3.33, N = 311190.0010230.3411123.0011137.00-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.030060090012001500SE +/- 11.21, N = 3SE +/- 6.19, N = 3SE +/- 0.48, N = 3SE +/- 2.93, N = 31322.901378.461385.881385.23-march=znver2-march=znver21. (CC) gcc options: -O3 -mavx2

Timed PHP Compilation

This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01224364860SE +/- 0.15, N = 3SE +/- 0.36, N = 3SE +/- 0.51, N = 3SE +/- 0.09, N = 352.7153.9153.7654.43-march=znver2-march=znver21. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

NGINX Benchmark

This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.09K18K27K36K45KSE +/- 102.83, N = 3SE +/- 112.05, N = 3SE +/- 23.74, N = 3SE +/- 158.42, N = 339734.8539602.4939346.9139525.70-march=znver2-march=znver21. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.030060090012001500SE +/- 7.48, N = 3SE +/- 17.21, N = 3SE +/- 25.50, N = 3SE +/- 5.99, N = 31523.521599.681556.911582.78MIN: 1357.02-march=znver2 - MIN: 1393.73-march=znver2 - MIN: 1368.2MIN: 1385.561. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

GNU MPC

GNU MPC is a C library for the arithmetic of complex numbers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGlobal Score, More Is BetterGNU MPC 1.1.0Multi-Precision BenchmarkGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02K4K6K8K10KSE +/- 31.80, N = 3SE +/- 50.44, N = 3SE +/- 102.03, N = 3SE +/- 26.46, N = 39597957793579580-march=znver2-march=znver21. (CC) gcc options: -lm -O3 -MT -MD -MP -MF

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020K40K60K80K100KSE +/- 1551.16, N = 3SE +/- 937.65, N = 15SE +/- 1267.59, N = 3SE +/- 1025.20, N = 1592376.4093850.5997228.2795710.60-march=znver2-march=znver21. (CC) gcc options: -O3 -lm -rdynamic

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256tGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020K40K60K80K100KSE +/- 990.16, N = 7SE +/- 1027.26, N = 6SE +/- 180.83, N = 3SE +/- 116.81, N = 387951872388644086417-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.030060090012001500SE +/- 9.50, N = 3SE +/- 12.79, N = 8SE +/- 0.33, N = 3SE +/- 0.58, N = 312691268125012871. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0100200300400500SE +/- 3.18, N = 3SE +/- 4.91, N = 8SE +/- 0.33, N = 34684684534671. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01122334455SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 347.1247.2747.2147.14-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3 -O2 -march=native

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0306090120150SE +/- 0.16, N = 3SE +/- 0.45, N = 3SE +/- 1.48, N = 4SE +/- 0.79, N = 3117.60118.47118.02116.62MIN: 103.13-march=znver2 - MIN: 103.47-march=znver2 - MIN: 102.11MIN: 102.391. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.70881.41762.12642.83523.544SE +/- 0.04, N = 5SE +/- 0.04, N = 6SE +/- 0.02, N = 25SE +/- 0.02, N = 253.123.153.043.03-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0700K1400K2100K2800K3500KSE +/- 40781.06, N = 3SE +/- 61029.58, N = 15SE +/- 51486.64, N = 15SE +/- 47460.73, N = 153297713.333066070.283031706.223042507.471. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 343.0939.4239.3642.63-march=znver2-march=znver21. (CC) gcc options: -lm -lpthread -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0306090120150SE +/- 0.33, N = 31161161081131. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0918273645SE +/- 0.33, N = 3394037401. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0714212835SE +/- 0.28, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.38, N = 531.5231.4331.3031.51-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbryGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08K16K24K32K40KSE +/- 550.28, N = 3SE +/- 5.77, N = 3SE +/- 20.82, N = 3SE +/- 460.86, N = 534583344203463035288-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7mGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0130260390520650SE +/- 0.29, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 3SE +/- 0.27, N = 3593.89590.66590.80591.32-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0816243240SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 334.6033.2033.0535.98-march=znver2-march=znver21. (CC) gcc options: -lm -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.030060090012001500SE +/- 10.00, N = 3SE +/- 0.33, N = 311191183115911471. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.060120180240300SE +/- 1.86, N = 3SE +/- 0.67, N = 32392572482501. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skeinGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.09K18K27K36K45KSE +/- 602.50, N = 3SE +/- 21.86, N = 3SE +/- 133.46, N = 3SE +/- 5.77, N = 339397397973984339720-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 10.0.0GCC 10.0.0 znver2GCC 9.1.0GCC 9.1.0 znver2110220330440550SE +/- 0.88, N = 3SE +/- 4.47, N = 11SE +/- 0.67, N = 3SE +/- 4.10, N = 35074994945151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-grGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.03K6K9K12K15KSE +/- 49.78, N = 3SE +/- 26.03, N = 3SE +/- 6.67, N = 3SE +/- 40.00, N = 314127140231413714130-march=znver2-march=znver21. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.04K8K12K16K20KSE +/- 64.93, N = 3SE +/- 63.01, N = 3SE +/- 64.22, N = 3SE +/- 63.74, N = 3203352025320426204261. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0120K240K360K480K600KSE +/- 1430.19, N = 3SE +/- 2761.64, N = 3SE +/- 1036.74, N = 3SE +/- 1210.22, N = 3567987.34555154.60567096.65568329.00-march=znver2-march=znver21. (CC) gcc options: -O2 -O3 -lrt" -lrt

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0714212835SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.45, N = 3SE +/- 0.08, N = 327.6028.9328.3028.19-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08001600240032004000SE +/- 25.64, N = 3SE +/- 5.91, N = 3SE +/- 13.97, N = 3SE +/- 5.96, N = 32768.163686.603553.673127.49-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

Apache Benchmark

This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08K16K24K32K40KSE +/- 57.64, N = 3SE +/- 139.15, N = 3SE +/- 79.10, N = 3SE +/- 65.39, N = 338392.2938022.7938009.2538490.98-march=znver2-march=znver21. (CC) gcc options: -shared -fPIC -pthread -O3

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0612182430SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 325.2325.2525.3925.26-march=znver2-march=znver21. (CC) gcc options: -pthread -fvisibility=hidden -O3

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: ThroughputGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0110K220K330K440K550KSE +/- 4767.11, N = 5SE +/- 4175.03, N = 5SE +/- 3715.76, N = 18SE +/- 5409.10, N = 5514551517095529657514748-march=znver2-march=znver21. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.05001000150020002500SE +/- 9.57, N = 3SE +/- 9.61, N = 3SE +/- 17.20, N = 3SE +/- 6.13, N = 32527.502520.012543.932507.16MIN: 2462.11-march=znver2 - MIN: 2467.07-march=znver2 - MIN: 2467.76MIN: 2461.571. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08001600240032004000SE +/- 7.07, N = 3SE +/- 1.42, N = 3SE +/- 0.70, N = 3SE +/- 1.89, N = 33516.273481.503487.103492.53-march=znver2-march=znver21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.048121620SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 4SE +/- 0.17, N = 314.4014.1514.9015.10-march=znver2-march=znver21. (CXX) g++ options: -O3 -std=c++11

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.03, N = 57.707.998.117.72-march=znver2-march=znver21. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01326395265SE +/- 0.66, N = 7SE +/- 0.49, N = 15SE +/- 0.58, N = 8SE +/- 0.69, N = 1559.0057.9756.8758.16MIN: 50.8-march=znver2 - MIN: 51.57-march=znver2 - MIN: 50.96MIN: 50.911. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0500K1000K1500K2000K2500KSE +/- 30290.32, N = 4SE +/- 19021.82, N = 3SE +/- 14796.01, N = 3SE +/- 28123.08, N = 32122162.942169531.002084989.882051361.331. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01224364860SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 3SE +/- 0.20, N = 352.9452.5352.4053.00-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.787.677.537.84-march=znver2-march=znver21. (CXX) g++ options: -fopenmp -O3

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.256.947.457.28-march=znver2-march=znver21. (CC) gcc options: -O3 -lncurses -lm

FFmpeg

This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.866.836.786.88-march=znver2-march=znver21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -O3 -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video EncodeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.020406080100SE +/- 0.28, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.15, N = 389.9996.5492.3589.84-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0306090120150SE +/- 1.55, N = 7SE +/- 2.03, N = 3SE +/- 2.09, N = 4SE +/- 2.27, N = 3139.59138.41139.82138.74-march=znver2-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.050100150200250SE +/- 0.44, N = 3SE +/- 0.31, N = 3SE +/- 0.30, N = 3SE +/- 2.32, N = 3218.09225.64225.44220.33-march=znver2-march=znver21. (CC) gcc options: -O3 -rdynamic

Ogg Encoding

This test times how long it takes to encode a sample WAV file to Ogg format using vorbis-tools, libvorbis, and libogg. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.3WAV To OggGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01.2062.4123.6184.8246.03SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 35.135.055.365.05-march=znver2-march=znver21. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -logg

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01122334455SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.27, N = 346.4546.3946.4946.22-march=znver2-march=znver21. (CXX) g++ options: -O3 -pie -lpthread -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02K4K6K8K10KSE +/- 30.08, N = 3SE +/- 148.34, N = 4SE +/- 10.67, N = 3SE +/- 19.17, N = 39028.1010814.0010531.009583.73-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

Bullet Physics Engine

This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.47480.94961.42441.89922.374SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 61.982.042.062.11-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.010K20K30K40K50KSE +/- 663.38, N = 4SE +/- 105.51, N = 3SE +/- 28.47, N = 3SE +/- 54.85, N = 345253449514630545361-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.03K6K9K12K15KSE +/- 1.76, N = 3SE +/- 15.90, N = 3SE +/- 5.51, N = 3SE +/- 110.06, N = 312958118281411312748-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32GCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.03K6K9K12K15KSE +/- 155.95, N = 3SE +/- 141.66, N = 3SE +/- 2.19, N = 311909143141411912902-march=znver2-march=znver21. (CC) gcc options: -pthread -O3 -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.050100150200250SE +/- 3.72, N = 3SE +/- 0.72, N = 3SE +/- 1.78, N = 3SE +/- 1.73, N = 3246.01247.33247.99248.85-march=znver2-march=znver21. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0300K600K900K1200K1500KSE +/- 620.00, N = 5SE +/- 10688.23, N = 5SE +/- 6261.48, N = 5SE +/- 676.60, N = 51305781133718814087521366017-march=znver2-march=znver21. (CC) gcc options: -O3 -march=native

Bullet Physics Engine

This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.21380.42760.64140.85521.069SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.890.900.910.95-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.18230.36460.54690.72920.9115SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.750.770.770.81-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.4950.991.4851.982.475SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 32.052.042.052.20-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.83931.67862.51793.35724.1965SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 33.513.573.603.73-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.92481.84962.77443.69924.624SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 33.883.773.854.11-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.7741.5482.3223.0963.87SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 33.203.223.273.44-march=znver2-march=znver21. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.05001000150020002500SE +/- 20.16, N = 3SE +/- 0.53, N = 3SE +/- 0.89, N = 3SE +/- 0.16, N = 32125.262408.262293.462175.85-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02K4K6K8K10KSE +/- 60.72, N = 3SE +/- 15.98, N = 3SE +/- 12.24, N = 3SE +/- 19.91, N = 36891.3711370.2710777.888526.66-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.08001600240032004000SE +/- 37.65, N = 3SE +/- 13.73, N = 3SE +/- 58.43, N = 3SE +/- 15.78, N = 33767.633580.733675.943856.63-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.070140210280350SE +/- 2.85, N = 3SE +/- 0.21, N = 3SE +/- 0.24, N = 3SE +/- 0.54, N = 3295.18273.49261.10301.17-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.02004006008001000SE +/- 7.16, N = 3SE +/- 0.74, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 3761.38800.23759.97777.17-march=znver2-march=znver21. (CC) gcc options: -O3 -lm

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.05K10K15K20K25KSE +/- 159.70, N = 3SE +/- 119.42, N = 3SE +/- 195.64, N = 3SE +/- 62.37, N = 324227.2523832.6123993.0423885.44-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.01.13542.27083.40624.54165.677SE +/- 0.02698, N = 3SE +/- 0.07571, N = 3SE +/- 0.04322, N = 3SE +/- 0.05697, N = 34.891614.988325.046034.94947-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.07470.14940.22410.29880.3735SE +/- 0.00047, N = 3SE +/- 0.00042, N = 3SE +/- 0.00071, N = 3SE +/- 0.00125, N = 30.325960.326980.325210.33186-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.0220.0440.0660.0880.11SE +/- 0.00036, N = 3SE +/- 0.00042, N = 3SE +/- 0.00044, N = 3SE +/- 0.00041, N = 30.097570.097980.097710.09778-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.38940.77881.16821.55761.947SE +/- 0.00015, N = 3SE +/- 0.00081, N = 3SE +/- 0.00091, N = 3SE +/- 0.00098, N = 31.708201.716681.730551.72205-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.00.66431.32861.99292.65723.3215SE +/- 0.00047, N = 3SE +/- 0.00095, N = 3SE +/- 0.00082, N = 3SE +/- 0.00151, N = 32.732552.952252.947302.72974-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0816243240SE +/- 0.19, N = 3SE +/- 0.42, N = 3SE +/- 0.22, N = 3SE +/- 0.11, N = 332.8332.6032.8432.86-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.02013, N = 3SE +/- 0.06300, N = 3SE +/- 0.02559, N = 3SE +/- 0.18198, N = 38.598038.605148.637948.81748-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteGCC 9.1.0GCC 9.1.0 znver2GCC 10.0.0 znver2GCC 10.0.0246810SE +/- 0.02013, N = 3SE +/- 0.06300, N = 3SE +/- 0.02559, N = 3SE +/- 0.18198, N = 38.598038.605148.637948.81748-march=znver2-march=znver21. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -funroll-loops2. OpenBLAS + Open MPI 2.1.1

103 Results Shown

CppPerformanceBenchmarks
HPC Challenge
MKL-DNN:
  Deconvolution Batch deconv_all - f32
  Convolution Batch conv_all - f32
CppPerformanceBenchmarks
GROMACS
FFTW
MKL-DNN
CppPerformanceBenchmarks
Timed LLVM Compilation
Apache Siege:
  250
  200
MKL-DNN
PostgreSQL pgbench
CppPerformanceBenchmarks
PostgreSQL pgbench
MKL-DNN
Memcached mcperf
AOM AV1
High Performance Conjugate Gradient
Stockfish
GraphicsMagick:
  Noise-Gaussian
  Swirl
  Enhanced
  Sharpen
  Resizing
  Rotate
  HWB Color Space
Cpuminer-Opt
Himeno Benchmark
Timed PHP Compilation
NGINX Benchmark
MKL-DNN
GNU MPC
Memcached mcperf
Cpuminer-Opt
lzbench:
  Zstd 1 - Decompression
  Zstd 1 - Compression
m-queens
MKL-DNN
Sockperf
Redis
C-Ray
lzbench:
  XZ 0 - Decompression
  XZ 0 - Compression
CppPerformanceBenchmarks
Cpuminer-Opt:
  lbry
  m7m
AOBench
lzbench:
  Libdeflate 1 - Decompression
  Libdeflate 1 - Compression
Cpuminer-Opt
lzbench
Cpuminer-Opt
John The Ripper
Coremark
CppPerformanceBenchmarks
SciMark
Apache Benchmark
XZ Compression
Sockperf
MKL-DNN
OpenSSL
CppPerformanceBenchmarks
FLAC Audio Encoding
MKL-DNN
Redis
x265
Smallpt
LAME MP3 Encoding
FFmpeg
SVT-VP9
x264
libjpeg-turbo tjbench
Ogg Encoding
SVT-AV1
FFTW
Bullet Physics Engine
FFTW:
  Float + SSE - 2D FFT Size 32
  Stock - 1D FFT Size 32
  Stock - 2D FFT Size 32
SVT-HEVC
TSCP
Bullet Physics Engine:
  Convex Trimesh
  Prim Trimesh
  136 Ragdolls
  1000 Convex
  1000 Stack
  3000 Fall
SciMark:
  Jacobi Successive Over-Relaxation
  Dense LU Matrix Factorization
  Sparse Matrix Multiply
  Fast Fourier Transform
  Monte Carlo
HPC Challenge:
  Max Ping Pong Bandwidth
  Rand Ring Bandwidth
  Rand Ring Latency
  G-Rand Access
  EP-STREAM Triad
  G-Ptrans
  EP-DGEMM
  G-Ffte
  G-Ffte