AMD EPYC 7F72 2P Linux 5.11 Perf Governor

2 x AMD EPYC 7F72 24-Core testing looking at CPU freq invariance on 5.11 with patch. CPU power consumption monitoring via AMD_Energy interface at 1 second polling. Additional data with CPUFreq performance governor included.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101253-HA-AMDEPYC7F96
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Linux 5.11 Git
January 22 2021
  15 Hours
Linux 5.11 Patched
January 23 2021
  15 Hours, 14 Minutes
CPUFreq Performance
January 24 2021
  16 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 38 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7F72 2P Linux 5.11 Perf GovernorProcessorMotherboardChipsetMemoryDiskGraphicsNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN1000GB Western Digital WD_BLACK SN850 1TBASPEED2 x Intel 10G X550TUbuntu 20.105.11.0-051100rc4daily20210122-generic (x86_64) 20210121GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x1080VE2285.11.0-rc4-max-boost-inv-patch (x86_64) 20210121OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Linux 5.11 Git: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Linux 5.11 Patched: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- CPUFreq Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformanceResult OverviewPhoronix Test Suite100%107%114%121%128%dav1dCpuminer-Optx265DaCapo BenchmarkTimed GDB GNU Debugger CompilationInfluxDBNebular Empirical Analysis ToolCLOMPFFTWLAMMPS Molecular Dynamics SimulatorQMCPACKZstd CompressiononeDNNQuantum ESPRESSOIORTTSIOD 3D RendererAI Benchmark Alpharav1eOSPrayRodiniaRedisSVT-VP9Himeno BenchmarkTimed Godot Game Engine CompilationFFTEYafaRayKeyDBJohn The RipperChaos Group V-RAYLeelaChessZeroTNNHigh Performance Conjugate GradientLULESHNAMDNAS Parallel BenchmarksASKAPTimed Linux Kernel CompilationStockfishBlogBenchOpenFOAMPOV-RayGPAWTachyonCython BenchmarkBlenderTimed LLVM CompilationBYTE Unix BenchmarkBuild2PrimesieveIntel Open Image DenoiseSVT-AV1SQLite SpeedtestONNX RuntimePlaidMLAlgebraic Multi-Grid BenchmarkTungsten RendererLuxCoreRenderTimed MrBayes AnalysissimdjsonASTC EncoderBRL-CADLZ4 CompressionGcrypt LibraryNumpy BenchmarkDolfynGROMACSEtcpakGoogle SynthMarkRELIONQuantLibGnuPGasmFishSwetTSCPHierarchical INTegrationTensorFlow LiteFinanceBench

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformancePer Watt Result OverviewPhoronix Test Suite100%105%110%116%121%ASKAPdav1dCpuminer-OptHigh Performance Conjugate GradientAI Benchmark AlphaKeyDBZstd CompressionFFTWLAMMPS Molecular Dynamics SimulatorCLOMPTTSIOD 3D RendererIORRedisOSPrayFFTESVT-VP9BlogBenchx265asmFishInfluxDBNAS Parallel BenchmarksHimeno BenchmarkLULESHChaos Group V-RAYQuantLibEtcpakJohn The RipperStockfishLeelaChessZeroBYTE Unix BenchmarkNumpy BenchmarkAlgebraic Multi-Grid BenchmarkLZ4 CompressionSwetONNX RuntimeGoogle SynthMarkHierarchical INTegrationBRL-CADTSCPrav1eGROMACSLuxCoreRenderIntel Open Image DenoisePlaidMLSVT-AV1P.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

AMD EPYC 7F72 2P Linux 5.11 Perf Governortscp: AI Chess Performancecompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedclomp: Static OMP Speedupdacapobench: H2dacapobench: Jythondacapobench: Tradebeansdacapobench: Tradesoapcython-bench: N-Queenshint: FLOATbrl-cad: VGR Performance Metricastcenc: Thoroughastcenc: Exhaustiveetcpak: ETC1etcpak: ETC1 + Ditheringetcpak: ETC2synthmark: VoiceMark_100gcrypt: quantlib: relion: Basic - CPUfftw: Float + SSE - 2D FFT Size 4096mrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solverplaidml: No - Inference - ResNet 50 - CPUplaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUlczero: BLASlczero: Eigennumpy: dolfyn: Computational Fluid Dynamicsai-benchmark: Device Training Scoreai-benchmark: Device AI Scoreior: 2MB - Default Test Directoryai-benchmark: Device Inference Scoretensorflow-lite: Mobilenet Quantior: 8MB - Default Test Directoryonnx: yolov4 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUtensorflow-lite: Mobilenet Floattensorflow-lite: NASNet Mobiletensorflow-lite: SqueezeNettensorflow-lite: Inception ResNet V2tensorflow-lite: Inception V4tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1gromacs: Water Benchmarklammps: Rhodopsin Proteinlammps: 20k Atomshpcg: npb: EP.Cnpb: EP.Dnpb: LU.Crodinia: OpenMP CFD Solverrodinia: OpenMP LavaMDrodinia: OpenMP Leukocyterodinia: OpenMP Streamclusterrodinia: OpenMP HotSpot3Dnamd: ATPase Simulation - 327,506 Atomsonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingamg: ffte: N=256, 3D Complex FFT Routineneat: lulesh: openfoam: Motorbike 30Mopenfoam: Motorbike 60Mqmcpack: simple-H2Ogpaw: Carbon Nanotubeqe: AUSURF112build-gdb: Time To Compileprimesieve: 1e12 Prime Number Generationstockfish: Total Timejohn-the-ripper: MD5john-the-ripper: Blowfishbuild-llvm: Time To Compilecompress-zstd: 3asmfish: 1024 Hash Memory, 26 Depthcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: x25xcpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: LBC, LBRY Creditsbuild-linux-kernel: Time To Compileswet: Averagetungsten: Hairtungsten: Water Caustictungsten: Volumetric Caustictachyon: Total Timesvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080pdav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitsvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 0 - 1080px265: Bosphorus 1080px265: Bosphorus 4Kttsiod-renderer: Phong Rendering With Soft-Shadow Mappingv-ray: CPUblender: Barbershop - CPU-Onlypovray: Trace Timerav1e: 10rav1e: 6rav1e: 5rav1e: 1build-godot: Time To Compileoidn: Memorialluxcorerender: DLSCluxcorerender: Rainbow Colors and Prismospray: Magnetic Reconnection - SciVisospray: Magnetic Reconnection - Path Tracerospray: XFrog Forest - SciVisospray: XFrog Forest - Path Tracerospray: NASA Streamlines - SciVisospray: NASA Streamlines - Path Tracerospray: San Miguel - SciVisospray: San Miguel - Path Traceryafaray: Total Time For Sample Scenebuild2: Time To Compilefinancebench: Bonds OpenMPfinancebench: Repo OpenMPblogbench: Readinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000keydb: redis: SETredis: GETredis: LPUSHredis: SADDsqlite-speedtest: Timed Time - Size 1,000simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDbyte: Dhrystone 2gnupg: 2.7GB Sample File EncryptionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11151339622.9911334.049.4310404.547.6310488.043.9531048975954517026.950322702844.182046389715.6941.17266.315244.793155.182712.252233.8252149.8349.8521846882.6504233.5115834.6525.1222.0941064284323.9318.71810592756505.19169745083.9531.09175439346659.218977165193.0765726894640303.449274.9445.23921.12924.99330.24153788.883854.60147443.869.25552.79453.86211.20997.9420.454510.9141982.405491.620880.8813480.5476741317.407426.0811870.31437771333174206.1300038727.06719576.12218.71129.6731.17760.7021217.4997.6414.54097945089455033370824210.3478205.21173109852975021524.219937.3136378413247725.9236858888636.6005421.32535.3010517.9375311.85381.08369.01308.29130.6168.2297.6630.09247.6618.63627.20854803158.6411.4852.9021.3701.0450.36860.85828.097.818.7232.2625011.115.9171.4316.3952.634.3084.73968.14157601.36328140124.3736981084405807463.11231991.2302893.561380890.221689203.101220973.711539146.2170.1600.620.370.570.6538181643.677.30211145629757.1711305.048.9510666.047.7610489.847.8521747785591514826.604323144417.171976365215.6540.97267.587245.595155.798714.914232.5422157.2348.2941701582.0424286.6283094.6325.4222.4940614433323.0018.65210672787475.25172041034.0520.72181421039523.513404462195.4736285810750289.764274.8695.26123.78725.07730.82623841.483863.45154376.768.88252.09252.68410.33896.6030.444720.8637822.332901.554470.8492480.5219681123.327453.5111944.21448718333178738.1249709424.63319771.22318.30128.2829.28159.8511171.0392.9164.53597042601461230872636208.7868270.51176329552969951541.769949.8836401713903725.7526874802626.6903621.33295.2623518.0565323.81371.48364.81317.45133.3768.2317.6480.09149.4519.74655.22553460156.8311.3053.0541.4081.0680.37259.17728.397.808.7632.6225011.195.9571.4316.3954.974.3287.14367.32256769.45312539406.7578121103118812193.61256112.1294214.371427348.101711621.521217218.751611164.3470.5400.630.360.570.6538319339.877.17911162559651.6111150.649.2910410.348.0210555.147.4457047404671462126.895322775283.801256357255.6741.07267.136245.453155.783714.181232.7602156.3348.3381733582.2614135.5191544.6625.2222.4341474450324.4818.63511332908517.33177541180.1539.63185419039981.913284461347.2737993818887297.132274.0975.25524.63925.11630.92603857.333867.30153770.578.51052.36051.29010.43296.8850.444690.8675452.304671.579630.8136280.5146041247.487441.9511492.91444174667180037.9535858625.03719334.86218.29128.8128.97159.7571249.0685.2774.48996082114476200071757208.0437770.31173708714202311537.529964.3552294819408725.4146861321096.6950521.46525.3118318.2004309.61363.18346.71363.33181.9568.4657.4880.09162.2920.75665.44855014158.0711.4273.1771.4461.0950.37558.74628.337.898.7632.80333.3311.195.9571.4316.3955.564.3387.33167.33456094.48828139948.3567711086347956189.41360163.0303171.331454741.421782755.971251006.001610484.5069.8830.630.370.570.6538656226.377.022OpenBenchmarking.org

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance200K400K600K800K1000KSE +/- 609.97, N = 12SE +/- 1015.01, N = 12SE +/- 517.61, N = 121114562111513311162551. (CC) gcc options: -O3 -march=native

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched2K4K6K8K10KSE +/- 96.44, N = 3SE +/- 71.91, N = 3SE +/- 19.43, N = 39622.999651.619757.171. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git2K4K6K8K10KSE +/- 110.98, N = 3SE +/- 25.21, N = 3SE +/- 46.17, N = 311150.611305.011334.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git1122334455SE +/- 0.12, N = 3SE +/- 0.52, N = 5SE +/- 0.56, N = 448.9549.2949.431. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched2K4K6K8K10KSE +/- 38.32, N = 4SE +/- 61.51, N = 5SE +/- 60.61, N = 310404.510410.310666.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1122334455SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 347.6347.7648.021. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 172.84, N = 3SE +/- 31.24, N = 3SE +/- 96.09, N = 310488.010489.810555.11. (CC) gcc options: -O3

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched1122334455SE +/- 0.60, N = 3SE +/- 0.55, N = 3SE +/- 0.47, N = 343.947.447.81. (CC) gcc options: -fopenmp -O3 -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11002200330044005500SE +/- 36.45, N = 20SE +/- 73.65, N = 20SE +/- 70.06, N = 20531052174570

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10002000300040005000SE +/- 28.66, N = 18SE +/- 43.93, N = 6SE +/- 20.83, N = 6489747784740

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance13002600390052006500SE +/- 50.83, N = 20SE +/- 66.39, N = 20SE +/- 52.34, N = 20595455914671

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11002200330044005500SE +/- 44.82, N = 4SE +/- 61.21, N = 4SE +/- 42.72, N = 5517051484621

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched612182430SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.22, N = 326.9526.9026.60

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched70M140M210M280M350MSE +/- 121683.80, N = 3SE +/- 201175.83, N = 3SE +/- 122621.60, N = 3322702844.18322775283.80323144417.171. (CC) gcc options: -O3 -march=native -lm

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git140K280K420K560K700K6357256365216389711. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched1.28032.56063.84095.12126.4015SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 55.695.675.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 341.1741.0740.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched60120180240300SE +/- 0.24, N = 3SE +/- 0.25, N = 3SE +/- 0.23, N = 3266.32267.14267.591. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched50100150200250SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3244.79245.45245.601. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched306090120150SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3155.18155.78155.801. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched150300450600750SE +/- 1.12, N = 3SE +/- 1.18, N = 3SE +/- 0.04, N = 3712.25714.18714.911. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched50100150200250SE +/- 0.85, N = 3SE +/- 0.62, N = 3SE +/- 0.81, N = 3233.83232.76232.541. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched5001000150020002500SE +/- 13.13, N = 3SE +/- 16.02, N = 3SE +/- 8.27, N = 32149.82156.32157.21. (CXX) g++ options: -O3 -march=native -rdynamic

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPULinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched80160240320400SE +/- 2.94, N = 9SE +/- 3.10, N = 9SE +/- 2.97, N = 9349.85348.34348.291. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git4K8K12K16K20KSE +/- 213.45, N = 3SE +/- 199.64, N = 9SE +/- 24.98, N = 31701517335184681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched20406080100SE +/- 0.08, N = 3SE +/- 0.29, N = 3SE +/- 0.29, N = 382.6582.2682.041. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched9001800270036004500SE +/- 27.09, N = 3SE +/- 35.16, N = 8SE +/- 25.26, N = 34135.524233.514286.631. (CC) gcc options: -O3 -mavx2

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPULinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance1.04852.0973.14554.1945.2425SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 34.634.654.66

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPULinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched612182430SE +/- 0.22, N = 15SE +/- 0.24, N = 15SE +/- 0.30, N = 1525.1225.2225.42

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPULinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched510152025SE +/- 0.20, N = 15SE +/- 0.20, N = 15SE +/- 0.16, N = 1522.0922.4322.49

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance9001800270036004500SE +/- 49.90, N = 9SE +/- 50.84, N = 3SE +/- 17.79, N = 34061410641471. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10002000300040005000SE +/- 49.20, N = 4SE +/- 36.23, N = 3SE +/- 26.71, N = 34284443344501. (CXX) g++ options: -flto -pthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance70140210280350SE +/- 0.23, N = 3SE +/- 1.74, N = 3SE +/- 0.25, N = 3323.00323.93324.48

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 318.7218.6518.64

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2004006008001000105910671133

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance6001200180024003000275627872908

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance110220330440550SE +/- 2.06, N = 3SE +/- 1.77, N = 3SE +/- 5.48, N = 3475.25505.19517.33MIN: 400.96 / MAX: 971.55MIN: 457.62 / MAX: 951.11MIN: 463.44 / MAX: 1007.521. (CC) gcc options: -O2 -lm -pthread -lmpi

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance400800120016002000169717201775

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched10K20K30K40K50KSE +/- 759.19, N = 15SE +/- 211.03, N = 3SE +/- 400.94, N = 645083.941180.141034.0

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance120240360480600SE +/- 2.63, N = 3SE +/- 2.21, N = 3SE +/- 5.78, N = 3520.72531.09539.63MIN: 176.53 / MAX: 1089.46MIN: 489.6 / MAX: 1034.89MIN: 280.44 / MAX: 1002.781. (CC) gcc options: -O2 -lm -pthread -lmpi

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4080120160200SE +/- 1.60, N = 12SE +/- 1.86, N = 3SE +/- 2.62, N = 31751811851. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git9001800270036004500SE +/- 68.07, N = 12SE +/- 44.10, N = 3SE +/- 78.33, N = 94190421043931. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched10K20K30K40K50KSE +/- 1144.94, N = 15SE +/- 473.02, N = 3SE +/- 395.37, N = 346659.239981.939523.5

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance40K80K120K160K200KSE +/- 7366.43, N = 15SE +/- 2393.85, N = 15SE +/- 2146.20, N = 15189771134044132844

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance14K28K42K56K70KSE +/- 690.93, N = 3SE +/- 412.91, N = 15SE +/- 715.70, N = 465193.062195.461347.2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched160K320K480K640K800KSE +/- 4257.59, N = 3SE +/- 2132.19, N = 3SE +/- 5824.36, N = 9765726737993736285

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched200K400K600K800K1000KSE +/- 2435.29, N = 3SE +/- 4685.69, N = 3SE +/- 1163.43, N = 3894640818887810750

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched70140210280350SE +/- 3.80, N = 3SE +/- 0.07, N = 3SE +/- 2.83, N = 3303.45297.13289.76MIN: 284.51 / MAX: 461.21MIN: 295.49 / MAX: 320.4MIN: 283.65 / MAX: 458.791. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance60120180240300SE +/- 0.39, N = 3SE +/- 0.68, N = 3SE +/- 0.08, N = 3274.94274.87274.10MIN: 273.57 / MAX: 276.21MIN: 273.07 / MAX: 276.74MIN: 273.16 / MAX: 274.861. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched1.18372.36743.55114.73485.9185SE +/- 0.039, N = 3SE +/- 0.021, N = 3SE +/- 0.022, N = 35.2395.2555.2611. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.23, N = 15SE +/- 0.17, N = 12SE +/- 0.19, N = 1521.1323.7924.641. (CXX) g++ options: -O3 -pthread -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 324.9925.0825.121. (CXX) g++ options: -O3 -pthread -lm

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance714212835SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 330.2430.8330.931. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance8001600240032004000SE +/- 10.26, N = 10SE +/- 5.01, N = 10SE +/- 4.01, N = 103788.883841.483857.331. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance8001600240032004000SE +/- 8.23, N = 3SE +/- 2.97, N = 3SE +/- 2.74, N = 33854.603863.453867.301. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C