AMD EPYC 7F72 2P Linux 5.11 Perf Governor

2 x AMD EPYC 7F72 24-Core testing looking at CPU freq invariance on 5.11 with patch. CPU power consumption monitoring via AMD_Energy interface at 1 second polling. Additional data with CPUFreq performance governor included.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101253-HA-AMDEPYC7F96
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Linux 5.11 Git
January 22 2021
  15 Hours
Linux 5.11 Patched
January 23 2021
  15 Hours, 14 Minutes
CPUFreq Performance
January 24 2021
  16 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 38 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7F72 2P Linux 5.11 Perf GovernorProcessorMotherboardChipsetMemoryDiskGraphicsNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN1000GB Western Digital WD_BLACK SN850 1TBASPEED2 x Intel 10G X550TUbuntu 20.105.11.0-051100rc4daily20210122-generic (x86_64) 20210121GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x1080VE2285.11.0-rc4-max-boost-inv-patch (x86_64) 20210121OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Linux 5.11 Git: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Linux 5.11 Patched: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- CPUFreq Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformanceResult OverviewPhoronix Test Suite100%107%114%121%128%dav1dCpuminer-Optx265DaCapo BenchmarkTimed GDB GNU Debugger CompilationInfluxDBNebular Empirical Analysis ToolCLOMPFFTWLAMMPS Molecular Dynamics SimulatorQMCPACKZstd CompressiononeDNNQuantum ESPRESSOIORTTSIOD 3D RendererAI Benchmark Alpharav1eOSPrayRodiniaRedisSVT-VP9Himeno BenchmarkTimed Godot Game Engine CompilationFFTEYafaRayKeyDBJohn The RipperChaos Group V-RAYLeelaChessZeroTNNHigh Performance Conjugate GradientLULESHNAMDNAS Parallel BenchmarksASKAPTimed Linux Kernel CompilationStockfishBlogBenchOpenFOAMPOV-RayGPAWTachyonCython BenchmarkBlenderTimed LLVM CompilationBYTE Unix BenchmarkBuild2PrimesieveIntel Open Image DenoiseSVT-AV1SQLite SpeedtestONNX RuntimePlaidMLAlgebraic Multi-Grid BenchmarkTungsten RendererLuxCoreRenderTimed MrBayes AnalysissimdjsonASTC EncoderBRL-CADLZ4 CompressionGcrypt LibraryNumpy BenchmarkDolfynGROMACSEtcpakGoogle SynthMarkRELIONQuantLibGnuPGasmFishSwetTSCPHierarchical INTegrationTensorFlow LiteFinanceBench

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformancePer Watt Result OverviewPhoronix Test Suite100%105%110%116%121%ASKAPdav1dCpuminer-OptHigh Performance Conjugate GradientAI Benchmark AlphaKeyDBZstd CompressionFFTWLAMMPS Molecular Dynamics SimulatorCLOMPTTSIOD 3D RendererIORRedisOSPrayFFTESVT-VP9BlogBenchx265asmFishInfluxDBNAS Parallel BenchmarksHimeno BenchmarkLULESHChaos Group V-RAYQuantLibEtcpakJohn The RipperStockfishLeelaChessZeroBYTE Unix BenchmarkNumpy BenchmarkAlgebraic Multi-Grid BenchmarkLZ4 CompressionSwetONNX RuntimeGoogle SynthMarkHierarchical INTegrationBRL-CADTSCPrav1eGROMACSLuxCoreRenderIntel Open Image DenoisePlaidMLSVT-AV1P.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

AMD EPYC 7F72 2P Linux 5.11 Perf Governortscp: AI Chess Performancecompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedclomp: Static OMP Speedupdacapobench: H2dacapobench: Jythondacapobench: Tradebeansdacapobench: Tradesoapcython-bench: N-Queenshint: FLOATbrl-cad: VGR Performance Metricastcenc: Thoroughastcenc: Exhaustiveetcpak: ETC1etcpak: ETC1 + Ditheringetcpak: ETC2synthmark: VoiceMark_100gcrypt: quantlib: relion: Basic - CPUfftw: Float + SSE - 2D FFT Size 4096mrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solverplaidml: No - Inference - ResNet 50 - CPUplaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUlczero: BLASlczero: Eigennumpy: dolfyn: Computational Fluid Dynamicsai-benchmark: Device Training Scoreai-benchmark: Device AI Scoreior: 2MB - Default Test Directoryai-benchmark: Device Inference Scoretensorflow-lite: Mobilenet Quantior: 8MB - Default Test Directoryonnx: yolov4 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUtensorflow-lite: Mobilenet Floattensorflow-lite: NASNet Mobiletensorflow-lite: SqueezeNettensorflow-lite: Inception ResNet V2tensorflow-lite: Inception V4tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1gromacs: Water Benchmarklammps: Rhodopsin Proteinlammps: 20k Atomshpcg: npb: EP.Cnpb: EP.Dnpb: LU.Crodinia: OpenMP CFD Solverrodinia: OpenMP LavaMDrodinia: OpenMP Leukocyterodinia: OpenMP Streamclusterrodinia: OpenMP HotSpot3Dnamd: ATPase Simulation - 327,506 Atomsonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingamg: ffte: N=256, 3D Complex FFT Routineneat: lulesh: openfoam: Motorbike 30Mopenfoam: Motorbike 60Mqmcpack: simple-H2Ogpaw: Carbon Nanotubeqe: AUSURF112build-gdb: Time To Compileprimesieve: 1e12 Prime Number Generationstockfish: Total Timejohn-the-ripper: MD5john-the-ripper: Blowfishbuild-llvm: Time To Compilecompress-zstd: 3asmfish: 1024 Hash Memory, 26 Depthcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: x25xcpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: LBC, LBRY Creditsbuild-linux-kernel: Time To Compileswet: Averagetungsten: Hairtungsten: Water Caustictungsten: Volumetric Caustictachyon: Total Timesvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080pdav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitsvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 0 - 1080px265: Bosphorus 1080px265: Bosphorus 4Kttsiod-renderer: Phong Rendering With Soft-Shadow Mappingv-ray: CPUblender: Barbershop - CPU-Onlypovray: Trace Timerav1e: 10rav1e: 6rav1e: 5rav1e: 1build-godot: Time To Compileoidn: Memorialluxcorerender: DLSCluxcorerender: Rainbow Colors and Prismospray: Magnetic Reconnection - SciVisospray: Magnetic Reconnection - Path Tracerospray: XFrog Forest - SciVisospray: XFrog Forest - Path Tracerospray: NASA Streamlines - SciVisospray: NASA Streamlines - Path Tracerospray: San Miguel - SciVisospray: San Miguel - Path Traceryafaray: Total Time For Sample Scenebuild2: Time To Compilefinancebench: Bonds OpenMPfinancebench: Repo OpenMPblogbench: Readinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000keydb: redis: SETredis: GETredis: LPUSHredis: SADDsqlite-speedtest: Timed Time - Size 1,000simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDbyte: Dhrystone 2gnupg: 2.7GB Sample File EncryptionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11151339622.9911334.049.4310404.547.6310488.043.9531048975954517026.950322702844.182046389715.6941.17266.315244.793155.182712.252233.8252149.8349.8521846882.6504233.5115834.6525.1222.0941064284323.9318.71810592756505.19169745083.9531.09175439346659.218977165193.0765726894640303.449274.9445.23921.12924.99330.24153788.883854.60147443.869.25552.79453.86211.20997.9420.454510.9141982.405491.620880.8813480.5476741317.407426.0811870.31437771333174206.1300038727.06719576.12218.71129.6731.17760.7021217.4997.6414.54097945089455033370824210.3478205.21173109852975021524.219937.3136378413247725.9236858888636.6005421.32535.3010517.9375311.85381.08369.01308.29130.6168.2297.6630.09247.6618.63627.20854803158.6411.4852.9021.3701.0450.36860.85828.097.818.7232.2625011.115.9171.4316.3952.634.3084.73968.14157601.36328140124.3736981084405807463.11231991.2302893.561380890.221689203.101220973.711539146.2170.1600.620.370.570.6538181643.677.30211145629757.1711305.048.9510666.047.7610489.847.8521747785591514826.604323144417.171976365215.6540.97267.587245.595155.798714.914232.5422157.2348.2941701582.0424286.6283094.6325.4222.4940614433323.0018.65210672787475.25172041034.0520.72181421039523.513404462195.4736285810750289.764274.8695.26123.78725.07730.82623841.483863.45154376.768.88252.09252.68410.33896.6030.444720.8637822.332901.554470.8492480.5219681123.327453.5111944.21448718333178738.1249709424.63319771.22318.30128.2829.28159.8511171.0392.9164.53597042601461230872636208.7868270.51176329552969951541.769949.8836401713903725.7526874802626.6903621.33295.2623518.0565323.81371.48364.81317.45133.3768.2317.6480.09149.4519.74655.22553460156.8311.3053.0541.4081.0680.37259.17728.397.808.7632.6225011.195.9571.4316.3954.974.3287.14367.32256769.45312539406.7578121103118812193.61256112.1294214.371427348.101711621.521217218.751611164.3470.5400.630.360.570.6538319339.877.17911162559651.6111150.649.2910410.348.0210555.147.4457047404671462126.895322775283.801256357255.6741.07267.136245.453155.783714.181232.7602156.3348.3381733582.2614135.5191544.6625.2222.4341474450324.4818.63511332908517.33177541180.1539.63185419039981.913284461347.2737993818887297.132274.0975.25524.63925.11630.92603857.333867.30153770.578.51052.36051.29010.43296.8850.444690.8675452.304671.579630.8136280.5146041247.487441.9511492.91444174667180037.9535858625.03719334.86218.29128.8128.97159.7571249.0685.2774.48996082114476200071757208.0437770.31173708714202311537.529964.3552294819408725.4146861321096.6950521.46525.3118318.2004309.61363.18346.71363.33181.9568.4657.4880.09162.2920.75665.44855014158.0711.4273.1771.4461.0950.37558.74628.337.898.7632.80333.3311.195.9571.4316.3955.564.3387.33167.33456094.48828139948.3567711086347956189.41360163.0303171.331454741.421782755.971251006.001610484.5069.8830.630.370.570.6538656226.377.022OpenBenchmarking.org

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched200K400K600K800K1000KSE +/- 517.61, N = 12SE +/- 1015.01, N = 12SE +/- 609.97, N = 121116255111513311145621. (CC) gcc options: -O3 -march=native

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git2K4K6K8K10KSE +/- 19.43, N = 3SE +/- 71.91, N = 3SE +/- 96.44, N = 39757.179651.619622.991. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 46.17, N = 3SE +/- 25.21, N = 3SE +/- 110.98, N = 311334.011305.011150.61. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedLinux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched1122334455SE +/- 0.56, N = 4SE +/- 0.52, N = 5SE +/- 0.12, N = 349.4349.2948.951. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git2K4K6K8K10KSE +/- 60.61, N = 3SE +/- 61.51, N = 5SE +/- 38.32, N = 410666.010410.310404.51. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git1122334455SE +/- 0.24, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 348.0247.7647.631. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git2K4K6K8K10KSE +/- 96.09, N = 3SE +/- 31.24, N = 3SE +/- 172.84, N = 310555.110489.810488.01. (CC) gcc options: -O3

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git1122334455SE +/- 0.47, N = 3SE +/- 0.55, N = 3SE +/- 0.60, N = 347.847.443.91. (CC) gcc options: -fopenmp -O3 -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2CPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git11002200330044005500SE +/- 70.06, N = 20SE +/- 73.65, N = 20SE +/- 36.45, N = 20457052175310

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git10002000300040005000SE +/- 20.83, N = 6SE +/- 43.93, N = 6SE +/- 28.66, N = 18474047784897

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git13002600390052006500SE +/- 52.34, N = 20SE +/- 66.39, N = 20SE +/- 50.83, N = 20467155915954

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git11002200330044005500SE +/- 42.72, N = 5SE +/- 61.21, N = 4SE +/- 44.82, N = 4462151485170

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git612182430SE +/- 0.22, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 326.6026.9026.95

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git70M140M210M280M350MSE +/- 122621.60, N = 3SE +/- 201175.83, N = 3SE +/- 121683.80, N = 3323144417.17322775283.80322702844.181. (CC) gcc options: -O3 -march=native -lm

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance140K280K420K560K700K6389716365216357251. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git1.28032.56063.84095.12126.4015SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 55.655.675.691. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 340.9741.0741.171. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git60120180240300SE +/- 0.23, N = 3SE +/- 0.25, N = 3SE +/- 0.24, N = 3267.59267.14266.321. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git50100150200250SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3245.60245.45244.791. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3155.80155.78155.181. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git150300450600750SE +/- 0.04, N = 3SE +/- 1.18, N = 3SE +/- 1.12, N = 3714.91714.18712.251. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git50100150200250SE +/- 0.81, N = 3SE +/- 0.62, N = 3SE +/- 0.85, N = 3232.54232.76233.831. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git5001000150020002500SE +/- 8.27, N = 3SE +/- 16.02, N = 3SE +/- 13.13, N = 32157.22156.32149.81. (CXX) g++ options: -O3 -march=native -rdynamic

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPULinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git80160240320400SE +/- 2.97, N = 9SE +/- 3.10, N = 9SE +/- 2.94, N = 9348.29348.34349.851. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Linux 5.11 GitCPUFreq PerformanceLinux 5.11 Patched4K8K12K16K20KSE +/- 24.98, N = 3SE +/- 199.64, N = 9SE +/- 213.45, N = 31846817335170151. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git20406080100SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 0.08, N = 382.0482.2682.651. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLinux 5.11 PatchedLinux 5.11 GitCPUFreq Performance9001800270036004500SE +/- 25.26, N = 3SE +/- 35.16, N = 8SE +/- 27.09, N = 34286.634233.514135.521. (CC) gcc options: -O3 -mavx2

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched1.04852.0973.14554.1945.2425SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 34.664.654.63

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPULinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git612182430SE +/- 0.30, N = 15SE +/- 0.24, N = 15SE +/- 0.22, N = 1525.4225.2225.12

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPULinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git510152025SE +/- 0.16, N = 15SE +/- 0.20, N = 15SE +/- 0.20, N = 1522.4922.4322.09

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched9001800270036004500SE +/- 17.79, N = 3SE +/- 50.84, N = 3SE +/- 49.90, N = 94147410640611. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git10002000300040005000SE +/- 26.71, N = 3SE +/- 36.23, N = 3SE +/- 49.20, N = 44450443342841. (CXX) g++ options: -flto -pthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched70140210280350SE +/- 0.25, N = 3SE +/- 1.74, N = 3SE +/- 0.23, N = 3324.48323.93323.00

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git510152025SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 318.6418.6518.72

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git2004006008001000113310671059

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git6001200180024003000290827872756

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched110220330440550SE +/- 5.48, N = 3SE +/- 1.77, N = 3SE +/- 2.06, N = 3517.33505.19475.25MIN: 463.44 / MAX: 1007.52MIN: 457.62 / MAX: 951.11MIN: 400.96 / MAX: 971.551. (CC) gcc options: -O2 -lm -pthread -lmpi

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git400800120016002000177517201697

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git10K20K30K40K50KSE +/- 400.94, N = 6SE +/- 211.03, N = 3SE +/- 759.19, N = 1541034.041180.145083.9

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryCPUFreq PerformanceLinux 5.11 GitLinux 5.11 Patched120240360480600SE +/- 5.78, N = 3SE +/- 2.21, N = 3SE +/- 2.63, N = 3539.63531.09520.72MIN: 280.44 / MAX: 1002.78MIN: 489.6 / MAX: 1034.89MIN: 176.53 / MAX: 1089.461. (CC) gcc options: -O2 -lm -pthread -lmpi

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git4080120160200SE +/- 2.62, N = 3SE +/- 1.86, N = 3SE +/- 1.60, N = 121851811751. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance9001800270036004500SE +/- 78.33, N = 9SE +/- 44.10, N = 3SE +/- 68.07, N = 124393421041901. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git10K20K30K40K50KSE +/- 395.37, N = 3SE +/- 473.02, N = 3SE +/- 1144.94, N = 1539523.539981.946659.2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git40K80K120K160K200KSE +/- 2146.20, N = 15SE +/- 2393.85, N = 15SE +/- 7366.43, N = 15132844134044189771

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git14K28K42K56K70KSE +/- 715.70, N = 4SE +/- 412.91, N = 15SE +/- 690.93, N = 361347.262195.465193.0

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git160K320K480K640K800KSE +/- 5824.36, N = 9SE +/- 2132.19, N = 3SE +/- 4257.59, N = 3736285737993765726

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git200K400K600K800K1000KSE +/- 1163.43, N = 3SE +/- 4685.69, N = 3SE +/- 2435.29, N = 3810750818887894640

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Linux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git70140210280350SE +/- 2.83, N = 3SE +/- 0.07, N = 3SE +/- 3.80, N = 3289.76297.13303.45MIN: 283.65 / MAX: 458.79MIN: 295.49 / MAX: 320.4MIN: 284.51 / MAX: 461.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1CPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git60120180240300SE +/- 0.08, N = 3SE +/- 0.68, N = 3SE +/- 0.39, N = 3274.10274.87274.94MIN: 273.16 / MAX: 274.86MIN: 273.07 / MAX: 276.74MIN: 273.57 / MAX: 276.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git1.18372.36743.55114.73485.9185SE +/- 0.022, N = 3SE +/- 0.021, N = 3SE +/- 0.039, N = 35.2615.2555.2391. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git612182430SE +/- 0.19, N = 15SE +/- 0.17, N = 12SE +/- 0.23, N = 1524.6423.7921.131. (CXX) g++ options: -O3 -pthread -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git612182430SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 325.1225.0824.991. (CXX) g++ options: -O3 -pthread -lm

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1CPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git714212835SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 330.9330.8330.241. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git8001600240032004000SE +/- 4.01, N = 10SE +/- 5.01, N = 10SE +/- 10.26, N = 103857.333841.483788.881. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git8001600240032004000SE +/- 2.74, N = 3SE +/- 2.97, N = 3SE +/- 8.23, N = 33867.303863.453854.601. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git30K60K90K120K150KSE +/- 509.59, N = 4SE +/- 121.33, N = 4SE +/- 1780.52, N = 15154376.76153770.57147443.861. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git3691215SE +/- 0.053, N = 5SE +/- 0.141, N = 15SE +/- 0.148, N = 158.5108.8829.2551. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git1224364860SE +/- 0.13, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 352.0952.3652.791. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git1224364860SE +/- 0.19, N = 3SE +/- 0.69, N = 3SE +/- 0.35, N = 351.2952.6853.861. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git3691215SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.22, N = 1510.3410.4311.211. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DLinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git20406080100SE +/- 0.59, N = 3SE +/- 0.55, N = 3SE +/- 0.20, N = 396.6096.8997.941. (CXX) g++ options: -O2 -lOpenCL

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git0.10230.20460.30690.40920.5115SE +/- 0.00075, N = 3SE +/- 0.00005, N = 3SE +/- 0.00311, N = 30.444690.444720.45451

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPULinux 5.11 PatchedCPUFreq PerformanceLinux 5.11 Git0.20570.41140.61710.82281.0285SE +/- 0.001510, N = 7SE +/- 0.001697, N = 7SE +/- 0.006064, N = 70.8637820.8675450.914198MIN: 0.79MIN: 0.78MIN: 0.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUCPUFreq PerformanceLinux 5.11 PatchedLinux 5.11 Git