AMD EPYC 7F72 2P Linux 5.11 Perf Governor

2 x AMD EPYC 7F72 24-Core testing looking at CPU freq invariance on 5.11 with patch. CPU power consumption monitoring via AMD_Energy interface at 1 second polling. Additional data with CPUFreq performance governor included.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101253-HA-AMDEPYC7F96
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Linux 5.11 Git
January 22 2021
  15 Hours
Linux 5.11 Patched
January 23 2021
  15 Hours, 14 Minutes
CPUFreq Performance
January 24 2021
  16 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 38 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7F72 2P Linux 5.11 Perf GovernorProcessorMotherboardChipsetMemoryDiskGraphicsNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN1000GB Western Digital WD_BLACK SN850 1TBASPEED2 x Intel 10G X550TUbuntu 20.105.11.0-051100rc4daily20210122-generic (x86_64) 20210121GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x1080VE2285.11.0-rc4-max-boost-inv-patch (x86_64) 20210121OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Linux 5.11 Git: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Linux 5.11 Patched: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- CPUFreq Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformanceResult OverviewPhoronix Test Suite100%107%114%121%128%dav1dCpuminer-Optx265DaCapo BenchmarkTimed GDB GNU Debugger CompilationInfluxDBNebular Empirical Analysis ToolCLOMPFFTWLAMMPS Molecular Dynamics SimulatorQMCPACKZstd CompressiononeDNNQuantum ESPRESSOIORTTSIOD 3D RendererAI Benchmark Alpharav1eOSPrayRodiniaRedisSVT-VP9Himeno BenchmarkTimed Godot Game Engine CompilationFFTEYafaRayKeyDBJohn The RipperChaos Group V-RAYLeelaChessZeroTNNHigh Performance Conjugate GradientLULESHNAMDNAS Parallel BenchmarksASKAPTimed Linux Kernel CompilationStockfishBlogBenchOpenFOAMPOV-RayGPAWTachyonCython BenchmarkBlenderTimed LLVM CompilationBYTE Unix BenchmarkBuild2PrimesieveIntel Open Image DenoiseSVT-AV1SQLite SpeedtestONNX RuntimePlaidMLAlgebraic Multi-Grid BenchmarkTungsten RendererLuxCoreRenderTimed MrBayes AnalysissimdjsonASTC EncoderBRL-CADLZ4 CompressionGcrypt LibraryNumpy BenchmarkDolfynGROMACSEtcpakGoogle SynthMarkRELIONQuantLibGnuPGasmFishSwetTSCPHierarchical INTegrationTensorFlow LiteFinanceBench

Linux 5.11 GitLinux 5.11 PatchedCPUFreq PerformancePer Watt Result OverviewPhoronix Test Suite100%105%110%116%121%ASKAPdav1dCpuminer-OptHigh Performance Conjugate GradientAI Benchmark AlphaKeyDBZstd CompressionFFTWLAMMPS Molecular Dynamics SimulatorCLOMPTTSIOD 3D RendererIORRedisOSPrayFFTESVT-VP9BlogBenchx265asmFishInfluxDBNAS Parallel BenchmarksHimeno BenchmarkLULESHChaos Group V-RAYQuantLibEtcpakJohn The RipperStockfishLeelaChessZeroBYTE Unix BenchmarkNumpy BenchmarkAlgebraic Multi-Grid BenchmarkLZ4 CompressionSwetONNX RuntimeGoogle SynthMarkHierarchical INTegrationBRL-CADTSCPSVT-AV1PlaidMLIntel Open Image DenoiseLuxCoreRenderGROMACSrav1eP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

AMD EPYC 7F72 2P Linux 5.11 Perf Governorcpuminer-opt: LBC, LBRY Creditsdav1d: Chimera 1080p 10-bitospray: Magnetic Reconnection - Path Tracerx265: Bosphorus 1080pdacapobench: Tradebeansinfluxdb: 4 - 10000 - 2,5000,1 - 10000dav1d: Summer Nature 4Klammps: Rhodopsin Proteinbuild-gdb: Time To Compiledacapobench: Tradesoapx265: Bosphorus 4Kinfluxdb: 64 - 10000 - 2,5000,1 - 10000tensorflow-lite: Inception V4rav1e: 10clomp: Static OMP Speedupior: 2MB - Default Test Directoryfftw: Float + SSE - 2D FFT Size 4096onednn: IP Shapes 3D - f32 - CPUai-benchmark: Device Training Scoreqe: AUSURF112svt-vp9: VMAF Optimized - Bosphorus 1080ponednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUtensorflow-lite: SqueezeNetttsiod-renderer: Phong Rendering With Soft-Shadow Mappingonednn: Convolution Batch Shapes Auto - f32 - CPUonnx: yolov4 - OpenMP CPUospray: San Miguel - SciVisrav1e: 6ai-benchmark: Device AI Scoreredis: SETrodinia: OpenMP Leukocytesvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponnx: super-resolution-10 - OpenMP CPUrav1e: 5tnn: CPU - MobileNet v2npb: LU.Credis: SADDjohn-the-ripper: MD5ai-benchmark: Device Inference Scoreonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - f32 - CPUtensorflow-lite: Inception ResNet V2askap: tConvolve MPI - Degriddinglczero: Eigenhimeno: Poisson Pressure Solverior: 8MB - Default Test Directorybuild-godot: Time To Compileffte: N=256, 3D Complex FFT Routinedacapobench: Jythonyafaray: Total Time For Sample Scenesimdjson: LargeRandredis: LPUSHfinancebench: Bonds OpenMPjohn-the-ripper: Blowfishcompress-lz4: 3 - Decompression Speedsvt-av1: Enc Mode 4 - 1080popenfoam: Motorbike 30Mhpcg: lulesh: namd: ATPase Simulation - 327,506 Atomslczero: BLASbuild-linux-kernel: Time To Compilestockfish: Total Timerav1e: 1financebench: Repo OpenMPplaidml: No - Inference - VGG19 - CPUnpb: EP.Cblogbench: Readospray: Magnetic Reconnection - SciViscompress-lz4: 1 - Decompression Speedsimdjson: PartialTweetspovray: Trace Timegpaw: Carbon Nanotubetachyon: Total Timetungsten: Haircompress-lz4: 1 - Compression Speedrodinia: OpenMP HotSpot3Drodinia: OpenMP LavaMDcython-bench: N-Queensbyte: Dhrystone 2build2: Time To Compileplaidml: No - Inference - VGG16 - CPUblender: Barbershop - CPU-Onlyluxcorerender: DLSCcpuminer-opt: x25xprimesieve: 1e12 Prime Number Generationbuild-llvm: Time To Compilesvt-av1: Enc Mode 0 - 1080popenfoam: Motorbike 60Moidn: Memorialcompress-lz4: 3 - Compression Speedtungsten: Volumetric Causticsqlite-speedtest: Timed Time - Size 1,000compress-lz4: 9 - Compression Speedamg: mrbayes: Primate Phylogeny Analysisospray: XFrog Forest - SciVisastcenc: Thoroughospray: San Miguel - Path Tracerospray: XFrog Forest - Path Tracertungsten: Water Causticplaidml: No - Inference - ResNet 50 - CPUcompress-lz4: 9 - Decompression Speedgcrypt: brl-cad: VGR Performance Metriclammps: 20k Atomsastcenc: Exhaustiveetcpak: ETC1luxcorerender: Rainbow Colors and Prismnumpy: relion: Basic - CPUdolfyn: Computational Fluid Dynamicsgromacs: Water Benchmarketcpak: ETC2synthmark: VoiceMark_100askap: tConvolve MPI - Griddinggnupg: 2.7GB Sample File Encryptionquantlib: npb: EP.Detcpak: ETC1 + Ditheringtnn: CPU - SqueezeNet v1.1asmfish: 1024 Hash Memory, 26 Depthcpuminer-opt: Garlicoinswet: Averagetscp: AI Chess Performancehint: FLOATsimdjson: DistinctUserIDsimdjson: Kostyaospray: NASA Streamlines - Path Tracerospray: NASA Streamlines - SciVisv-ray: CPUtensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatsvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-av1: Enc Mode 8 - 1080prodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverneat: onednn: Recurrent Neural Network Training - f32 - CPUkeydb: dacapobench: H2compress-zstd: 3redis: GETqmcpack: simple-H2Ocpuminer-opt: Skeincoincpuminer-opt: Quad SHA-256, PyriteLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance132477130.6125047.665954807463.1308.2921.12997.641517018.631231991.28946402.90243.9505.19184680.88134810591217.49369.010.54767465193.0627.2080.91419817552.631.37027561380890.2253.862381.0843931.045303.449147443.861539146.21455033316972.405491.6208876572611870.342844233.511583531.0960.858174206.13000387489784.7390.371220973.7157601.3632817082410404.57.66318.7130.241519576.1220.45451410625.923979450890.36840124.37369822.093788.88108440532.2611334.00.6211.48560.70217.93756.600549622.9997.94252.79426.95038181643.668.14125.12158.647.811524.214.540210.3470.092129.6728.0949.435.3010570.16047.63143777133382.65011.115.694.305.9121.32534.6510488.0233.82563897124.99341.17266.3158.72323.93349.85218.7185.239155.182712.2527426.0877.3022149.83854.60244.793274.9441173109859937.316858888631115133322702844.182040.650.5716.3971.435480318977145083.946659.2311.8568.22911.2099.25527.0671317.40302893.5653108205.21689203.1031.177363784297502139037133.3725049.455591812193.6317.4523.78792.916514819.741256112.18107503.05447.8475.25170150.84924810671171.03364.810.52196862195.4655.2250.86378218154.971.40827871427348.1052.684371.4842101.068289.764154376.761611164.34461230817202.332901.5544773628511944.244334286.628309520.7259.177178738.12497094477887.1430.361217218.7556769.4531257263610666.07.64818.3030.826219771.2230.44472406125.752970426010.37239406.75781222.493841.48110311832.6211305.00.6311.30559.85118.05656.690369757.1796.60352.09226.60438319339.867.32225.42156.837.801541.764.535208.7860.091128.2828.3948.955.2623570.54047.76144871833382.04211.195.654.325.9521.33294.6310489.8232.54263652125.07740.97267.5878.76323.00348.29418.6525.261155.798714.9147453.5177.1792157.23863.45245.595274.8691176329559949.886874802621114562323144417.171970.650.5716.3971.435346013404441034.039523.5323.8168.23110.3388.88224.6331123.32294214.3752178270.51711621.5229.281364017296995194087181.95333.3362.294671956189.4363.3324.63985.277462120.751360163.08188873.17747.4517.33173350.81362811331249.06346.710.51460461347.2665.4480.86754518555.561.44629081454741.4251.290363.1841901.095297.132153770.571610484.50476200017752.304671.5796373799311492.944504135.519154539.6358.746180037.95358586474087.3310.371251006.0056094.4882817175710410.37.48818.2930.926019334.8620.44469414725.414960821140.37539948.35677122.433857.33108634732.8011150.60.6311.42759.75718.20046.695059651.6196.88552.36026.89538656226.367.33425.22158.077.891537.524.489208.0430.091128.8128.3349.295.3118369.88348.02144417466782.26111.195.674.335.9521.46524.6610555.1232.76063572525.11641.07267.1368.76324.48348.33818.6355.255155.783714.1817441.9577.0222156.33867.30245.453274.0971173708719964.356861321091116255322775283.801250.650.5716.3971.435501413284441180.139981.9309.6168.46510.4328.51025.0371247.48303171.3345707770.31782755.9728.971522948420231OpenBenchmarking.org

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY CreditsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance40K80K120K160K200KSE +/- 1036.73, N = 3SE +/- 1380.06, N = 3SE +/- 861.90, N = 31324771390371940871. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4080120160200SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.23, N = 3130.61133.37181.95MIN: 90.23 / MAX: 199.74MIN: 92.59 / MAX: 205.11MIN: 125.32 / MAX: 275.361. (CC) gcc options: -pthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance70140210280350SE +/- 0.00, N = 11250.00250.00333.33MIN: 90.91 / MAX: 500MIN: 90.91 / MAX: 333.33MIN: 100 / MAX: 500

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1428425670SE +/- 0.42, N = 7SE +/- 0.52, N = 4SE +/- 0.73, N = 1547.6649.4562.291. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance13002600390052006500SE +/- 50.83, N = 20SE +/- 66.39, N = 20SE +/- 52.34, N = 20595455914671

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance200K400K600K800K1000KSE +/- 2183.04, N = 3SE +/- 1525.09, N = 3SE +/- 2401.68, N = 3807463.1812193.6956189.4

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance80160240320400SE +/- 1.86, N = 3SE +/- 0.53, N = 3SE +/- 3.54, N = 15308.29317.45363.33MIN: 163.13 / MAX: 334.13MIN: 173.69 / MAX: 340.43MIN: 186.32 / MAX: 403.051. (CC) gcc options: -pthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.23, N = 15SE +/- 0.17, N = 12SE +/- 0.19, N = 1521.1323.7924.641. (CXX) g++ options: -O3 -pthread -lm

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20406080100SE +/- 0.40, N = 3SE +/- 0.43, N = 3SE +/- 0.14, N = 397.6492.9285.28

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11002200330044005500SE +/- 44.82, N = 4SE +/- 61.21, N = 4SE +/- 42.72, N = 5517051484621

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 318.6319.7420.751. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance300K600K900K1200K1500KSE +/- 6204.63, N = 3SE +/- 2545.78, N = 3SE +/- 9433.94, N = 31231991.21256112.11360163.0

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance200K400K600K800K1000KSE +/- 2435.29, N = 3SE +/- 1163.43, N = 3SE +/- 4685.69, N = 3894640810750818887

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.71481.42962.14442.85923.574SE +/- 0.016, N = 3SE +/- 0.008, N = 3SE +/- 0.018, N = 32.9023.0543.177

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1122334455SE +/- 0.60, N = 3SE +/- 0.47, N = 3SE +/- 0.55, N = 343.947.847.41. (CC) gcc options: -fopenmp -O3 -lm

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance110220330440550SE +/- 1.77, N = 3SE +/- 2.06, N = 3SE +/- 5.48, N = 3505.19475.25517.33MIN: 457.62 / MAX: 951.11MIN: 400.96 / MAX: 971.55MIN: 463.44 / MAX: 1007.521. (CC) gcc options: -O2 -lm -pthread -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4K8K12K16K20KSE +/- 24.98, N = 3SE +/- 213.45, N = 3SE +/- 199.64, N = 91846817015173351. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.19830.39660.59490.79320.9915SE +/- 0.005127, N = 5SE +/- 0.004000, N = 5SE +/- 0.005456, N = 50.8813480.8492480.813628MIN: 0.71MIN: 0.73MIN: 0.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2004006008001000105910671133

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance30060090012001500SE +/- 11.28, N = 3SE +/- 12.21, N = 4SE +/- 19.05, N = 91217.491171.031249.061. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance80160240320400SE +/- 1.11, N = 10SE +/- 0.91, N = 10SE +/- 2.58, N = 15369.01364.81346.711. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.12320.24640.36960.49280.616SE +/- 0.005010, N = 4SE +/- 0.004601, N = 4SE +/- 0.005906, N = 40.5476740.5219680.514604MIN: 0.43MIN: 0.43MIN: 0.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance14K28K42K56K70KSE +/- 690.93, N = 3SE +/- 412.91, N = 15SE +/- 715.70, N = 465193.062195.461347.2

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance140280420560700SE +/- 9.04, N = 15SE +/- 3.22, N = 3SE +/- 5.92, N = 15627.21655.23665.451. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.20570.41140.61710.82281.0285SE +/- 0.006064, N = 7SE +/- 0.001510, N = 7SE +/- 0.001697, N = 70.9141980.8637820.867545MIN: 0.78MIN: 0.79MIN: 0.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4080120160200SE +/- 1.60, N = 12SE +/- 1.86, N = 3SE +/- 2.62, N = 31751811851. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1224364860SE +/- 0.00, N = 3SE +/- 0.58, N = 5SE +/- 0.00, N = 352.6354.9755.56MIN: 27.03 / MAX: 58.82MIN: 31.25 / MAX: 58.82MIN: 33.33 / MAX: 58.82

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.32540.65080.97621.30161.627SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 31.3701.4081.446

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance6001200180024003000275627872908

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance300K600K900K1200K1500KSE +/- 10410.66, N = 15SE +/- 13176.39, N = 15SE +/- 10017.79, N = 131380890.221427348.101454741.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1224364860SE +/- 0.35, N = 3SE +/- 0.69, N = 3SE +/- 0.19, N = 353.8652.6851.291. (CXX) g++ options: -O2 -lOpenCL

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance80160240320400SE +/- 2.00, N = 10SE +/- 1.70, N = 9SE +/- 1.84, N = 9381.08371.48363.181. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance9001800270036004500SE +/- 78.33, N = 9SE +/- 44.10, N = 3SE +/- 68.07, N = 124393421041901. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.24640.49280.73920.98561.232SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 31.0451.0681.095

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance70140210280350SE +/- 3.80, N = 3SE +/- 2.83, N = 3SE +/- 0.07, N = 3303.45289.76297.13MIN: 284.51 / MAX: 461.21MIN: 283.65 / MAX: 458.79MIN: 295.49 / MAX: 320.41. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance30K60K90K120K150KSE +/- 1780.52, N = 15SE +/- 509.59, N = 4SE +/- 121.33, N = 4147443.86154376.76153770.571. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance300K600K900K1200K1500KSE +/- 16361.41, N = 3SE +/- 15585.71, N = 4SE +/- 17675.25, N = 151539146.211611164.341610484.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1000K2000K3000K4000K5000KSE +/- 49184.46, N = 3SE +/- 54344.04, N = 13SE +/- 7371.11, N = 34550333461230847620001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance400800120016002000169717201775

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.54121.08241.62362.16482.706SE +/- 0.03372, N = 3SE +/- 0.01587, N = 3SE +/- 0.02560, N = 152.405492.332902.30467MIN: 1.92MIN: 2MIN: 1.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.36470.72941.09411.45881.8235SE +/- 0.01518, N = 4SE +/- 0.01340, N = 4SE +/- 0.01359, N = 41.620881.554471.57963MIN: 1.31MIN: 1.29MIN: 1.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance160K320K480K640K800KSE +/- 4257.59, N = 3SE +/- 5824.36, N = 9SE +/- 2132.19, N = 3765726736285737993

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - DegriddingLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3K6K9K12K15KSE +/- 7.33, N = 3SE +/- 6.47, N = 3SE +/- 137.09, N = 311870.311944.211492.91. (CXX) g++ options: -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10002000300040005000SE +/- 49.20, N = 4SE +/- 36.23, N = 3SE +/- 26.71, N = 34284443344501. (CXX) g++ options: -flto -pthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance9001800270036004500SE +/- 35.16, N = 8SE +/- 25.26, N = 3SE +/- 27.09, N = 34233.514286.634135.521. (CC) gcc options: -O3 -mavx2

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance120240360480600SE +/- 2.21, N = 3SE +/- 2.63, N = 3SE +/- 5.78, N = 3531.09520.72539.63MIN: 489.6 / MAX: 1034.89MIN: 176.53 / MAX: 1089.46MIN: 280.44 / MAX: 1002.781. (CC) gcc options: -O2 -lm -pthread -lmpi

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1428425670SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.27, N = 360.8659.1858.75

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance40K80K120K160K200KSE +/- 1640.30, N = 15SE +/- 1760.31, N = 15SE +/- 1616.93, N = 15174206.13178738.12180037.951. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10002000300040005000SE +/- 28.66, N = 18SE +/- 43.93, N = 6SE +/- 20.83, N = 6489747784740

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20406080100SE +/- 0.38, N = 3SE +/- 0.85, N = 15SE +/- 0.69, N = 1584.7487.1487.331. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.360.371. (CXX) g++ options: -O3 -pthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance300K600K900K1200K1500KSE +/- 11246.95, N = 3SE +/- 13782.67, N = 3SE +/- 11397.44, N = 31220973.711217218.751251006.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance12K24K36K48K60KSE +/- 721.39, N = 3SE +/- 598.50, N = 3SE +/- 365.12, N = 357601.3656769.4556094.491. (CXX) g++ options: -O3 -march=native -fopenmp

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance16K32K48K64K80KSE +/- 327.40, N = 3SE +/- 73.45, N = 3SE +/- 507.10, N = 37082472636717571. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 38.32, N = 4SE +/- 60.61, N = 3SE +/- 61.51, N = 510404.510666.010410.31. (CC) gcc options: -O3

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance246810SE +/- 0.049, N = 4SE +/- 0.029, N = 4SE +/- 0.031, N = 47.6637.6487.4881. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 318.7118.3018.291. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance714212835SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 330.2430.8330.931. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4K8K12K16K20KSE +/- 67.78, N = 5SE +/- 171.84, N = 5SE +/- 182.76, N = 519576.1219771.2219334.861. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.10230.20460.30690.40920.5115SE +/- 0.00311, N = 3SE +/- 0.00005, N = 3SE +/- 0.00075, N = 30.454510.444720.44469

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance9001800270036004500SE +/- 50.84, N = 3SE +/- 49.90, N = 9SE +/- 17.79, N = 34106406141471. (CXX) g++ options: -flto -pthread

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.17, N = 12SE +/- 0.20, N = 9SE +/- 0.19, N = 1025.9225.7525.41

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20M40M60M80M100MSE +/- 1123146.37, N = 3SE +/- 769788.53, N = 3SE +/- 1344259.09, N = 39794508997042601960821141. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.08440.16880.25320.33760.422SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3680.3720.375

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance9K18K27K36K45KSE +/- 319.03, N = 3SE +/- 393.10, N = 3SE +/- 456.10, N = 340124.3739406.7639948.361. (CXX) g++ options: -O3 -march=native -fopenmp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.20, N = 15SE +/- 0.16, N = 15SE +/- 0.20, N = 1522.0922.4922.43

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance8001600240032004000SE +/- 10.26, N = 10SE +/- 5.01, N = 10SE +/- 4.01, N = 103788.883841.483857.331. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

BlogBench

BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: ReadLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance200K400K600K800K1000KSE +/- 10984.18, N = 9SE +/- 1738.41, N = 3SE +/- 13508.02, N = 31084405110311810863471. (CC) gcc options: -O2 -pthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance816243240SE +/- 0.00, N = 6SE +/- 0.23, N = 6SE +/- 0.24, N = 632.2632.6232.80MIN: 12.66 / MAX: 33.33MIN: 12.82 / MAX: 33.33MIN: 13.16 / MAX: 34.48

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 46.17, N = 3SE +/- 25.21, N = 3SE +/- 110.98, N = 311334.011305.011150.61. (CC) gcc options: -O3

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.630.631. (CXX) g++ options: -O3 -pthread

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3691215SE +/- 0.05, N = 4SE +/- 0.05, N = 4SE +/- 0.01, N = 411.4911.3111.431. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 20.1Input: Carbon NanotubeLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1428425670SE +/- 0.18, N = 3SE +/- 0.39, N = 3SE +/- 0.31, N = 360.7059.8559.761. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance48121620SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 317.9418.0618.201. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance246810SE +/- 0.05969, N = 6SE +/- 0.01480, N = 6SE +/- 0.06150, N = 66.600546.690366.695051. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 96.44, N = 3SE +/- 19.43, N = 3SE +/- 71.91, N = 39622.999757.179651.611. (CC) gcc options: -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20406080100SE +/- 0.20, N = 3SE +/- 0.59, N = 3SE +/- 0.55, N = 397.9496.6096.891. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1224364860SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.00, N = 352.7952.0952.361. (CXX) g++ options: -O2 -lOpenCL

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.16, N = 3SE +/- 0.22, N = 3SE +/- 0.07, N = 326.9526.6026.90

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance8M16M24M32M40MSE +/- 411547.50, N = 3SE +/- 341040.65, N = 3SE +/- 498492.56, N = 338181643.638319339.838656226.3

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1530456075SE +/- 0.67, N = 3SE +/- 0.52, N = 3SE +/- 0.32, N = 368.1467.3267.33

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.22, N = 15SE +/- 0.30, N = 15SE +/- 0.24, N = 1525.1225.4225.22

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.95, N = 3158.64156.83158.07

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance246810SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 37.817.807.89MIN: 7.68 / MAX: 8.29MIN: 7.61 / MAX: 8.59MIN: 7.67 / MAX: 8.57

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25xLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance30060090012001500SE +/- 21.93, N = 14SE +/- 17.73, N = 15SE +/- 11.04, N = 121524.211541.761537.521. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.02152.0433.06454.0865.1075SE +/- 0.012, N = 8SE +/- 0.015, N = 8SE +/- 0.008, N = 84.5404.5354.4891. (CXX) g++ options: -O3 -lpthread

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance50100150200250SE +/- 1.36, N = 3SE +/- 0.79, N = 3SE +/- 2.78, N = 3210.35208.79208.04

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.02070.04140.06210.08280.1035SE +/- 0.001, N = 3SE +/- 0.001, N = 12SE +/- 0.000, N = 30.0920.0910.0911. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance306090120150SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3129.67128.28128.811. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance714212835SE +/- 0.09, N = 6SE +/- 0.04, N = 6SE +/- 0.17, N = 628.0928.3928.33

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1122334455SE +/- 0.56, N = 4SE +/- 0.12, N = 3SE +/- 0.52, N = 549.4348.9549.291. (CC) gcc options: -O3

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.19522.39043.58564.78085.976SE +/- 0.01185, N = 7SE +/- 0.03676, N = 7SE +/- 0.03612, N = 135.301055.262355.311831. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1632486480SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.21, N = 370.1670.5469.881. (CC) gcc options: -O2 -ldl -lz -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1122334455SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 347.6347.7648.021. (CC) gcc options: -O3

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance300M600M900M1200M1500MSE +/- 2689640.27, N = 3SE +/- 750486.58, N = 3SE +/- 4018322.42, N = 31437771333144871833314441746671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20406080100SE +/- 0.08, N = 3SE +/- 0.29, N = 3SE +/- 0.29, N = 382.6582.0482.261. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3691215SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 311.1111.1911.19MIN: 9.62 / MAX: 11.24MIN: 8.2 / MAX: 11.36MIN: 7.87 / MAX: 11.36

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.28032.56063.84095.12126.4015SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 55.695.655.671. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.97431.94862.92293.89724.8715SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.304.324.33MIN: 3.38 / MAX: 4.35MIN: 3.76 / MAX: 4.37MIN: 3.65 / MAX: 4.37

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.33882.67764.01645.35526.694SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.915.955.95MIN: 5.18 / MAX: 5.99MIN: 5.35 / MAX: 6.02MIN: 5.46 / MAX: 6.02

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.25, N = 3SE +/- 0.21, N = 15SE +/- 0.26, N = 321.3321.3321.471. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.04852.0973.14554.1945.2425SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 34.654.634.66

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 172.84, N = 3SE +/- 31.24, N = 3SE +/- 96.09, N = 310488.010489.810555.11. (CC) gcc options: -O3

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance50100150200250SE +/- 0.85, N = 3SE +/- 0.81, N = 3SE +/- 0.62, N = 3233.83232.54232.761. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance140K280K420K560K700K6389716365216357251. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 324.9925.0825.121. (CXX) g++ options: -O3 -pthread -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance918273645SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 341.1740.9741.071. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance60120180240300SE +/- 0.24, N = 3SE +/- 0.23, N = 3SE +/- 0.25, N = 3266.32267.59267.141. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance246810SE +/- 0.09, N = 5SE +/- 0.09, N = 3SE +/- 0.04, N = 38.728.768.76MIN: 8.07 / MAX: 9.01MIN: 8.31 / MAX: 8.97MIN: 8.22 / MAX: 8.86

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance70140210280350SE +/- 1.74, N = 3SE +/- 0.23, N = 3SE +/- 0.25, N = 3323.93323.00324.48

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance80160240320400SE +/- 2.94, N = 9SE +/- 2.97, N = 9SE +/- 3.10, N = 9349.85348.29348.341. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 318.7218.6518.64

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1.18372.36743.55114.73485.9185SE +/- 0.039, N = 3SE +/- 0.022, N = 3SE +/- 0.021, N = 35.2395.2615.2551. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance306090120150SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3155.18155.80155.781. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance150300450600750SE +/- 1.12, N = 3SE +/- 0.04, N = 3SE +/- 1.18, N = 3712.25714.91714.181. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - GriddingLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance16003200480064008000SE +/- 2.49, N = 3SE +/- 2.90, N = 3SE +/- 7.65, N = 37426.087453.517441.951. (CXX) g++ options: -lpthread

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance20406080100SE +/- 0.20, N = 3SE +/- 0.20, N = 3SE +/- 0.45, N = 377.3077.1877.021. (CC) gcc options: -O2

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance5001000150020002500SE +/- 13.13, N = 3SE +/- 8.27, N = 3SE +/- 16.02, N = 32149.82157.22156.31. (CXX) g++ options: -O3 -march=native -rdynamic

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance8001600240032004000SE +/- 8.23, N = 3SE +/- 2.97, N = 3SE +/- 2.74, N = 33854.603863.453867.301. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance50100150200250SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3244.79245.60245.451. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance60120180240300SE +/- 0.39, N = 3SE +/- 0.68, N = 3SE +/- 0.08, N = 3274.94274.87274.10MIN: 273.57 / MAX: 276.21MIN: 273.07 / MAX: 276.74MIN: 273.16 / MAX: 274.861. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance30M60M90M120M150MSE +/- 865236.83, N = 3SE +/- 358515.89, N = 3SE +/- 1411556.16, N = 4117310985117632955117370871

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: GarlicoinLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 88.01, N = 14SE +/- 99.38, N = 15SE +/- 155.17, N = 149937.319949.889964.351. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance150M300M450M600M750MSE +/- 5135867.19, N = 3SE +/- 1732499.41, N = 3SE +/- 3454923.88, N = 36858888636874802626861321091. (CC) gcc options: -lm -lpthread -lcurses -lrt

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance200K400K600K800K1000KSE +/- 1015.01, N = 12SE +/- 609.97, N = 12SE +/- 517.61, N = 121115133111456211162551. (CC) gcc options: -O3 -march=native

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance70M140M210M280M350MSE +/- 121683.80, N = 3SE +/- 122621.60, N = 3SE +/- 201175.83, N = 3322702844.18323144417.17322775283.801. (CC) gcc options: -O3 -march=native -lm

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.650.650.651. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.570.571. (CXX) g++ options: -O3 -pthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance48121620SE +/- 0.00, N = 4SE +/- 0.00, N = 4SE +/- 0.00, N = 416.3916.3916.39MIN: 10.31 / MAX: 16.67MIN: 10.99 / MAX: 16.95MIN: 10.31 / MAX: 16.95

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1632486480SE +/- 0.00, N = 7SE +/- 0.00, N = 7SE +/- 0.00, N = 771.4371.4371.43MIN: 21.28 / MAX: 76.92MIN: 19.61 / MAX: 76.92MIN: 19.61 / MAX: 76.92

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance100200300400500Min: 60.44 / Avg: 279.2 / Max: 548.33Min: 59.88 / Avg: 280.92 / Max: 530.69Min: 59.74 / Avg: 285.23 / Max: 514.05

Chaos Group V-RAY

MinAvgMaxLinux 5.11 Git121.5421.3492.0Linux 5.11 Patched120.3419.0492.2CPUFreq Performance120.8421.1492.1OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 4.10.07CPU Power Consumption Monitor130260390520650

OpenBenchmarking.orgKsamples Per Watt, More Is BetterChaos Group V-RAY 4.10.07Mode: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance306090120150130.07127.59130.65

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance12K24K36K48K60KSE +/- 504.01, N = 3SE +/- 1018.74, N = 13SE +/- 503.58, N = 3548035346055014

TensorFlow Lite

MinAvgMaxLinux 5.11 Git123.2385.7435.9Linux 5.11 Patched121.5403.7446.4CPUFreq Performance121.8401.7439.7OpenBenchmarking.orgWatts, Fewer Is BetterTensorFlow Lite 2020-08-23CPU Power Consumption Monitor120240360480600

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance40K80K120K160K200KSE +/- 7366.43, N = 15SE +/- 2393.85, N = 15SE +/- 2146.20, N = 15189771134044132844

MinAvgMaxLinux 5.11 Git123.7447.4480.2Linux 5.11 Patched121.2453.6483.1CPUFreq Performance121.6453.0481.2OpenBenchmarking.orgWatts, Fewer Is BetterTensorFlow Lite 2020-08-23CPU Power Consumption Monitor120240360480600

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10K20K30K40K50KSE +/- 759.19, N = 15SE +/- 400.94, N = 6SE +/- 211.03, N = 345083.941034.041180.1

MinAvgMaxLinux 5.11 Git122.2441.5477.4Linux 5.11 Patched120.6451.2480.3CPUFreq Performance120.8451.8481.0OpenBenchmarking.orgWatts, Fewer Is BetterTensorFlow Lite 2020-08-23CPU Power Consumption Monitor120240360480600

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance10K20K30K40K50KSE +/- 1144.94, N = 15SE +/- 395.37, N = 3SE +/- 473.02, N = 346659.239523.539981.9

SVT-VP9

MinAvgMaxLinux 5.11 Git120.1183.5382.5Linux 5.11 Patched119.4185.8379.1CPUFreq Performance119.5184.0372.3OpenBenchmarking.orgWatts, Fewer Is BetterSVT-VP9 0.1CPU Power Consumption Monitor100200300400500

OpenBenchmarking.orgFrames Per Second Per Watt, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.39150.7831.17451.5661.95751.701.741.68

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance70140210280350SE +/- 16.16, N = 15SE +/- 4.21, N = 15SE +/- 15.29, N = 15311.85323.81309.611. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-AV1

MinAvgMaxLinux 5.11 Git121.4207.9382.6Linux 5.11 Patched120.4209.8388.0CPUFreq Performance120.4207.6386.1OpenBenchmarking.orgWatts, Fewer Is BetterSVT-AV1 0.8CPU Power Consumption Monitor100200300400500

OpenBenchmarking.orgFrames Per Second Per Watt, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance0.07430.14860.22290.29720.37150.330.330.33

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance1530456075SE +/- 1.18, N = 15SE +/- 1.05, N = 15SE +/- 0.65, N = 1568.2368.2368.471. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Rodinia

MinAvgMaxLinux 5.11 Git121.2220.5291.1Linux 5.11 Patched120.3215.0284.6CPUFreq Performance120.7216.9281.4OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 3.1CPU Power Consumption Monitor70140210280350

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3691215SE +/- 0.22, N = 15SE +/- 0.03, N = 5SE +/- 0.04, N = 511.2110.3410.431. (CXX) g++ options: -O2 -lOpenCL

MinAvgMaxLinux 5.11 Git121.8253.0370.4Linux 5.11 Patched121.0253.4373.0CPUFreq Performance120.9253.1370.6OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 3.1CPU Power Consumption Monitor100200300400500

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3691215SE +/- 0.148, N = 15SE +/- 0.141, N = 15SE +/- 0.053, N = 59.2558.8828.5101. (CXX) g++ options: -O2 -lOpenCL

Nebular Empirical Analysis Tool

MinAvgMaxLinux 5.11 Git120.7177.9286.2Linux 5.11 Patched120.1179.4280.2CPUFreq Performance120.5178.5280.4OpenBenchmarking.orgWatts, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29CPU Power Consumption Monitor70140210280350

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance612182430SE +/- 0.26, N = 15SE +/- 0.56, N = 12SE +/- 0.62, N = 1427.0724.6325.041. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

oneDNN

OpenBenchmarking.orgWatts, Fewer Is BetteroneDNN 2.0CPU Power Consumption MonitorLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance90180270360450Min: 122.37 / Avg: 333.59 / Max: 492.01Min: 120.62 / Avg: 341.54 / Max: 492.22Min: 120.77 / Avg: 339.16 / Max: 492.27

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance30060090012001500SE +/- 26.23, N = 15SE +/- 3.00, N = 3SE +/- 35.45, N = 151317.401123.321247.48MIN: 1147.91MIN: 1077.56MIN: 1069.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

KeyDB

MinAvgMaxLinux 5.11 Git120.8157.0179.4Linux 5.11 Patched64.4157.7178.6CPUFreq Performance60.8172.1190.1OpenBenchmarking.orgWatts, Fewer Is BetterKeyDB 6.0.16CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgOps/sec Per Watt, More Is BetterKeyDB 6.0.16Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance4008001200160020001929.561865.161761.51

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance60K120K180K240K300KSE +/- 4239.68, N = 15SE +/- 3012.50, N = 15SE +/- 5131.54, N = 15302893.56294214.37303171.331. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

DaCapo Benchmark

MinAvgMaxLinux 5.11 Git120.7134.9160.4Linux 5.11 Patched60.6134.4163.7CPUFreq Performance119.8139.4180.3OpenBenchmarking.orgWatts, Fewer Is BetterDaCapo Benchmark 9.12-MR1CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance11002200330044005500SE +/- 36.45, N = 20SE +/- 73.65, N = 20SE +/- 70.06, N = 20531052174570

Zstd Compression

MinAvgMaxLinux 5.11 Git121.4153.5205.1Linux 5.11 Patched120.7150.7214.6CPUFreq Performance120.0153.7222.8OpenBenchmarking.orgWatts, Fewer Is BetterZstd Compression 1.4.5CPU Power Consumption Monitor60120180240300

OpenBenchmarking.orgMB/s Per Watt, More Is BetterZstd Compression 1.4.5Compression Level: 3Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance122436486053.4554.8850.56

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3Linux 5.11 GitLinux 5.11 PatchedCPUFreq Performance2K4K6K8K10KSE +/- 30.36, N = 3SE +/- 69.07, N = 3SE +/- 648.67, N = 128205.28270.57770.31. (CC) gcc options: -O3 -pthread -lz -llzma

Redis

MinAvgMaxLinux 5.11 Git120.6125.0135.5Linux 5.11 Patched119.7123.6133.0CPUFreq Performance119.8124.0135.9OpenBenchmarking.orgWatts, Fewer Is BetterRedis 6.0.9CPU Power Consumption Monitor4080120160200

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: GETLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance3K6K9K12K15K13515.1713846.0414375.86

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance400K800K1200K1600K2000KSE +/- 6519.22, N = 4SE +/- 12716.89, N = 11SE +/- 32510.57, N = 121689203.101711621.521782755.971. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

QMCPACK

MinAvgMaxLinux 5.11 Git121.2433.2491.7Linux 5.11 Patched120.3437.6491.3CPUFreq Performance120.6438.1491.9OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.10CPU Power Consumption Monitor130260390520650

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance714212835SE +/- 0.53, N = 15SE +/- 0.08, N = 3SE +/- 0.06, N = 331.1829.2828.971. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

Cpuminer-Opt

MinAvgMaxLinux 5.11 Git120.4125.5148.6Linux 5.11 Patched120.0125.6144.4CPUFreq Performance72.0144.2193.5OpenBenchmarking.orgWatts, Fewer Is BetterCpuminer-Opt 3.15.5CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance80016002400320040002899.042897.783625.65

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance110K220K330K440K550KSE +/- 3597.13, N = 15SE +/- 5604.68, N = 12SE +/- 9614.65, N = 123637843640175229481. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

MinAvgMaxLinux 5.11 Git120.6125.4134.4Linux 5.11 Patched120.1125.9138.6CPUFreq Performance120.9145.0174.3OpenBenchmarking.orgWatts, Fewer Is BetterCpuminer-Opt 3.15.5CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance60012001800240030002373.232358.082898.85

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteLinux 5.11 GitLinux 5.11 PatchedCPUFreq Performance90K180K270K360K450KSE +/- 2892.28, N = 6SE +/- 5314.19, N = 12SE +/- 19086.02, N = 122975022969954202311. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

173 Results Shown

Cpuminer-Opt
dav1d
OSPray
x265
DaCapo Benchmark
InfluxDB
dav1d
LAMMPS Molecular Dynamics Simulator
Timed GDB GNU Debugger Compilation
DaCapo Benchmark
x265
InfluxDB
TensorFlow Lite
rav1e
CLOMP
IOR
FFTW
oneDNN
AI Benchmark Alpha
Quantum ESPRESSO
SVT-VP9
oneDNN
TensorFlow Lite
TTSIOD 3D Renderer
oneDNN
ONNX Runtime
OSPray
rav1e
AI Benchmark Alpha
Redis
Rodinia
SVT-VP9
ONNX Runtime
rav1e
TNN
NAS Parallel Benchmarks
Redis
John The Ripper
AI Benchmark Alpha
oneDNN:
  Deconvolution Batch shapes_1d - f32 - CPU
  IP Shapes 1D - f32 - CPU
TensorFlow Lite
ASKAP
LeelaChessZero
Himeno Benchmark
IOR
Timed Godot Game Engine Compilation
FFTE
DaCapo Benchmark
YafaRay
simdjson
Redis
FinanceBench
John The Ripper
LZ4 Compression
SVT-AV1
OpenFOAM
High Performance Conjugate Gradient
LULESH
NAMD
LeelaChessZero
Timed Linux Kernel Compilation
Stockfish
rav1e
FinanceBench
PlaidML
NAS Parallel Benchmarks
BlogBench
OSPray
LZ4 Compression
simdjson
POV-Ray
GPAW
Tachyon
Tungsten Renderer
LZ4 Compression
Rodinia:
  OpenMP HotSpot3D
  OpenMP LavaMD
Cython Benchmark
BYTE Unix Benchmark
Build2
PlaidML
Blender
LuxCoreRender
Cpuminer-Opt
Primesieve
Timed LLVM Compilation
SVT-AV1
OpenFOAM
Intel Open Image Denoise
LZ4 Compression
Tungsten Renderer
SQLite Speedtest
LZ4 Compression
Algebraic Multi-Grid Benchmark
Timed MrBayes Analysis
OSPray
ASTC Encoder
OSPray:
  San Miguel - Path Tracer
  XFrog Forest - Path Tracer
Tungsten Renderer
PlaidML
LZ4 Compression
Gcrypt Library
BRL-CAD
LAMMPS Molecular Dynamics Simulator
ASTC Encoder
Etcpak
LuxCoreRender
Numpy Benchmark
RELION
Dolfyn
GROMACS
Etcpak
Google SynthMark
ASKAP
GnuPG
QuantLib
NAS Parallel Benchmarks
Etcpak
TNN
asmFish
Cpuminer-Opt
Swet
TSCP
Hierarchical INTegration
simdjson:
  DistinctUserID
  Kostya
OSPray:
  NASA Streamlines - Path Tracer
  NASA Streamlines - SciVis
CPU Power Consumption Monitor:
  Phoronix Test Suite System Monitoring
  CPU Power Consumption Monitor
  CPU
Chaos Group V-RAY
TensorFlow Lite
TensorFlow Lite
TensorFlow Lite
TensorFlow Lite
TensorFlow Lite
TensorFlow Lite
SVT-VP9:
  CPU Power Consumption Monitor
  Visual Quality Optimized - Bosphorus 1080p
SVT-VP9
SVT-AV1:
  CPU Power Consumption Monitor
  Enc Mode 8 - 1080p
SVT-AV1
Rodinia
Rodinia
Rodinia
Rodinia
Nebular Empirical Analysis Tool
Nebular Empirical Analysis Tool
oneDNN
oneDNN
KeyDB:
  CPU Power Consumption Monitor
 
KeyDB
DaCapo Benchmark
DaCapo Benchmark
Zstd Compression:
  CPU Power Consumption Monitor
  3
Zstd Compression
Redis:
  CPU Power Consumption Monitor
  GET
Redis
QMCPACK
QMCPACK
Cpuminer-Opt:
  CPU Power Consumption Monitor
  Skeincoin
Cpuminer-Opt
Cpuminer-Opt:
  CPU Power Consumption Monitor
  Quad SHA-256, Pyrite
Cpuminer-Opt