AMD EPYC 7F72 2P Linux 5.11 - Perf Governor

2 x AMD EPYC 7F72 24-Core testing looking at CPU freq invariance on 5.11 with patch. CPU power consumption monitoring via AMD_Energy interface at 1 second polling. Additional data with CPUFreq performance governor included.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101273-SYST-210125309
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 3 Tests
Bioinformatics 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 4 Tests
C++ Boost Tests 5 Tests
Chess Test Suite 4 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 20 Tests
Compression Tests 2 Tests
CPU Massive 38 Tests
Creator Workloads 21 Tests
Cryptography 4 Tests
Database Test Suite 4 Tests
Encoding 5 Tests
Finance 2 Tests
Fortran Tests 7 Tests
Game Development 5 Tests
HPC - High Performance Computing 28 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Machine Learning 8 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 8 Tests
Multi-Core 35 Tests
NVIDIA GPU Compute 8 Tests
Intel oneAPI 4 Tests
OpenMPI Tests 15 Tests
Programmer / Developer System Benchmarks 9 Tests
Python 2 Tests
Quantum Mechanics 2 Tests
Raytracing 4 Tests
Renderers 9 Tests
Scientific Computing 15 Tests
Server 6 Tests
Server CPU Tests 22 Tests
Single-Threaded 6 Tests
Texture Compression 2 Tests
Video Encoding 5 Tests
Common Workstation Benchmarks 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
CPUFreq Schedutil
January 23 2021
  15 Hours, 14 Minutes
CPUFreq Performance
January 24 2021
  16 Hours, 38 Minutes
Invert Hiding All Results Option
  15 Hours, 56 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7F72 2P Linux 5.11 - Perf GovernorOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN1000GB Western Digital WD_BLACK SN850 1TBASPEEDVE2282 x Intel 10G X550TUbuntu 20.105.11.0-rc4-max-boost-inv-patch (x86_64) 20210121GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionAMD EPYC 7F72 2P Linux 5.11 - Perf Governor BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- CPUFreq Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034 - CPUFreq Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 - OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- Python 3.8.6- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

CPUFreq Schedutil vs. CPUFreq Performance ComparisonPhoronix Test SuiteBaseline+10.9%+10.9%+21.8%+21.8%+32.7%+32.7%+43.6%+43.6%43.7%41.5%39.6%36.4%33.3%26%19.7%17.7%14.5%14.2%11.4%9%8.9%8.3%6.2%5.1%4.4%4.4%4.3%4.2%4%3.6%3.6%3.2%3.2%3%2.9%2.8%2.8%2.7%2.7%2.5%2.2%2.1%SkeincoinQ.S.2.PLBC, LBRY CreditsC.1.1.bM.R - Path TracerBosphorus 1080pTradebeans4 - 10000 - 2,5000,1 - 10000Summer Nature 4KH2TradesoapR.N.N.T - f32 - CPU11.1%Time To Compile2MB64 - 10000 - 2,5000,1 - 10000AUSURF1126.7%36.4%D.T.SVMAF Optimized - Bosphorus 1080p5.2%Bosphorus 4KV.Q.O - Bosphorus 1080p4.6%IP Shapes 3D - f32 - CPUOpenMP CFD SolverDevice AI ScoreGET10tConvolve MPI - Degridding3.9%P.P.S3.7%8MBRhodopsin ProteinMD5D.I.SCPULargeRandLPUSHOpenMP Leukocyte6CPU - MobileNet v22.5%53 - D.S2.5%P.S.O - Bosphorus 1080p2.3%2.3%yolov4 - OpenMP CPUEnc Mode 4 - 1080p2.1%BLASCpuminer-OptCpuminer-OptCpuminer-Optdav1dOSPrayx265DaCapo BenchmarkInfluxDBdav1dDaCapo BenchmarkDaCapo BenchmarkoneDNNTimed GDB GNU Debugger CompilationIORInfluxDBQuantum ESPRESSOZstd CompressionAI Benchmark AlphaSVT-VP9x265SVT-VP9oneDNNRodiniaAI Benchmark AlphaRedisrav1eASKAPHimeno BenchmarkIORLAMMPS Molecular Dynamics SimulatorJohn The RipperAI Benchmark AlphaKeyDBChaos Group V-RAYsimdjsonRedisRodiniarav1eTNNrav1eLZ4 CompressionSVT-VP9LULESHONNX RuntimeSVT-AV1LeelaChessZeroCPUFreq SchedutilCPUFreq Performance

AMD EPYC 7F72 2P Linux 5.11 - Perf Governorcpuminer-opt: LBC, LBRY Creditsdav1d: Chimera 1080p 10-bitospray: Magnetic Reconnection - Path Tracerx265: Bosphorus 1080pdacapobench: Tradebeansinfluxdb: 4 - 10000 - 2,5000,1 - 10000dav1d: Summer Nature 4Kdacapobench: Tradesoapbuild-gdb: Time To Compileior: 2MB - Default Test Directoryinfluxdb: 64 - 10000 - 2,5000,1 - 10000qe: AUSURF112ai-benchmark: Device Training Scoresvt-vp9: VMAF Optimized - Bosphorus 1080px265: Bosphorus 4Konednn: IP Shapes 3D - f32 - CPUai-benchmark: Device AI Scorerav1e: 10askap: tConvolve MPI - Degriddinghimeno: Poisson Pressure Solverior: 8MB - Default Test Directorylammps: Rhodopsin Proteinjohn-the-ripper: MD5ai-benchmark: Device Inference Scoresimdjson: LargeRandredis: LPUSHrodinia: OpenMP Leukocyterav1e: 6tnn: CPU - MobileNet v2rav1e: 5compress-lz4: 3 - Decompression Speedsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080plulesh: onnx: yolov4 - OpenMP CPUsvt-av1: Enc Mode 4 - 1080plczero: BLASredis: SETfftw: Float + SSE - 2D FFT Size 4096onednn: IP Shapes 1D - f32 - CPUttsiod-renderer: Phong Rendering With Soft-Shadow Mappingblogbench: Readonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUcompress-lz4: 1 - Decompression Speedtensorflow-lite: SqueezeNetfinancebench: Repo OpenMPbuild-linux-kernel: Time To Compilejohn-the-ripper: Blowfishonednn: Deconvolution Batch shapes_1d - f32 - CPUfinancebench: Bonds OpenMPtensorflow-lite: Mobilenet Floatluxcorerender: DLSCcython-bench: N-Queenscompress-lz4: 1 - Compression Speedpovray: Trace Timeospray: San Miguel - SciVisqmcpack: simple-H2Oprimesieve: 1e12 Prime Number Generationtensorflow-lite: Inception V4stockfish: Total Timetungsten: Volumetric Causticsqlite-speedtest: Timed Time - Size 1,000rodinia: OpenMP Streamclusterbyte: Dhrystone 2clomp: Static OMP Speeduprav1e: 1dacapobench: Jythontachyon: Total Timeplaidml: No - Inference - VGG16 - CPUblender: Barbershop - CPU-Onlybuild-godot: Time To Compileffte: N=256, 3D Complex FFT Routinecompress-lz4: 3 - Compression Speedplaidml: No - Inference - ResNet 50 - CPUcompress-lz4: 9 - Decompression Speedtungsten: Water Causticospray: Magnetic Reconnection - SciViscompress-lz4: 9 - Compression Speedrodinia: OpenMP LavaMDonnx: super-resolution-10 - OpenMP CPUnumpy: onednn: Convolution Batch Shapes Auto - f32 - CPUopenfoam: Motorbike 60Mnpb: EP.Cnpb: LU.Clczero: Eigenbuild-llvm: Time To Compiletensorflow-lite: Mobilenet Quantastcenc: Thoroughsvt-av1: Enc Mode 8 - 1080phpcg: amg: rodinia: OpenMP HotSpot3Dtnn: CPU - SqueezeNet v1.1cpuminer-opt: x25xplaidml: No - Inference - VGG19 - CPUmrbayes: Primate Phylogeny Analysisastcenc: Exhaustivetensorflow-lite: Inception ResNet V2ospray: San Miguel - Path Tracerasmfish: 1024 Hash Memory, 26 Depthyafaray: Total Time For Sample Sceneoidn: Memorialgnupg: 2.7GB Sample File Encryptionswet: Averageetcpak: ETC1gpaw: Carbon Nanotubelammps: 20k Atomsaskap: tConvolve MPI - Griddingtscp: AI Chess Performancecpuminer-opt: Garlicoinbrl-cad: VGR Performance Metrichint: FLOATgromacs: Water Benchmarksynthmark: VoiceMark_100npb: EP.Dgcrypt: dolfyn: Computational Fluid Dynamicstungsten: Hairetcpak: ETC1 + Ditheringopenfoam: Motorbike 30Mredis: SADDquantlib: build2: Time To Compilerelion: Basic - CPUetcpak: ETC2namd: ATPase Simulation - 327,506 Atomssvt-av1: Enc Mode 0 - 1080psimdjson: DistinctUserIDsimdjson: Kostyasimdjson: PartialTweetsluxcorerender: Rainbow Colors and Prismospray: NASA Streamlines - Path Tracerospray: NASA Streamlines - SciVisospray: XFrog Forest - Path Tracerospray: XFrog Forest - SciVisv-ray: CPUtensorflow-lite: NASNet Mobilesvt-vp9: Visual Quality Optimized - Bosphorus 1080prodinia: OpenMP CFD Solverneat: onednn: Recurrent Neural Network Training - f32 - CPUkeydb: dacapobench: H2compress-zstd: 3redis: GETcpuminer-opt: Skeincoincpuminer-opt: Quad SHA-256, PyriteCPUFreq SchedutilCPUFreq Performance139037133.3725049.455591812193.6317.45514892.916475.251256112.11171.031067364.8119.740.84924827873.05411944.24286.628309520.7223.787461230817200.361217218.7552.6841.408289.7641.06810666.0371.4819771.2231817.64840611427348.10170151.55447655.22511031180.52196811305.062195.439406.75781225.752726362.3329056769.45312539523.57.8026.6049757.1711.30554.9729.2814.535810750970426015.2623570.54010.33838319339.847.80.372477818.056525.42156.8359.177178738.1249709448.954.6310489.821.332932.6247.7652.0924210323.000.863782128.283841.48154376.764433208.78641034.05.6568.23130.8262144871833396.603274.8691541.7622.4982.04240.977362854.3211763295587.14328.3977.179687480262267.58759.85125.0777453.5111145629949.88636521323144417.171975.261714.9143863.45232.54218.6526.69036245.59518.301611164.342157.267.322348.294155.7980.444720.0910.650.570.638.7616.3971.435.9511.1953460134044323.818.88224.6331123.32294214.3752178270.51711621.52364017296995194087181.95333.3362.294671956189.4363.33462185.277517.331360163.01249.061133346.7120.750.81362829083.17711492.94135.519154539.6324.639476200017750.371251006.0051.2901.446297.1321.09510410.3363.1819334.8621857.48841471454741.42173351.57963665.44810863470.51460411150.661347.239948.35677125.414717572.3046756094.48828139981.97.8926.8959651.6111.42755.5628.9714.489818887960821145.3118369.88310.43238656226.347.40.375474018.200425.22158.0758.746180037.9535858649.294.6610555.121.465232.8048.0252.3604190324.480.867545128.813857.33153770.574450208.04341180.15.6768.46530.9260144417466796.885274.0971537.5222.4382.26141.077379934.3311737087187.33128.3377.022686132109267.13659.75725.1167441.9511162559964.35635725322775283.801255.255714.1813867.30232.76018.6356.69505245.45318.291610484.502156.367.334348.338155.7830.444690.0910.650.570.638.7616.3971.435.9511.1955014132844309.618.51025.0371247.48303171.3345707770.31782755.97522948420231OpenBenchmarking.org

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY CreditsCPUFreq PerformanceCPUFreq Schedutil40K80K120K160K200KSE +/- 861.90, N = 3SE +/- 1380.06, N = 31940871390371. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY CreditsCPUFreq PerformanceCPUFreq Schedutil30K60K90K120K150KMin: 192690 / Avg: 194086.67 / Max: 195660Min: 136670 / Avg: 139036.67 / Max: 1414501. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitCPUFreq PerformanceCPUFreq Schedutil4080120160200SE +/- 0.23, N = 3SE +/- 0.14, N = 3181.95133.37MIN: 125.32 / MAX: 275.36MIN: 92.59 / MAX: 205.111. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitCPUFreq PerformanceCPUFreq Schedutil306090120150Min: 181.5 / Avg: 181.95 / Max: 182.28Min: 133.23 / Avg: 133.37 / Max: 133.651. (CC) gcc options: -pthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil70140210280350SE +/- 0.00, N = 11333.33250.00MIN: 100 / MAX: 500MIN: 90.91 / MAX: 333.33
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil60120180240300Min: 333.33 / Avg: 333.33 / Max: 333.33

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil1428425670SE +/- 0.73, N = 15SE +/- 0.52, N = 462.2949.451. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil1224364860Min: 58.9 / Avg: 62.29 / Max: 69.02Min: 48.01 / Avg: 49.45 / Max: 50.311. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansCPUFreq PerformanceCPUFreq Schedutil12002400360048006000SE +/- 52.34, N = 20SE +/- 66.39, N = 2046715591
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansCPUFreq PerformanceCPUFreq Schedutil10002000300040005000Min: 4214 / Avg: 4670.95 / Max: 5110Min: 5113 / Avg: 5590.8 / Max: 6277

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000CPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KSE +/- 2401.68, N = 3SE +/- 1525.09, N = 3956189.4812193.6
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000CPUFreq PerformanceCPUFreq Schedutil170K340K510K680K850KMin: 952399.9 / Avg: 956189.37 / Max: 960640.3Min: 810252.3 / Avg: 812193.6 / Max: 815201.7

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KCPUFreq PerformanceCPUFreq Schedutil80160240320400SE +/- 3.54, N = 15SE +/- 0.53, N = 3363.33317.45MIN: 186.32 / MAX: 403.05MIN: 173.69 / MAX: 340.431. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KCPUFreq PerformanceCPUFreq Schedutil60120180240300Min: 333.89 / Avg: 363.33 / Max: 377.56Min: 316.41 / Avg: 317.45 / Max: 318.171. (CC) gcc options: -pthread

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapCPUFreq PerformanceCPUFreq Schedutil11002200330044005500SE +/- 42.72, N = 5SE +/- 61.21, N = 446215148
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapCPUFreq PerformanceCPUFreq Schedutil9001800270036004500Min: 4484 / Avg: 4621.4 / Max: 4730Min: 4978 / Avg: 5147.75 / Max: 5264

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileCPUFreq PerformanceCPUFreq Schedutil20406080100SE +/- 0.14, N = 3SE +/- 0.43, N = 385.2892.92
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileCPUFreq PerformanceCPUFreq Schedutil20406080100Min: 85.13 / Avg: 85.28 / Max: 85.56Min: 92.05 / Avg: 92.92 / Max: 93.37

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryCPUFreq PerformanceCPUFreq Schedutil110220330440550SE +/- 5.48, N = 3SE +/- 2.06, N = 3517.33475.25MIN: 463.44 / MAX: 1007.52MIN: 400.96 / MAX: 971.551. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryCPUFreq PerformanceCPUFreq Schedutil90180270360450Min: 507.13 / Avg: 517.33 / Max: 525.92Min: 473.03 / Avg: 475.25 / Max: 479.371. (CC) gcc options: -O2 -lm -pthread -lmpi

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000CPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KSE +/- 9433.94, N = 3SE +/- 2545.78, N = 31360163.01256112.1
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000CPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KMin: 1341751.8 / Avg: 1360163.03 / Max: 1372941.8Min: 1251965 / Avg: 1256112.13 / Max: 1260743.8

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112CPUFreq PerformanceCPUFreq Schedutil30060090012001500SE +/- 19.05, N = 9SE +/- 12.21, N = 41249.061171.031. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112CPUFreq PerformanceCPUFreq Schedutil2004006008001000Min: 1163.48 / Avg: 1249.06 / Max: 1316.52Min: 1148.76 / Avg: 1171.03 / Max: 1205.511. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreCPUFreq PerformanceCPUFreq Schedutil200400600800100011331067

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil80160240320400SE +/- 2.58, N = 15SE +/- 0.91, N = 10346.71364.811. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil70140210280350Min: 338.22 / Avg: 346.71 / Max: 380.95Min: 361.66 / Avg: 364.81 / Max: 369.911. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KCPUFreq PerformanceCPUFreq Schedutil510152025SE +/- 0.09, N = 3SE +/- 0.14, N = 320.7519.741. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KCPUFreq PerformanceCPUFreq Schedutil510152025Min: 20.59 / Avg: 20.75 / Max: 20.9Min: 19.52 / Avg: 19.74 / Max: 19.991. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil0.19110.38220.57330.76440.9555SE +/- 0.005456, N = 5SE +/- 0.004000, N = 50.8136280.849248MIN: 0.69MIN: 0.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.8 / Avg: 0.81 / Max: 0.83Min: 0.84 / Avg: 0.85 / Max: 0.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreCPUFreq PerformanceCPUFreq Schedutil600120018002400300029082787

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10CPUFreq PerformanceCPUFreq Schedutil0.71481.42962.14442.85923.574SE +/- 0.018, N = 3SE +/- 0.008, N = 33.1773.054
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10CPUFreq PerformanceCPUFreq Schedutil246810Min: 3.15 / Avg: 3.18 / Max: 3.21Min: 3.04 / Avg: 3.05 / Max: 3.07

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - DegriddingCPUFreq PerformanceCPUFreq Schedutil3K6K9K12K15KSE +/- 137.09, N = 3SE +/- 6.47, N = 311492.911944.21. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - DegriddingCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 11230.5 / Avg: 11492.93 / Max: 11692.9Min: 11933 / Avg: 11944.2 / Max: 11955.41. (CXX) g++ options: -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCPUFreq PerformanceCPUFreq Schedutil9001800270036004500SE +/- 27.09, N = 3SE +/- 25.26, N = 34135.524286.631. (CC) gcc options: -O3 -mavx2
OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCPUFreq PerformanceCPUFreq Schedutil7001400210028003500Min: 4105.93 / Avg: 4135.52 / Max: 4189.62Min: 4237.67 / Avg: 4286.63 / Max: 4321.931. (CC) gcc options: -O3 -mavx2

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryCPUFreq PerformanceCPUFreq Schedutil120240360480600SE +/- 5.78, N = 3SE +/- 2.63, N = 3539.63520.72MIN: 280.44 / MAX: 1002.78MIN: 176.53 / MAX: 1089.461. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryCPUFreq PerformanceCPUFreq Schedutil100200300400500Min: 528.99 / Avg: 539.63 / Max: 548.85Min: 517.86 / Avg: 520.72 / Max: 525.961. (CC) gcc options: -O2 -lm -pthread -lmpi

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinCPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.19, N = 15SE +/- 0.17, N = 1224.6423.791. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinCPUFreq PerformanceCPUFreq Schedutil612182430Min: 22.96 / Avg: 24.64 / Max: 25.7Min: 23 / Avg: 23.79 / Max: 25.061. (CXX) g++ options: -O3 -pthread -lm

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5CPUFreq PerformanceCPUFreq Schedutil1000K2000K3000K4000K5000KSE +/- 7371.11, N = 3SE +/- 54344.04, N = 13476200046123081. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5CPUFreq PerformanceCPUFreq Schedutil800K1600K2400K3200K4000KMin: 4751000 / Avg: 4762000 / Max: 4776000Min: 4069000 / Avg: 4612307.69 / Max: 47710001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreCPUFreq PerformanceCPUFreq Schedutil40080012001600200017751720

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomCPUFreq PerformanceCPUFreq Schedutil0.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.361. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomCPUFreq PerformanceCPUFreq Schedutil12345Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.36 / Avg: 0.36 / Max: 0.371. (CXX) g++ options: -O3 -pthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KSE +/- 11397.44, N = 3SE +/- 13782.67, N = 31251006.001217218.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHCPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KMin: 1228211.25 / Avg: 1251006 / Max: 1262466.88Min: 1192471.25 / Avg: 1217218.75 / Max: 1240107.121. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteCPUFreq PerformanceCPUFreq Schedutil1224364860SE +/- 0.19, N = 3SE +/- 0.69, N = 351.2952.681. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteCPUFreq PerformanceCPUFreq Schedutil1122334455Min: 50.95 / Avg: 51.29 / Max: 51.6Min: 51.76 / Avg: 52.68 / Max: 54.031. (CXX) g++ options: -O2 -lOpenCL

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6CPUFreq PerformanceCPUFreq Schedutil0.32540.65080.97621.30161.627SE +/- 0.001, N = 3SE +/- 0.003, N = 31.4461.408
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6CPUFreq PerformanceCPUFreq Schedutil246810Min: 1.45 / Avg: 1.45 / Max: 1.45Min: 1.4 / Avg: 1.41 / Max: 1.41

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2CPUFreq PerformanceCPUFreq Schedutil60120180240300SE +/- 0.07, N = 3SE +/- 2.83, N = 3297.13289.76MIN: 295.49 / MAX: 320.4MIN: 283.65 / MAX: 458.791. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2CPUFreq PerformanceCPUFreq Schedutil50100150200250Min: 296.99 / Avg: 297.13 / Max: 297.23Min: 285.31 / Avg: 289.76 / Max: 2951. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5CPUFreq PerformanceCPUFreq Schedutil0.24640.49280.73920.98561.232SE +/- 0.003, N = 3SE +/- 0.001, N = 31.0951.068
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5CPUFreq PerformanceCPUFreq Schedutil246810Min: 1.09 / Avg: 1.1 / Max: 1.1Min: 1.07 / Avg: 1.07 / Max: 1.07

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 61.51, N = 5SE +/- 60.61, N = 310410.310666.01. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 10225.2 / Avg: 10410.26 / Max: 10560.5Min: 10559.6 / Avg: 10665.97 / Max: 10769.51. (CC) gcc options: -O3

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil80160240320400SE +/- 1.84, N = 9SE +/- 1.70, N = 9363.18371.481. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil70140210280350Min: 350.88 / Avg: 363.18 / Max: 369.69Min: 363.2 / Avg: 371.48 / Max: 378.551. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3CPUFreq PerformanceCPUFreq Schedutil4K8K12K16K20KSE +/- 182.76, N = 5SE +/- 171.84, N = 519334.8619771.221. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3CPUFreq PerformanceCPUFreq Schedutil3K6K9K12K15KMin: 18747.2 / Avg: 19334.86 / Max: 19750.99Min: 19188.04 / Avg: 19771.22 / Max: 20140.781. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUCPUFreq PerformanceCPUFreq Schedutil4080120160200SE +/- 2.62, N = 3SE +/- 1.86, N = 31851811. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUCPUFreq PerformanceCPUFreq Schedutil306090120150Min: 181 / Avg: 185.17 / Max: 190Min: 177.5 / Avg: 181.17 / Max: 183.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil246810SE +/- 0.031, N = 4SE +/- 0.029, N = 47.4887.6481. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil3691215Min: 7.44 / Avg: 7.49 / Max: 7.57Min: 7.58 / Avg: 7.65 / Max: 7.71. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASCPUFreq PerformanceCPUFreq Schedutil9001800270036004500SE +/- 17.79, N = 3SE +/- 49.90, N = 9414740611. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASCPUFreq PerformanceCPUFreq Schedutil7001400210028003500Min: 4119 / Avg: 4147 / Max: 4180Min: 3811 / Avg: 4060.89 / Max: 43591. (CXX) g++ options: -flto -pthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KSE +/- 10017.79, N = 13SE +/- 13176.39, N = 151454741.421427348.101. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KMin: 1381801.25 / Avg: 1454741.42 / Max: 1521606.75Min: 1355752.75 / Avg: 1427348.1 / Max: 15195261. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096CPUFreq PerformanceCPUFreq Schedutil4K8K12K16K20KSE +/- 199.64, N = 9SE +/- 213.45, N = 317335170151. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096CPUFreq PerformanceCPUFreq Schedutil3K6K9K12K15KMin: 16583 / Avg: 17334.89 / Max: 18665Min: 16653 / Avg: 17015.33 / Max: 173921. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil0.35540.71081.06621.42161.777SE +/- 0.01359, N = 4SE +/- 0.01340, N = 41.579631.55447MIN: 1.31MIN: 1.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 1.55 / Avg: 1.58 / Max: 1.61Min: 1.53 / Avg: 1.55 / Max: 1.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingCPUFreq PerformanceCPUFreq Schedutil140280420560700SE +/- 5.92, N = 15SE +/- 3.22, N = 3665.45655.231. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingCPUFreq PerformanceCPUFreq Schedutil120240360480600Min: 632.51 / Avg: 665.45 / Max: 717.26Min: 651.25 / Avg: 655.23 / Max: 661.591. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

BlogBench

BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: ReadCPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KSE +/- 13508.02, N = 3SE +/- 1738.41, N = 3108634711031181. (CC) gcc options: -O2 -pthread
OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: ReadCPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KMin: 1066904 / Avg: 1086347.33 / Max: 1112313Min: 1099845 / Avg: 1103118.33 / Max: 11057701. (CC) gcc options: -O2 -pthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil0.11740.23480.35220.46960.587SE +/- 0.005906, N = 4SE +/- 0.004601, N = 40.5146040.521968MIN: 0.43MIN: 0.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.51 / Avg: 0.51 / Max: 0.53Min: 0.51 / Avg: 0.52 / Max: 0.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 110.98, N = 3SE +/- 25.21, N = 311150.611305.01. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 10929 / Avg: 11150.57 / Max: 11272.7Min: 11265.7 / Avg: 11304.97 / Max: 113521. (CC) gcc options: -O3

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetCPUFreq PerformanceCPUFreq Schedutil13K26K39K52K65KSE +/- 715.70, N = 4SE +/- 412.91, N = 1561347.262195.4
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetCPUFreq PerformanceCPUFreq Schedutil11K22K33K44K55KMin: 60282.3 / Avg: 61347.15 / Max: 63426.1Min: 59468.9 / Avg: 62195.4 / Max: 65210.3

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPCPUFreq PerformanceCPUFreq Schedutil9K18K27K36K45KSE +/- 456.10, N = 3SE +/- 393.10, N = 339948.3639406.761. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPCPUFreq PerformanceCPUFreq Schedutil7K14K21K28K35KMin: 39249.63 / Avg: 39948.36 / Max: 40805.59Min: 38632.56 / Avg: 39406.76 / Max: 39912.351. (CXX) g++ options: -O3 -march=native -fopenmp

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileCPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.19, N = 10SE +/- 0.20, N = 925.4125.75
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileCPUFreq PerformanceCPUFreq Schedutil612182430Min: 24.99 / Avg: 25.41 / Max: 27.14Min: 25.49 / Avg: 25.75 / Max: 27.38

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishCPUFreq PerformanceCPUFreq Schedutil16K32K48K64K80KSE +/- 507.10, N = 3SE +/- 73.45, N = 371757726361. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishCPUFreq PerformanceCPUFreq Schedutil13K26K39K52K65KMin: 70743 / Avg: 71756.67 / Max: 72292Min: 72547 / Avg: 72636.33 / Max: 727821. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil0.52491.04981.57472.09962.6245SE +/- 0.02560, N = 15SE +/- 0.01587, N = 32.304672.33290MIN: 1.88MIN: 21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 2.11 / Avg: 2.3 / Max: 2.43Min: 2.31 / Avg: 2.33 / Max: 2.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPCPUFreq PerformanceCPUFreq Schedutil12K24K36K48K60KSE +/- 365.12, N = 3SE +/- 598.50, N = 356094.4956769.451. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPCPUFreq PerformanceCPUFreq Schedutil10K20K30K40K50KMin: 55460.11 / Avg: 56094.49 / Max: 56724.89Min: 56062.76 / Avg: 56769.45 / Max: 57959.481. (CXX) g++ options: -O3 -march=native -fopenmp

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatCPUFreq PerformanceCPUFreq Schedutil9K18K27K36K45KSE +/- 473.02, N = 3SE +/- 395.37, N = 339981.939523.5
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatCPUFreq PerformanceCPUFreq Schedutil7K14K21K28K35KMin: 39408.4 / Avg: 39981.87 / Max: 40920.2Min: 38800.9 / Avg: 39523.47 / Max: 40162.9

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCCPUFreq PerformanceCPUFreq Schedutil246810SE +/- 0.09, N = 3SE +/- 0.10, N = 37.897.80MIN: 7.67 / MAX: 8.57MIN: 7.61 / MAX: 8.59
OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCCPUFreq PerformanceCPUFreq Schedutil3691215Min: 7.75 / Avg: 7.89 / Max: 8.06Min: 7.69 / Avg: 7.8 / Max: 8

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensCPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.07, N = 3SE +/- 0.22, N = 326.9026.60
OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensCPUFreq PerformanceCPUFreq Schedutil612182430Min: 26.76 / Avg: 26.9 / Max: 26.99Min: 26.38 / Avg: 26.6 / Max: 27.04

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 71.91, N = 3SE +/- 19.43, N = 39651.619757.171. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 9570.08 / Avg: 9651.61 / Max: 9794.98Min: 9734.46 / Avg: 9757.17 / Max: 9795.841. (CC) gcc options: -O3

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeCPUFreq PerformanceCPUFreq Schedutil3691215SE +/- 0.01, N = 4SE +/- 0.05, N = 411.4311.311. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeCPUFreq PerformanceCPUFreq Schedutil3691215Min: 11.39 / Avg: 11.43 / Max: 11.45Min: 11.16 / Avg: 11.31 / Max: 11.421. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil1224364860SE +/- 0.00, N = 3SE +/- 0.58, N = 555.5654.97MIN: 33.33 / MAX: 58.82MIN: 31.25 / MAX: 58.82
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil1122334455Min: 55.56 / Avg: 55.56 / Max: 55.56Min: 52.63 / Avg: 54.97 / Max: 55.56

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OCPUFreq PerformanceCPUFreq Schedutil714212835SE +/- 0.06, N = 3SE +/- 0.08, N = 328.9729.281. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OCPUFreq PerformanceCPUFreq Schedutil612182430Min: 28.9 / Avg: 28.97 / Max: 29.09Min: 29.11 / Avg: 29.28 / Max: 29.371. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationCPUFreq PerformanceCPUFreq Schedutil1.02042.04083.06124.08165.102SE +/- 0.008, N = 8SE +/- 0.015, N = 84.4894.5351. (CXX) g++ options: -O3 -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationCPUFreq PerformanceCPUFreq Schedutil246810Min: 4.46 / Avg: 4.49 / Max: 4.53Min: 4.43 / Avg: 4.54 / Max: 4.571. (CXX) g++ options: -O3 -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4CPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KSE +/- 4685.69, N = 3SE +/- 1163.43, N = 3818887810750
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4CPUFreq PerformanceCPUFreq Schedutil140K280K420K560K700KMin: 809516 / Avg: 818887.33 / Max: 823600Min: 808813 / Avg: 810749.67 / Max: 812835

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeCPUFreq PerformanceCPUFreq Schedutil20M40M60M80M100MSE +/- 1344259.09, N = 3SE +/- 769788.53, N = 396082114970426011. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeCPUFreq PerformanceCPUFreq Schedutil20M40M60M80M100MMin: 93413248 / Avg: 96082114.33 / Max: 97697549Min: 96198105 / Avg: 97042601.33 / Max: 985796781. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticCPUFreq PerformanceCPUFreq Schedutil1.19522.39043.58564.78085.976SE +/- 0.03612, N = 13SE +/- 0.03676, N = 75.311835.262351. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticCPUFreq PerformanceCPUFreq Schedutil246810Min: 5.11 / Avg: 5.31 / Max: 5.57Min: 5.12 / Avg: 5.26 / Max: 5.351. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000CPUFreq PerformanceCPUFreq Schedutil1632486480SE +/- 0.21, N = 3SE +/- 0.13, N = 369.8870.541. (CC) gcc options: -O2 -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000CPUFreq PerformanceCPUFreq Schedutil1428425670Min: 69.48 / Avg: 69.88 / Max: 70.18Min: 70.34 / Avg: 70.54 / Max: 70.771. (CC) gcc options: -O2 -ldl -lz -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterCPUFreq PerformanceCPUFreq Schedutil3691215SE +/- 0.04, N = 5SE +/- 0.03, N = 510.4310.341. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterCPUFreq PerformanceCPUFreq Schedutil3691215Min: 10.33 / Avg: 10.43 / Max: 10.53Min: 10.27 / Avg: 10.34 / Max: 10.451. (CXX) g++ options: -O2 -lOpenCL

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2CPUFreq PerformanceCPUFreq Schedutil8M16M24M32M40MSE +/- 498492.56, N = 3SE +/- 341040.65, N = 338656226.338319339.8
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2CPUFreq PerformanceCPUFreq Schedutil7M14M21M28M35MMin: 37961548.8 / Avg: 38656226.3 / Max: 39622880.7Min: 37857878.7 / Avg: 38319339.83 / Max: 38985060.5

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupCPUFreq PerformanceCPUFreq Schedutil1122334455SE +/- 0.55, N = 3SE +/- 0.47, N = 347.447.81. (CC) gcc options: -fopenmp -O3 -lm
OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupCPUFreq PerformanceCPUFreq Schedutil1020304050Min: 46.4 / Avg: 47.37 / Max: 48.3Min: 47.2 / Avg: 47.77 / Max: 48.71. (CC) gcc options: -fopenmp -O3 -lm

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1CPUFreq PerformanceCPUFreq Schedutil0.08440.16880.25320.33760.422SE +/- 0.000, N = 3SE +/- 0.001, N = 30.3750.372
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1CPUFreq PerformanceCPUFreq Schedutil12345Min: 0.38 / Avg: 0.38 / Max: 0.38Min: 0.37 / Avg: 0.37 / Max: 0.37

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonCPUFreq PerformanceCPUFreq Schedutil10002000300040005000SE +/- 20.83, N = 6SE +/- 43.93, N = 647404778
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonCPUFreq PerformanceCPUFreq Schedutil8001600240032004000Min: 4645 / Avg: 4740.33 / Max: 4787Min: 4629 / Avg: 4777.83 / Max: 4946

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeCPUFreq PerformanceCPUFreq Schedutil48121620SE +/- 0.15, N = 3SE +/- 0.06, N = 318.2018.061. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeCPUFreq PerformanceCPUFreq Schedutil510152025Min: 17.91 / Avg: 18.2 / Max: 18.38Min: 17.94 / Avg: 18.06 / Max: 18.151. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.24, N = 15SE +/- 0.30, N = 1525.2225.42
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil612182430Min: 23.35 / Avg: 25.22 / Max: 26.66Min: 23.51 / Avg: 25.42 / Max: 27.36

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyCPUFreq PerformanceCPUFreq Schedutil306090120150SE +/- 0.95, N = 3SE +/- 0.13, N = 3158.07156.83
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyCPUFreq PerformanceCPUFreq Schedutil306090120150Min: 156.44 / Avg: 158.07 / Max: 159.73Min: 156.6 / Avg: 156.83 / Max: 157.05

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileCPUFreq PerformanceCPUFreq Schedutil1326395265SE +/- 0.27, N = 3SE +/- 0.17, N = 358.7559.18
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileCPUFreq PerformanceCPUFreq Schedutil1224364860Min: 58.39 / Avg: 58.75 / Max: 59.28Min: 58.84 / Avg: 59.18 / Max: 59.4

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineCPUFreq PerformanceCPUFreq Schedutil40K80K120K160K200KSE +/- 1616.93, N = 15SE +/- 1760.31, N = 15180037.95178738.121. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineCPUFreq PerformanceCPUFreq Schedutil30K60K90K120K150KMin: 168937.66 / Avg: 180037.95 / Max: 186561.98Min: 165715.79 / Avg: 178738.12 / Max: 190995.741. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil1122334455SE +/- 0.52, N = 5SE +/- 0.12, N = 349.2948.951. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil1020304050Min: 48.66 / Avg: 49.29 / Max: 51.35Min: 48.8 / Avg: 48.95 / Max: 49.191. (CC) gcc options: -O3

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil1.04852.0973.14554.1945.2425SE +/- 0.04, N = 3SE +/- 0.03, N = 34.664.63
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 4.58 / Avg: 4.66 / Max: 4.71Min: 4.58 / Avg: 4.63 / Max: 4.67

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 96.09, N = 3SE +/- 31.24, N = 310555.110489.81. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 10374.4 / Avg: 10555.13 / Max: 10702.1Min: 10427.4 / Avg: 10489.83 / Max: 10523.11. (CC) gcc options: -O3

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticCPUFreq PerformanceCPUFreq Schedutil510152025SE +/- 0.26, N = 3SE +/- 0.21, N = 1521.4721.331. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticCPUFreq PerformanceCPUFreq Schedutil510152025Min: 21 / Avg: 21.47 / Max: 21.88Min: 20.56 / Avg: 21.33 / Max: 23.081. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil816243240SE +/- 0.24, N = 6SE +/- 0.23, N = 632.8032.62MIN: 13.16 / MAX: 34.48MIN: 12.82 / MAX: 33.33
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil714212835Min: 32.26 / Avg: 32.8 / Max: 33.33Min: 32.26 / Avg: 32.62 / Max: 33.33

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil1122334455SE +/- 0.24, N = 3SE +/- 0.15, N = 348.0247.761. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedCPUFreq PerformanceCPUFreq Schedutil1020304050Min: 47.55 / Avg: 48.02 / Max: 48.35Min: 47.47 / Avg: 47.76 / Max: 47.921. (CC) gcc options: -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDCPUFreq PerformanceCPUFreq Schedutil1224364860SE +/- 0.00, N = 3SE +/- 0.13, N = 352.3652.091. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDCPUFreq PerformanceCPUFreq Schedutil1020304050Min: 52.35 / Avg: 52.36 / Max: 52.37Min: 51.82 / Avg: 52.09 / Max: 52.231. (CXX) g++ options: -O2 -lOpenCL

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUCPUFreq PerformanceCPUFreq Schedutil9001800270036004500SE +/- 68.07, N = 12SE +/- 44.10, N = 3419042101. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUCPUFreq PerformanceCPUFreq Schedutil7001400210028003500Min: 3763 / Avg: 4189.54 / Max: 4500Min: 4127 / Avg: 4210.33 / Max: 42771. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkCPUFreq PerformanceCPUFreq Schedutil70140210280350SE +/- 0.25, N = 3SE +/- 0.23, N = 3324.48323.00
OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkCPUFreq PerformanceCPUFreq Schedutil60120180240300Min: 324.07 / Avg: 324.48 / Max: 324.94Min: 322.57 / Avg: 323 / Max: 323.37

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil0.19520.39040.58560.78080.976SE +/- 0.001697, N = 7SE +/- 0.001510, N = 70.8675450.863782MIN: 0.78MIN: 0.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.86 / Avg: 0.87 / Max: 0.88Min: 0.86 / Avg: 0.86 / Max: 0.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MCPUFreq PerformanceCPUFreq Schedutil306090120150SE +/- 0.09, N = 3SE +/- 0.07, N = 3128.81128.281. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MCPUFreq PerformanceCPUFreq Schedutil20406080100Min: 128.69 / Avg: 128.81 / Max: 128.99Min: 128.2 / Avg: 128.28 / Max: 128.421. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CCPUFreq PerformanceCPUFreq Schedutil8001600240032004000SE +/- 4.01, N = 10SE +/- 5.01, N = 103857.333841.481. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CCPUFreq PerformanceCPUFreq Schedutil7001400210028003500Min: 3838.02 / Avg: 3857.33 / Max: 3877.8Min: 3814.52 / Avg: 3841.48 / Max: 3867.31. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CCPUFreq PerformanceCPUFreq Schedutil30K60K90K120K150KSE +/- 121.33, N = 4SE +/- 509.59, N = 4153770.57154376.761. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CCPUFreq PerformanceCPUFreq Schedutil30K60K90K120K150KMin: 153434.98 / Avg: 153770.57 / Max: 153998.55Min: 153161.9 / Avg: 154376.76 / Max: 155556.231. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenCPUFreq PerformanceCPUFreq Schedutil10002000300040005000SE +/- 26.71, N = 3SE +/- 36.23, N = 3445044331. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenCPUFreq PerformanceCPUFreq Schedutil8001600240032004000Min: 4409 / Avg: 4449.67 / Max: 4500Min: 4385 / Avg: 4433 / Max: 45041. (CXX) g++ options: -flto -pthread

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileCPUFreq PerformanceCPUFreq Schedutil50100150200250SE +/- 2.78, N = 3SE +/- 0.79, N = 3208.04208.79
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileCPUFreq PerformanceCPUFreq Schedutil4080120160200Min: 203.35 / Avg: 208.04 / Max: 212.98Min: 207.68 / Avg: 208.79 / Max: 210.33

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantCPUFreq PerformanceCPUFreq Schedutil9K18K27K36K45KSE +/- 211.03, N = 3SE +/- 400.94, N = 641180.141034.0
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantCPUFreq PerformanceCPUFreq Schedutil7K14K21K28K35KMin: 40933.4 / Avg: 41180.07 / Max: 41600Min: 40070.1 / Avg: 41033.95 / Max: 42901.4

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughCPUFreq PerformanceCPUFreq Schedutil1.27582.55163.82745.10326.379SE +/- 0.01, N = 5SE +/- 0.00, N = 55.675.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughCPUFreq PerformanceCPUFreq Schedutil246810Min: 5.65 / Avg: 5.67 / Max: 5.68Min: 5.64 / Avg: 5.65 / Max: 5.661. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil1530456075SE +/- 0.65, N = 15SE +/- 1.05, N = 1568.4768.231. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil1326395265Min: 59.38 / Avg: 68.46 / Max: 69.67Min: 53.64 / Avg: 68.23 / Max: 70.241. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1CPUFreq PerformanceCPUFreq Schedutil714212835SE +/- 0.01, N = 3SE +/- 0.13, N = 330.9330.831. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1CPUFreq PerformanceCPUFreq Schedutil714212835Min: 30.91 / Avg: 30.93 / Max: 30.94Min: 30.57 / Avg: 30.83 / Max: 30.951. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2CPUFreq PerformanceCPUFreq Schedutil300M600M900M1200M1500MSE +/- 4018322.42, N = 3SE +/- 750486.58, N = 3144417466714487183331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2CPUFreq PerformanceCPUFreq Schedutil300M600M900M1200M1500MMin: 1436204000 / Avg: 1444174666.67 / Max: 1449050000Min: 1447228000 / Avg: 1448718333.33 / Max: 14496180001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DCPUFreq PerformanceCPUFreq Schedutil20406080100SE +/- 0.55, N = 3SE +/- 0.59, N = 396.8996.601. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DCPUFreq PerformanceCPUFreq Schedutil20406080100Min: 96.34 / Avg: 96.89 / Max: 97.98Min: 95.76 / Avg: 96.6 / Max: 97.731. (CXX) g++ options: -O2 -lOpenCL

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1CPUFreq PerformanceCPUFreq Schedutil60120180240300SE +/- 0.08, N = 3SE +/- 0.68, N = 3274.10274.87MIN: 273.16 / MAX: 274.86MIN: 273.07 / MAX: 276.741. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1CPUFreq PerformanceCPUFreq Schedutil50100150200250Min: 273.95 / Avg: 274.1 / Max: 274.2Min: 274.14 / Avg: 274.87 / Max: 276.241. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25xCPUFreq PerformanceCPUFreq Schedutil30060090012001500SE +/- 11.04, N = 12SE +/- 17.73, N = 151537.521541.761. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25xCPUFreq PerformanceCPUFreq Schedutil30060090012001500Min: 1510.17 / Avg: 1537.52 / Max: 1644.91Min: 1487.43 / Avg: 1541.76 / Max: 1780.851. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil510152025SE +/- 0.20, N = 15SE +/- 0.16, N = 1522.4322.49
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUCPUFreq PerformanceCPUFreq Schedutil510152025Min: 20.74 / Avg: 22.43 / Max: 23.39Min: 21.2 / Avg: 22.49 / Max: 23.35

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisCPUFreq PerformanceCPUFreq Schedutil20406080100SE +/- 0.29, N = 3SE +/- 0.29, N = 382.2682.041. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisCPUFreq PerformanceCPUFreq Schedutil1632486480Min: 81.75 / Avg: 82.26 / Max: 82.76Min: 81.48 / Avg: 82.04 / Max: 82.431. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveCPUFreq PerformanceCPUFreq Schedutil918273645SE +/- 0.04, N = 3SE +/- 0.03, N = 341.0740.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveCPUFreq PerformanceCPUFreq Schedutil918273645Min: 40.98 / Avg: 41.07 / Max: 41.11Min: 40.91 / Avg: 40.97 / Max: 41.031. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2CPUFreq PerformanceCPUFreq Schedutil160K320K480K640K800KSE +/- 2132.19, N = 3SE +/- 5824.36, N = 9737993736285
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2CPUFreq PerformanceCPUFreq Schedutil130K260K390K520K650KMin: 733830 / Avg: 737993 / Max: 740875Min: 718390 / Avg: 736285 / Max: 779525

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil0.97431.94862.92293.89724.8715SE +/- 0.00, N = 3SE +/- 0.01, N = 34.334.32MIN: 3.65 / MAX: 4.37MIN: 3.76 / MAX: 4.37
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil246810Min: 4.33 / Avg: 4.33 / Max: 4.33Min: 4.31 / Avg: 4.32 / Max: 4.33

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthCPUFreq PerformanceCPUFreq Schedutil30M60M90M120M150MSE +/- 1411556.16, N = 4SE +/- 358515.89, N = 3117370871117632955
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthCPUFreq PerformanceCPUFreq Schedutil20M40M60M80M100MMin: 114810239 / Avg: 117370871.25 / Max: 121062060Min: 117005181 / Avg: 117632954.67 / Max: 118246885

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneCPUFreq PerformanceCPUFreq Schedutil20406080100SE +/- 0.69, N = 15SE +/- 0.85, N = 1587.3387.141. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneCPUFreq PerformanceCPUFreq Schedutil20406080100Min: 83.68 / Avg: 87.33 / Max: 90.79Min: 82.11 / Avg: 87.14 / Max: 91.421. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialCPUFreq PerformanceCPUFreq Schedutil714212835SE +/- 0.17, N = 6SE +/- 0.04, N = 628.3328.39
OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialCPUFreq PerformanceCPUFreq Schedutil612182430Min: 27.68 / Avg: 28.33 / Max: 28.87Min: 28.22 / Avg: 28.39 / Max: 28.47

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionCPUFreq PerformanceCPUFreq Schedutil20406080100SE +/- 0.45, N = 3SE +/- 0.20, N = 377.0277.181. (CC) gcc options: -O2
OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionCPUFreq PerformanceCPUFreq Schedutil1530456075Min: 76.4 / Avg: 77.02 / Max: 77.89Min: 76.84 / Avg: 77.18 / Max: 77.541. (CC) gcc options: -O2

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageCPUFreq PerformanceCPUFreq Schedutil150M300M450M600M750MSE +/- 3454923.88, N = 3SE +/- 1732499.41, N = 36861321096874802621. (CC) gcc options: -lm -lpthread -lcurses -lrt
OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageCPUFreq PerformanceCPUFreq Schedutil120M240M360M480M600MMin: 682276149 / Avg: 686132109 / Max: 693025789Min: 684449025 / Avg: 687480261.67 / Max: 6904496221. (CC) gcc options: -lm -lpthread -lcurses -lrt

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1CPUFreq PerformanceCPUFreq Schedutil60120180240300SE +/- 0.25, N = 3SE +/- 0.23, N = 3267.14267.591. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1CPUFreq PerformanceCPUFreq Schedutil50100150200250Min: 266.88 / Avg: 267.14 / Max: 267.63Min: 267.12 / Avg: 267.59 / Max: 267.851. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 20.1Input: Carbon NanotubeCPUFreq PerformanceCPUFreq Schedutil1326395265SE +/- 0.31, N = 3SE +/- 0.39, N = 359.7659.851. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 20.1Input: Carbon NanotubeCPUFreq PerformanceCPUFreq Schedutil1224364860Min: 59.37 / Avg: 59.76 / Max: 60.38Min: 59.07 / Avg: 59.85 / Max: 60.311. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsCPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.04, N = 3SE +/- 0.05, N = 325.1225.081. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsCPUFreq PerformanceCPUFreq Schedutil612182430Min: 25.05 / Avg: 25.12 / Max: 25.15Min: 25.01 / Avg: 25.08 / Max: 25.161. (CXX) g++ options: -O3 -pthread -lm

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - GriddingCPUFreq PerformanceCPUFreq Schedutil16003200480064008000SE +/- 7.65, N = 3SE +/- 2.90, N = 37441.957453.511. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - GriddingCPUFreq PerformanceCPUFreq Schedutil13002600390052006500Min: 7430.4 / Avg: 7441.95 / Max: 7456.41Min: 7447.72 / Avg: 7453.51 / Max: 7456.411. (CXX) g++ options: -lpthread

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceCPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KSE +/- 517.61, N = 12SE +/- 609.97, N = 12111625511145621. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceCPUFreq PerformanceCPUFreq Schedutil200K400K600K800K1000KMin: 1112682 / Avg: 1116255.42 / Max: 1119467Min: 1110439 / Avg: 1114562.33 / Max: 11171961. (CC) gcc options: -O3 -march=native

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: GarlicoinCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 155.17, N = 14SE +/- 99.38, N = 159964.359949.881. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: GarlicoinCPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KMin: 9775.82 / Avg: 9964.35 / Max: 11980Min: 9761.89 / Avg: 9949.88 / Max: 113101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricCPUFreq PerformanceCPUFreq Schedutil140K280K420K560K700K6357256365211. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATCPUFreq PerformanceCPUFreq Schedutil70M140M210M280M350MSE +/- 201175.83, N = 3SE +/- 122621.60, N = 3322775283.80323144417.171. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATCPUFreq PerformanceCPUFreq Schedutil60M120M180M240M300MMin: 322378619.67 / Avg: 322775283.8 / Max: 323031996.68Min: 322905080.69 / Avg: 323144417.17 / Max: 323310417.961. (CC) gcc options: -O3 -march=native -lm

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkCPUFreq PerformanceCPUFreq Schedutil1.18372.36743.55114.73485.9185SE +/- 0.021, N = 3SE +/- 0.022, N = 35.2555.2611. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkCPUFreq PerformanceCPUFreq Schedutil246810Min: 5.22 / Avg: 5.26 / Max: 5.29Min: 5.22 / Avg: 5.26 / Max: 5.291. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100CPUFreq PerformanceCPUFreq Schedutil150300450600750SE +/- 1.18, N = 3SE +/- 0.04, N = 3714.18714.911. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100CPUFreq PerformanceCPUFreq Schedutil130260390520650Min: 711.87 / Avg: 714.18 / Max: 715.7Min: 714.87 / Avg: 714.91 / Max: 7151. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DCPUFreq PerformanceCPUFreq Schedutil8001600240032004000SE +/- 2.74, N = 3SE +/- 2.97, N = 33867.303863.451. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DCPUFreq PerformanceCPUFreq Schedutil7001400210028003500Min: 3861.81 / Avg: 3867.3 / Max: 3870.16Min: 3858.03 / Avg: 3863.45 / Max: 3868.271. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9CPUFreq PerformanceCPUFreq Schedutil50100150200250SE +/- 0.62, N = 3SE +/- 0.81, N = 3232.76232.541. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9CPUFreq PerformanceCPUFreq Schedutil4080120160200Min: 232.1 / Avg: 232.76 / Max: 233.99Min: 231.18 / Avg: 232.54 / Max: 233.991. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsCPUFreq PerformanceCPUFreq Schedutil510152025SE +/- 0.03, N = 3SE +/- 0.03, N = 318.6418.65
OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsCPUFreq PerformanceCPUFreq Schedutil510152025Min: 18.59 / Avg: 18.64 / Max: 18.7Min: 18.62 / Avg: 18.65 / Max: 18.71

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairCPUFreq PerformanceCPUFreq Schedutil246810SE +/- 0.06150, N = 6SE +/- 0.01480, N = 66.695056.690361. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairCPUFreq PerformanceCPUFreq Schedutil3691215Min: 6.49 / Avg: 6.7 / Max: 6.88Min: 6.63 / Avg: 6.69 / Max: 6.731. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringCPUFreq PerformanceCPUFreq Schedutil50100150200250SE +/- 0.10, N = 3SE +/- 0.09, N = 3245.45245.601. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringCPUFreq PerformanceCPUFreq Schedutil4080120160200Min: 245.35 / Avg: 245.45 / Max: 245.65Min: 245.48 / Avg: 245.6 / Max: 245.781. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MCPUFreq PerformanceCPUFreq Schedutil510152025SE +/- 0.10, N = 3SE +/- 0.08, N = 318.2918.301. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MCPUFreq PerformanceCPUFreq Schedutil510152025Min: 18.16 / Avg: 18.29 / Max: 18.48Min: 18.15 / Avg: 18.3 / Max: 18.421. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KSE +/- 17675.25, N = 15SE +/- 15585.71, N = 41610484.501611164.341. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KMin: 1516070.25 / Avg: 1610484.5 / Max: 1736747.5Min: 1584073.5 / Avg: 1611164.34 / Max: 1649898.121. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21CPUFreq PerformanceCPUFreq Schedutil5001000150020002500SE +/- 16.02, N = 3SE +/- 8.27, N = 32156.32157.21. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21CPUFreq PerformanceCPUFreq Schedutil400800120016002000Min: 2124.7 / Avg: 2156.3 / Max: 2176.7Min: 2141.7 / Avg: 2157.17 / Max: 21701. (CXX) g++ options: -O3 -march=native -rdynamic

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileCPUFreq PerformanceCPUFreq Schedutil1530456075SE +/- 0.32, N = 3SE +/- 0.52, N = 367.3367.32
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileCPUFreq PerformanceCPUFreq Schedutil1326395265Min: 66.7 / Avg: 67.33 / Max: 67.74Min: 66.38 / Avg: 67.32 / Max: 68.17

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUCPUFreq PerformanceCPUFreq Schedutil80160240320400SE +/- 3.10, N = 9SE +/- 2.97, N = 9348.34348.291. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUCPUFreq PerformanceCPUFreq Schedutil60120180240300Min: 344.31 / Avg: 348.34 / Max: 373.09Min: 344.39 / Avg: 348.29 / Max: 371.971. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2CPUFreq PerformanceCPUFreq Schedutil306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3155.78155.801. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2CPUFreq PerformanceCPUFreq Schedutil306090120150Min: 155.75 / Avg: 155.78 / Max: 155.83Min: 155.75 / Avg: 155.8 / Max: 155.831. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsCPUFreq PerformanceCPUFreq Schedutil0.10010.20020.30030.40040.5005SE +/- 0.00075, N = 3SE +/- 0.00005, N = 30.444690.44472
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsCPUFreq PerformanceCPUFreq Schedutil12345Min: 0.44 / Avg: 0.44 / Max: 0.45Min: 0.44 / Avg: 0.44 / Max: 0.44

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil0.02050.0410.06150.0820.1025SE +/- 0.000, N = 3SE +/- 0.001, N = 120.0910.0911. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pCPUFreq PerformanceCPUFreq Schedutil12345Min: 0.09 / Avg: 0.09 / Max: 0.09Min: 0.09 / Avg: 0.09 / Max: 0.091. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDCPUFreq PerformanceCPUFreq Schedutil0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 30.650.651. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.64 / Avg: 0.65 / Max: 0.65Min: 0.65 / Avg: 0.65 / Max: 0.651. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaCPUFreq PerformanceCPUFreq Schedutil0.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.571. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.56 / Avg: 0.57 / Max: 0.57Min: 0.57 / Avg: 0.57 / Max: 0.571. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsCPUFreq PerformanceCPUFreq Schedutil0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 30.630.631. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsCPUFreq PerformanceCPUFreq Schedutil246810Min: 0.63 / Avg: 0.63 / Max: 0.63Min: 0.63 / Avg: 0.63 / Max: 0.631. (CXX) g++ options: -O3 -pthread

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismCPUFreq PerformanceCPUFreq Schedutil246810SE +/- 0.04, N = 3SE +/- 0.09, N = 38.768.76MIN: 8.22 / MAX: 8.86MIN: 8.31 / MAX: 8.97
OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismCPUFreq PerformanceCPUFreq Schedutil3691215Min: 8.68 / Avg: 8.76 / Max: 8.83Min: 8.59 / Avg: 8.76 / Max: 8.91

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil48121620SE +/- 0.00, N = 4SE +/- 0.00, N = 416.3916.39MIN: 10.31 / MAX: 16.95MIN: 10.99 / MAX: 16.95
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil48121620Min: 16.39 / Avg: 16.39 / Max: 16.39Min: 16.39 / Avg: 16.39 / Max: 16.39

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil1632486480SE +/- 0.00, N = 7SE +/- 0.00, N = 771.4371.43MIN: 19.61 / MAX: 76.92MIN: 19.61 / MAX: 76.92
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil1428425670Min: 71.43 / Avg: 71.43 / Max: 71.43Min: 71.43 / Avg: 71.43 / Max: 71.43

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil1.33882.67764.01645.35526.694SE +/- 0.00, N = 3SE +/- 0.00, N = 35.955.95MIN: 5.46 / MAX: 6.02MIN: 5.35 / MAX: 6.02
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerCPUFreq PerformanceCPUFreq Schedutil246810Min: 5.95 / Avg: 5.95 / Max: 5.95Min: 5.95 / Avg: 5.95 / Max: 5.95

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil3691215SE +/- 0.04, N = 3SE +/- 0.04, N = 311.1911.19MIN: 7.87 / MAX: 11.36MIN: 8.2 / MAX: 11.36
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisCPUFreq PerformanceCPUFreq Schedutil3691215Min: 11.11 / Avg: 11.19 / Max: 11.24Min: 11.11 / Avg: 11.19 / Max: 11.24

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringCPUFreq PerformanceCPUFreq Schedutil90180270360450Min: 59.74 / Avg: 285.23 / Max: 514.05Min: 59.88 / Avg: 280.92 / Max: 530.69

Chaos Group V-RAY

MinAvgMaxCPUFreq Performance120.8421.1492.1CPUFreq Schedutil120.3419.0492.2OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 4.10.07CPU Power Consumption Monitor130260390520650

OpenBenchmarking.orgKsamples Per Watt, More Is BetterChaos Group V-RAY 4.10.07Mode: CPUCPUFreq PerformanceCPUFreq Schedutil306090120150130.65127.59

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPUCPUFreq PerformanceCPUFreq Schedutil12K24K36K48K60KSE +/- 503.58, N = 3SE +/- 1018.74, N = 135501453460
OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPUCPUFreq PerformanceCPUFreq Schedutil10K20K30K40K50KMin: 54316 / Avg: 55014.33 / Max: 55992Min: 42072 / Avg: 53460.15 / Max: 55444

TensorFlow Lite

MinAvgMaxCPUFreq Performance121.8401.7439.7CPUFreq Schedutil121.5403.7446.4OpenBenchmarking.orgWatts, Fewer Is BetterTensorFlow Lite 2020-08-23CPU Power Consumption Monitor120240360480600

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileCPUFreq PerformanceCPUFreq Schedutil30K60K90K120K150KSE +/- 2146.20, N = 15SE +/- 2393.85, N = 15132844134044
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileCPUFreq PerformanceCPUFreq Schedutil20K40K60K80K100KMin: 119861 / Avg: 132844.47 / Max: 152016Min: 106838 / Avg: 134043.8 / Max: 146394

SVT-VP9

MinAvgMaxCPUFreq Performance119.5184.0372.3CPUFreq Schedutil119.4185.8379.1OpenBenchmarking.orgWatts, Fewer Is BetterSVT-VP9 0.1CPU Power Consumption Monitor100200300400500

OpenBenchmarking.orgFrames Per Second Per Watt, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil0.39150.7831.17451.5661.95751.681.74

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil70140210280350SE +/- 15.29, N = 15SE +/- 4.21, N = 15309.61323.811. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pCPUFreq PerformanceCPUFreq Schedutil60120180240300Min: 95.95 / Avg: 309.61 / Max: 331.67Min: 266.08 / Avg: 323.81 / Max: 334.261. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Rodinia

MinAvgMaxCPUFreq Performance120.9253.1370.6CPUFreq Schedutil121.0253.4373.0OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 3.1CPU Power Consumption Monitor100200300400500

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverCPUFreq PerformanceCPUFreq Schedutil246810SE +/- 0.053, N = 5SE +/- 0.141, N = 158.5108.8821. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverCPUFreq PerformanceCPUFreq Schedutil3691215Min: 8.4 / Avg: 8.51 / Max: 8.65Min: 8.4 / Avg: 8.88 / Max: 10.281. (CXX) g++ options: -O2 -lOpenCL

Nebular Empirical Analysis Tool

MinAvgMaxCPUFreq Performance120.5178.5280.4CPUFreq Schedutil120.1179.4280.2OpenBenchmarking.orgWatts, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29CPU Power Consumption Monitor70140210280350

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29CPUFreq PerformanceCPUFreq Schedutil612182430SE +/- 0.62, N = 14SE +/- 0.56, N = 1225.0424.631. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace
OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29CPUFreq PerformanceCPUFreq Schedutil612182430Min: 23.13 / Avg: 25.04 / Max: 32.57Min: 22.43 / Avg: 24.63 / Max: 29.21. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

oneDNN

OpenBenchmarking.orgWatts, Fewer Is BetteroneDNN 2.0CPU Power Consumption MonitorCPUFreq PerformanceCPUFreq Schedutil90180270360450Min: 120.77 / Avg: 339.16 / Max: 492.27Min: 120.62 / Avg: 341.54 / Max: 492.22

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil30060090012001500SE +/- 35.45, N = 15SE +/- 3.00, N = 31247.481123.32MIN: 1069.01MIN: 1077.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUCPUFreq PerformanceCPUFreq Schedutil2004006008001000Min: 1102.05 / Avg: 1247.48 / Max: 1519.52Min: 1118.58 / Avg: 1123.32 / Max: 1128.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

KeyDB

MinAvgMaxCPUFreq Performance60.8172.1190.1CPUFreq Schedutil64.4157.7178.6OpenBenchmarking.orgWatts, Fewer Is BetterKeyDB 6.0.16CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgOps/sec Per Watt, More Is BetterKeyDB 6.0.16CPUFreq PerformanceCPUFreq Schedutil4008001200160020001761.511865.16

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16CPUFreq PerformanceCPUFreq Schedutil60K120K180K240K300KSE +/- 5131.54, N = 15SE +/- 3012.50, N = 15303171.33294214.371. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16CPUFreq PerformanceCPUFreq Schedutil50K100K150K200K250KMin: 272657.95 / Avg: 303171.33 / Max: 333275.99Min: 273233.06 / Avg: 294214.37 / Max: 314258.831. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

DaCapo Benchmark

MinAvgMaxCPUFreq Performance119.8139.4180.3CPUFreq Schedutil60.6134.4163.7OpenBenchmarking.orgWatts, Fewer Is BetterDaCapo Benchmark 9.12-MR1CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2CPUFreq PerformanceCPUFreq Schedutil11002200330044005500SE +/- 70.06, N = 20SE +/- 73.65, N = 2045705217
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2CPUFreq PerformanceCPUFreq Schedutil9001800270036004500Min: 4124 / Avg: 4570 / Max: 5359Min: 4742 / Avg: 5217.25 / Max: 5913

Zstd Compression

MinAvgMaxCPUFreq Performance120.0153.7222.8CPUFreq Schedutil120.7150.7214.6OpenBenchmarking.orgWatts, Fewer Is BetterZstd Compression 1.4.5CPU Power Consumption Monitor60120180240300

OpenBenchmarking.orgMB/s Per Watt, More Is BetterZstd Compression 1.4.5Compression Level: 3CPUFreq PerformanceCPUFreq Schedutil122436486050.5654.88

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3CPUFreq PerformanceCPUFreq Schedutil2K4K6K8K10KSE +/- 648.67, N = 12SE +/- 69.07, N = 37770.38270.51. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3CPUFreq PerformanceCPUFreq Schedutil14002800420056007000Min: 647.9 / Avg: 7770.25 / Max: 8631.9Min: 8143.5 / Avg: 8270.47 / Max: 8381.11. (CC) gcc options: -O3 -pthread -lz -llzma

Redis

MinAvgMaxCPUFreq Performance119.8124.0135.9CPUFreq Schedutil119.7123.6133.0OpenBenchmarking.orgWatts, Fewer Is BetterRedis 6.0.9CPU Power Consumption Monitor4080120160200

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: GETCPUFreq PerformanceCPUFreq Schedutil3K6K9K12K15K14375.8613846.04

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETCPUFreq PerformanceCPUFreq Schedutil400K800K1200K1600K2000KSE +/- 32510.57, N = 12SE +/- 12716.89, N = 111782755.971711621.521. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETCPUFreq PerformanceCPUFreq Schedutil300K600K900K1200K1500KMin: 1676727 / Avg: 1782755.97 / Max: 2107939.25Min: 1649625.88 / Avg: 1711621.52 / Max: 1804728.381. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cpuminer-Opt

MinAvgMaxCPUFreq Performance72.0144.2193.5CPUFreq Schedutil120.0125.6144.4OpenBenchmarking.orgWatts, Fewer Is BetterCpuminer-Opt 3.15.5CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinCPUFreq PerformanceCPUFreq Schedutil80016002400320040003625.652897.78

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinCPUFreq PerformanceCPUFreq Schedutil110K220K330K440K550KSE +/- 9614.65, N = 12SE +/- 5604.68, N = 125229483640171. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinCPUFreq PerformanceCPUFreq Schedutil90K180K270K360K450KMin: 419180 / Avg: 522948.33 / Max: 540350Min: 303220 / Avg: 364016.67 / Max: 3775401. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

MinAvgMaxCPUFreq Performance120.9145.0174.3CPUFreq Schedutil120.1125.9138.6OpenBenchmarking.orgWatts, Fewer Is BetterCpuminer-Opt 3.15.5CPU Power Consumption Monitor50100150200250

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteCPUFreq PerformanceCPUFreq Schedutil60012001800240030002898.852358.08

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteCPUFreq PerformanceCPUFreq Schedutil90K180K270K360K450KSE +/- 19086.02, N = 12SE +/- 5314.19, N = 124202312969951. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteCPUFreq PerformanceCPUFreq Schedutil70K140K210K280K350KMin: 210390 / Avg: 420230.83 / Max: 442270Min: 238900 / Avg: 296995 / Max: 3062901. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

167 Results Shown

Cpuminer-Opt
dav1d
OSPray
x265
DaCapo Benchmark
InfluxDB
dav1d
DaCapo Benchmark
Timed GDB GNU Debugger Compilation
IOR
InfluxDB
Quantum ESPRESSO
AI Benchmark Alpha
SVT-VP9
x265
oneDNN
AI Benchmark Alpha
rav1e
ASKAP
Himeno Benchmark
IOR
LAMMPS Molecular Dynamics Simulator
John The Ripper
AI Benchmark Alpha
simdjson
Redis
Rodinia
rav1e
TNN
rav1e
LZ4 Compression
SVT-VP9
LULESH
ONNX Runtime
SVT-AV1
LeelaChessZero
Redis
FFTW
oneDNN
TTSIOD 3D Renderer
BlogBench
oneDNN
LZ4 Compression
TensorFlow Lite
FinanceBench
Timed Linux Kernel Compilation
John The Ripper
oneDNN
FinanceBench
TensorFlow Lite
LuxCoreRender
Cython Benchmark
LZ4 Compression
POV-Ray
OSPray
QMCPACK
Primesieve
TensorFlow Lite
Stockfish
Tungsten Renderer
SQLite Speedtest
Rodinia
BYTE Unix Benchmark
CLOMP
rav1e
DaCapo Benchmark
Tachyon
PlaidML
Blender
Timed Godot Game Engine Compilation
FFTE
LZ4 Compression
PlaidML
LZ4 Compression
Tungsten Renderer
OSPray
LZ4 Compression
Rodinia
ONNX Runtime
Numpy Benchmark
oneDNN
OpenFOAM
NAS Parallel Benchmarks:
  EP.C
  LU.C
LeelaChessZero
Timed LLVM Compilation
TensorFlow Lite
ASTC Encoder
SVT-AV1
High Performance Conjugate Gradient
Algebraic Multi-Grid Benchmark
Rodinia
TNN
Cpuminer-Opt
PlaidML
Timed MrBayes Analysis
ASTC Encoder
TensorFlow Lite
OSPray
asmFish
YafaRay
Intel Open Image Denoise
GnuPG
Swet
Etcpak
GPAW
LAMMPS Molecular Dynamics Simulator
ASKAP
TSCP
Cpuminer-Opt
BRL-CAD
Hierarchical INTegration
GROMACS
Google SynthMark
NAS Parallel Benchmarks
Gcrypt Library
Dolfyn
Tungsten Renderer
Etcpak
OpenFOAM
Redis
QuantLib
Build2
RELION
Etcpak
NAMD
SVT-AV1
simdjson:
  DistinctUserID
  Kostya
  PartialTweets
LuxCoreRender
OSPray:
  NASA Streamlines - Path Tracer
  NASA Streamlines - SciVis
  XFrog Forest - Path Tracer
  XFrog Forest - SciVis
CPU Power Consumption Monitor:
  Phoronix Test Suite System Monitoring
  CPU Power Consumption Monitor
  CPU
Chaos Group V-RAY
TensorFlow Lite
TensorFlow Lite
SVT-VP9:
  CPU Power Consumption Monitor
  Visual Quality Optimized - Bosphorus 1080p
SVT-VP9
Rodinia
Rodinia
Nebular Empirical Analysis Tool
Nebular Empirical Analysis Tool
oneDNN
oneDNN
KeyDB:
  CPU Power Consumption Monitor
 
KeyDB
DaCapo Benchmark
DaCapo Benchmark
Zstd Compression:
  CPU Power Consumption Monitor
  3
Zstd Compression
Redis:
  CPU Power Consumption Monitor
  GET
Redis
Cpuminer-Opt:
  CPU Power Consumption Monitor
  Skeincoin
Cpuminer-Opt
Cpuminer-Opt:
  CPU Power Consumption Monitor
  Quad SHA-256, Pyrite
Cpuminer-Opt