AMD EPYC 7601 2P

Tests for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012011-HA-AMDEPYC7693
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Chess Test Suite 4 Tests
C/C++ Compiler Tests 11 Tests
CPU Massive 16 Tests
Creator Workloads 13 Tests
Database Test Suite 5 Tests
Encoding 3 Tests
Fortran Tests 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Imaging 2 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 10 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 2 Tests
Multi-Core 11 Tests
NVIDIA GPU Compute 5 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 2 Tests
Python 2 Tests
Scientific Computing 6 Tests
Server 6 Tests
Server CPU Tests 6 Tests
Single-Threaded 6 Tests
Speech 2 Tests
Telephony 2 Tests
Texture Compression 2 Tests
Video Encoding 3 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
November 30 2020
  22 Hours, 51 Minutes
2
December 01 2020
  4 Hours, 13 Minutes
Invert Hiding All Results Option
  13 Hours, 32 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7601 2PProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution12AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V1.02.B10 BIOS)AMD 17h126GB280GB INTEL SSDPE21D280GAllvmpipeVE2282 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.4.0-47-generic (x86_64)GNOME Shell 3.36.3X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.8 (LLVM 10.0.0 128 bits)GCC 9.3.0ext41920x1080GNOME Shell 3.36.4OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8001250Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.04)Python Details- Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

1 vs. 2 ComparisonPhoronix Test SuiteBaseline+6.2%+6.2%+12.4%+12.4%+18.6%+18.6%24.7%21.3%7.5%5.8%5.8%2.6%BLASEigenN.2.3.C.F.RRhodopsin Protein20k Atoms1 - Compression Speed4.2%Throughput4%Rand3.1%Latency Under LoadMemory Allocations2.1%LeelaChessZeroLeelaChessZeroFFTELAMMPS Molecular Dynamics SimulatorLAMMPS Molecular Dynamics SimulatorLZ4 CompressionSockperfLeelaChessZeroSockperfOSBench12

AMD EPYC 7601 2Pcaffe: GoogleNet - CPU - 1000lammps: 20k Atomscaffe: AlexNet - CPU - 1000opencv: Features 2Dbasis: UASTC Level 2 + RDO Post-Processinglczero: Eigenbuild-clash: Time To Compilecaffe: GoogleNet - CPU - 200ncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU - squeezenetai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scoremlpack: scikit_qdalczero: BLASnumpy: leveldb: Seq Fillleveldb: Seq Filllczero: Randleveldb: Rand Deletecaffe: GoogleNet - CPU - 100brl-cad: VGR Performance Metriccaffe: AlexNet - CPU - 200hmmer: Pfam Database Searchcouchdb: 100 - 1000 - 24asmfish: 1024 Hash Memory, 26 Depthhint: FLOATopencv: Object Detectioncompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedbyte: Dhrystone 2caffe: AlexNet - CPU - 100leveldb: Seek Randastcenc: Exhaustivegromacs: Water Benchmarkmlpack: scikit_icacompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedrav1e: 1rav1e: 5openvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP32 - CPUmlpack: scikit_linearridgeregressionkvazaar: Bosphorus 4K - Slowopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP32 - CPUopenvino: Face Detection 0106 FP32 - CPUkvazaar: Bosphorus 4K - Mediumkeydb: leveldb: Rand Fillleveldb: Rand Fillleveldb: Overwriteleveldb: Overwritesockperf: Latency Ping Pongleveldb: Rand Readbasis: ETC1Sleveldb: Hot Readindigobench: CPU - Bedroomindigobench: CPU - Supercarrav1e: 6openvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUstockfish: Total Timeespeak: Text-To-Speech Synthesislibraw: Post-Processing Benchmarkwebp: Quality 100, Lossless, Highest Compressionsunflow: Global Illumination + Image Synthesisembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objphpbench: PHP Benchmark Suiterav1e: 10x265: Bosphorus 4Kpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Onlypgbench: 100 - 50 - Read Write - Average Latencypgbench: 100 - 50 - Read Writepgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 1 - Read Writepgbench: 100 - 50 - Read Only - Average Latencypgbench: 100 - 50 - Read Onlycompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedcrafty: Elapsed Timekvazaar: Bosphorus 4K - Very Fastbasis: UASTC Level 3mlpack: scikit_svmembree: Pathtracer ISPC - Crownembree: Pathtracer - Crownkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumembree: Pathtracer ISPC - Asian Dragonredis: LPUSHtnn: CPU - MobileNet v2rnnoise: embree: Pathtracer - Asian Dragonpgbench: 1 - 50 - Read Write - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 1 - Read Only - Average Latencypgbench: 1 - 1 - Read Onlypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 50 - Read Onlypgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 1 - Read Writewebp: Quality 100, Losslesstnn: CPU - SqueezeNet v1.1dolfyn: Computational Fluid Dynamicsosbench: Create Processesopencv: DNN - Deep Neural Networkkvazaar: Bosphorus 4K - Ultra Fastbasis: UASTC Level 2astcenc: Thoroughx265: Bosphorus 1080pffte: N=256, 3D Complex FFT Routineosbench: Launch Programsmafft: Multiple Sequence Alignment - LSU RNAkvazaar: Bosphorus 1080p - Very Fastsockperf: Latency Under Loadsockperf: Throughputredis: SETredis: SADDredis: GETredis: LPOPwebp: Quality 100, Highest Compressionbasis: UASTC Level 0astcenc: Mediumkvazaar: Bosphorus 1080p - Ultra Fastastcenc: Fastosbench: Create Filesosbench: Memory Allocationsosbench: Create Threadswebp: Quality 100webp: Defaultlammps: Rhodopsin Proteinleveldb: Fill Syncleveldb: Fill Sync12265868013.3861059057342894831.821616620.84453675861.6664.3334.5249.72103.9256.137.1522.5316.2118.1418.0418.6643.7340.552029884114572.21603249.53694.98010.2116500622.234269964236961213239199.070184.26161592327264552963.558951450527428.235.9031054170.8107071106.98792.102.03092.657414.736.600.2430.7377579.142.087593.742.073.028.125041.573.155038.663.168.27264990.18698.59810.1701.78310.16.44963.83063.53363.0584.0428.5830.9812.117453.002.057688.944263227640.08923.6751.0260.95120.767722.27904455102.20815.080.069144921.571318430.43722910.1323800457950.27150.67568045517.6032.69025.9420.114921.376721.5222.0223.7669889153.70368.41325.78724.212321.19623600.058173740.1174294970.382261724.414341.16923.35347.6280851571328.8920.42412.1733.1379402.43712221967.53603614.96343.2322.6503487421033603.521162742.421396994.711576478.5810.1149.8558.3579.146.9019.40274897.13236526.5296303.2032.11013.4651048.7026.714.157747752695.17910.2113018631.806200.0727440.535.6330748131.4107.1857384.636.820.2430.7338.118.29709.72110.0705.52410.06.48964.63062.5430.98423.8750.19320.891022.63037971.56859.42568743617.6219.983121.305421.4922.0423.575224.471024.33523.28747.52715428.9285377.77048903466.89055814.88243.2922.08133524610.13978.9819.64216699.19190426.1123983.2092.12414.2441044.6626.7OpenBenchmarking.org

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10001600K1200K1800K2400K3000KSE +/- 4667.04, N = 326586801. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1248121620SE +/- 0.19, N = 9SE +/- 0.04, N = 313.3914.161. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1248121620Min: 11.87 / Avg: 13.39 / Max: 13.69Min: 14.11 / Avg: 14.16 / Max: 14.241. (CXX) g++ options: -O3 -pthread -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 10001200K400K600K800K1000KSE +/- 2384.89, N = 310590571. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Features 2D170K140K210K280K350KSE +/- 9636.57, N = 93428941. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing12004006008001000SE +/- 0.85, N = 3831.821. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen12160320480640800SE +/- 6.31, N = 8SE +/- 7.84, N = 36167471. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen12130260390520650Min: 587 / Avg: 616.25 / Max: 643Min: 739 / Avg: 747.33 / Max: 7631. (CXX) g++ options: -flto -pthread

Timed Clash Compilation

Build the clash-lang Haskell to VHDL/Verilog/SystemVerilog compiler with GHC 8.10.1 Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile1130260390520650SE +/- 2.03, N = 3620.84

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 2001110K220K330K440K550KSE +/- 517.20, N = 35367581. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny11428425670SE +/- 1.07, N = 961.66MIN: 50.03 / MAX: 263.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet5011428425670SE +/- 3.47, N = 964.33MIN: 41.73 / MAX: 705.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet1816243240SE +/- 3.59, N = 934.52MIN: 17.35 / MAX: 153.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1811122334455SE +/- 6.14, N = 949.72MIN: 23.26 / MAX: 267.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg16120406080100SE +/- 4.25, N = 9103.92MIN: 72.99 / MAX: 322.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet11326395265SE +/- 2.83, N = 956.13MIN: 33.95 / MAX: 618.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface1246810SE +/- 0.08, N = 97.15MIN: 6.73 / MAX: 82.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b01510152025SE +/- 0.34, N = 922.53MIN: 20.42 / MAX: 355.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet148121620SE +/- 0.24, N = 916.21MIN: 14.8 / MAX: 390.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v2148121620SE +/- 0.53, N = 918.14MIN: 16.7 / MAX: 497.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v3148121620SE +/- 0.54, N = 918.04MIN: 15.98 / MAX: 509.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v21510152025SE +/- 0.79, N = 918.66MIN: 15.75 / MAX: 447.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet11020304050SE +/- 3.04, N = 943.73MIN: 35.16 / MAX: 504.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet1918273645SE +/- 2.20, N = 940.55MIN: 32.73 / MAX: 521.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score14008001200160020002029

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score12004006008001000884

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score120040060080010001145

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qda11632486480SE +/- 1.72, N = 972.21

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS12160320480640800SE +/- 8.87, N = 4SE +/- 5.29, N = 36037521. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS12130260390520650Min: 589 / Avg: 603 / Max: 629Min: 742 / Avg: 752 / Max: 7601. (CXX) g++ options: -flto -pthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark150100150200250SE +/- 0.56, N = 3249.53

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential Fill12150300450600750SE +/- 3.04, N = 3SE +/- 0.79, N = 3694.98695.181. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential Fill12120240360480600Min: 689.01 / Avg: 694.98 / Max: 699Min: 693.97 / Avg: 695.18 / Max: 696.671. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential Fill123691215SE +/- 0.06, N = 3SE +/- 0.00, N = 310.210.21. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential Fill123691215Min: 10.1 / Avg: 10.2 / Max: 10.3Min: 10.2 / Avg: 10.2 / Max: 10.21. (CXX) g++ options: -O3 -lsnappy -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Random1220K40K60K80K100KSE +/- 1996.86, N = 3SE +/- 1143.77, N = 31165001130181. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Random1220K40K60K80K100KMin: 112511 / Avg: 116499.67 / Max: 118668Min: 111588 / Avg: 113017.67 / Max: 1152791. (CXX) g++ options: -flto -pthread

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Delete12140280420560700SE +/- 0.79, N = 3SE +/- 4.77, N = 3622.23631.811. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Delete12110220330440550Min: 620.76 / Avg: 622.23 / Max: 623.45Min: 624.06 / Avg: 631.81 / Max: 640.51. (CXX) g++ options: -O3 -lsnappy -lpthread

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100160K120K180K240K300KSE +/- 943.06, N = 32699641. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric150K100K150K200K250K2369611. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200150K100K150K200K250KSE +/- 250.59, N = 32132391. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search124080120160200SE +/- 1.11, N = 3SE +/- 0.13, N = 3199.07200.071. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search124080120160200Min: 197.78 / Avg: 199.07 / Max: 201.29Min: 199.84 / Avg: 200.07 / Max: 200.281. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Apache CouchDB

This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.1.1Bulk Size: 100 - Inserts: 1000 - Rounds: 2414080120160200SE +/- 0.47, N = 3184.261. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth113M26M39M52M65MSE +/- 945564.40, N = 361592327

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT160M120M180M240M300MSE +/- 212571.61, N = 3264552963.561. (CC) gcc options: -O3 -march=native -lm

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Object Detection130K60K90K120K150KSE +/- 2167.63, N = 31450521. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1216003200480064008000SE +/- 11.24, N = 6SE +/- 6.80, N = 37428.27440.51. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1213002600390052006500Min: 7404.7 / Avg: 7428.23 / Max: 7469.4Min: 7427.5 / Avg: 7440.53 / Max: 7450.41. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed12816243240SE +/- 0.43, N = 6SE +/- 0.34, N = 335.9035.631. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed12816243240Min: 35.1 / Avg: 35.9 / Max: 37.83Min: 34.95 / Avg: 35.63 / Max: 36.051. (CC) gcc options: -O3

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2127M14M21M28M35MSE +/- 59342.21, N = 3SE +/- 239652.72, N = 331054170.830748131.4
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2125M10M15M20M25MMin: 30936228.7 / Avg: 31054170.77 / Max: 31124619.9Min: 30269717.1 / Avg: 30748131.43 / Max: 31012638.3

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100120K40K60K80K100KSE +/- 610.04, N = 31070711. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek Random1220406080100SE +/- 0.25, N = 3SE +/- 0.68, N = 3106.99107.191. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek Random1220406080100Min: 106.52 / Avg: 106.99 / Max: 107.39Min: 106.01 / Avg: 107.18 / Max: 108.361. (CXX) g++ options: -O3 -lsnappy -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive120406080100SE +/- 0.32, N = 392.101. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark10.45680.91361.37041.82722.284SE +/- 0.010, N = 32.0301. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_ica120406080100SE +/- 0.39, N = 392.65

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1216003200480064008000SE +/- 13.54, N = 3SE +/- 4.73, N = 37414.77384.61. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1213002600390052006500Min: 7398.8 / Avg: 7414.67 / Max: 7441.6Min: 7379.2 / Avg: 7384.57 / Max: 73941. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed12816243240SE +/- 0.45, N = 3SE +/- 0.54, N = 336.6036.821. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed12816243240Min: 35.99 / Avg: 36.6 / Max: 37.47Min: 35.75 / Avg: 36.82 / Max: 37.391. (CC) gcc options: -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1120.05470.10940.16410.21880.2735SE +/- 0.000, N = 3SE +/- 0.000, N = 30.2430.243
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11212345Min: 0.24 / Avg: 0.24 / Max: 0.24Min: 0.24 / Avg: 0.24 / Max: 0.24

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5120.16580.33160.49740.66320.829SE +/- 0.001, N = 3SE +/- 0.001, N = 30.7370.733
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 512246810Min: 0.73 / Avg: 0.74 / Max: 0.74Min: 0.73 / Avg: 0.73 / Max: 0.74

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPU116003200480064008000SE +/- 24.10, N = 37579.14

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPU10.4680.9361.4041.8722.34SE +/- 0.01, N = 32.08

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPU116003200480064008000SE +/- 32.68, N = 37593.74

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPU10.46580.93161.39741.86322.329SE +/- 0.01, N = 32.07

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression10.67951.3592.03852.7183.3975SE +/- 0.02, N = 33.02

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow12246810SE +/- 0.00, N = 3SE +/- 0.01, N = 38.128.111. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow123691215Min: 8.12 / Avg: 8.12 / Max: 8.12Min: 8.1 / Avg: 8.11 / Max: 8.131. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPU111002200330044005500SE +/- 4.71, N = 35041.57

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPU10.70881.41762.12642.83523.544SE +/- 0.01, N = 33.15

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPU111002200330044005500SE +/- 6.72, N = 35038.66

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPU10.7111.4222.1332.8443.555SE +/- 0.01, N = 33.16

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium12246810SE +/- 0.01, N = 3SE +/- 0.01, N = 38.278.291. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium123691215Min: 8.26 / Avg: 8.27 / Max: 8.28Min: 8.27 / Avg: 8.29 / Max: 8.311. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16160K120K180K240K300KSE +/- 3258.77, N = 3264990.181. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Fill12150300450600750SE +/- 1.68, N = 3SE +/- 3.15, N = 3698.60709.721. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Fill12120240360480600Min: 695.28 / Avg: 698.6 / Max: 700.67Min: 705.82 / Avg: 709.72 / Max: 715.951. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random Fill123691215SE +/- 0.03, N = 3SE +/- 0.03, N = 310.110.01. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random Fill123691215Min: 10.1 / Avg: 10.13 / Max: 10.2Min: 9.9 / Avg: 9.97 / Max: 101. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Overwrite12150300450600750SE +/- 1.59, N = 3SE +/- 2.82, N = 3701.78705.521. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Overwrite12120240360480600Min: 699.44 / Avg: 701.78 / Max: 704.81Min: 700.85 / Avg: 705.52 / Max: 710.611. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Overwrite123691215SE +/- 0.03, N = 3SE +/- 0.03, N = 310.110.01. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Overwrite123691215Min: 10 / Avg: 10.07 / Max: 10.1Min: 10 / Avg: 10.03 / Max: 10.11. (CXX) g++ options: -O3 -lsnappy -lpthread

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping Pong12246810SE +/- 0.108, N = 25SE +/- 0.126, N = 256.4496.4891. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping Pong123691215Min: 5.28 / Avg: 6.45 / Max: 7.69Min: 5.46 / Avg: 6.49 / Max: 7.891. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Read121428425670SE +/- 0.35, N = 3SE +/- 0.24, N = 363.8364.631. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Read121326395265Min: 63.12 / Avg: 63.83 / Max: 64.22Min: 64.39 / Avg: 64.63 / Max: 65.111. (CXX) g++ options: -O3 -lsnappy -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S11428425670SE +/- 0.19, N = 363.531. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot Read121428425670SE +/- 0.49, N = 3SE +/- 0.58, N = 363.0662.541. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot Read121224364860Min: 62.08 / Avg: 63.06 / Max: 63.64Min: 61.58 / Avg: 62.54 / Max: 63.581. (CXX) g++ options: -O3 -lsnappy -lpthread

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom10.90951.8192.72853.6384.5475SE +/- 0.014, N = 34.042

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1246810SE +/- 0.146, N = 38.583

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6120.22140.44280.66420.88561.107SE +/- 0.001, N = 3SE +/- 0.004, N = 30.9810.984
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 612246810Min: 0.98 / Avg: 0.98 / Max: 0.98Min: 0.98 / Avg: 0.98 / Max: 0.99

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU10.47480.94961.42441.89922.374SE +/- 0.03, N = 32.11

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU116003200480064008000SE +/- 113.02, N = 37453.00

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU10.46130.92261.38391.84522.3065SE +/- 0.03, N = 32.05

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU116003200480064008000SE +/- 100.52, N = 37688.94

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time19M18M27M36M45MSE +/- 592012.77, N = 3426322761. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1918273645SE +/- 0.12, N = 440.091. (CC) gcc options: -O2 -std=c99

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark12612182430SE +/- 0.10, N = 3SE +/- 0.14, N = 323.6723.871. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark12612182430Min: 23.47 / Avg: 23.67 / Max: 23.82Min: 23.59 / Avg: 23.87 / Max: 24.051. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression121224364860SE +/- 0.46, N = 3SE +/- 0.15, N = 351.0350.191. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression121020304050Min: 50.16 / Avg: 51.03 / Max: 51.72Min: 49.9 / Avg: 50.19 / Max: 50.411. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis10.2140.4280.6420.8561.07SE +/- 0.018, N = 150.951MIN: 0.6 / MAX: 2.12

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj12510152025SE +/- 0.21, N = 3SE +/- 0.11, N = 320.7720.89MIN: 20.32 / MAX: 21.4MIN: 20.52 / MAX: 21.37
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj12510152025Min: 20.55 / Avg: 20.77 / Max: 21.18Min: 20.71 / Avg: 20.89 / Max: 21.08

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj12510152025SE +/- 0.14, N = 3SE +/- 0.12, N = 322.2822.63MIN: 21.88 / MAX: 22.9MIN: 22.28 / MAX: 23.1
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj12510152025Min: 22.03 / Avg: 22.28 / Max: 22.5Min: 22.45 / Avg: 22.63 / Max: 22.86

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1100K200K300K400K500KSE +/- 855.79, N = 3445510

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1010.49680.99361.49041.98722.484SE +/- 0.006, N = 32.208

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K148121620SE +/- 0.02, N = 315.081. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency10.01550.0310.04650.0620.0775SE +/- 0.001, N = 30.0691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only13K6K9K12K15KSE +/- 90.87, N = 3144921. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency10.35350.7071.06051.4141.7675SE +/- 0.004, N = 31.5711. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Write17K14K21K28K35KSE +/- 86.16, N = 3318431. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency10.09830.19660.29490.39320.4915SE +/- 0.007, N = 30.4371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write15001000150020002500SE +/- 38.48, N = 322911. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency10.02970.05940.08910.11880.1485SE +/- 0.000, N = 30.1321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Only180K160K240K320K400KSE +/- 1057.80, N = 33800451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed122K4K6K8K10KSE +/- 53.81, N = 3SE +/- 3.66, N = 37950.27971.51. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1214002800420056007000Min: 7842.6 / Avg: 7950.2 / Max: 8006Min: 7966.8 / Avg: 7971.5 / Max: 7978.71. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1215003000450060007500SE +/- 107.66, N = 3SE +/- 11.30, N = 37150.676859.421. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1212002400360048006000Min: 6996.96 / Avg: 7150.67 / Max: 7358.11Min: 6837.17 / Avg: 6859.42 / Max: 6874.031. (CC) gcc options: -O3

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time121.2M2.4M3.6M4.8M6MSE +/- 10735.68, N = 3SE +/- 12624.04, N = 3568045556874361. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time121000K2000K3000K4000K5000KMin: 5664561 / Avg: 5680455 / Max: 5700904Min: 5666764 / Avg: 5687436.33 / Max: 57103261. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 317.6017.621. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1248121620Min: 17.54 / Avg: 17.6 / Max: 17.66Min: 17.56 / Avg: 17.62 / Max: 17.661. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31816243240SE +/- 0.06, N = 332.691. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm1612182430SE +/- 0.10, N = 325.94

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown12510152025SE +/- 0.13, N = 3SE +/- 0.17, N = 320.1119.98MIN: 19.49 / MAX: 20.65MIN: 19.4 / MAX: 20.67
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown12510152025Min: 19.91 / Avg: 20.11 / Max: 20.36Min: 19.78 / Avg: 19.98 / Max: 20.32

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown12510152025SE +/- 0.09, N = 3SE +/- 0.22, N = 321.3821.31MIN: 20.72 / MAX: 22.05MIN: 20.6 / MAX: 22.06
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown12510152025Min: 21.21 / Avg: 21.38 / Max: 21.54Min: 21 / Avg: 21.31 / Max: 21.73

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow12510152025SE +/- 0.06, N = 3SE +/- 0.07, N = 321.5221.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow12510152025Min: 21.44 / Avg: 21.52 / Max: 21.64Min: 21.35 / Avg: 21.49 / Max: 21.571. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium12510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 322.0222.041. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium12510152025Min: 21.93 / Avg: 22.02 / Max: 22.09Min: 21.99 / Avg: 22.04 / Max: 22.111. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon12612182430SE +/- 0.27, N = 3SE +/- 0.21, N = 323.7723.58MIN: 23.24 / MAX: 24.53MIN: 23.02 / MAX: 24.08
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon12612182430Min: 23.38 / Avg: 23.77 / Max: 24.28Min: 23.17 / Avg: 23.58 / Max: 23.87

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1200K400K600K800K1000KSE +/- 9242.05, N = 7889153.701. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2180160240320400SE +/- 0.72, N = 3368.41MIN: 357.34 / MAX: 406.291. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-281612182430SE +/- 0.03, N = 325.791. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon12612182430SE +/- 0.04, N = 3SE +/- 0.16, N = 324.2124.47MIN: 23.99 / MAX: 24.59MIN: 24.08 / MAX: 25
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon12612182430Min: 24.14 / Avg: 24.21 / Max: 24.26Min: 24.24 / Avg: 24.47 / Max: 24.79

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency1510152025SE +/- 0.02, N = 321.201. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write15001000150020002500SE +/- 2.48, N = 323601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency10.01310.02620.03930.05240.0655SE +/- 0.000, N = 30.0581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only14K8K12K16K20KSE +/- 88.62, N = 3173741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency10.02630.05260.07890.10520.1315SE +/- 0.000, N = 30.1171. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only190K180K270K360K450KSE +/- 1316.54, N = 34294971. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency10.0860.1720.2580.3440.43SE +/- 0.004, N = 30.3821. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write16001200180024003000SE +/- 23.96, N = 326171. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12612182430SE +/- 0.07, N = 3SE +/- 0.08, N = 324.4124.341. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12612182430Min: 24.28 / Avg: 24.41 / Max: 24.49Min: 24.2 / Avg: 24.34 / Max: 24.491. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1170140210280350SE +/- 0.54, N = 3341.17MIN: 338.47 / MAX: 342.851. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 323.3523.29
OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12510152025Min: 23.27 / Avg: 23.35 / Max: 23.41Min: 23.27 / Avg: 23.29 / Max: 23.3

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Processes121122334455SE +/- 0.76, N = 12SE +/- 0.57, N = 1547.6347.531. (CC) gcc options: -lm
OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Processes121020304050Min: 40.35 / Avg: 47.63 / Max: 49.85Min: 44.02 / Avg: 47.53 / Max: 51.431. (CC) gcc options: -lm

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: DNN - Deep Neural Network13K6K9K12K15KSE +/- 231.61, N = 4157131. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast12714212835SE +/- 0.11, N = 3SE +/- 0.06, N = 328.8928.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast12612182430Min: 28.77 / Avg: 28.89 / Max: 29.11Min: 28.85 / Avg: 28.92 / Max: 29.031. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21510152025SE +/- 0.02, N = 320.421. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough13691215SE +/- 0.04, N = 312.171. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1816243240SE +/- 0.19, N = 333.131. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1220K40K60K80K100KSE +/- 1406.21, N = 15SE +/- 1716.89, N = 1579402.4485377.771. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1215K30K45K60K75KMin: 74219.45 / Avg: 79402.44 / Max: 94585.1Min: 73274.91 / Avg: 85377.77 / Max: 95791.281. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch Programs121530456075SE +/- 1.03, N = 3SE +/- 0.73, N = 1567.5466.891. (CC) gcc options: -lm
OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch Programs121326395265Min: 65.64 / Avg: 67.54 / Max: 69.18Min: 61.1 / Avg: 66.89 / Max: 73.081. (CC) gcc options: -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1248121620SE +/- 0.06, N = 3SE +/- 0.08, N = 314.9614.881. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1248121620Min: 14.84 / Avg: 14.96 / Max: 15.04Min: 14.77 / Avg: 14.88 / Max: 15.031. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast121020304050SE +/- 0.10, N = 3SE +/- 0.13, N = 343.2343.291. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast12918273645Min: 43.08 / Avg: 43.23 / Max: 43.43Min: 43.08 / Avg: 43.29 / Max: 43.521. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under Load12510152025SE +/- 0.24, N = 5SE +/- 0.16, N = 522.6522.081. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under Load12510152025Min: 22.17 / Avg: 22.65 / Max: 23.42Min: 21.67 / Avg: 22.08 / Max: 22.631. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: Throughput1270K140K210K280K350KSE +/- 3170.60, N = 5SE +/- 3895.07, N = 53487423352461. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: Throughput1260K120K180K240K300KMin: 340539 / Avg: 348742.2 / Max: 359137Min: 323471 / Avg: 335246 / Max: 3468151. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1200K400K600K800K1000KSE +/- 8313.33, N = 31033603.521. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1200K400K600K800K1000KSE +/- 10563.93, N = 31162742.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1300K600K900K1200K1500KSE +/- 13033.36, N = 31396994.711. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1300K600K900K1200K1500KSE +/- 22334.81, N = 31576478.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123691215SE +/- 0.00, N = 3SE +/- 0.00, N = 310.1110.141. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123691215Min: 10.11 / Avg: 10.11 / Max: 10.12Min: 10.13 / Avg: 10.14 / Max: 10.141. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 013691215SE +/- 0.023, N = 39.8551. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1246810SE +/- 0.02, N = 38.351. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast1220406080100SE +/- 0.34, N = 3SE +/- 0.25, N = 379.1478.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast121530456075Min: 78.46 / Avg: 79.14 / Max: 79.51Min: 78.6 / Avg: 78.98 / Max: 79.441. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1246810SE +/- 0.01, N = 36.901. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Files12510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 319.4019.641. (CC) gcc options: -lm
OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Files12510152025Min: 19.37 / Avg: 19.4 / Max: 19.43Min: 19.55 / Avg: 19.64 / Max: 19.731. (CC) gcc options: -lm

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory Allocations1220406080100SE +/- 0.25, N = 3SE +/- 0.28, N = 397.1399.191. (CC) gcc options: -lm
OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory Allocations1220406080100Min: 96.81 / Avg: 97.13 / Max: 97.63Min: 98.64 / Avg: 99.19 / Max: 99.491. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Threads12612182430SE +/- 0.05, N = 3SE +/- 0.40, N = 326.5326.111. (CC) gcc options: -lm
OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Threads12612182430Min: 26.44 / Avg: 26.53 / Max: 26.58Min: 25.37 / Avg: 26.11 / Max: 26.731. (CC) gcc options: -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100120.7221.4442.1662.8883.61SE +/- 0.002, N = 3SE +/- 0.001, N = 33.2033.2091. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 10012246810Min: 3.2 / Avg: 3.2 / Max: 3.21Min: 3.21 / Avg: 3.21 / Max: 3.211. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default120.47790.95581.43371.91162.3895SE +/- 0.002, N = 3SE +/- 0.016, N = 32.1102.1241. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default12246810Min: 2.11 / Avg: 2.11 / Max: 2.11Min: 2.11 / Avg: 2.12 / Max: 2.161. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1248121620SE +/- 0.05, N = 3SE +/- 0.04, N = 313.4714.241. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1248121620Min: 13.41 / Avg: 13.46 / Max: 13.56Min: 14.19 / Avg: 14.24 / Max: 14.311. (CXX) g++ options: -O3 -pthread -lm

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill Sync122004006008001000SE +/- 15.86, N = 3SE +/- 10.40, N = 31048.701044.661. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill Sync122004006008001000Min: 1023.44 / Avg: 1048.7 / Max: 1077.94Min: 1025.17 / Avg: 1044.66 / Max: 1060.71. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill Sync12246810SE +/- 0.09, N = 3SE +/- 0.06, N = 36.76.71. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill Sync123691215Min: 6.5 / Avg: 6.67 / Max: 6.8Min: 6.6 / Avg: 6.7 / Max: 6.81. (CXX) g++ options: -O3 -lsnappy -lpthread

152 Results Shown

Caffe
LAMMPS Molecular Dynamics Simulator
Caffe
OpenCV
Basis Universal
LeelaChessZero
Timed Clash Compilation
Caffe
NCNN:
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
  CPU - squeezenet
AI Benchmark Alpha:
  Device AI Score
  Device Training Score
  Device Inference Score
Mlpack Benchmark
LeelaChessZero
Numpy Benchmark
LevelDB:
  Seq Fill:
    Microseconds Per Op
    MB/s
LeelaChessZero
LevelDB
Caffe
BRL-CAD
Caffe
Timed HMMer Search
Apache CouchDB
asmFish
Hierarchical INTegration
OpenCV
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
BYTE Unix Benchmark
Caffe
LevelDB
ASTC Encoder
GROMACS
Mlpack Benchmark
LZ4 Compression:
  3 - Decompression Speed
  3 - Compression Speed
rav1e:
  1
  5
OpenVINO:
  Person Detection 0106 FP16 - CPU:
    ms
    FPS
  Person Detection 0106 FP32 - CPU:
    ms
    FPS
Mlpack Benchmark
Kvazaar
OpenVINO:
  Face Detection 0106 FP16 - CPU:
    ms
    FPS
  Face Detection 0106 FP32 - CPU:
    ms
    FPS
Kvazaar
KeyDB
LevelDB:
  Rand Fill:
    Microseconds Per Op
    MB/s
  Overwrite:
    Microseconds Per Op
    MB/s
Sockperf
LevelDB
Basis Universal
LevelDB
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
rav1e
OpenVINO:
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP32 - CPU:
    ms
    FPS
Stockfish
eSpeak-NG Speech Engine
LibRaw
WebP Image Encode
Sunflow Rendering System
Embree:
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer - Asian Dragon Obj
PHPBench
rav1e
x265
PostgreSQL pgbench:
  100 - 1 - Read Only - Average Latency
  100 - 1 - Read Only
  100 - 50 - Read Write - Average Latency
  100 - 50 - Read Write
  100 - 1 - Read Write - Average Latency
  100 - 1 - Read Write
  100 - 50 - Read Only - Average Latency
  100 - 50 - Read Only
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Crafty
Kvazaar
Basis Universal
Mlpack Benchmark
Embree:
  Pathtracer ISPC - Crown
  Pathtracer - Crown
Kvazaar:
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
Embree
Redis
TNN
RNNoise
Embree
PostgreSQL pgbench:
  1 - 50 - Read Write - Average Latency
  1 - 50 - Read Write
  1 - 1 - Read Only - Average Latency
  1 - 1 - Read Only
  1 - 50 - Read Only - Average Latency
  1 - 50 - Read Only
  1 - 1 - Read Write - Average Latency
  1 - 1 - Read Write
WebP Image Encode
TNN
Dolfyn
OSBench
OpenCV
Kvazaar
Basis Universal
ASTC Encoder
x265
FFTE
OSBench
Timed MAFFT Alignment
Kvazaar
Sockperf:
  Latency Under Load
  Throughput
Redis:
  SET
  SADD
  GET
  LPOP
WebP Image Encode
Basis Universal
ASTC Encoder
Kvazaar
ASTC Encoder
OSBench:
  Create Files
  Memory Allocations
  Create Threads
WebP Image Encode:
  Quality 100
  Default
LAMMPS Molecular Dynamics Simulator
LevelDB:
  Fill Sync:
    Microseconds Per Op
    MB/s