CompuLab Airtop 3

Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2011040-FI-COMPULABA81
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 11 Tests
CPU Massive 19 Tests
Creator Workloads 16 Tests
Database Test Suite 3 Tests
Encoding 5 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 15 Tests
Imaging 4 Tests
Java 3 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 7 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 3 Tests
Multi-Core 15 Tests
NVIDIA GPU Compute 7 Tests
OCR 2 Tests
OpenMPI Tests 4 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 4 Tests
Scientific Computing 7 Tests
Server 4 Tests
Server CPU Tests 11 Tests
Single-Threaded 3 Tests
Video Encoding 5 Tests
Common Workstation Benchmarks 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
October 31 2020
  13 Hours, 46 Minutes
2
November 02 2020
  13 Hours, 25 Minutes
3
November 03 2020
  14 Hours, 45 Minutes
Invert Hiding All Results Option
  13 Hours, 59 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


CompuLab Airtop 3ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads)Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS)Intel Cannon Lake PCH64GBSamsung SSD 970 EVO Plus 250GBNVIDIA Quadro RTX 4000 8GB (1005/6500MHz)Intel Cannon Lake PCH cAVSVE228Intel I219-LM + Intel I210Ubuntu 20.105.8.0-26-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9NVIDIA 455.284.6.0OpenCL 1.2 CUDA 11.1.961.2.142GCC 10.2.0ext41920x1080NVIDIA Quadro RTX 4000 8GB (300/405MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 2.3OpenCL Details- GPU Compute Cores: 2304Java Details- OpenJDK Runtime Environment (build 11.0.9+11-Ubuntu-0ubuntu1)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

123Result OverviewPhoronix Test Suite100%134%168%202%236%RodiniaFFTEIncompact3DNAMDLeelaChessZeroOCRMyPDFLAMMPS Molecular Dynamics SimulatorDaCapo BenchmarkPostgreSQL pgbenchKvazaaroneDNNNeatBenchDolfynx265OpenSSLLibRawNCNNdav1dBYTE Unix BenchmarkWireGuard + Linux Networking Stack Stress TestJava Gradle BuildInfluxDBSunflow Rendering SystemBlenderTesseract OCRGROMACSASTC EncoderTimed HMMer SearchTensorFlow LiteRawTherapeeTNNHierarchical INTegrationKeyDBlibavif avifencPyPerformanceAOM AV1BRL-CADWebP Image EncodeRNNoiseCaffe

CompuLab Airtop 3ffte: N=256, 3D Complex FFT Routineincompact3d: Cylinderrodinia: OpenMP Leukocytenamd: ATPase Simulation - 327,506 Atomsonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: IP Batch 1D - f32 - CPUrodinia: OpenMP LavaMDncnn: CPU - mnasnetkvazaar: Bosphorus 1080p - Ultra Fastdacapobench: Tradebeansonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUocrmypdf: Processing 60 Page PDF Documentblender: Barbershop - NVIDIA OptiXonednn: Deconvolution Batch deconv_1d - f32 - CPUdacapobench: Tradesoaplczero: BLASncnn: Vulkan GPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2onednn: IP Batch 1D - u8s8f32 - CPUncnn: Vulkan GPU - resnet18rodinia: OpenCL Myocytencnn: CPU - blazefacencnn: CPU-v3-v3 - mobilenet-v3pgbench: 1 - 1 - Read Only - Average Latencyncnn: Vulkan GPU - blazefacekvazaar: Bosphorus 1080p - Very Fastx265: Bosphorus 1080pncnn: CPU - yolov4-tinyncnn: CPU - shufflenet-v2ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: CPU - alexnetncnn: CPU - resnet18kvazaar: Bosphorus 4K - Ultra Fastonednn: Deconvolution Batch deconv_3d - f32 - CPUdav1d: Summer Nature 4Kncnn: CPU - googlenetdav1d: Chimera 1080p 10-bitkvazaar: Bosphorus 1080p - Slowpgbench: 1 - 1 - Read Onlyncnn: CPU - efficientnet-b0pgbench: 1 - 100 - Read Only - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 100 - Read Onlypgbench: 1 - 50 - Read Write - Average Latencypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 50 - Read Onlypgbench: 1 - 250 - Read Only - Average Latencypgbench: 1 - 250 - Read Onlyonednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUdolfyn: Computational Fluid Dynamicsonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUncnn: CPU - vgg16pgbench: 1 - 100 - Read Writeopenssl: RSA 4096-bit Performancencnn: CPU - squeezenetpgbench: 1 - 100 - Read Write - Average Latencylibraw: Post-Processing Benchmarkonednn: Recurrent Neural Network Training - f32 - CPUpyperformance: django_templatepyperformance: regex_compileinfluxdb: 1024 - 10000 - 2,5000,1 - 10000byte: Dhrystone 2pyperformance: pathlibcaffe: AlexNet - CPU - 100onednn: IP Batch All - f32 - CPUncnn: CPU - resnet50wireguard: ncnn: Vulkan GPU - mnasnetastcenc: Mediumtnn: CPU - MobileNet v2influxdb: 4 - 10000 - 2,5000,1 - 10000kvazaar: Bosphorus 1080p - Mediumblender: Classroom - NVIDIA OptiXjava-gradle-perf: Reactordav1d: Chimera 1080pncnn: Vulkan GPU - vgg16kvazaar: Bosphorus 4K - Very Fastpyperformance: 2to3build-llvm: Time To Compileonednn: IP Batch All - u8s8f32 - CPUaom-av1: Speed 8 Realtimelammps: 20k Atomssunflow: Global Illumination + Image Synthesisncnn: Vulkan GPU - alexnetblender: Fishy Cat - NVIDIA OptiXtensorflow-lite: Mobilenet Quanttesseract-ocr: Time To OCR 7 Imagespyperformance: python_startuptensorflow-lite: NASNet Mobileinfluxdb: 64 - 10000 - 2,5000,1 - 10000pyperformance: raytracencnn: Vulkan GPU - resnet50astcenc: Exhaustiveaom-av1: Speed 6 Realtimencnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - efficientnet-b0tensorflow-lite: Mobilenet Floatgromacs: Water Benchmarkcaffe: AlexNet - CPU - 200kvazaar: Bosphorus 4K - Mediumtensorflow-lite: SqueezeNethmmer: Pfam Database Searchaom-av1: Speed 6 Two-Passwebp: Quality 100, Lossless, Highest Compressionblender: Pabellon Barcelona - NVIDIA OptiXblender: Classroom - CPU-Onlyrawtherapee: Total Benchmark Timencnn: Vulkan GPU-v3-v3 - mobilenet-v3webp: Quality 100, Losslessblender: Fishy Cat - CPU-Onlyastcenc: Fastdacapobench: Jythonavifenc: 8caffe: GoogleNet - CPU - 100hint: FLOATblender: Barbershop - CPU-Onlyavifenc: 10blender: BMW27 - NVIDIA OptiXpyperformance: gokvazaar: Bosphorus 4K - Slowpyperformance: json_loadspyperformance: chaosonednn: Convolution Batch Shapes Auto - f32 - CPUavifenc: 2keydb: dav1d: Summer Nature 1080pncnn: CPU - mobilenetx265: Bosphorus 4Kavifenc: 0webp: Defaulttensorflow-lite: Inception ResNet V2blender: BMW27 - CPU-Onlyastcenc: Thoroughtensorflow-lite: Inception V4pyperformance: pickle_pure_pythonncnn: Vulkan GPU - squeezenetblender: Pabellon Barcelona - CPU-Onlybrl-cad: VGR Performance Metriccaffe: GoogleNet - CPU - 200onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUpyperformance: crypto_pyaespyperformance: floattnn: CPU - SqueezeNet v1.1webp: Quality 100webp: Quality 100, Highest Compressionrnnoise: build-linux-kernel: Time To Compilepyperformance: nbodyaom-av1: Speed 4 Two-Passaom-av1: Speed 0 Two-Passneatbench: GPUneatbench: CPUncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - googlenetpgbench: 1 - 250 - Read Write - Average Latencypgbench: 1 - 250 - Read Writepgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 1 - Read Writedacapobench: H2lammps: Rhodopsin Proteinrodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverrodinia: OpenMP HotSpot3Dlczero: Eigen12331760.023038312378.478658140.3131.944533.58701149.6184.49627276.2153.9287.1626903.9991122.0241312.284.8965136528444.635.111.801031.6834.7861.464.060.030.6146.7354.4928.703.211.4514.3815.2021.876.64261156.4014.99114.5518.24334146.690.36561127446381.8650.1732894010.9822546785.7059416.83316.441867.225682431.816.10176.12633.31321.80338.61381535930.848134384.714.44009568.052528.80160.5131.488.48286.0351507808.818.69104.11197.762639.978.3611.95264771.20727.636445.925.8511.1382.1054.9315725020.1346.802010151534293.63743.77274.7223.541.302.671539190.820802734.21227852100.1234.3034.700145.47584.9861.7491.6715.613271.165.5337095.022103786472824990.62325787.854.75727.902004.1521.385.717.118358.664737463.31581.3419.3012.3799.6691.3132954177186.6433.3332693533383.66651.27999772079022.5808185.890.0269.7122.0776.38521.58798.1821032.720.3329.410.98.673.53419.7255992.08448028646.60019.56322.64992.66875131509.634098231377.557241138.3891.938473.24189139.8714.13153273.1553.9192.5326454.0697021.0671264.864.6739235508374.585.131.805711.7234.5141.454.070.0300.6248.2456.1429.583.171.4214.3315.4922.456.47349160.4715.09116.3618.58332676.680.35659628129183.8860.1692960880.9952514295.5935716.71116.744166.605782467.916.10173.12033.66326.96038.91371557794.248808319.114.33975968.999428.91158.3601.488.44287.5591526391.318.92103.42199.454647.748.3212.05261762.88427.370746.265.9091.1322.0854.7115629920.3156.741995041543312.53753.74273.4623.441.312.651532210.815799224.23226239100.2364.2934.844144.87587.9761.8541.6815.603269.675.5236984.995103349474322870.49016783.834.73827.761994.1721.285.817.139758.399736916.33582.1319.2212.3999.2731.3152946580186.1933.2432607103393.65650.83998322078892.5792685.890.0269.6562.0766.38621.59398.1471032.720.3330.510.88.323.22424.2575912.41242929256.80519.63122.74089.72976219217.721194045605.889669214.3082.629683.30402136.1514.13129296.2694.1787.1828013.8761422.0941325.384.7075637048094.775.311.867061.6635.7601.504.200.0310.6346.7954.4229.073.261.4614.7215.6121.926.61172156.8415.38113.4118.11325796.850.36060427801982.7330.1712919951.0042493485.7069017.02716.604667.825782474.616.38173.23933.87324.33539.21391539617.548679363.314.54031568.594829.20159.7531.508.55284.0251508093.518.69104.69197.054641.988.4211.9126127.347345.785.9051.1272.0955.2315771920.2246.752012651530241.03773.76275.5923.361.312.651543620.821805074.20227599100.8324.2734.612145.79584.4062.1241.6815.694270.295.5537185.017103903471834576.34719787.494.73327.862004.1521.286.117.196758.664734176.59579.6719.2312.3499.3411.3182957670186.8733.3432704333393.66650.06997922082642.5823285.990.1269.4352.0786.38221.5971032.720.3330.610.98.703.23449.1145572.31244227756.32269.374265.624213.515671OpenBenchmarking.org

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1237K14K21K28K35KSE +/- 39.81, N = 3SE +/- 21.47, N = 3SE +/- 122.08, N = 331760.0231509.6319217.721. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1236K12K18K24K30KMin: 31694.29 / Avg: 31760.02 / Max: 31831.8Min: 31484.3 / Avg: 31509.63 / Max: 31552.32Min: 18974.27 / Avg: 19217.72 / Max: 19355.581. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Incompact3D

Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder123130260390520650SE +/- 1.16, N = 3SE +/- 1.24, N = 3SE +/- 3.73, N = 3378.48377.56605.891. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder123110220330440550Min: 376.21 / Avg: 378.48 / Max: 380.06Min: 375.1 / Avg: 377.56 / Max: 379Min: 600.85 / Avg: 605.89 / Max: 613.171. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocyte12350100150200250SE +/- 1.02, N = 3SE +/- 0.74, N = 3SE +/- 0.84, N = 3140.31138.39214.311. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocyte1234080120160200Min: 138.61 / Avg: 140.31 / Max: 142.13Min: 136.96 / Avg: 138.39 / Max: 139.41Min: 213.19 / Avg: 214.31 / Max: 215.961. (CXX) g++ options: -O2 -lOpenCL

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms1230.59171.18341.77512.36682.9585SE +/- 0.00650, N = 3SE +/- 0.00297, N = 3SE +/- 0.04166, N = 31.944531.938472.62968
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms123246810Min: 1.93 / Avg: 1.94 / Max: 1.95Min: 1.94 / Avg: 1.94 / Max: 1.94Min: 2.55 / Avg: 2.63 / Max: 2.7

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU1230.80711.61422.42133.22844.0355SE +/- 0.03967, N = 3SE +/- 0.05394, N = 3SE +/- 0.03320, N = 153.587013.241893.30402MIN: 3.47MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU123246810Min: 3.52 / Avg: 3.59 / Max: 3.66Min: 3.19 / Avg: 3.24 / Max: 3.35Min: 3.19 / Avg: 3.3 / Max: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123306090120150SE +/- 1.81, N = 3SE +/- 0.87, N = 3SE +/- 1.35, N = 3149.62139.87136.15MIN: 143.31MIN: 136.06MIN: 130.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123306090120150Min: 146.36 / Avg: 149.62 / Max: 152.63Min: 138.13 / Avg: 139.87 / Max: 140.79Min: 133.47 / Avg: 136.15 / Max: 137.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU1231.01172.02343.03514.04685.0585SE +/- 0.04422, N = 15SE +/- 0.03723, N = 13SE +/- 0.04076, N = 124.496274.131534.13129MIN: 3.83MIN: 3.59MIN: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU123246810Min: 3.99 / Avg: 4.5 / Max: 4.77Min: 3.73 / Avg: 4.13 / Max: 4.29Min: 3.75 / Avg: 4.13 / Max: 4.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMD12360120180240300SE +/- 1.21, N = 3SE +/- 0.19, N = 3SE +/- 3.61, N = 3276.22273.16296.271. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMD12350100150200250Min: 273.93 / Avg: 276.21 / Max: 278.07Min: 272.86 / Avg: 273.15 / Max: 273.52Min: 291.76 / Avg: 296.27 / Max: 303.411. (CXX) g++ options: -O2 -lOpenCL

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet1230.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 33.923.914.17MIN: 3.87 / MAX: 4.28MIN: 3.86 / MAX: 4.25MIN: 3.88 / MAX: 4.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet123246810Min: 3.9 / Avg: 3.92 / Max: 3.93Min: 3.89 / Avg: 3.91 / Max: 3.93Min: 3.91 / Avg: 4.17 / Max: 4.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12320406080100SE +/- 1.07, N = 3SE +/- 0.90, N = 9SE +/- 1.01, N = 387.1692.5387.181. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12320406080100Min: 85.39 / Avg: 87.16 / Max: 89.09Min: 88.35 / Avg: 92.53 / Max: 97.71Min: 86.15 / Avg: 87.18 / Max: 89.21. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeans1236001200180024003000SE +/- 35.35, N = 5SE +/- 27.27, N = 4SE +/- 21.94, N = 4269026452801
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeans1235001000150020002500Min: 2577 / Avg: 2690.2 / Max: 2793Min: 2572 / Avg: 2645.25 / Max: 2697Min: 2754 / Avg: 2801.25 / Max: 2860

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.91571.83142.74713.66284.5785SE +/- 0.00919, N = 3SE +/- 0.04507, N = 3SE +/- 0.00529, N = 33.999114.069703.87614MIN: 3.93MIN: 3.86MIN: 3.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810Min: 3.98 / Avg: 4 / Max: 4.01Min: 3.98 / Avg: 4.07 / Max: 4.13Min: 3.87 / Avg: 3.88 / Max: 3.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document123510152025SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 322.0221.0722.09
OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document123510152025Min: 21.8 / Avg: 22.02 / Max: 22.2Min: 20.63 / Avg: 21.07 / Max: 21.32Min: 21.67 / Avg: 22.09 / Max: 22.36

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX12330060090012001500SE +/- 19.68, N = 4SE +/- 20.32, N = 3SE +/- 22.81, N = 31312.281264.861325.38
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX1232004006008001000Min: 1253.45 / Avg: 1312.28 / Max: 1334.9Min: 1224.32 / Avg: 1264.86 / Max: 1287.66Min: 1279.77 / Avg: 1325.38 / Max: 1348.74

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU1231.10172.20343.30514.40685.5085SE +/- 0.04290, N = 15SE +/- 0.05834, N = 12SE +/- 0.04582, N = 34.896514.673924.70756MIN: 3.92MIN: 3.78MIN: 4.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU123246810Min: 4.75 / Avg: 4.9 / Max: 5.36Min: 4.04 / Avg: 4.67 / Max: 4.83Min: 4.62 / Avg: 4.71 / Max: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap1238001600240032004000SE +/- 18.80, N = 4SE +/- 25.69, N = 4SE +/- 42.18, N = 4365235503704
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap1236001200180024003000Min: 3596 / Avg: 3652.25 / Max: 3675Min: 3504 / Avg: 3549.5 / Max: 3612Min: 3589 / Avg: 3704 / Max: 3792

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS1232004006008001000SE +/- 10.82, N = 38448378091. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS123150300450600750Min: 823 / Avg: 844 / Max: 8591. (CXX) g++ options: -flto -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet1231.07332.14663.21994.29325.3665SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 34.634.584.77MIN: 4.56 / MAX: 5.02MIN: 4.54 / MAX: 4.65MIN: 4.56 / MAX: 34.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet123246810Min: 4.61 / Avg: 4.63 / Max: 4.67Min: 4.57 / Avg: 4.58 / Max: 4.58Min: 4.64 / Avg: 4.77 / Max: 5.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v21231.19482.38963.58444.77925.974SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 35.115.135.31MIN: 5.01 / MAX: 5.37MIN: 5.01 / MAX: 5.35MIN: 5.02 / MAX: 8.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v2123246810Min: 5.1 / Avg: 5.11 / Max: 5.13Min: 5.12 / Avg: 5.13 / Max: 5.13Min: 5.14 / Avg: 5.31 / Max: 5.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU1230.42010.84021.26031.68042.1005SE +/- 0.01882, N = 8SE +/- 0.01584, N = 15SE +/- 0.03006, N = 31.801031.805711.86706MIN: 1.51MIN: 1.49MIN: 1.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU123246810Min: 1.67 / Avg: 1.8 / Max: 1.82Min: 1.61 / Avg: 1.81 / Max: 1.9Min: 1.81 / Avg: 1.87 / Max: 1.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet181230.3870.7741.1611.5481.935SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.681.721.66MIN: 1.64 / MAX: 17.75MIN: 1.64 / MAX: 23.58MIN: 1.64 / MAX: 1.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet18123246810Min: 1.65 / Avg: 1.68 / Max: 1.74Min: 1.65 / Avg: 1.72 / Max: 1.77Min: 1.66 / Avg: 1.66 / Max: 1.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte123816243240SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.36, N = 834.7934.5135.761. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte123816243240Min: 34.65 / Avg: 34.79 / Max: 35.05Min: 34.43 / Avg: 34.51 / Max: 34.6Min: 34.76 / Avg: 35.76 / Max: 37.71. (CXX) g++ options: -O2 -lOpenCL

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface1230.33750.6751.01251.351.6875SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.461.451.50MIN: 1.38 / MAX: 1.57MIN: 1.34 / MAX: 1.66MIN: 1.42 / MAX: 1.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface123246810Min: 1.4 / Avg: 1.46 / Max: 1.49Min: 1.38 / Avg: 1.45 / Max: 1.5Min: 1.5 / Avg: 1.5 / Max: 1.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v31230.9451.892.8353.784.725SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.15, N = 34.064.074.20MIN: 4.01 / MAX: 4.32MIN: 4.03 / MAX: 4.44MIN: 4.01 / MAX: 4.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v3123246810Min: 4.05 / Avg: 4.06 / Max: 4.07Min: 4.05 / Avg: 4.07 / Max: 4.08Min: 4.04 / Avg: 4.2 / Max: 4.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency1230.0070.0140.0210.0280.035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0300.0300.0311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency12312345Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface1230.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.610.620.63MIN: 0.6 / MAX: 0.63MIN: 0.6 / MAX: 0.66MIN: 0.61 / MAX: 2.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface123246810Min: 0.61 / Avg: 0.61 / Max: 0.62Min: 0.61 / Avg: 0.62 / Max: 0.62Min: 0.62 / Avg: 0.63 / Max: 0.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231122334455SE +/- 0.52, N = 3SE +/- 0.64, N = 5SE +/- 0.02, N = 346.7348.2446.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231020304050Min: 45.69 / Avg: 46.73 / Max: 47.27Min: 47.55 / Avg: 48.24 / Max: 50.78Min: 46.75 / Avg: 46.79 / Max: 46.821. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1231326395265SE +/- 0.06, N = 3SE +/- 0.96, N = 3SE +/- 0.78, N = 354.4956.1454.421. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1231122334455Min: 54.37 / Avg: 54.49 / Max: 54.56Min: 54.66 / Avg: 56.14 / Max: 57.94Min: 52.87 / Avg: 54.42 / Max: 55.241. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny123714212835SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 328.7029.5829.07MIN: 28.5 / MAX: 29.09MIN: 28.17 / MAX: 140.34MIN: 28.8 / MAX: 29.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny123714212835Min: 28.65 / Avg: 28.7 / Max: 28.78Min: 28.88 / Avg: 29.58 / Max: 29.96Min: 28.89 / Avg: 29.07 / Max: 29.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v21230.73351.4672.20052.9343.6675SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 33.213.173.26MIN: 2.98 / MAX: 3.47MIN: 2.99 / MAX: 3.47MIN: 2.99 / MAX: 3.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v2123246810Min: 3.13 / Avg: 3.21 / Max: 3.26Min: 3.12 / Avg: 3.17 / Max: 3.21Min: 3.19 / Avg: 3.26 / Max: 3.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21230.32850.6570.98551.3141.6425SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 31.451.421.46MIN: 1.41 / MAX: 20.49MIN: 1.41 / MAX: 1.44MIN: 1.41 / MAX: 20.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2123246810Min: 1.42 / Avg: 1.45 / Max: 1.52Min: 1.42 / Avg: 1.42 / Max: 1.42Min: 1.42 / Avg: 1.46 / Max: 1.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet12348121620SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 314.3814.3314.72MIN: 14.11 / MAX: 17.2MIN: 14.26 / MAX: 14.55MIN: 14.22 / MAX: 122.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet12348121620Min: 14.19 / Avg: 14.38 / Max: 14.58Min: 14.31 / Avg: 14.33 / Max: 14.35Min: 14.41 / Avg: 14.72 / Max: 14.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1812348121620SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 315.2015.4915.61MIN: 14.69 / MAX: 15.68MIN: 14.84 / MAX: 15.88MIN: 15.01 / MAX: 51.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1812348121620Min: 14.8 / Avg: 15.2 / Max: 15.4Min: 15.43 / Avg: 15.49 / Max: 15.58Min: 15.36 / Avg: 15.61 / Max: 16.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123510152025SE +/- 0.10, N = 3SE +/- 0.23, N = 3SE +/- 0.11, N = 321.8722.4521.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123510152025Min: 21.69 / Avg: 21.87 / Max: 22.03Min: 22.19 / Avg: 22.45 / Max: 22.9Min: 21.73 / Avg: 21.92 / Max: 22.121. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.02212, N = 3SE +/- 0.00067, N = 3SE +/- 0.09853, N = 46.642616.473496.61172MIN: 6.44MIN: 6.31MIN: 6.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU1233691215Min: 6.6 / Avg: 6.64 / Max: 6.67Min: 6.47 / Avg: 6.47 / Max: 6.47Min: 6.5 / Avg: 6.61 / Max: 6.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4K1234080120160200SE +/- 1.17, N = 3SE +/- 1.07, N = 3SE +/- 0.52, N = 3156.40160.47156.84MIN: 125.77 / MAX: 171.79MIN: 149.07 / MAX: 177.15MIN: 148.02 / MAX: 171.091. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4K123306090120150Min: 154.17 / Avg: 156.4 / Max: 158.09Min: 159.02 / Avg: 160.47 / Max: 162.57Min: 156.01 / Avg: 156.84 / Max: 157.791. (CC) gcc options: -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet12348121620SE +/- 0.31, N = 3SE +/- 0.26, N = 3SE +/- 0.01, N = 314.9915.0915.38MIN: 14.17 / MAX: 16.1MIN: 14.25 / MAX: 15.64MIN: 15.04 / MAX: 15.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet12348121620Min: 14.37 / Avg: 14.99 / Max: 15.31Min: 14.57 / Avg: 15.09 / Max: 15.35Min: 15.37 / Avg: 15.38 / Max: 15.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bit123306090120150SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3114.55116.36113.41MIN: 72.75 / MAX: 275.84MIN: 73.44 / MAX: 275.45MIN: 72.47 / MAX: 269.271. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bit12320406080100Min: 114.36 / Avg: 114.55 / Max: 114.77Min: 116.01 / Avg: 116.36 / Max: 116.78Min: 113.29 / Avg: 113.41 / Max: 113.481. (CC) gcc options: -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123510152025SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 318.2418.5818.111. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123510152025Min: 18.22 / Avg: 18.24 / Max: 18.27Min: 18.42 / Avg: 18.58 / Max: 18.88Min: 18.08 / Avg: 18.11 / Max: 18.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only1237K14K21K28K35KSE +/- 77.57, N = 3SE +/- 373.98, N = 3SE +/- 92.20, N = 33341433267325791. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only1236K12K18K24K30KMin: 33282.52 / Avg: 33413.52 / Max: 33551.01Min: 32547.57 / Avg: 33267.12 / Max: 33803.71Min: 32398.65 / Avg: 32578.8 / Max: 32702.931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b0123246810SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 36.696.686.85MIN: 6.36 / MAX: 7.32MIN: 6.34 / MAX: 7.11MIN: 6.72 / MAX: 7.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b01233691215Min: 6.4 / Avg: 6.69 / Max: 6.83Min: 6.38 / Avg: 6.68 / Max: 6.83Min: 6.84 / Avg: 6.85 / Max: 6.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency1230.08210.16420.24630.32840.4105SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 30.3650.3560.3601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency12312345Min: 0.36 / Avg: 0.36 / Max: 0.37Min: 0.35 / Avg: 0.36 / Max: 0.36Min: 0.36 / Avg: 0.36 / Max: 0.361. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write123130260390520650SE +/- 8.58, N = 3SE +/- 0.22, N = 3SE +/- 4.93, N = 36115966041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write123110220330440550Min: 595.47 / Avg: 611.06 / Max: 625.07Min: 595.68 / Avg: 596.12 / Max: 596.38Min: 597.67 / Avg: 604.49 / Max: 614.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only12360K120K180K240K300KSE +/- 1999.95, N = 3SE +/- 3007.11, N = 3SE +/- 753.18, N = 32744632812912780191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only12350K100K150K200K250KMin: 270821.15 / Avg: 274463.42 / Max: 277716.25Min: 278167.59 / Avg: 281290.87 / Max: 287303.58Min: 276740.71 / Avg: 278019.02 / Max: 279348.291. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency12320406080100SE +/- 1.15, N = 3SE +/- 0.03, N = 3SE +/- 0.67, N = 381.8783.8982.731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency1231632486480Min: 80 / Avg: 81.86 / Max: 83.98Min: 83.85 / Avg: 83.89 / Max: 83.95Min: 81.43 / Avg: 82.73 / Max: 83.671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency1230.03890.07780.11670.15560.1945SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.1730.1690.1711. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency12312345Min: 0.17 / Avg: 0.17 / Max: 0.17Min: 0.17 / Avg: 0.17 / Max: 0.17Min: 0.17 / Avg: 0.17 / Max: 0.181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only12360K120K180K240K300KSE +/- 1001.63, N = 3SE +/- 4333.65, N = 3SE +/- 3723.51, N = 32894012960882919951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only12350K100K150K200K250KMin: 287488.27 / Avg: 289401.29 / Max: 290872.65Min: 287556.43 / Avg: 296087.6 / Max: 301678.34Min: 284577.31 / Avg: 291995.21 / Max: 296273.911. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency1230.22590.45180.67770.90361.1295SE +/- 0.003, N = 3SE +/- 0.013, N = 5SE +/- 0.013, N = 50.9820.9951.0041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency123246810Min: 0.98 / Avg: 0.98 / Max: 0.99Min: 0.96 / Avg: 1 / Max: 1.04Min: 0.97 / Avg: 1 / Max: 1.041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only12350K100K150K200K250KSE +/- 853.38, N = 3SE +/- 3217.19, N = 5SE +/- 3216.57, N = 52546782514292493481. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only12340K80K120K160K200KMin: 253004.72 / Avg: 254678.41 / Max: 255804.8Min: 241661.57 / Avg: 251428.53 / Max: 260764.33Min: 240318.41 / Avg: 249347.52 / Max: 2592721. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU1231.28412.56823.85235.13646.4205SE +/- 0.03901, N = 3SE +/- 0.01208, N = 3SE +/- 0.05691, N = 35.705945.593575.70690MIN: 4.96MIN: 4.91MIN: 4.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU123246810Min: 5.64 / Avg: 5.71 / Max: 5.78Min: 5.58 / Avg: 5.59 / Max: 5.62Min: 5.63 / Avg: 5.71 / Max: 5.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 316.8316.7117.03
OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620Min: 16.66 / Avg: 16.83 / Max: 17.06Min: 16.66 / Avg: 16.71 / Max: 16.76Min: 16.87 / Avg: 17.03 / Max: 17.12

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.16, N = 316.4416.7416.60MIN: 16.35MIN: 16.38MIN: 16.351. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620Min: 16.42 / Avg: 16.44 / Max: 16.46Min: 16.46 / Avg: 16.74 / Max: 17.28Min: 16.44 / Avg: 16.6 / Max: 16.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg161231530456075SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.37, N = 367.2266.6067.82MIN: 65.96 / MAX: 186.53MIN: 66.46 / MAX: 67.69MIN: 66.24 / MAX: 187.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg161231326395265Min: 66.13 / Avg: 67.22 / Max: 68.09Min: 66.6 / Avg: 66.6 / Max: 66.61Min: 67.37 / Avg: 67.82 / Max: 68.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write123120240360480600SE +/- 1.34, N = 3SE +/- 5.30, N = 3SE +/- 3.40, N = 35685785781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write123100200300400500Min: 565.61 / Avg: 568.11 / Max: 570.21Min: 571.02 / Avg: 578.14 / Max: 588.49Min: 570.89 / Avg: 577.54 / Max: 582.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance1235001000150020002500SE +/- 27.67, N = 3SE +/- 18.02, N = 3SE +/- 14.41, N = 32431.82467.92474.61. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance123400800120016002000Min: 2376.6 / Avg: 2431.8 / Max: 2462.7Min: 2439 / Avg: 2467.87 / Max: 2501Min: 2459.3 / Avg: 2474.6 / Max: 2503.41. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 316.1016.1016.38MIN: 15.74 / MAX: 17.79MIN: 15.94 / MAX: 17.11MIN: 15.97 / MAX: 24.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet12348121620Min: 16.05 / Avg: 16.1 / Max: 16.14Min: 16.06 / Avg: 16.1 / Max: 16.16Min: 16.12 / Avg: 16.38 / Max: 16.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency1234080120160200SE +/- 0.42, N = 3SE +/- 1.60, N = 3SE +/- 1.01, N = 3176.13173.12173.241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency123306090120150Min: 175.49 / Avg: 176.13 / Max: 176.92Min: 169.99 / Avg: 173.12 / Max: 175.26Min: 171.91 / Avg: 173.24 / Max: 175.221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123816243240SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 333.3133.6633.871. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123714212835Min: 33.06 / Avg: 33.31 / Max: 33.49Min: 33.44 / Avg: 33.66 / Max: 33.88Min: 33.71 / Avg: 33.87 / Max: 33.991. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12370140210280350SE +/- 3.32, N = 3SE +/- 2.20, N = 3SE +/- 1.53, N = 3321.80326.96324.34MIN: 304.56MIN: 315.72MIN: 307.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12360120180240300Min: 315.6 / Avg: 321.8 / Max: 326.96Min: 323.11 / Avg: 326.96 / Max: 330.73Min: 321.71 / Avg: 324.34 / Max: 327.021. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_template123918273645SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 338.638.939.2
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_template123816243240Min: 38.5 / Avg: 38.63 / Max: 38.8Min: 38.8 / Avg: 38.9 / Max: 39Min: 39 / Avg: 39.17 / Max: 39.3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compile123306090120150138137139

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 4676.23, N = 3SE +/- 6187.13, N = 3SE +/- 1033.19, N = 31535930.81557794.21539617.5
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1530612.2 / Avg: 1535930.83 / Max: 1545252.4Min: 1545905.9 / Avg: 1557794.23 / Max: 1566712Min: 1537861.5 / Avg: 1539617.47 / Max: 1541438.8

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 212310M20M30M40M50MSE +/- 543289.97, N = 3SE +/- 22614.50, N = 3SE +/- 80326.86, N = 348134384.748808319.148679363.3
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 21238M16M24M32M40MMin: 47057724.3 / Avg: 48134384.73 / Max: 48799575.9Min: 48770683.9 / Avg: 48808319.07 / Max: 48848860.9Min: 48550770.6 / Avg: 48679363.3 / Max: 48827057.2

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlib12348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.09, N = 314.414.314.5
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlib12348121620Min: 14.4 / Avg: 14.4 / Max: 14.4Min: 14.3 / Avg: 14.3 / Max: 14.3Min: 14.3 / Avg: 14.47 / Max: 14.6

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001239K18K27K36K45KSE +/- 134.25, N = 3SE +/- 121.90, N = 3SE +/- 115.70, N = 34009539759403151. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001237K14K21K28K35KMin: 39879 / Avg: 40094.67 / Max: 40341Min: 39518 / Avg: 39759.33 / Max: 39910Min: 40090 / Avg: 40314.67 / Max: 404751. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU1231530456075SE +/- 0.38, N = 3SE +/- 0.04, N = 3SE +/- 0.54, N = 368.0569.0068.59MIN: 64.46MIN: 63.06MIN: 62.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU1231326395265Min: 67.53 / Avg: 68.05 / Max: 68.79Min: 68.94 / Avg: 69 / Max: 69.07Min: 67.59 / Avg: 68.59 / Max: 69.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50123714212835SE +/- 0.17, N = 3SE +/- 0.43, N = 3SE +/- 0.53, N = 328.8028.9129.20MIN: 27.99 / MAX: 145.54MIN: 27.49 / MAX: 158.03MIN: 27.61 / MAX: 140.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50123612182430Min: 28.63 / Avg: 28.8 / Max: 29.13Min: 28.39 / Avg: 28.91 / Max: 29.76Min: 28.52 / Avg: 29.2 / Max: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WireGuard + Linux Networking Stack Stress Test

This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Test1234080120160200SE +/- 0.83, N = 3SE +/- 0.95, N = 3SE +/- 0.79, N = 3160.51158.36159.75
OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Test123306090120150Min: 159.13 / Avg: 160.51 / Max: 162Min: 156.69 / Avg: 158.36 / Max: 159.99Min: 158.37 / Avg: 159.75 / Max: 161.09

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet1230.33750.6751.01251.351.6875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.481.481.50MIN: 1.47 / MAX: 1.72MIN: 1.47 / MAX: 1.53MIN: 1.47 / MAX: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet123246810Min: 1.48 / Avg: 1.48 / Max: 1.48Min: 1.48 / Avg: 1.48 / Max: 1.48Min: 1.49 / Avg: 1.5 / Max: 1.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.488.448.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215Min: 8.47 / Avg: 8.48 / Max: 8.49Min: 8.41 / Avg: 8.44 / Max: 8.47Min: 8.54 / Avg: 8.55 / Max: 8.561. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212360120180240300SE +/- 1.30, N = 3SE +/- 0.68, N = 3SE +/- 0.57, N = 3286.04287.56284.03MIN: 283.63 / MAX: 356.82MIN: 285.93 / MAX: 307.5MIN: 282.82 / MAX: 325.111. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212350100150200250Min: 284.04 / Avg: 286.03 / Max: 288.48Min: 286.5 / Avg: 287.56 / Max: 288.84Min: 283.39 / Avg: 284.03 / Max: 285.161. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 2205.46, N = 3SE +/- 2194.00, N = 3SE +/- 2673.51, N = 31507808.81526391.31508093.5
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1503409.4 / Avg: 1507808.83 / Max: 1510284.1Min: 1523213.5 / Avg: 1526391.27 / Max: 1530600.7Min: 1505179.9 / Avg: 1508093.5 / Max: 1513433.1

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123510152025SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 318.6918.9218.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123510152025Min: 18.58 / Avg: 18.69 / Max: 18.84Min: 18.89 / Avg: 18.92 / Max: 18.95Min: 18.57 / Avg: 18.69 / Max: 18.761. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX12320406080100SE +/- 0.79, N = 3SE +/- 0.75, N = 3SE +/- 0.86, N = 3104.11103.42104.69
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX12320406080100Min: 102.69 / Avg: 104.11 / Max: 105.44Min: 102.09 / Avg: 103.42 / Max: 104.67Min: 103.1 / Avg: 104.69 / Max: 106.07

Java Gradle Build

This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: Reactor1234080120160200SE +/- 2.21, N = 12SE +/- 2.22, N = 12SE +/- 2.36, N = 12197.76199.45197.05
OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: Reactor1234080120160200Min: 188.13 / Avg: 197.76 / Max: 216Min: 192.34 / Avg: 199.45 / Max: 220.25Min: 181.51 / Avg: 197.05 / Max: 210.63

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p123140280420560700SE +/- 2.78, N = 3SE +/- 1.12, N = 3SE +/- 3.61, N = 3639.97647.74641.98MIN: 466.44 / MAX: 934.79MIN: 471.76 / MAX: 969.53MIN: 460.65 / MAX: 940.691. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p123110220330440550Min: 634.42 / Avg: 639.97 / Max: 642.83Min: 645.5 / Avg: 647.74 / Max: 648.95Min: 634.76 / Avg: 641.98 / Max: 645.691. (CC) gcc options: -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 38.368.328.42MIN: 7.77 / MAX: 22.44MIN: 7.77 / MAX: 22.07MIN: 7.88 / MAX: 18.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg161233691215Min: 8.34 / Avg: 8.36 / Max: 8.38Min: 8.27 / Avg: 8.32 / Max: 8.39Min: 8.41 / Avg: 8.42 / Max: 8.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 311.9512.0511.911. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast12348121620Min: 11.9 / Avg: 11.95 / Max: 12Min: 12.01 / Avg: 12.05 / Max: 12.13Min: 11.83 / Avg: 11.91 / Max: 11.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to312360120180240300264261261

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile12170340510680850SE +/- 2.53, N = 3SE +/- 1.50, N = 3771.21762.88
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile12140280420560700Min: 767.83 / Avg: 771.21 / Max: 776.17Min: 760.16 / Avg: 762.88 / Max: 765.34

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU123714212835SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.04, N = 327.6427.3727.35MIN: 25.19MIN: 24.77MIN: 25.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU123612182430Min: 27.55 / Avg: 27.64 / Max: 27.77Min: 27.12 / Avg: 27.37 / Max: 27.86Min: 27.28 / Avg: 27.35 / Max: 27.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime1231020304050SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 345.9246.2645.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime123918273645Min: 45.69 / Avg: 45.92 / Max: 46.13Min: 46.13 / Avg: 46.26 / Max: 46.43Min: 45.34 / Avg: 45.78 / Max: 46.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1231.32952.6593.98855.3186.6475SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.046, N = 35.8515.9095.9051. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123246810Min: 5.83 / Avg: 5.85 / Max: 5.88Min: 5.9 / Avg: 5.91 / Max: 5.92Min: 5.84 / Avg: 5.91 / Max: 5.991. (CXX) g++ options: -O3 -pthread -lm

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.25610.51220.76831.02441.2805SE +/- 0.010, N = 15SE +/- 0.011, N = 15SE +/- 0.011, N = 151.1381.1321.127MIN: 0.97 / MAX: 1.48MIN: 0.94 / MAX: 1.51MIN: 0.95 / MAX: 1.48
OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis123246810Min: 1.03 / Avg: 1.14 / Max: 1.19Min: 1.02 / Avg: 1.13 / Max: 1.17Min: 1.01 / Avg: 1.13 / Max: 1.17

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet1230.47250.9451.41751.892.3625SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.102.082.09MIN: 1.85 / MAX: 2.59MIN: 2.04 / MAX: 2.57MIN: 1.82 / MAX: 2.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet123246810Min: 2.08 / Avg: 2.1 / Max: 2.11Min: 2.08 / Avg: 2.08 / Max: 2.09Min: 2.09 / Avg: 2.09 / Max: 2.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1231224364860SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 354.9354.7155.23
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1231122334455Min: 54.71 / Avg: 54.93 / Max: 55.2Min: 54.45 / Avg: 54.71 / Max: 55.02Min: 55 / Avg: 55.23 / Max: 55.42

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12330K60K90K120K150KSE +/- 112.51, N = 3SE +/- 261.09, N = 3SE +/- 197.44, N = 3157250156299157719
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12330K60K90K120K150KMin: 157085 / Avg: 157250 / Max: 157465Min: 156026 / Avg: 156299 / Max: 156821Min: 157339 / Avg: 157719 / Max: 158002

Tesseract OCR

Tesseract-OCR is the open-source optical character recognition (OCR) engine for the conversion of text within images to raw text output. This test profile relies upon a system-supplied Tesseract installation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.1.1Time To OCR 7 Images123510152025SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.1320.3220.22
OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.1.1Time To OCR 7 Images123510152025Min: 20.12 / Avg: 20.13 / Max: 20.14Min: 20.27 / Avg: 20.31 / Max: 20.39Min: 20.18 / Avg: 20.22 / Max: 20.26

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startup123246810SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.806.746.75
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startup1233691215Min: 6.73 / Avg: 6.8 / Max: 6.85Min: 6.73 / Avg: 6.74 / Max: 6.74Min: 6.75 / Avg: 6.75 / Max: 6.76

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12340K80K120K160K200KSE +/- 363.56, N = 3SE +/- 120.29, N = 3SE +/- 1000.76, N = 3201015199504201265
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12330K60K90K120K150KMin: 200409 / Avg: 201015 / Max: 201666Min: 199271 / Avg: 199503.67 / Max: 199673Min: 199347 / Avg: 201265.33 / Max: 202719

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 5130.21, N = 3SE +/- 4759.58, N = 3SE +/- 4617.21, N = 31534293.61543312.51530241.0
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1525190.4 / Avg: 1534293.57 / Max: 1542944.7Min: 1538259.6 / Avg: 1543312.5 / Max: 1552825.5Min: 1522977.7 / Avg: 1530241 / Max: 1538811.2

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytrace12380160240320400374375377

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet501230.84831.69662.54493.39324.2415SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.773.743.76MIN: 3.72 / MAX: 13.18MIN: 3.72 / MAX: 3.76MIN: 3.74 / MAX: 3.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet50123246810Min: 3.76 / Avg: 3.77 / Max: 3.79Min: 3.73 / Avg: 3.74 / Max: 3.75Min: 3.75 / Avg: 3.76 / Max: 3.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive12360120180240300SE +/- 0.80, N = 3SE +/- 0.56, N = 3SE +/- 0.84, N = 3274.72273.46275.591. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive12350100150200250Min: 273.3 / Avg: 274.72 / Max: 276.07Min: 272.37 / Avg: 273.46 / Max: 274.26Min: 274.09 / Avg: 275.59 / Max: 276.991. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime123612182430SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 323.5423.4423.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime123510152025Min: 23.38 / Avg: 23.54 / Max: 23.68Min: 23.27 / Avg: 23.44 / Max: 23.62Min: 23.22 / Avg: 23.36 / Max: 23.521. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v21230.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.301.311.31MIN: 1.29 / MAX: 1.32MIN: 1.29 / MAX: 1.35MIN: 1.3 / MAX: 1.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v2123246810Min: 1.3 / Avg: 1.3 / Max: 1.31Min: 1.31 / Avg: 1.31 / Max: 1.31Min: 1.31 / Avg: 1.31 / Max: 1.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b01230.60081.20161.80242.40323.004SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.672.652.65MIN: 2.64 / MAX: 12.58MIN: 2.63 / MAX: 3.85MIN: 2.63 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b0123246810Min: 2.65 / Avg: 2.67 / Max: 2.7Min: 2.64 / Avg: 2.65 / Max: 2.65Min: 2.65 / Avg: 2.65 / Max: 2.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12330K60K90K120K150KSE +/- 387.33, N = 3SE +/- 330.84, N = 3SE +/- 188.88, N = 3153919153221154362
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12330K60K90K120K150KMin: 153221 / Avg: 153919 / Max: 154559Min: 152834 / Avg: 153220.67 / Max: 153879Min: 154000 / Avg: 154361.67 / Max: 154637

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.18470.36940.55410.73880.9235SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 30.8200.8150.8211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark123246810Min: 0.82 / Avg: 0.82 / Max: 0.82Min: 0.81 / Avg: 0.82 / Max: 0.82Min: 0.82 / Avg: 0.82 / Max: 0.821. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012320K40K60K80K100KSE +/- 78.77, N = 3SE +/- 143.26, N = 3SE +/- 112.10, N = 38027379922805071. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012314K28K42K56K70KMin: 80131 / Avg: 80273.33 / Max: 80403Min: 79648 / Avg: 79921.67 / Max: 80132Min: 80318 / Avg: 80507.33 / Max: 807061. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.95181.90362.85543.80724.759SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.214.234.201. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium123246810Min: 4.2 / Avg: 4.21 / Max: 4.22Min: 4.22 / Avg: 4.23 / Max: 4.25Min: 4.19 / Avg: 4.2 / Max: 4.21. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet12350K100K150K200K250KSE +/- 345.32, N = 3SE +/- 496.71, N = 3SE +/- 627.42, N = 3227852226239227599
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet12340K80K120K160K200KMin: 227164 / Avg: 227851.67 / Max: 228251Min: 225661 / Avg: 226239.33 / Max: 227228Min: 226382 / Avg: 227598.67 / Max: 228473

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.30, N = 3100.12100.24100.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100Min: 100.06 / Avg: 100.12 / Max: 100.22Min: 100.03 / Avg: 100.24 / Max: 100.35Min: 100.31 / Avg: 100.83 / Max: 101.341. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass1230.96751.9352.90253.874.8375SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.304.294.271. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass123246810Min: 4.28 / Avg: 4.3 / Max: 4.31Min: 4.26 / Avg: 4.29 / Max: 4.3Min: 4.26 / Avg: 4.27 / Max: 4.291. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression123816243240SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 334.7034.8434.611. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression123714212835Min: 34.25 / Avg: 34.7 / Max: 34.94Min: 34.59 / Avg: 34.84 / Max: 35.06Min: 34.51 / Avg: 34.61 / Max: 34.731. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX123306090120150SE +/- 0.92, N = 3SE +/- 0.89, N = 3SE +/- 0.98, N = 3145.47144.87145.79
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX123306090120150Min: 143.84 / Avg: 145.47 / Max: 147.04Min: 143.29 / Avg: 144.87 / Max: 146.38Min: 144 / Avg: 145.79 / Max: 147.37

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-Only123130260390520650SE +/- 0.63, N = 3SE +/- 1.27, N = 3SE +/- 0.47, N = 3584.98587.97584.40
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-Only123100200300400500Min: 583.76 / Avg: 584.98 / Max: 585.88Min: 585.46 / Avg: 587.97 / Max: 589.52Min: 583.49 / Avg: 584.4 / Max: 585.09

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time1231428425670SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 0.05, N = 361.7561.8562.121. RawTherapee, version 5.8, command line.
OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time1231224364860Min: 61.53 / Avg: 61.75 / Max: 61.99Min: 61.29 / Avg: 61.85 / Max: 62.2Min: 62.06 / Avg: 62.12 / Max: 62.221. RawTherapee, version 5.8, command line.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31230.3780.7561.1341.5121.89SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.671.681.68MIN: 1.66 / MAX: 1.7MIN: 1.66 / MAX: 1.93MIN: 1.66 / MAX: 1.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123246810Min: 1.67 / Avg: 1.67 / Max: 1.68Min: 1.67 / Avg: 1.68 / Max: 1.68Min: 1.68 / Avg: 1.68 / Max: 1.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 315.6115.6015.691. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12348121620Min: 15.6 / Avg: 15.61 / Max: 15.63Min: 15.59 / Avg: 15.6 / Max: 15.62Min: 15.63 / Avg: 15.69 / Max: 15.821. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-Only12360120180240300SE +/- 0.29, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 3271.16269.67270.29
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-Only12350100150200250Min: 270.59 / Avg: 271.16 / Max: 271.52Min: 269.16 / Avg: 269.67 / Max: 270.04Min: 269.74 / Avg: 270.29 / Max: 270.71

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1231.24882.49763.74644.99526.244SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.535.525.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810Min: 5.52 / Avg: 5.53 / Max: 5.54Min: 5.51 / Avg: 5.52 / Max: 5.53Min: 5.54 / Avg: 5.55 / Max: 5.561. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython1238001600240032004000SE +/- 52.97, N = 4SE +/- 52.89, N = 4SE +/- 27.24, N = 4370936983718
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython1236001200180024003000Min: 3607 / Avg: 3709.25 / Max: 3847Min: 3616 / Avg: 3697.5 / Max: 3852Min: 3661 / Avg: 3717.5 / Max: 3792

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 81231.132.263.394.525.65SE +/- 0.013, N = 3SE +/- 0.019, N = 3SE +/- 0.017, N = 35.0224.9955.0171. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 8123246810Min: 5 / Avg: 5.02 / Max: 5.05Min: 4.96 / Avg: 4.99 / Max: 5.03Min: 4.99 / Avg: 5.02 / Max: 5.051. (CXX) g++ options: -O3 -fPIC

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KSE +/- 216.65, N = 3SE +/- 79.24, N = 3SE +/- 179.36, N = 31037861033491039031. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KMin: 103440 / Avg: 103786.33 / Max: 104185Min: 103248 / Avg: 103348.67 / Max: 103505Min: 103621 / Avg: 103903 / Max: 1042361. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT123100M200M300M400M500MSE +/- 493184.05, N = 3SE +/- 93008.17, N = 3SE +/- 1375357.19, N = 3472824990.62474322870.49471834576.351. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT12380M160M240M320M400MMin: 472204306.73 / Avg: 472824990.62 / Max: 473799226.51Min: 474146266.95 / Avg: 474322870.49 / Max: 474461768.19Min: 469295372.86 / Avg: 471834576.35 / Max: 474020232.921. (CC) gcc options: -O3 -march=native -lm

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-Only1232004006008001000SE +/- 1.21, N = 3SE +/- 0.37, N = 3SE +/- 0.65, N = 3787.85783.83787.49
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-Only123140280420560700Min: 786.16 / Avg: 787.85 / Max: 790.19Min: 783.27 / Avg: 783.83 / Max: 784.52Min: 786.81 / Avg: 787.49 / Max: 788.79

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 101231.07032.14063.21094.28125.3515SE +/- 0.009, N = 3SE +/- 0.014, N = 3SE +/- 0.010, N = 34.7574.7384.7331. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 10123246810Min: 4.74 / Avg: 4.76 / Max: 4.77Min: 4.72 / Avg: 4.74 / Max: 4.77Min: 4.72 / Avg: 4.73 / Max: 4.751. (CXX) g++ options: -O3 -fPIC

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX123714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 327.9027.7627.86
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX123612182430Min: 27.89 / Avg: 27.9 / Max: 27.92Min: 27.68 / Avg: 27.76 / Max: 27.8Min: 27.83 / Avg: 27.86 / Max: 27.91

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: go1234080120160200200199200

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.93831.87662.81493.75324.6915SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 34.154.174.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow123246810Min: 4.13 / Avg: 4.15 / Max: 4.19Min: 4.15 / Avg: 4.17 / Max: 4.21Min: 4.12 / Avg: 4.15 / Max: 4.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loads123510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 321.321.221.2
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loads123510152025Min: 21.2 / Avg: 21.33 / Max: 21.5Min: 21.2 / Avg: 21.2 / Max: 21.2Min: 21.1 / Avg: 21.17 / Max: 21.2

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaos12320406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 385.785.886.1
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaos1231632486480Min: 85.6 / Avg: 85.67 / Max: 85.8Min: 85.7 / Avg: 85.83 / Max: 86Min: 86 / Avg: 86.07 / Max: 86.2

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 417.1217.1417.20MIN: 16.89MIN: 16.92MIN: 16.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620Min: 16.96 / Avg: 17.12 / Max: 17.37Min: 16.99 / Avg: 17.14 / Max: 17.45Min: 16.94 / Avg: 17.2 / Max: 17.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21231326395265SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 358.6658.4058.661. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21231224364860Min: 58.48 / Avg: 58.66 / Max: 58.79Min: 57.86 / Avg: 58.4 / Max: 58.75Min: 58.33 / Avg: 58.66 / Max: 58.881. (CXX) g++ options: -O3 -fPIC

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123160K320K480K640K800KSE +/- 1649.46, N = 3SE +/- 3580.61, N = 3SE +/- 673.14, N = 3737463.31736916.33734176.591. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123130K260K390K520K650KMin: 734781.8 / Avg: 737463.31 / Max: 740468.19Min: 732167.59 / Avg: 736916.33 / Max: 743932.83Min: 732907.25 / Avg: 734176.59 / Max: 735199.771. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080p123130260390520650SE +/- 1.17, N = 3SE +/- 0.67, N = 3SE +/- 1.29, N = 3581.34582.13579.67MIN: 498.37 / MAX: 635.39MIN: 511.78 / MAX: 635.13MIN: 479.84 / MAX: 634.421. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080p123100200300400500Min: 579.93 / Avg: 581.34 / Max: 583.67Min: 580.8 / Avg: 582.13 / Max: 583.01Min: 577.1 / Avg: 579.67 / Max: 581.131. (CC) gcc options: -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet123510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 319.3019.2219.23MIN: 18.89 / MAX: 19.7MIN: 18.82 / MAX: 26.38MIN: 18.97 / MAX: 19.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet123510152025Min: 19.2 / Avg: 19.3 / Max: 19.38Min: 19.13 / Avg: 19.22 / Max: 19.28Min: 19.05 / Avg: 19.23 / Max: 19.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1233691215SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 312.3712.3912.341. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K12348121620Min: 12.23 / Avg: 12.37 / Max: 12.47Min: 12.25 / Avg: 12.39 / Max: 12.56Min: 12.3 / Avg: 12.34 / Max: 12.371. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 012320406080100SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 399.6799.2799.341. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 012320406080100Min: 98.95 / Avg: 99.67 / Max: 100.25Min: 98.79 / Avg: 99.27 / Max: 100.05Min: 98.59 / Avg: 99.34 / Max: 99.811. (CXX) g++ options: -O3 -fPIC

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default1230.29660.59320.88981.18641.483SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 31.3131.3151.3181. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default123246810Min: 1.31 / Avg: 1.31 / Max: 1.31Min: 1.31 / Avg: 1.32 / Max: 1.32Min: 1.31 / Avg: 1.32 / Max: 1.331. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2123600K1200K1800K2400K3000KSE +/- 93.33, N = 3SE +/- 2042.95, N = 3SE +/- 867.58, N = 3295417729465802957670
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2123500K1000K1500K2000K2500KMin: 2954030 / Avg: 2954176.67 / Max: 2954350Min: 2942500 / Avg: 2946580 / Max: 2948810Min: 2956560 / Avg: 2957670 / Max: 2959380

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-Only1234080120160200SE +/- 0.68, N = 3SE +/- 1.07, N = 3SE +/- 1.31, N = 3186.64186.19186.87
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-Only123306090120150Min: 185.5 / Avg: 186.64 / Max: 187.86Min: 184.41 / Avg: 186.19 / Max: 188.1Min: 184.25 / Avg: 186.87 / Max: 188.3

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough123816243240SE +/- 0.40, N = 3SE +/- 0.42, N = 3SE +/- 0.40, N = 633.3333.2433.341. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough123714212835Min: 32.52 / Avg: 33.33 / Max: 33.74Min: 32.39 / Avg: 33.24 / Max: 33.68Min: 31.33 / Avg: 33.34 / Max: 33.771. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4123700K1400K2100K2800K3500KSE +/- 1176.00, N = 3SE +/- 1202.26, N = 3SE +/- 369.56, N = 3326935332607103270433
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4123600K1200K1800K2400K3000KMin: 3267420 / Avg: 3269353.33 / Max: 3271480Min: 3258800 / Avg: 3260710 / Max: 3262930Min: 3269800 / Avg: 3270433.33 / Max: 3271080

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_python12370140210280350338339339

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet1230.82351.6472.47053.2944.1175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.663.653.66MIN: 3.6 / MAX: 3.77MIN: 3.6 / MAX: 3.7MIN: 3.61 / MAX: 3.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet123246810Min: 3.65 / Avg: 3.66 / Max: 3.68Min: 3.63 / Avg: 3.65 / Max: 3.66Min: 3.65 / Avg: 3.66 / Max: 3.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-Only123140280420560700SE +/- 1.32, N = 3SE +/- 1.12, N = 3SE +/- 0.68, N = 3651.27650.83650.06
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-Only123110220330440550Min: 648.83 / Avg: 651.27 / Max: 653.38Min: 648.71 / Avg: 650.83 / Max: 652.53Min: 649.25 / Avg: 650.06 / Max: 651.41

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric12320K40K60K80K100K9997799832997921. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lpthread -ldl -luuid -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KSE +/- 124.50, N = 3SE +/- 420.35, N = 3SE +/- 147.41, N = 32079022078892082641. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KMin: 207662 / Avg: 207901.67 / Max: 208080Min: 207245 / Avg: 207889 / Max: 208679Min: 208054 / Avg: 208263.67 / Max: 2085481. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.5811.1621.7432.3242.905SE +/- 0.02285, N = 3SE +/- 0.03056, N = 3SE +/- 0.02621, N = 32.580812.579262.58232MIN: 2.21MIN: 2.16MIN: 2.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810Min: 2.56 / Avg: 2.58 / Max: 2.63Min: 2.55 / Avg: 2.58 / Max: 2.64Min: 2.55 / Avg: 2.58 / Max: 2.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaes12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 385.885.885.9
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaes1231632486480Min: 85.7 / Avg: 85.77 / Max: 85.8Min: 85.7 / Avg: 85.8 / Max: 85.9Min: 85.9 / Avg: 85.93 / Max: 86

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: float12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.36, N = 390.090.090.1
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: float12320406080100Min: 89.9 / Avg: 89.97 / Max: 90Min: 89.9 / Avg: 90 / Max: 90.1Min: 89.6 / Avg: 90.1 / Max: 90.8

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112360120180240300SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3269.71269.66269.44MIN: 268.38 / MAX: 284.03MIN: 268.59 / MAX: 282.48MIN: 268.04 / MAX: 281.471. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112350100150200250Min: 269.66 / Avg: 269.71 / Max: 269.75Min: 269.54 / Avg: 269.66 / Max: 269.77Min: 269.21 / Avg: 269.43 / Max: 269.651. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 1001230.46760.93521.40281.87042.338SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 32.0772.0762.0781. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100123246810Min: 2.07 / Avg: 2.08 / Max: 2.08Min: 2.08 / Avg: 2.08 / Max: 2.08Min: 2.08 / Avg: 2.08 / Max: 2.081. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123246810SE +/- 0.014, N = 3SE +/- 0.012, N = 3SE +/- 0.013, N = 36.3856.3866.3821. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression1233691215Min: 6.37 / Avg: 6.38 / Max: 6.41Min: 6.37 / Avg: 6.39 / Max: 6.41Min: 6.37 / Avg: 6.38 / Max: 6.411. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 321.5921.5921.601. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025Min: 21.56 / Avg: 21.59 / Max: 21.63Min: 21.57 / Avg: 21.59 / Max: 21.6Min: 21.52 / Avg: 21.6 / Max: 21.651. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile1220406080100SE +/- 0.05, N = 3SE +/- 0.19, N = 398.1898.15
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile1220406080100Min: 98.11 / Avg: 98.18 / Max: 98.29Min: 97.9 / Avg: 98.15 / Max: 98.51

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbody12320406080100103103103

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass1230.6121.2241.8362.4483.06SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.722.722.721. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass123246810Min: 2.72 / Avg: 2.72 / Max: 2.73Min: 2.71 / Avg: 2.72 / Max: 2.72Min: 2.71 / Avg: 2.72 / Max: 2.731. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-Pass1230.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-Pass12312345Min: 0.32 / Avg: 0.33 / Max: 0.33Min: 0.32 / Avg: 0.33 / Max: 0.33Min: 0.32 / Avg: 0.33 / Max: 0.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU123714212835SE +/- 0.06, N = 3SE +/- 0.50, N = 15SE +/- 0.50, N = 1529.430.530.6
OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU123714212835Min: 29.3 / Avg: 29.4 / Max: 29.5Min: 29.3 / Avg: 30.53 / Max: 34.8Min: 29.3 / Avg: 30.64 / Max: 34.6

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPU1233691215SE +/- 1.26, N = 16SE +/- 1.23, N = 16SE +/- 1.26, N = 1610.910.810.9
OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPU1233691215Min: 6 / Avg: 10.89 / Max: 15.9Min: 6 / Avg: 10.75 / Max: 15.6Min: 6 / Avg: 10.89 / Max: 15.9

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny123246810SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.34, N = 38.678.328.70MIN: 8.03 / MAX: 53MIN: 8.01 / MAX: 8.7MIN: 8.04 / MAX: 83.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny1233691215Min: 8.32 / Avg: 8.67 / Max: 8.9Min: 8.3 / Avg: 8.32 / Max: 8.35Min: 8.34 / Avg: 8.7 / Max: 9.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet1230.79431.58862.38293.17723.9715SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.533.223.23MIN: 3.2 / MAX: 21.92MIN: 3.21 / MAX: 3.25MIN: 3.22 / MAX: 3.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet123246810Min: 3.22 / Avg: 3.53 / Max: 4.15Min: 3.22 / Avg: 3.22 / Max: 3.22Min: 3.23 / Avg: 3.23 / Max: 3.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency123100200300400500SE +/- 7.83, N = 15SE +/- 4.74, N = 15SE +/- 4.44, N = 3419.73424.26449.111. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency12380160240320400Min: 353.46 / Avg: 419.73 / Max: 450.05Min: 381.95 / Avg: 424.26 / Max: 450.12Min: 440.96 / Avg: 449.11 / Max: 456.251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write123130260390520650SE +/- 11.83, N = 15SE +/- 6.84, N = 15SE +/- 5.55, N = 35995915571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write123110220330440550Min: 555.78 / Avg: 598.98 / Max: 707.59Min: 555.59 / Avg: 590.58 / Max: 654.7Min: 548.17 / Avg: 556.98 / Max: 567.221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency1230.54271.08541.62812.17082.7135SE +/- 0.025, N = 3SE +/- 0.138, N = 12SE +/- 0.108, N = 122.0842.4122.3121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency123246810Min: 2.05 / Avg: 2.08 / Max: 2.13Min: 1.98 / Avg: 2.41 / Max: 3.19Min: 1.97 / Avg: 2.31 / Max: 3.071. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write123100200300400500SE +/- 5.79, N = 3SE +/- 22.30, N = 12SE +/- 17.55, N = 124804294421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write12390180270360450Min: 468.53 / Avg: 479.93 / Max: 487.4Min: 313.38 / Avg: 428.64 / Max: 504.63Min: 326.1 / Avg: 441.57 / Max: 506.731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H21236001200180024003000SE +/- 46.69, N = 19SE +/- 39.90, N = 20SE +/- 38.16, N = 4286429252775
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H21235001000150020002500Min: 2537 / Avg: 2863.58 / Max: 3247Min: 2663 / Avg: 2925.3 / Max: 3239Min: 2697 / Avg: 2775.25 / Max: 2880

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123246810SE +/- 0.086, N = 15SE +/- 0.019, N = 3SE +/- 0.197, N = 126.6006.8056.3221. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1233691215Min: 5.57 / Avg: 6.6 / Max: 6.83Min: 6.77 / Avg: 6.81 / Max: 6.83Min: 4.38 / Avg: 6.32 / Max: 6.851. (CXX) g++ options: -O3 -pthread -lm

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster1231530456075SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 7.66, N = 1219.5619.6369.371. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster1231326395265Min: 19.48 / Avg: 19.56 / Max: 19.69Min: 19.48 / Avg: 19.63 / Max: 19.73Min: 21.89 / Avg: 69.37 / Max: 90.861. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12360120180240300SE +/- 0.23, N = 3SE +/- 0.15, N = 3SE +/- 37.41, N = 722.6522.74265.621. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12350100150200250Min: 22.23 / Avg: 22.65 / Max: 23.04Min: 22.5 / Avg: 22.74 / Max: 23.02Min: 44.22 / Avg: 265.62 / Max: 329.011. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3D12350100150200250SE +/- 1.38, N = 3SE +/- 1.37, N = 3SE +/- 13.20, N = 1292.6789.73213.521. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3D1234080120160200Min: 90.04 / Avg: 92.67 / Max: 94.72Min: 88.17 / Avg: 89.73 / Max: 92.47Min: 112.13 / Avg: 213.51 / Max: 250.911. (CXX) g++ options: -O2 -lOpenCL

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123160320480640800SE +/- 8.19, N = 3SE +/- 4.63, N = 3SE +/- 13.55, N = 97517626711. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123130260390520650Min: 740 / Avg: 751 / Max: 767Min: 754 / Avg: 762.33 / Max: 770Min: 609 / Avg: 670.89 / Max: 7221. (CXX) g++ options: -flto -pthread

165 Results Shown

FFTE
Incompact3D
Rodinia
NAMD
oneDNN:
  Deconvolution Batch deconv_3d - u8s8f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  IP Batch 1D - f32 - CPU
Rodinia
NCNN
Kvazaar
DaCapo Benchmark
oneDNN
OCRMyPDF
Blender
oneDNN
DaCapo Benchmark
LeelaChessZero
NCNN:
  Vulkan GPU - mobilenet
  CPU-v2-v2 - mobilenet-v2
oneDNN
NCNN
Rodinia
NCNN:
  CPU - blazeface
  CPU-v3-v3 - mobilenet-v3
PostgreSQL pgbench
NCNN
Kvazaar
x265
NCNN:
  CPU - yolov4-tiny
  CPU - shufflenet-v2
  Vulkan GPU-v2-v2 - mobilenet-v2
  CPU - alexnet
  CPU - resnet18
Kvazaar
oneDNN
dav1d
NCNN
dav1d
Kvazaar
PostgreSQL pgbench
NCNN
PostgreSQL pgbench:
  1 - 100 - Read Only - Average Latency
  1 - 50 - Read Write
  1 - 100 - Read Only
  1 - 50 - Read Write - Average Latency
  1 - 50 - Read Only - Average Latency
  1 - 50 - Read Only
  1 - 250 - Read Only - Average Latency
  1 - 250 - Read Only
oneDNN
Dolfyn
oneDNN
NCNN
PostgreSQL pgbench
OpenSSL
NCNN
PostgreSQL pgbench
LibRaw
oneDNN
PyPerformance:
  django_template
  regex_compile
InfluxDB
BYTE Unix Benchmark
PyPerformance
Caffe
oneDNN
NCNN
WireGuard + Linux Networking Stack Stress Test
NCNN
ASTC Encoder
TNN
InfluxDB
Kvazaar
Blender
Java Gradle Build
dav1d
NCNN
Kvazaar
PyPerformance
Timed LLVM Compilation
oneDNN
AOM AV1
LAMMPS Molecular Dynamics Simulator
Sunflow Rendering System
NCNN
Blender
TensorFlow Lite
Tesseract OCR
PyPerformance
TensorFlow Lite
InfluxDB
PyPerformance
NCNN
ASTC Encoder
AOM AV1
NCNN:
  Vulkan GPU - shufflenet-v2
  Vulkan GPU - efficientnet-b0
TensorFlow Lite
GROMACS
Caffe
Kvazaar
TensorFlow Lite
Timed HMMer Search
AOM AV1
WebP Image Encode
Blender:
  Pabellon Barcelona - NVIDIA OptiX
  Classroom - CPU-Only
RawTherapee
NCNN
WebP Image Encode
Blender
ASTC Encoder
DaCapo Benchmark
libavif avifenc
Caffe
Hierarchical INTegration
Blender
libavif avifenc
Blender
PyPerformance
Kvazaar
PyPerformance:
  json_loads
  chaos
oneDNN
libavif avifenc
KeyDB
dav1d
NCNN
x265
libavif avifenc
WebP Image Encode
TensorFlow Lite
Blender
ASTC Encoder
TensorFlow Lite
PyPerformance
NCNN
Blender
BRL-CAD
Caffe
oneDNN
PyPerformance:
  crypto_pyaes
  float
TNN
WebP Image Encode:
  Quality 100
  Quality 100, Highest Compression
RNNoise
Timed Linux Kernel Compilation
PyPerformance
AOM AV1:
  Speed 4 Two-Pass
  Speed 0 Two-Pass
NeatBench:
  GPU
  CPU
NCNN:
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - googlenet
PostgreSQL pgbench:
  1 - 250 - Read Write - Average Latency
  1 - 250 - Read Write
  1 - 1 - Read Write - Average Latency
  1 - 1 - Read Write
DaCapo Benchmark
LAMMPS Molecular Dynamics Simulator
Rodinia:
  OpenMP Streamcluster
  OpenMP CFD Solver
  OpenMP HotSpot3D
LeelaChessZero