CompuLab Airtop 3

Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2011040-FI-COMPULABA81
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 11 Tests
CPU Massive 19 Tests
Creator Workloads 16 Tests
Database Test Suite 3 Tests
Encoding 5 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 15 Tests
Imaging 4 Tests
Java 3 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 7 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 3 Tests
Multi-Core 15 Tests
NVIDIA GPU Compute 7 Tests
OCR 2 Tests
OpenMPI Tests 4 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 4 Tests
Scientific Computing 7 Tests
Server 4 Tests
Server CPU Tests 11 Tests
Single-Threaded 3 Tests
Video Encoding 5 Tests
Common Workstation Benchmarks 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
October 31 2020
  13 Hours, 46 Minutes
2
November 02 2020
  13 Hours, 25 Minutes
3
November 03 2020
  14 Hours, 45 Minutes
Invert Hiding All Results Option
  13 Hours, 59 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


CompuLab Airtop 3ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads)Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS)Intel Cannon Lake PCH64GBSamsung SSD 970 EVO Plus 250GBNVIDIA Quadro RTX 4000 8GB (1005/6500MHz)Intel Cannon Lake PCH cAVSVE228Intel I219-LM + Intel I210Ubuntu 20.105.8.0-26-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9NVIDIA 455.284.6.0OpenCL 1.2 CUDA 11.1.961.2.142GCC 10.2.0ext41920x1080NVIDIA Quadro RTX 4000 8GB (300/405MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 2.3OpenCL Details- GPU Compute Cores: 2304Java Details- OpenJDK Runtime Environment (build 11.0.9+11-Ubuntu-0ubuntu1)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

123Result OverviewPhoronix Test Suite100%134%168%202%236%RodiniaFFTEIncompact3DNAMDLeelaChessZeroOCRMyPDFLAMMPS Molecular Dynamics SimulatorDaCapo BenchmarkPostgreSQL pgbenchKvazaaroneDNNNeatBenchDolfynx265OpenSSLLibRawNCNNdav1dBYTE Unix BenchmarkWireGuard + Linux Networking Stack Stress TestJava Gradle BuildInfluxDBSunflow Rendering SystemBlenderTesseract OCRGROMACSASTC EncoderTimed HMMer SearchTensorFlow LiteRawTherapeeTNNHierarchical INTegrationKeyDBlibavif avifencPyPerformanceAOM AV1BRL-CADWebP Image EncodeRNNoiseCaffe

CompuLab Airtop 3lammps: 20k Atomsblender: Barbershop - NVIDIA OptiXjava-gradle-perf: Reactorblender: Barbershop - CPU-Onlybuild-llvm: Time To Compileblender: Pabellon Barcelona - CPU-Onlyblender: Classroom - CPU-Onlylczero: Eigenincompact3d: Cylinderhint: FLOATlczero: BLASrodinia: OpenMP HotSpot3Dbrl-cad: VGR Performance Metricrodinia: OpenMP LavaMDastcenc: Exhaustiveblender: Fishy Cat - CPU-Onlyrodinia: OpenMP CFD Solvergromacs: Water Benchmarkcaffe: GoogleNet - CPU - 200blender: BMW27 - CPU-Onlyrodinia: OpenMP Leukocytewireguard: tensorflow-lite: Inception V4tensorflow-lite: Inception ResNet V2blender: Pabellon Barcelona - NVIDIA OptiXkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumdav1d: Chimera 1080p 10-bitbyte: Dhrystone 2rodinia: OpenMP Streamclusterblender: Classroom - NVIDIA OptiXcaffe: GoogleNet - CPU - 100namd: ATPase Simulation - 327,506 Atomshmmer: Pfam Database Searchavifenc: 0build-linux-kernel: Time To Compilepgbench: 1 - 250 - Read Write - Average Latencypgbench: 1 - 250 - Read Writecaffe: AlexNet - CPU - 200pgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 1 - Read Writeinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000onednn: Deconvolution Batch deconv_1d - f32 - CPUkeydb: onednn: IP Batch 1D - f32 - CPUpyperformance: raytracencnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU - squeezenetrawtherapee: Total Benchmark Timetensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatavifenc: 2onednn: IP Batch All - u8s8f32 - CPUonednn: IP Batch All - f32 - CPUsunflow: Global Illumination + Image Synthesispyperformance: python_startupblender: Fishy Cat - NVIDIA OptiXrodinia: OpenCL Myocytekvazaar: Bosphorus 4K - Very Fastastcenc: Thoroughx265: Bosphorus 4Kpyperformance: 2to3onednn: IP Batch 1D - u8s8f32 - CPUcaffe: AlexNet - CPU - 100pyperformance: goonednn: Recurrent Neural Network Training - f32 - CPUpgbench: 1 - 250 - Read Only - Average Latencypgbench: 1 - 250 - Read Onlyonednn: Recurrent Neural Network Inference - f32 - CPUlibraw: Post-Processing Benchmarkwebp: Quality 100, Lossless, Highest Compressionkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumpyperformance: floataom-av1: Speed 0 Two-Passpyperformance: django_templatepyperformance: chaospyperformance: crypto_pyaesdacapobench: H2blender: BMW27 - NVIDIA OptiXpyperformance: regex_compilekvazaar: Bosphorus 4K - Ultra Fastaom-av1: Speed 6 Realtimepgbench: 1 - 100 - Read Write - Average Latencypgbench: 1 - 100 - Read Writepgbench: 1 - 50 - Read Write - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 1 - Read Only - Average Latencypgbench: 1 - 1 - Read Onlypgbench: 1 - 100 - Read Only - Average Latencypgbench: 1 - 100 - Read Onlypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 50 - Read Onlyaom-av1: Speed 6 Two-Passdav1d: Summer Nature 4Kpyperformance: pathlibocrmypdf: Processing 60 Page PDF Documentrnnoise: onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUtesseract-ocr: Time To OCR 7 Imagesopenssl: RSA 4096-bit Performancetnn: CPU - MobileNet v2pyperformance: pickle_pure_pythonpyperformance: json_loadsneatbench: CPUtnn: CPU - SqueezeNet v1.1pyperformance: nbodydav1d: Chimera 1080paom-av1: Speed 4 Two-Passncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - squeezenetdolfyn: Computational Fluid Dynamicswebp: Quality 100, Losslesskvazaar: Bosphorus 1080p - Very Fastaom-av1: Speed 8 Realtimedacapobench: Tradesoaponednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUlammps: Rhodopsin Proteinkvazaar: Bosphorus 1080p - Ultra Fastx265: Bosphorus 1080pdacapobench: Tradebeansastcenc: Mediumonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUwebp: Quality 100, Highest Compressiondacapobench: Jythondav1d: Summer Nature 1080ponednn: Convolution Batch Shapes Auto - u8s8f32 - CPUastcenc: Fastavifenc: 8avifenc: 10onednn: Deconvolution Batch deconv_3d - f32 - CPUneatbench: GPUffte: N=256, 3D Complex FFT Routinewebp: Quality 100webp: Default1235.8511312.28197.762787.85771.207651.27584.98751378.478658472824990.6232584492.66899977276.215274.72271.1622.6490.820207902186.64140.313160.51332693532954177145.474.154.21114.5548134384.719.563104.111037861.94453100.12399.66998.182419.725599802732.0844801507808.81534293.61535930.84.89651737463.314.4962737428.7028.8014.3815.2067.2214.991.466.693.923.214.065.1119.3016.1061.74922785220101515725015391958.66427.636468.05251.1386.8054.9334.78611.9533.3312.372641.8010340095200321.8030.982254678149.61833.3134.70018.2418.6990.00.3338.685.785.8286427.9013821.8723.54176.12656881.8656110.03334140.3652744630.1732894014.30156.4014.422.02421.5875.7059420.1342431.8286.03533821.310.9269.712103639.972.728.673.772.101.688.363.530.612.671.481.301.671.454.633.6616.83315.61346.7345.9236523.999112.580816.60087.1654.4926908.483.5870117.11836.3853709581.3416.44185.535.0224.7576.6426129.431760.0230383122.0771.3135.9091264.86199.454783.83762.884650.83587.97762377.557241474322870.4901683789.72999832273.155273.46269.6722.7400.815207889186.19138.389158.36032607102946580144.874.174.23116.3648808319.119.631103.421033491.93847100.23699.27398.147424.257591799222.4124291526391.31543312.51557794.24.67392736916.334.1315337529.5828.9114.3315.4966.6015.091.456.683.913.174.075.1319.2216.1061.85422623919950415629915322158.39927.370768.99941.1326.7454.7134.51412.0533.2412.392611.8057139759199326.9600.995251429139.87133.6634.84418.5818.9290.00.3338.985.885.8292527.7613722.4523.44173.12057883.8865960.030332670.3562812910.1692960884.29160.4714.321.06721.5935.5935720.3152467.9287.55933921.210.8269.656103647.742.728.323.742.081.728.323.220.622.651.481.311.681.424.583.6516.71115.60348.2446.2635504.069702.579266.80592.5356.1426458.443.2418917.13976.3863698582.1316.74415.524.9954.7386.4734930.531509.6340982312.0761.3155.9051325.38197.054787.49650.06584.40671605.889669471834576.34719809213.51599792296.269275.59270.29265.6240.821208264186.87214.308159.75332704332957670145.794.154.20113.4148679363.369.374104.691039032.62968100.83299.341449.114557805072.3124421508093.51530241.01539617.54.70756734176.594.1312937729.0729.2014.7215.6167.8215.381.506.854.173.264.205.3119.2316.3862.12422759920126515771915436258.66427.347368.59481.1276.7555.2335.76011.9133.3412.342611.8670640315200324.3351.004249348136.15133.8734.61218.1118.6990.10.3339.286.185.9277527.8613921.9223.36173.23957882.7336040.031325790.3602780190.1712919954.27156.8414.522.09421.5975.7069020.2242474.6284.02533921.210.9269.435103641.982.728.703.762.091.668.423.230.632.651.501.311.681.464.773.6617.02715.69446.7945.7837043.876142.582326.32287.1854.4228018.553.3040217.19676.3823718579.6716.60465.555.0174.7336.6117230.619217.7211940452.0781.318OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1231.32952.6593.98855.3186.6475SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.046, N = 35.8515.9095.9051. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123246810Min: 5.83 / Avg: 5.85 / Max: 5.88Min: 5.9 / Avg: 5.91 / Max: 5.92Min: 5.84 / Avg: 5.91 / Max: 5.991. (CXX) g++ options: -O3 -pthread -lm

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX12330060090012001500SE +/- 19.68, N = 4SE +/- 20.32, N = 3SE +/- 22.81, N = 31312.281264.861325.38
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX1232004006008001000Min: 1253.45 / Avg: 1312.28 / Max: 1334.9Min: 1224.32 / Avg: 1264.86 / Max: 1287.66Min: 1279.77 / Avg: 1325.38 / Max: 1348.74

Java Gradle Build

This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: Reactor1234080120160200SE +/- 2.21, N = 12SE +/- 2.22, N = 12SE +/- 2.36, N = 12197.76199.45197.05
OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: Reactor1234080120160200Min: 188.13 / Avg: 197.76 / Max: 216Min: 192.34 / Avg: 199.45 / Max: 220.25Min: 181.51 / Avg: 197.05 / Max: 210.63

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-Only1232004006008001000SE +/- 1.21, N = 3SE +/- 0.37, N = 3SE +/- 0.65, N = 3787.85783.83787.49
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-Only123140280420560700Min: 786.16 / Avg: 787.85 / Max: 790.19Min: 783.27 / Avg: 783.83 / Max: 784.52Min: 786.81 / Avg: 787.49 / Max: 788.79

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile12170340510680850SE +/- 2.53, N = 3SE +/- 1.50, N = 3771.21762.88
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile12140280420560700Min: 767.83 / Avg: 771.21 / Max: 776.17Min: 760.16 / Avg: 762.88 / Max: 765.34

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-Only123140280420560700SE +/- 1.32, N = 3SE +/- 1.12, N = 3SE +/- 0.68, N = 3651.27650.83650.06
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-Only123110220330440550Min: 648.83 / Avg: 651.27 / Max: 653.38Min: 648.71 / Avg: 650.83 / Max: 652.53Min: 649.25 / Avg: 650.06 / Max: 651.41

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-Only123130260390520650SE +/- 0.63, N = 3SE +/- 1.27, N = 3SE +/- 0.47, N = 3584.98587.97584.40
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-Only123100200300400500Min: 583.76 / Avg: 584.98 / Max: 585.88Min: 585.46 / Avg: 587.97 / Max: 589.52Min: 583.49 / Avg: 584.4 / Max: 585.09

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123160320480640800SE +/- 8.19, N = 3SE +/- 4.63, N = 3SE +/- 13.55, N = 97517626711. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123130260390520650Min: 740 / Avg: 751 / Max: 767Min: 754 / Avg: 762.33 / Max: 770Min: 609 / Avg: 670.89 / Max: 7221. (CXX) g++ options: -flto -pthread

Incompact3D

Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder123130260390520650SE +/- 1.16, N = 3SE +/- 1.24, N = 3SE +/- 3.73, N = 3378.48377.56605.891. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder123110220330440550Min: 376.21 / Avg: 378.48 / Max: 380.06Min: 375.1 / Avg: 377.56 / Max: 379Min: 600.85 / Avg: 605.89 / Max: 613.171. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT123100M200M300M400M500MSE +/- 493184.05, N = 3SE +/- 93008.17, N = 3SE +/- 1375357.19, N = 3472824990.62474322870.49471834576.351. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT12380M160M240M320M400MMin: 472204306.73 / Avg: 472824990.62 / Max: 473799226.51Min: 474146266.95 / Avg: 474322870.49 / Max: 474461768.19Min: 469295372.86 / Avg: 471834576.35 / Max: 474020232.921. (CC) gcc options: -O3 -march=native -lm

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS1232004006008001000SE +/- 10.82, N = 38448378091. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS123150300450600750Min: 823 / Avg: 844 / Max: 8591. (CXX) g++ options: -flto -pthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3D12350100150200250SE +/- 1.38, N = 3SE +/- 1.37, N = 3SE +/- 13.20, N = 1292.6789.73213.521. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3D1234080120160200Min: 90.04 / Avg: 92.67 / Max: 94.72Min: 88.17 / Avg: 89.73 / Max: 92.47Min: 112.13 / Avg: 213.51 / Max: 250.911. (CXX) g++ options: -O2 -lOpenCL

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric12320K40K60K80K100K9997799832997921. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lpthread -ldl -luuid -lm

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMD12360120180240300SE +/- 1.21, N = 3SE +/- 0.19, N = 3SE +/- 3.61, N = 3276.22273.16296.271. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMD12350100150200250Min: 273.93 / Avg: 276.21 / Max: 278.07Min: 272.86 / Avg: 273.15 / Max: 273.52Min: 291.76 / Avg: 296.27 / Max: 303.411. (CXX) g++ options: -O2 -lOpenCL

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive12360120180240300SE +/- 0.80, N = 3SE +/- 0.56, N = 3SE +/- 0.84, N = 3274.72273.46275.591. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive12350100150200250Min: 273.3 / Avg: 274.72 / Max: 276.07Min: 272.37 / Avg: 273.46 / Max: 274.26Min: 274.09 / Avg: 275.59 / Max: 276.991. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-Only12360120180240300SE +/- 0.29, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 3271.16269.67270.29
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-Only12350100150200250Min: 270.59 / Avg: 271.16 / Max: 271.52Min: 269.16 / Avg: 269.67 / Max: 270.04Min: 269.74 / Avg: 270.29 / Max: 270.71

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12360120180240300SE +/- 0.23, N = 3SE +/- 0.15, N = 3SE +/- 37.41, N = 722.6522.74265.621. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12350100150200250Min: 22.23 / Avg: 22.65 / Max: 23.04Min: 22.5 / Avg: 22.74 / Max: 23.02Min: 44.22 / Avg: 265.62 / Max: 329.011. (CXX) g++ options: -O2 -lOpenCL

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.18470.36940.55410.73880.9235SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 30.8200.8150.8211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark123246810Min: 0.82 / Avg: 0.82 / Max: 0.82Min: 0.81 / Avg: 0.82 / Max: 0.82Min: 0.82 / Avg: 0.82 / Max: 0.821. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KSE +/- 124.50, N = 3SE +/- 420.35, N = 3SE +/- 147.41, N = 32079022078892082641. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KMin: 207662 / Avg: 207901.67 / Max: 208080Min: 207245 / Avg: 207889 / Max: 208679Min: 208054 / Avg: 208263.67 / Max: 2085481. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-Only1234080120160200SE +/- 0.68, N = 3SE +/- 1.07, N = 3SE +/- 1.31, N = 3186.64186.19186.87
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-Only123306090120150Min: 185.5 / Avg: 186.64 / Max: 187.86Min: 184.41 / Avg: 186.19 / Max: 188.1Min: 184.25 / Avg: 186.87 / Max: 188.3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocyte12350100150200250SE +/- 1.02, N = 3SE +/- 0.74, N = 3SE +/- 0.84, N = 3140.31138.39214.311. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocyte1234080120160200Min: 138.61 / Avg: 140.31 / Max: 142.13Min: 136.96 / Avg: 138.39 / Max: 139.41Min: 213.19 / Avg: 214.31 / Max: 215.961. (CXX) g++ options: -O2 -lOpenCL

WireGuard + Linux Networking Stack Stress Test

This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Test1234080120160200SE +/- 0.83, N = 3SE +/- 0.95, N = 3SE +/- 0.79, N = 3160.51158.36159.75
OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Test123306090120150Min: 159.13 / Avg: 160.51 / Max: 162Min: 156.69 / Avg: 158.36 / Max: 159.99Min: 158.37 / Avg: 159.75 / Max: 161.09

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4123700K1400K2100K2800K3500KSE +/- 1176.00, N = 3SE +/- 1202.26, N = 3SE +/- 369.56, N = 3326935332607103270433
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4123600K1200K1800K2400K3000KMin: 3267420 / Avg: 3269353.33 / Max: 3271480Min: 3258800 / Avg: 3260710 / Max: 3262930Min: 3269800 / Avg: 3270433.33 / Max: 3271080

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2123600K1200K1800K2400K3000KSE +/- 93.33, N = 3SE +/- 2042.95, N = 3SE +/- 867.58, N = 3295417729465802957670
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2123500K1000K1500K2000K2500KMin: 2954030 / Avg: 2954176.67 / Max: 2954350Min: 2942500 / Avg: 2946580 / Max: 2948810Min: 2956560 / Avg: 2957670 / Max: 2959380

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX123306090120150SE +/- 0.92, N = 3SE +/- 0.89, N = 3SE +/- 0.98, N = 3145.47144.87145.79
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX123306090120150Min: 143.84 / Avg: 145.47 / Max: 147.04Min: 143.29 / Avg: 144.87 / Max: 146.38Min: 144 / Avg: 145.79 / Max: 147.37

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.93831.87662.81493.75324.6915SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 34.154.174.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow123246810Min: 4.13 / Avg: 4.15 / Max: 4.19Min: 4.15 / Avg: 4.17 / Max: 4.21Min: 4.12 / Avg: 4.15 / Max: 4.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.95181.90362.85543.80724.759SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.214.234.201. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium123246810Min: 4.2 / Avg: 4.21 / Max: 4.22Min: 4.22 / Avg: 4.23 / Max: 4.25Min: 4.19 / Avg: 4.2 / Max: 4.21. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bit123306090120150SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3114.55116.36113.41MIN: 72.75 / MAX: 275.84MIN: 73.44 / MAX: 275.45MIN: 72.47 / MAX: 269.271. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bit12320406080100Min: 114.36 / Avg: 114.55 / Max: 114.77Min: 116.01 / Avg: 116.36 / Max: 116.78Min: 113.29 / Avg: 113.41 / Max: 113.481. (CC) gcc options: -pthread

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 212310M20M30M40M50MSE +/- 543289.97, N = 3SE +/- 22614.50, N = 3SE +/- 80326.86, N = 348134384.748808319.148679363.3
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 21238M16M24M32M40MMin: 47057724.3 / Avg: 48134384.73 / Max: 48799575.9Min: 48770683.9 / Avg: 48808319.07 / Max: 48848860.9Min: 48550770.6 / Avg: 48679363.3 / Max: 48827057.2

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster1231530456075SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 7.66, N = 1219.5619.6369.371. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster1231326395265Min: 19.48 / Avg: 19.56 / Max: 19.69Min: 19.48 / Avg: 19.63 / Max: 19.73Min: 21.89 / Avg: 69.37 / Max: 90.861. (CXX) g++ options: -O2 -lOpenCL

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX12320406080100SE +/- 0.79, N = 3SE +/- 0.75, N = 3SE +/- 0.86, N = 3104.11103.42104.69
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX12320406080100Min: 102.69 / Avg: 104.11 / Max: 105.44Min: 102.09 / Avg: 103.42 / Max: 104.67Min: 103.1 / Avg: 104.69 / Max: 106.07

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KSE +/- 216.65, N = 3SE +/- 79.24, N = 3SE +/- 179.36, N = 31037861033491039031. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KMin: 103440 / Avg: 103786.33 / Max: 104185Min: 103248 / Avg: 103348.67 / Max: 103505Min: 103621 / Avg: 103903 / Max: 1042361. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms1230.59171.18341.77512.36682.9585SE +/- 0.00650, N = 3SE +/- 0.00297, N = 3SE +/- 0.04166, N = 31.944531.938472.62968
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms123246810Min: 1.93 / Avg: 1.94 / Max: 1.95Min: 1.94 / Avg: 1.94 / Max: 1.94Min: 2.55 / Avg: 2.63 / Max: 2.7

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.30, N = 3100.12100.24100.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100Min: 100.06 / Avg: 100.12 / Max: 100.22Min: 100.03 / Avg: 100.24 / Max: 100.35Min: 100.31 / Avg: 100.83 / Max: 101.341. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 012320406080100SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 399.6799.2799.341. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 012320406080100Min: 98.95 / Avg: 99.67 / Max: 100.25Min: 98.79 / Avg: 99.27 / Max: 100.05Min: 98.59 / Avg: 99.34 / Max: 99.811. (CXX) g++ options: -O3 -fPIC

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile1220406080100SE +/- 0.05, N = 3SE +/- 0.19, N = 398.1898.15
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile1220406080100Min: 98.11 / Avg: 98.18 / Max: 98.29Min: 97.9 / Avg: 98.15 / Max: 98.51

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency123100200300400500SE +/- 7.83, N = 15SE +/- 4.74, N = 15SE +/- 4.44, N = 3419.73424.26449.111. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency12380160240320400Min: 353.46 / Avg: 419.73 / Max: 450.05Min: 381.95 / Avg: 424.26 / Max: 450.12Min: 440.96 / Avg: 449.11 / Max: 456.251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write123130260390520650SE +/- 11.83, N = 15SE +/- 6.84, N = 15SE +/- 5.55, N = 35995915571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write123110220330440550Min: 555.78 / Avg: 598.98 / Max: 707.59Min: 555.59 / Avg: 590.58 / Max: 654.7Min: 548.17 / Avg: 556.98 / Max: 567.221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012320K40K60K80K100KSE +/- 78.77, N = 3SE +/- 143.26, N = 3SE +/- 112.10, N = 38027379922805071. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012314K28K42K56K70KMin: 80131 / Avg: 80273.33 / Max: 80403Min: 79648 / Avg: 79921.67 / Max: 80132Min: 80318 / Avg: 80507.33 / Max: 807061. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency1230.54271.08541.62812.17082.7135SE +/- 0.025, N = 3SE +/- 0.138, N = 12SE +/- 0.108, N = 122.0842.4122.3121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency123246810Min: 2.05 / Avg: 2.08 / Max: 2.13Min: 1.98 / Avg: 2.41 / Max: 3.19Min: 1.97 / Avg: 2.31 / Max: 3.071. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write123100200300400500SE +/- 5.79, N = 3SE +/- 22.30, N = 12SE +/- 17.55, N = 124804294421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write12390180270360450Min: 468.53 / Avg: 479.93 / Max: 487.4Min: 313.38 / Avg: 428.64 / Max: 504.63Min: 326.1 / Avg: 441.57 / Max: 506.731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 2205.46, N = 3SE +/- 2194.00, N = 3SE +/- 2673.51, N = 31507808.81526391.31508093.5
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1503409.4 / Avg: 1507808.83 / Max: 1510284.1Min: 1523213.5 / Avg: 1526391.27 / Max: 1530600.7Min: 1505179.9 / Avg: 1508093.5 / Max: 1513433.1

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 5130.21, N = 3SE +/- 4759.58, N = 3SE +/- 4617.21, N = 31534293.61543312.51530241.0
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1525190.4 / Avg: 1534293.57 / Max: 1542944.7Min: 1538259.6 / Avg: 1543312.5 / Max: 1552825.5Min: 1522977.7 / Avg: 1530241 / Max: 1538811.2

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 4676.23, N = 3SE +/- 6187.13, N = 3SE +/- 1033.19, N = 31535930.81557794.21539617.5
OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KMin: 1530612.2 / Avg: 1535930.83 / Max: 1545252.4Min: 1545905.9 / Avg: 1557794.23 / Max: 1566712Min: 1537861.5 / Avg: 1539617.47 / Max: 1541438.8

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU1231.10172.20343.30514.40685.5085SE +/- 0.04290, N = 15SE +/- 0.05834, N = 12SE +/- 0.04582, N = 34.896514.673924.70756MIN: 3.92MIN: 3.78MIN: 4.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU123246810Min: 4.75 / Avg: 4.9 / Max: 5.36Min: 4.04 / Avg: 4.67 / Max: 4.83Min: 4.62 / Avg: 4.71 / Max: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123160K320K480K640K800KSE +/- 1649.46, N = 3SE +/- 3580.61, N = 3SE +/- 673.14, N = 3737463.31736916.33734176.591. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123130K260K390K520K650KMin: 734781.8 / Avg: 737463.31 / Max: 740468.19Min: 732167.59 / Avg: 736916.33 / Max: 743932.83Min: 732907.25 / Avg: 734176.59 / Max: 735199.771. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU1231.01172.02343.03514.04685.0585SE +/- 0.04422, N = 15SE +/- 0.03723, N = 13SE +/- 0.04076, N = 124.496274.131534.13129MIN: 3.83MIN: 3.59MIN: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU123246810Min: 3.99 / Avg: 4.5 / Max: 4.77Min: 3.73 / Avg: 4.13 / Max: 4.29Min: 3.75 / Avg: 4.13 / Max: 4.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytrace12380160240320400374375377

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny123714212835SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 328.7029.5829.07MIN: 28.5 / MAX: 29.09MIN: 28.17 / MAX: 140.34MIN: 28.8 / MAX: 29.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny123714212835Min: 28.65 / Avg: 28.7 / Max: 28.78Min: 28.88 / Avg: 29.58 / Max: 29.96Min: 28.89 / Avg: 29.07 / Max: 29.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50123714212835SE +/- 0.17, N = 3SE +/- 0.43, N = 3SE +/- 0.53, N = 328.8028.9129.20MIN: 27.99 / MAX: 145.54MIN: 27.49 / MAX: 158.03MIN: 27.61 / MAX: 140.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50123612182430Min: 28.63 / Avg: 28.8 / Max: 29.13Min: 28.39 / Avg: 28.91 / Max: 29.76Min: 28.52 / Avg: 29.2 / Max: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet12348121620SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 314.3814.3314.72MIN: 14.11 / MAX: 17.2MIN: 14.26 / MAX: 14.55MIN: 14.22 / MAX: 122.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet12348121620Min: 14.19 / Avg: 14.38 / Max: 14.58Min: 14.31 / Avg: 14.33 / Max: 14.35Min: 14.41 / Avg: 14.72 / Max: 14.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1812348121620SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 315.2015.4915.61MIN: 14.69 / MAX: 15.68MIN: 14.84 / MAX: 15.88MIN: 15.01 / MAX: 51.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1812348121620Min: 14.8 / Avg: 15.2 / Max: 15.4Min: 15.43 / Avg: 15.49 / Max: 15.58Min: 15.36 / Avg: 15.61 / Max: 16.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg161231530456075SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.37, N = 367.2266.6067.82MIN: 65.96 / MAX: 186.53MIN: 66.46 / MAX: 67.69MIN: 66.24 / MAX: 187.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg161231326395265Min: 66.13 / Avg: 67.22 / Max: 68.09Min: 66.6 / Avg: 66.6 / Max: 66.61Min: 67.37 / Avg: 67.82 / Max: 68.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet12348121620SE +/- 0.31, N = 3SE +/- 0.26, N = 3SE +/- 0.01, N = 314.9915.0915.38MIN: 14.17 / MAX: 16.1MIN: 14.25 / MAX: 15.64MIN: 15.04 / MAX: 15.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet12348121620Min: 14.37 / Avg: 14.99 / Max: 15.31Min: 14.57 / Avg: 15.09 / Max: 15.35Min: 15.37 / Avg: 15.38 / Max: 15.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface1230.33750.6751.01251.351.6875SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.461.451.50MIN: 1.38 / MAX: 1.57MIN: 1.34 / MAX: 1.66MIN: 1.42 / MAX: 1.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface123246810Min: 1.4 / Avg: 1.46 / Max: 1.49Min: 1.38 / Avg: 1.45 / Max: 1.5Min: 1.5 / Avg: 1.5 / Max: 1.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b0123246810SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 36.696.686.85MIN: 6.36 / MAX: 7.32MIN: 6.34 / MAX: 7.11MIN: 6.72 / MAX: 7.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b01233691215Min: 6.4 / Avg: 6.69 / Max: 6.83Min: 6.38 / Avg: 6.68 / Max: 6.83Min: 6.84 / Avg: 6.85 / Max: 6.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet1230.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 33.923.914.17MIN: 3.87 / MAX: 4.28MIN: 3.86 / MAX: 4.25MIN: 3.88 / MAX: 4.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet123246810Min: 3.9 / Avg: 3.92 / Max: 3.93Min: 3.89 / Avg: 3.91 / Max: 3.93Min: 3.91 / Avg: 4.17 / Max: 4.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v21230.73351.4672.20052.9343.6675SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 33.213.173.26MIN: 2.98 / MAX: 3.47MIN: 2.99 / MAX: 3.47MIN: 2.99 / MAX: 3.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v2123246810Min: 3.13 / Avg: 3.21 / Max: 3.26Min: 3.12 / Avg: 3.17 / Max: 3.21Min: 3.19 / Avg: 3.26 / Max: 3.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v31230.9451.892.8353.784.725SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.15, N = 34.064.074.20MIN: 4.01 / MAX: 4.32MIN: 4.03 / MAX: 4.44MIN: 4.01 / MAX: 4.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v3123246810Min: 4.05 / Avg: 4.06 / Max: 4.07Min: 4.05 / Avg: 4.07 / Max: 4.08Min: 4.04 / Avg: 4.2 / Max: 4.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v21231.19482.38963.58444.77925.974SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 35.115.135.31MIN: 5.01 / MAX: 5.37MIN: 5.01 / MAX: 5.35MIN: 5.02 / MAX: 8.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v2123246810Min: 5.1 / Avg: 5.11 / Max: 5.13Min: 5.12 / Avg: 5.13 / Max: 5.13Min: 5.14 / Avg: 5.31 / Max: 5.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet123510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 319.3019.2219.23MIN: 18.89 / MAX: 19.7MIN: 18.82 / MAX: 26.38MIN: 18.97 / MAX: 19.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet123510152025Min: 19.2 / Avg: 19.3 / Max: 19.38Min: 19.13 / Avg: 19.22 / Max: 19.28Min: 19.05 / Avg: 19.23 / Max: 19.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 316.1016.1016.38MIN: 15.74 / MAX: 17.79MIN: 15.94 / MAX: 17.11MIN: 15.97 / MAX: 24.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet12348121620Min: 16.05 / Avg: 16.1 / Max: 16.14Min: 16.06 / Avg: 16.1 / Max: 16.16Min: 16.12 / Avg: 16.38 / Max: 16.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time1231428425670SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 0.05, N = 361.7561.8562.121. RawTherapee, version 5.8, command line.
OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time1231224364860Min: 61.53 / Avg: 61.75 / Max: 61.99Min: 61.29 / Avg: 61.85 / Max: 62.2Min: 62.06 / Avg: 62.12 / Max: 62.221. RawTherapee, version 5.8, command line.

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet12350K100K150K200K250KSE +/- 345.32, N = 3SE +/- 496.71, N = 3SE +/- 627.42, N = 3227852226239227599
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet12340K80K120K160K200KMin: 227164 / Avg: 227851.67 / Max: 228251Min: 225661 / Avg: 226239.33 / Max: 227228Min: 226382 / Avg: 227598.67 / Max: 228473

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12340K80K120K160K200KSE +/- 363.56, N = 3SE +/- 120.29, N = 3SE +/- 1000.76, N = 3201015199504201265
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12330K60K90K120K150KMin: 200409 / Avg: 201015 / Max: 201666Min: 199271 / Avg: 199503.67 / Max: 199673Min: 199347 / Avg: 201265.33 / Max: 202719

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12330K60K90K120K150KSE +/- 112.51, N = 3SE +/- 261.09, N = 3SE +/- 197.44, N = 3157250156299157719
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12330K60K90K120K150KMin: 157085 / Avg: 157250 / Max: 157465Min: 156026 / Avg: 156299 / Max: 156821Min: 157339 / Avg: 157719 / Max: 158002

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12330K60K90K120K150KSE +/- 387.33, N = 3SE +/- 330.84, N = 3SE +/- 188.88, N = 3153919153221154362
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12330K60K90K120K150KMin: 153221 / Avg: 153919 / Max: 154559Min: 152834 / Avg: 153220.67 / Max: 153879Min: 154000 / Avg: 154361.67 / Max: 154637

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21231326395265SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 358.6658.4058.661. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21231224364860Min: 58.48 / Avg: 58.66 / Max: 58.79Min: 57.86 / Avg: 58.4 / Max: 58.75Min: 58.33 / Avg: 58.66 / Max: 58.881. (CXX) g++ options: -O3 -fPIC

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU123714212835SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.04, N = 327.6427.3727.35MIN: 25.19MIN: 24.77MIN: 25.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU123612182430Min: 27.55 / Avg: 27.64 / Max: 27.77Min: 27.12 / Avg: 27.37 / Max: 27.86Min: 27.28 / Avg: 27.35 / Max: 27.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU1231530456075SE +/- 0.38, N = 3SE +/- 0.04, N = 3SE +/- 0.54, N = 368.0569.0068.59MIN: 64.46MIN: 63.06MIN: 62.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU1231326395265Min: 67.53 / Avg: 68.05 / Max: 68.79Min: 68.94 / Avg: 69 / Max: 69.07Min: 67.59 / Avg: 68.59 / Max: 69.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.25610.51220.76831.02441.2805SE +/- 0.010, N = 15SE +/- 0.011, N = 15SE +/- 0.011, N = 151.1381.1321.127MIN: 0.97 / MAX: 1.48MIN: 0.94 / MAX: 1.51MIN: 0.95 / MAX: 1.48
OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis123246810Min: 1.03 / Avg: 1.14 / Max: 1.19Min: 1.02 / Avg: 1.13 / Max: 1.17Min: 1.01 / Avg: 1.13 / Max: 1.17

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startup123246810SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.806.746.75
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startup1233691215Min: 6.73 / Avg: 6.8 / Max: 6.85Min: 6.73 / Avg: 6.74 / Max: 6.74Min: 6.75 / Avg: 6.75 / Max: 6.76

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1231224364860SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 354.9354.7155.23
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1231122334455Min: 54.71 / Avg: 54.93 / Max: 55.2Min: 54.45 / Avg: 54.71 / Max: 55.02Min: 55 / Avg: 55.23 / Max: 55.42

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte123816243240SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.36, N = 834.7934.5135.761. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte123816243240Min: 34.65 / Avg: 34.79 / Max: 35.05Min: 34.43 / Avg: 34.51 / Max: 34.6Min: 34.76 / Avg: 35.76 / Max: 37.71. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 311.9512.0511.911. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast12348121620Min: 11.9 / Avg: 11.95 / Max: 12Min: 12.01 / Avg: 12.05 / Max: 12.13Min: 11.83 / Avg: 11.91 / Max: 11.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough123816243240SE +/- 0.40, N = 3SE +/- 0.42, N = 3SE +/- 0.40, N = 633.3333.2433.341. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough123714212835Min: 32.52 / Avg: 33.33 / Max: 33.74Min: 32.39 / Avg: 33.24 / Max: 33.68Min: 31.33 / Avg: 33.34 / Max: 33.771. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1233691215SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 312.3712.3912.341. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K12348121620Min: 12.23 / Avg: 12.37 / Max: 12.47Min: 12.25 / Avg: 12.39 / Max: 12.56Min: 12.3 / Avg: 12.34 / Max: 12.371. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to312360120180240300264261261

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU1230.42010.84021.26031.68042.1005SE +/- 0.01882, N = 8SE +/- 0.01584, N = 15SE +/- 0.03006, N = 31.801031.805711.86706MIN: 1.51MIN: 1.49MIN: 1.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU123246810Min: 1.67 / Avg: 1.8 / Max: 1.82Min: 1.61 / Avg: 1.81 / Max: 1.9Min: 1.81 / Avg: 1.87 / Max: 1.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001239K18K27K36K45KSE +/- 134.25, N = 3SE +/- 121.90, N = 3SE +/- 115.70, N = 34009539759403151. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001237K14K21K28K35KMin: 39879 / Avg: 40094.67 / Max: 40341Min: 39518 / Avg: 39759.33 / Max: 39910Min: 40090 / Avg: 40314.67 / Max: 404751. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: go1234080120160200200199200

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12370140210280350SE +/- 3.32, N = 3SE +/- 2.20, N = 3SE +/- 1.53, N = 3321.80326.96324.34MIN: 304.56MIN: 315.72MIN: 307.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12360120180240300Min: 315.6 / Avg: 321.8 / Max: 326.96Min: 323.11 / Avg: 326.96 / Max: 330.73Min: 321.71 / Avg: 324.34 / Max: 327.021. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency1230.22590.45180.67770.90361.1295SE +/- 0.003, N = 3SE +/- 0.013, N = 5SE +/- 0.013, N = 50.9820.9951.0041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency123246810Min: 0.98 / Avg: 0.98 / Max: 0.99Min: 0.96 / Avg: 1 / Max: 1.04Min: 0.97 / Avg: 1 / Max: 1.041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only12350K100K150K200K250KSE +/- 853.38, N = 3SE +/- 3217.19, N = 5SE +/- 3216.57, N = 52546782514292493481. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only12340K80K120K160K200KMin: 253004.72 / Avg: 254678.41 / Max: 255804.8Min: 241661.57 / Avg: 251428.53 / Max: 260764.33Min: 240318.41 / Avg: 249347.52 / Max: 2592721. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123306090120150SE +/- 1.81, N = 3SE +/- 0.87, N = 3SE +/- 1.35, N = 3149.62139.87136.15MIN: 143.31MIN: 136.06MIN: 130.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123306090120150Min: 146.36 / Avg: 149.62 / Max: 152.63Min: 138.13 / Avg: 139.87 / Max: 140.79Min: 133.47 / Avg: 136.15 / Max: 137.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123816243240SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 333.3133.6633.871. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123714212835Min: 33.06 / Avg: 33.31 / Max: 33.49Min: 33.44 / Avg: 33.66 / Max: 33.88Min: 33.71 / Avg: 33.87 / Max: 33.991. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression123816243240SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 334.7034.8434.611. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression123714212835Min: 34.25 / Avg: 34.7 / Max: 34.94Min: 34.59 / Avg: 34.84 / Max: 35.06Min: 34.51 / Avg: 34.61 / Max: 34.731. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123510152025SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 318.2418.5818.111. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123510152025Min: 18.22 / Avg: 18.24 / Max: 18.27Min: 18.42 / Avg: 18.58 / Max: 18.88Min: 18.08 / Avg: 18.11 / Max: 18.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123510152025SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 318.6918.9218.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123510152025Min: 18.58 / Avg: 18.69 / Max: 18.84Min: 18.89 / Avg: 18.92 / Max: 18.95Min: 18.57 / Avg: 18.69 / Max: 18.761. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: float12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.36, N = 390.090.090.1
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: float12320406080100Min: 89.9 / Avg: 89.97 / Max: 90Min: 89.9 / Avg: 90 / Max: 90.1Min: 89.6 / Avg: 90.1 / Max: 90.8

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-Pass1230.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-Pass12312345Min: 0.32 / Avg: 0.33 / Max: 0.33Min: 0.32 / Avg: 0.33 / Max: 0.33Min: 0.32 / Avg: 0.33 / Max: 0.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_template123918273645SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 338.638.939.2
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_template123816243240Min: 38.5 / Avg: 38.63 / Max: 38.8Min: 38.8 / Avg: 38.9 / Max: 39Min: 39 / Avg: 39.17 / Max: 39.3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaos12320406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 385.785.886.1
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaos1231632486480Min: 85.6 / Avg: 85.67 / Max: 85.8Min: 85.7 / Avg: 85.83 / Max: 86Min: 86 / Avg: 86.07 / Max: 86.2

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaes12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 385.885.885.9
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaes1231632486480Min: 85.7 / Avg: 85.77 / Max: 85.8Min: 85.7 / Avg: 85.8 / Max: 85.9Min: 85.9 / Avg: 85.93 / Max: 86

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H21236001200180024003000SE +/- 46.69, N = 19SE +/- 39.90, N = 20SE +/- 38.16, N = 4286429252775
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H21235001000150020002500Min: 2537 / Avg: 2863.58 / Max: 3247Min: 2663 / Avg: 2925.3 / Max: 3239Min: 2697 / Avg: 2775.25 / Max: 2880

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX123714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 327.9027.7627.86
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX123612182430Min: 27.89 / Avg: 27.9 / Max: 27.92Min: 27.68 / Avg: 27.76 / Max: 27.8Min: 27.83 / Avg: 27.86 / Max: 27.91

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compile123306090120150138137139

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123510152025SE +/- 0.10, N = 3SE +/- 0.23, N = 3SE +/- 0.11, N = 321.8722.4521.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123510152025Min: 21.69 / Avg: 21.87 / Max: 22.03Min: 22.19 / Avg: 22.45 / Max: 22.9Min: 21.73 / Avg: 21.92 / Max: 22.121. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime123612182430SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 323.5423.4423.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime123510152025Min: 23.38 / Avg: 23.54 / Max: 23.68Min: 23.27 / Avg: 23.44 / Max: 23.62Min: 23.22 / Avg: 23.36 / Max: 23.521. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency1234080120160200SE +/- 0.42, N = 3SE +/- 1.60, N = 3SE +/- 1.01, N = 3176.13173.12173.241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency123306090120150Min: 175.49 / Avg: 176.13 / Max: 176.92Min: 169.99 / Avg: 173.12 / Max: 175.26Min: 171.91 / Avg: 173.24 / Max: 175.221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write123120240360480600SE +/- 1.34, N = 3SE +/- 5.30, N = 3SE +/- 3.40, N = 35685785781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write123100200300400500Min: 565.61 / Avg: 568.11 / Max: 570.21Min: 571.02 / Avg: 578.14 / Max: 588.49Min: 570.89 / Avg: 577.54 / Max: 582.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency12320406080100SE +/- 1.15, N = 3SE +/- 0.03, N = 3SE +/- 0.67, N = 381.8783.8982.731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency1231632486480Min: 80 / Avg: 81.86 / Max: 83.98Min: 83.85 / Avg: 83.89 / Max: 83.95Min: 81.43 / Avg: 82.73 / Max: 83.671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write123130260390520650SE +/- 8.58, N = 3SE +/- 0.22, N = 3SE +/- 4.93, N = 36115966041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write123110220330440550Min: 595.47 / Avg: 611.06 / Max: 625.07Min: 595.68 / Avg: 596.12 / Max: 596.38Min: 597.67 / Avg: 604.49 / Max: 614.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency1230.0070.0140.0210.0280.035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0300.0300.0311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency12312345Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only1237K14K21K28K35KSE +/- 77.57, N = 3SE +/- 373.98, N = 3SE +/- 92.20, N = 33341433267325791. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only1236K12K18K24K30KMin: 33282.52 / Avg: 33413.52 / Max: 33551.01Min: 32547.57 / Avg: 33267.12 / Max: 33803.71Min: 32398.65 / Avg: 32578.8 / Max: 32702.931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency1230.08210.16420.24630.32840.4105SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 30.3650.3560.3601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency12312345Min: 0.36 / Avg: 0.36 / Max: 0.37Min: 0.35 / Avg: 0.36 / Max: 0.36Min: 0.36 / Avg: 0.36 / Max: 0.361. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only12360K120K180K240K300KSE +/- 1999.95, N = 3SE +/- 3007.11, N = 3SE +/- 753.18, N = 32744632812912780191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only12350K100K150K200K250KMin: 270821.15 / Avg: 274463.42 / Max: 277716.25Min: 278167.59 / Avg: 281290.87 / Max: 287303.58Min: 276740.71 / Avg: 278019.02 / Max: 279348.291. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency1230.03890.07780.11670.15560.1945SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.1730.1690.1711. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency12312345Min: 0.17 / Avg: 0.17 / Max: 0.17Min: 0.17 / Avg: 0.17 / Max: 0.17Min: 0.17 / Avg: 0.17 / Max: 0.181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only12360K120K180K240K300KSE +/- 1001.63, N = 3SE +/- 4333.65, N = 3SE +/- 3723.51, N = 32894012960882919951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only12350K100K150K200K250KMin: 287488.27 / Avg: 289401.29 / Max: 290872.65Min: 287556.43 / Avg: 296087.6 / Max: 301678.34Min: 284577.31 / Avg: 291995.21 / Max: 296273.911. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass1230.96751.9352.90253.874.8375SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.304.294.271. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass123246810Min: 4.28 / Avg: 4.3 / Max: 4.31Min: 4.26 / Avg: 4.29 / Max: 4.3Min: 4.26 / Avg: 4.27 / Max: 4.291. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4K1234080120160200SE +/- 1.17, N = 3SE +/- 1.07, N = 3SE +/- 0.52, N = 3156.40160.47156.84MIN: 125.77 / MAX: 171.79MIN: 149.07 / MAX: 177.15MIN: 148.02 / MAX: 171.091. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4K123306090120150Min: 154.17 / Avg: 156.4 / Max: 158.09Min: 159.02 / Avg: 160.47 / Max: 162.57Min: 156.01 / Avg: 156.84 / Max: 157.791. (CC) gcc options: -pthread

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlib12348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.09, N = 314.414.314.5
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlib12348121620Min: 14.4 / Avg: 14.4 / Max: 14.4Min: 14.3 / Avg: 14.3 / Max: 14.3Min: 14.3 / Avg: 14.47 / Max: 14.6

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document123510152025SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 322.0221.0722.09
OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document123510152025Min: 21.8 / Avg: 22.02 / Max: 22.2Min: 20.63 / Avg: 21.07 / Max: 21.32Min: 21.67 / Avg: 22.09 / Max: 22.36

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 321.5921.5921.601. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025Min: 21.56 / Avg: 21.59 / Max: 21.63Min: 21.57 / Avg: 21.59 / Max: 21.6Min: 21.52 / Avg: 21.6 / Max: 21.651. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU1231.28412.56823.85235.13646.4205SE +/- 0.03901, N = 3SE +/- 0.01208, N = 3SE +/- 0.05691, N = 35.705945.593575.70690MIN: 4.96MIN: 4.91MIN: 4.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU123246810Min: 5.64 / Avg: 5.71 / Max: 5.78Min: 5.58 / Avg: 5.59 / Max: 5.62Min: 5.63 / Avg: 5.71 / Max: 5.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Tesseract OCR

Tesseract-OCR is the open-source optical character recognition (OCR) engine for the conversion of text within images to raw text output. This test profile relies upon a system-supplied Tesseract installation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.1.1Time To OCR 7 Images123510152025SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.1320.3220.22
OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.1.1Time To OCR 7 Images123510152025Min: 20.12 / Avg: 20.13 / Max: 20.14Min: 20.27 / Avg: 20.31 / Max: 20.39Min: 20.18 / Avg: 20.22 / Max: 20.26

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance1235001000150020002500SE +/- 27.67, N = 3SE +/- 18.02, N = 3SE +/- 14.41, N = 32431.82467.92474.61. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance123400800120016002000Min: 2376.6 / Avg: 2431.8 / Max: 2462.7Min: 2439 / Avg: 2467.87 / Max: 2501Min: 2459.3 / Avg: 2474.6 / Max: 2503.41. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212360120180240300SE +/- 1.30, N = 3SE +/- 0.68, N = 3SE +/- 0.57, N = 3286.04287.56284.03MIN: 283.63 / MAX: 356.82MIN: 285.93 / MAX: 307.5MIN: 282.82 / MAX: 325.111. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212350100150200250Min: 284.04 / Avg: 286.03 / Max: 288.48Min: 286.5 / Avg: 287.56 / Max: 288.84Min: 283.39 / Avg: 284.03 / Max: 285.161. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_python12370140210280350338339339

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loads123510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 321.321.221.2
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loads123510152025Min: 21.2 / Avg: 21.33 / Max: 21.5Min: 21.2 / Avg: 21.2 / Max: 21.2Min: 21.1 / Avg: 21.17 / Max: 21.2

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPU1233691215SE +/- 1.26, N = 16SE +/- 1.23, N = 16SE +/- 1.26, N = 1610.910.810.9
OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPU1233691215Min: 6 / Avg: 10.89 / Max: 15.9Min: 6 / Avg: 10.75 / Max: 15.6Min: 6 / Avg: 10.89 / Max: 15.9

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112360120180240300SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3269.71269.66269.44MIN: 268.38 / MAX: 284.03MIN: 268.59 / MAX: 282.48MIN: 268.04 / MAX: 281.471. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112350100150200250Min: 269.66 / Avg: 269.71 / Max: 269.75Min: 269.54 / Avg: 269.66 / Max: 269.77Min: 269.21 / Avg: 269.43 / Max: 269.651. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbody12320406080100103103103

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p123140280420560700SE +/- 2.78, N = 3SE +/- 1.12, N = 3SE +/- 3.61, N = 3639.97647.74641.98MIN: 466.44 / MAX: 934.79MIN: 471.76 / MAX: 969.53MIN: 460.65 / MAX: 940.691. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p123110220330440550Min: 634.42 / Avg: 639.97 / Max: 642.83Min: 645.5 / Avg: 647.74 / Max: 648.95Min: 634.76 / Avg: 641.98 / Max: 645.691. (CC) gcc options: -pthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass1230.6121.2241.8362.4483.06SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.722.722.721. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass123246810Min: 2.72 / Avg: 2.72 / Max: 2.73Min: 2.71 / Avg: 2.72 / Max: 2.72Min: 2.71 / Avg: 2.72 / Max: 2.731. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny123246810SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.34, N = 38.678.328.70MIN: 8.03 / MAX: 53MIN: 8.01 / MAX: 8.7MIN: 8.04 / MAX: 83.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny1233691215Min: 8.32 / Avg: 8.67 / Max: 8.9Min: 8.3 / Avg: 8.32 / Max: 8.35Min: 8.34 / Avg: 8.7 / Max: 9.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet501230.84831.69662.54493.39324.2415SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.773.743.76MIN: 3.72 / MAX: 13.18MIN: 3.72 / MAX: 3.76MIN: 3.74 / MAX: 3.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet50123246810Min: 3.76 / Avg: 3.77 / Max: 3.79Min: 3.73 / Avg: 3.74 / Max: 3.75Min: 3.75 / Avg: 3.76 / Max: 3.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet1230.47250.9451.41751.892.3625SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.102.082.09MIN: 1.85 / MAX: 2.59MIN: 2.04 / MAX: 2.57MIN: 1.82 / MAX: 2.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet123246810Min: 2.08 / Avg: 2.1 / Max: 2.11Min: 2.08 / Avg: 2.08 / Max: 2.09Min: 2.09 / Avg: 2.09 / Max: 2.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet181230.3870.7741.1611.5481.935SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.681.721.66MIN: 1.64 / MAX: 17.75MIN: 1.64 / MAX: 23.58MIN: 1.64 / MAX: 1.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet18123246810Min: 1.65 / Avg: 1.68 / Max: 1.74Min: 1.65 / Avg: 1.72 / Max: 1.77Min: 1.66 / Avg: 1.66 / Max: 1.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 38.368.328.42MIN: 7.77 / MAX: 22.44MIN: 7.77 / MAX: 22.07MIN: 7.88 / MAX: 18.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg161233691215Min: 8.34 / Avg: 8.36 / Max: 8.38Min: 8.27 / Avg: 8.32 / Max: 8.39Min: 8.41 / Avg: 8.42 / Max: 8.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet1230.79431.58862.38293.17723.9715SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.533.223.23MIN: 3.2 / MAX: 21.92MIN: 3.21 / MAX: 3.25MIN: 3.22 / MAX: 3.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet123246810Min: 3.22 / Avg: 3.53 / Max: 4.15Min: 3.22 / Avg: 3.22 / Max: 3.22Min: 3.23 / Avg: 3.23 / Max: 3.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface1230.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.610.620.63MIN: 0.6 / MAX: 0.63MIN: 0.6 / MAX: 0.66MIN: 0.61 / MAX: 2.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface123246810Min: 0.61 / Avg: 0.61 / Max: 0.62Min: 0.61 / Avg: 0.62 / Max: 0.62Min: 0.62 / Avg: 0.63 / Max: 0.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b01230.60081.20161.80242.40323.004SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.672.652.65MIN: 2.64 / MAX: 12.58MIN: 2.63 / MAX: 3.85MIN: 2.63 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b0123246810Min: 2.65 / Avg: 2.67 / Max: 2.7Min: 2.64 / Avg: 2.65 / Max: 2.65Min: 2.65 / Avg: 2.65 / Max: 2.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet1230.33750.6751.01251.351.6875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.481.481.50MIN: 1.47 / MAX: 1.72MIN: 1.47 / MAX: 1.53MIN: 1.47 / MAX: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet123246810Min: 1.48 / Avg: 1.48 / Max: 1.48Min: 1.48 / Avg: 1.48 / Max: 1.48Min: 1.49 / Avg: 1.5 / Max: 1.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v21230.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.301.311.31MIN: 1.29 / MAX: 1.32MIN: 1.29 / MAX: 1.35MIN: 1.3 / MAX: 1.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v2123246810Min: 1.3 / Avg: 1.3 / Max: 1.31Min: 1.31 / Avg: 1.31 / Max: 1.31Min: 1.31 / Avg: 1.31 / Max: 1.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31230.3780.7561.1341.5121.89SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.671.681.68MIN: 1.66 / MAX: 1.7MIN: 1.66 / MAX: 1.93MIN: 1.66 / MAX: 1.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123246810Min: 1.67 / Avg: 1.67 / Max: 1.68Min: 1.67 / Avg: 1.68 / Max: 1.68Min: 1.68 / Avg: 1.68 / Max: 1.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21230.32850.6570.98551.3141.6425SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 31.451.421.46MIN: 1.41 / MAX: 20.49MIN: 1.41 / MAX: 1.44MIN: 1.41 / MAX: 20.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2123246810Min: 1.42 / Avg: 1.45 / Max: 1.52Min: 1.42 / Avg: 1.42 / Max: 1.42Min: 1.42 / Avg: 1.46 / Max: 1.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet1231.07332.14663.21994.29325.3665SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 34.634.584.77MIN: 4.56 / MAX: 5.02MIN: 4.54 / MAX: 4.65MIN: 4.56 / MAX: 34.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet123246810Min: 4.61 / Avg: 4.63 / Max: 4.67Min: 4.57 / Avg: 4.58 / Max: 4.58Min: 4.64 / Avg: 4.77 / Max: 5.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet1230.82351.6472.47053.2944.1175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.663.653.66MIN: 3.6 / MAX: 3.77MIN: 3.6 / MAX: 3.7MIN: 3.61 / MAX: 3.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet123246810Min: 3.65 / Avg: 3.66 / Max: 3.68Min: 3.63 / Avg: 3.65 / Max: 3.66Min: 3.65 / Avg: 3.66 / Max: 3.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 316.8316.7117.03
OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620Min: 16.66 / Avg: 16.83 / Max: 17.06Min: 16.66 / Avg: 16.71 / Max: 16.76Min: 16.87 / Avg: 17.03 / Max: 17.12

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 315.6115.6015.691. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12348121620Min: 15.6 / Avg: 15.61 / Max: 15.63Min: 15.59 / Avg: 15.6 / Max: 15.62Min: 15.63 / Avg: 15.69 / Max: 15.821. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231122334455SE +/- 0.52, N = 3SE +/- 0.64, N = 5SE +/- 0.02, N = 346.7348.2446.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231020304050Min: 45.69 / Avg: 46.73 / Max: 47.27Min: 47.55 / Avg: 48.24 / Max: 50.78Min: 46.75 / Avg: 46.79 / Max: 46.821. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime1231020304050SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 345.9246.2645.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime123918273645Min: 45.69 / Avg: 45.92 / Max: 46.13Min: 46.13 / Avg: 46.26 / Max: 46.43Min: 45.34 / Avg: 45.78 / Max: 46.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap1238001600240032004000SE +/- 18.80, N = 4SE +/- 25.69, N = 4SE +/- 42.18, N = 4365235503704
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap1236001200180024003000Min: 3596 / Avg: 3652.25 / Max: 3675Min: 3504 / Avg: 3549.5 / Max: 3612Min: 3589 / Avg: 3704 / Max: 3792

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.91571.83142.74713.66284.5785SE +/- 0.00919, N = 3SE +/- 0.04507, N = 3SE +/- 0.00529, N = 33.999114.069703.87614MIN: 3.93MIN: 3.86MIN: 3.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810Min: 3.98 / Avg: 4 / Max: 4.01Min: 3.98 / Avg: 4.07 / Max: 4.13Min: 3.87 / Avg: 3.88 / Max: 3.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.5811.1621.7432.3242.905SE +/- 0.02285, N = 3SE +/- 0.03056, N = 3SE +/- 0.02621, N = 32.580812.579262.58232MIN: 2.21MIN: 2.16MIN: 2.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810Min: 2.56 / Avg: 2.58 / Max: 2.63Min: 2.55 / Avg: 2.58 / Max: 2.64Min: 2.55 / Avg: 2.58 / Max: 2.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123246810SE +/- 0.086, N = 15SE +/- 0.019, N = 3SE +/- 0.197, N = 126.6006.8056.3221. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1233691215Min: 5.57 / Avg: 6.6 / Max: 6.83Min: 6.77 / Avg: 6.81 / Max: 6.83Min: 4.38 / Avg: 6.32 / Max: 6.851. (CXX) g++ options: -O3 -pthread -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12320406080100SE +/- 1.07, N = 3SE +/- 0.90, N = 9SE +/- 1.01, N = 387.1692.5387.181. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12320406080100Min: 85.39 / Avg: 87.16 / Max: 89.09Min: 88.35 / Avg: 92.53 / Max: 97.71Min: 86.15 / Avg: 87.18 / Max: 89.21. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1231326395265SE +/- 0.06, N = 3SE +/- 0.96, N = 3SE +/- 0.78, N = 354.4956.1454.421. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1231122334455Min: 54.37 / Avg: 54.49 / Max: 54.56Min: 54.66 / Avg: 56.14 / Max: 57.94Min: 52.87 / Avg: 54.42 / Max: 55.241. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeans1236001200180024003000SE +/- 35.35, N = 5SE +/- 27.27, N = 4SE +/- 21.94, N = 4269026452801
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeans1235001000150020002500Min: 2577 / Avg: 2690.2 / Max: 2793Min: 2572 / Avg: 2645.25 / Max: 2697Min: 2754 / Avg: 2801.25 / Max: 2860

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.488.448.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215Min: 8.47 / Avg: 8.48 / Max: 8.49Min: 8.41 / Avg: 8.44 / Max: 8.47Min: 8.54 / Avg: 8.55 / Max: 8.561. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU1230.80711.61422.42133.22844.0355SE +/- 0.03967, N = 3SE +/- 0.05394, N = 3SE +/- 0.03320, N = 153.587013.241893.30402MIN: 3.47MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU123246810Min: 3.52 / Avg: 3.59 / Max: 3.66Min: 3.19 / Avg: 3.24 / Max: 3.35Min: 3.19 / Avg: 3.3 / Max: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 417.1217.1417.20MIN: 16.89MIN: 16.92MIN: 16.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620Min: 16.96 / Avg: 17.12 / Max: 17.37Min: 16.99 / Avg: 17.14 / Max: 17.45Min: 16.94 / Avg: 17.2 / Max: 17.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123246810SE +/- 0.014, N = 3SE +/- 0.012, N = 3SE +/- 0.013, N = 36.3856.3866.3821. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression1233691215Min: 6.37 / Avg: 6.38 / Max: 6.41Min: 6.37 / Avg: 6.39 / Max: 6.41Min: 6.37 / Avg: 6.38 / Max: 6.411. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython1238001600240032004000SE +/- 52.97, N = 4SE +/- 52.89, N = 4SE +/- 27.24, N = 4370936983718
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython1236001200180024003000Min: 3607 / Avg: 3709.25 / Max: 3847Min: 3616 / Avg: 3697.5 / Max: 3852Min: 3661 / Avg: 3717.5 / Max: 3792

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080p123130260390520650SE +/- 1.17, N = 3SE +/- 0.67, N = 3SE +/- 1.29, N = 3581.34582.13579.67MIN: 498.37 / MAX: 635.39MIN: 511.78 / MAX: 635.13MIN: 479.84 / MAX: 634.421. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080p123100200300400500Min: 579.93 / Avg: 581.34 / Max: 583.67Min: 580.8 / Avg: 582.13 / Max: 583.01Min: 577.1 / Avg: 579.67 / Max: 581.131. (CC) gcc options: -pthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.16, N = 316.4416.7416.60MIN: 16.35MIN: 16.38MIN: 16.351. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620Min: 16.42 / Avg: 16.44 / Max: 16.46Min: 16.46 / Avg: 16.74 / Max: 17.28Min: 16.44 / Avg: 16.6 / Max: 16.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1231.24882.49763.74644.99526.244SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.535.525.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810Min: 5.52 / Avg: 5.53 / Max: 5.54Min: 5.51 / Avg: 5.52 / Max: 5.53Min: 5.54 / Avg: 5.55 / Max: 5.561. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 81231.132.263.394.525.65SE +/- 0.013, N = 3SE +/- 0.019, N = 3SE +/- 0.017, N = 35.0224.9955.0171. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 8123246810Min: 5 / Avg: 5.02 / Max: 5.05Min: 4.96 / Avg: 4.99 / Max: 5.03Min: 4.99 / Avg: 5.02 / Max: 5.051. (CXX) g++ options: -O3 -fPIC

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 101231.07032.14063.21094.28125.3515SE +/- 0.009, N = 3SE +/- 0.014, N = 3SE +/- 0.010, N = 34.7574.7384.7331. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 10123246810Min: 4.74 / Avg: 4.76 / Max: 4.77Min: 4.72 / Avg: 4.74 / Max: 4.77Min: 4.72 / Avg: 4.73 / Max: 4.751. (CXX) g++ options: -O3 -fPIC

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.02212, N = 3SE +/- 0.00067, N = 3SE +/- 0.09853, N = 46.642616.473496.61172MIN: 6.44MIN: 6.31MIN: 6.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU1233691215Min: 6.6 / Avg: 6.64 / Max: 6.67Min: 6.47 / Avg: 6.47 / Max: 6.47Min: 6.5 / Avg: 6.61 / Max: 6.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU123714212835SE +/- 0.06, N = 3SE +/- 0.50, N = 15SE +/- 0.50, N = 1529.430.530.6
OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU123714212835Min: 29.3 / Avg: 29.4 / Max: 29.5Min: 29.3 / Avg: 30.53 / Max: 34.8Min: 29.3 / Avg: 30.64 / Max: 34.6

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1237K14K21K28K35KSE +/- 39.81, N = 3SE +/- 21.47, N = 3SE +/- 122.08, N = 331760.0231509.6319217.721. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1236K12K18K24K30KMin: 31694.29 / Avg: 31760.02 / Max: 31831.8Min: 31484.3 / Avg: 31509.63 / Max: 31552.32Min: 18974.27 / Avg: 19217.72 / Max: 19355.581. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 1001230.46760.93521.40281.87042.338SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 32.0772.0762.0781. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100123246810Min: 2.07 / Avg: 2.08 / Max: 2.08Min: 2.08 / Avg: 2.08 / Max: 2.08Min: 2.08 / Avg: 2.08 / Max: 2.081. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default1230.29660.59320.88981.18641.483SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 31.3131.3151.3181. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default123246810Min: 1.31 / Avg: 1.31 / Max: 1.31Min: 1.31 / Avg: 1.32 / Max: 1.32Min: 1.31 / Avg: 1.32 / Max: 1.331. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

165 Results Shown

LAMMPS Molecular Dynamics Simulator
Blender
Java Gradle Build
Blender
Timed LLVM Compilation
Blender:
  Pabellon Barcelona - CPU-Only
  Classroom - CPU-Only
LeelaChessZero
Incompact3D
Hierarchical INTegration
LeelaChessZero
Rodinia
BRL-CAD
Rodinia
ASTC Encoder
Blender
Rodinia
GROMACS
Caffe
Blender
Rodinia
WireGuard + Linux Networking Stack Stress Test
TensorFlow Lite:
  Inception V4
  Inception ResNet V2
Blender
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
dav1d
BYTE Unix Benchmark
Rodinia
Blender
Caffe
NAMD
Timed HMMer Search
libavif avifenc
Timed Linux Kernel Compilation
PostgreSQL pgbench:
  1 - 250 - Read Write - Average Latency
  1 - 250 - Read Write
Caffe
PostgreSQL pgbench:
  1 - 1 - Read Write - Average Latency
  1 - 1 - Read Write
InfluxDB:
  4 - 10000 - 2,5000,1 - 10000
  64 - 10000 - 2,5000,1 - 10000
  1024 - 10000 - 2,5000,1 - 10000
oneDNN
KeyDB
oneDNN
PyPerformance
NCNN:
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
  CPU - squeezenet
RawTherapee
TensorFlow Lite:
  SqueezeNet
  NASNet Mobile
  Mobilenet Quant
  Mobilenet Float
libavif avifenc
oneDNN:
  IP Batch All - u8s8f32 - CPU
  IP Batch All - f32 - CPU
Sunflow Rendering System
PyPerformance
Blender
Rodinia
Kvazaar
ASTC Encoder
x265
PyPerformance
oneDNN
Caffe
PyPerformance
oneDNN
PostgreSQL pgbench:
  1 - 250 - Read Only - Average Latency
  1 - 250 - Read Only
oneDNN
LibRaw
WebP Image Encode
Kvazaar:
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
PyPerformance
AOM AV1
PyPerformance:
  django_template
  chaos
  crypto_pyaes
DaCapo Benchmark
Blender
PyPerformance
Kvazaar
AOM AV1
PostgreSQL pgbench:
  1 - 100 - Read Write - Average Latency
  1 - 100 - Read Write
  1 - 50 - Read Write - Average Latency
  1 - 50 - Read Write
  1 - 1 - Read Only - Average Latency
  1 - 1 - Read Only
  1 - 100 - Read Only - Average Latency
  1 - 100 - Read Only
  1 - 50 - Read Only - Average Latency
  1 - 50 - Read Only
AOM AV1
dav1d
PyPerformance
OCRMyPDF
RNNoise
oneDNN
Tesseract OCR
OpenSSL
TNN
PyPerformance:
  pickle_pure_python
  json_loads
NeatBench
TNN
PyPerformance
dav1d
AOM AV1
NCNN:
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
  Vulkan GPU - squeezenet
Dolfyn
WebP Image Encode
Kvazaar
AOM AV1
DaCapo Benchmark
oneDNN:
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
LAMMPS Molecular Dynamics Simulator
Kvazaar
x265
DaCapo Benchmark
ASTC Encoder
oneDNN:
  Deconvolution Batch deconv_3d - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
WebP Image Encode
DaCapo Benchmark
dav1d
oneDNN
ASTC Encoder
libavif avifenc:
  8
  10
oneDNN
NeatBench
FFTE
WebP Image Encode:
  Quality 100
  Default