Ryzen 3 2200G 2021

AMD Ryzen 3 2200G testing with a ASUS PRIME B350M-E (5220 BIOS) and ASUS AMD Radeon Vega / Mobile 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101191-HA-RYZEN322022
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 3 Tests
AV1 3 Tests
Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
Chess Test Suite 4 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 15 Tests
Compression Tests 2 Tests
CPU Massive 21 Tests
Creator Workloads 24 Tests
Database Test Suite 4 Tests
Encoding 8 Tests
Fortran Tests 6 Tests
Game Development 3 Tests
HPC - High Performance Computing 24 Tests
Imaging 6 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 9 Tests
Molecular Dynamics 9 Tests
MPI Benchmarks 4 Tests
Multi-Core 19 Tests
NVIDIA GPU Compute 7 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 9 Tests
Programmer / Developer System Benchmarks 9 Tests
Python Tests 5 Tests
Scientific Computing 15 Tests
Server 7 Tests
Server CPU Tests 12 Tests
Single-Threaded 6 Tests
Speech 3 Tests
Telephony 3 Tests
Texture Compression 2 Tests
Video Encoding 5 Tests
Vulkan Compute 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
January 16 2021
  18 Hours, 35 Minutes
2
January 17 2021
  20 Hours, 52 Minutes
3
January 18 2021
  19 Hours, 6 Minutes
Invert Hiding All Results Option
  19 Hours, 31 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ryzen 3 2200G 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen 3 2200G @ 3.50GHz (4 Cores)ASUS PRIME B350M-E (5220 BIOS)AMD Raven/Raven26GBSamsung SSD 970 EVO 250GBASUS AMD Radeon Vega / Mobile 2GB (1100/1600MHz)AMD Raven/Raven2/FenghuangG237HLRealtek RTL8111/8168/8411Ubuntu 20.105.8.0-38-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 20.2.6 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8101016 Graphics Details- GLAMORJava Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%105%110%115%121%LeelaChessZeroRedisSunflow Rendering SystemNode.js V8 Web Tooling BenchmarkSockperfLULESHFFTELibRawGROMACSRNNoiseHuginOSBenchasmFishDarktableStockfishKeyDBCP2K Molecular DynamicsOpenFOAMNAMDTensorFlow LiteCraftyLAMMPS Molecular Dynamics SimulatorBYTE Unix BenchmarkTimed Godot Game Engine CompilationAOM AV1Zstd Compressionrav1eWarsowIncompact3DSQLite SpeedtestIndigoBenchLZ4 CompressionNumpy BenchmarkPHPBenchx265dav1dCoremarkDolfynMonte Carlo Simulations of Ionised NebulaeBasis UniversalWavPack Audio EncodingNCNNOCRMyPDFTimed Eigen CompilationTimed FFmpeg CompilationeSpeak-NG Speech EngineGoogle SynthMarkInfluxDBoneDNNCloverLeafTimed HMMer SearchAlgebraic Multi-Grid BenchmarkMobile Neural NetworkTimed MAFFT AlignmentTNNEmbreeMonkey Audio EncodingVKMarkRawTherapeeOpus Codec EncodingGIMPWaifu2x-NCNN VulkanUnpacking Firefoxyquake2ASTC EncoderRealSR-NCNNKvazaarHierarchical INTegrationGLmark2WebP Image EncodeBuild2CLOMPCaffesimdjson

Ryzen 3 2200G 2021incompact3d: Cylinderlczero: BLASkripke: astcenc: Exhaustivelczero: Eigengromacs: Water Benchmarkbuild2: Time To Compilebuild-godot: Time To Compilecp2k: Fayalite-FIST Datarealsr-ncnn: 4x - Yeskvazaar: Bosphorus 4K - Mediumnamd: ATPase Simulation - 327,506 Atomsopenfoam: Motorbike 30Mcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedmocassin: Dust 2D tau100.0numpy: compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUdav1d: Chimera 1080p 10-bitembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objembree: Pathtracer - Crownasmfish: 1024 Hash Memory, 26 Depthcompress-zstd: 19cloverleaf: Lagrangian-Eulerian Hydrodynamicsembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragonbuild-ffmpeg: Time To Compiletensorflow-lite: Inception V4ncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenettensorflow-lite: Inception ResNet V2kvazaar: Bosphorus 4K - Very Fastmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0influxdb: 4 - 10000 - 2,5000,1 - 10000hint: FLOATclomp: Static OMP Speedupinfluxdb: 64 - 10000 - 2,5000,1 - 10000ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetvkmark: 1920 x 1080hmmer: Pfam Database Searchnode-web-tooling: rawtherapee: Total Benchmark Timex265: Bosphorus 4Kbyte: Dhrystone 2build-eigen: Time To Compilecaffe: GoogleNet - CPU - 100glmark2: 1920 x 1080onednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Mediumastcenc: Thoroughkvazaar: Bosphorus 4K - Ultra Fastbasis: UASTC Level 2hugin: Panorama Photo Assistant + Stitching Timebasis: ETC1Ssqlite-speedtest: Timed Time - Size 1,000warsow: 1920 x 1080stockfish: Total Timesunflow: Global Illumination + Image Synthesisdav1d: Summer Nature 4Krav1e: 5keydb: dav1d: Chimera 1080pcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedsockperf: Latency Under Loadrealsr-ncnn: 4x - Noindigobench: CPU - Bedroomlibraw: Post-Processing Benchmarkindigobench: CPU - Supercartensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quantaom-av1: Speed 6 Realtimesimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDwebp: Quality 100, Lossless, Highest Compressiondarktable: Boat - CPU-onlyrav1e: 6ocrmypdf: Processing 60 Page PDF Documentencode-wavpack: WAV To WavPackonednn: Deconvolution Batch shapes_1d - f32 - CPUsimdjson: Kostyaespeak: Text-To-Speech Synthesisonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUaom-av1: Speed 6 Two-Passcompress-zstd: 3caffe: AlexNet - CPU - 100phpbench: PHP Benchmark Suitekvazaar: Bosphorus 1080p - Very Fastrnnoise: rav1e: 10onednn: IP Shapes 3D - f32 - CPUaom-av1: Speed 4 Two-Passunpack-firefox: firefox-84.0.source.tar.xzcrafty: Elapsed Timex265: Bosphorus 1080psynthmark: VoiceMark_100waifu2x-ncnn: 2x - 3 - Yesencode-ape: WAV To APEdarktable: Masskrug - CPU-onlywebp: Quality 100, Losslessaom-av1: Speed 8 Realtimedarktable: Server Room - CPU-onlykvazaar: Bosphorus 1080p - Ultra Fastdolfyn: Computational Fluid Dynamicsdav1d: Summer Nature 1080ptnn: CPU - SqueezeNet v1.1coremark: CoreMark Size 666 - Iterations Per Secondtnn: CPU - MobileNet v2astcenc: Mediumgimp: unsharp-maskredis: SETgimp: auto-levelsonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUmafft: Multiple Sequence Alignment - LSU RNAencode-opus: WAV To Opus Encodegimp: rotatesockperf: Latency Ping Pongsockperf: Throughputredis: GETgimp: resizeonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUbasis: UASTC Level 0amg: redis: LPUSHredis: LPOPredis: SADDastcenc: Fastonednn: IP Shapes 3D - u8s8f32 - CPUwebp: Quality 100, Highest Compressionyquake2: Software CPU - 1920 x 1080lammps: Rhodopsin Proteinonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUosbench: Create Fileslulesh: osbench: Memory Allocationsosbench: Launch Programsosbench: Create Processesosbench: Create Threadswaifu2x-ncnn: 2x - 3 - Noonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUwebp: Quality 100ffte: N=256, 3D Complex FFT Routinewebp: Defaultyquake2: OpenGL 3.x - 1920 x 1080darktable: Server Rack - CPU-only123810.9547124324811563696.054480.333514.482501.2001448.589482.6091.496.75407342.988565.242.22342242.348554.342.778195.2052.552.58192.81992.99032.7601774804714.0191.413.24313.3140182.189644101718.8859.2859.0071.6823.4629.17117.4632.563.2516.9810.3812.639.5910.8846.4056913103.9463.4157.3955.42450.2169.732706035.5301333349.993822721428.319.1159.3959.2872.5523.3029.04117.1932.593.3117.0710.5012.659.6711.2046.831199127.1497.38123.6844.8135499453.0113.52011008418497721.137913.617746.8736.55968437.128193.616.5184.336.8486.47982.00682.06081.222158.157181693.20651.920.848265074.45184.258722.47994.3053.77563.0280.49419.361.10746774531611230994632873310.130.350.450.4657.67025.2061.08252.67515.08222.54130.3835.13414.82152.222346.04187750810615.6022.2382.57313.07361.3823.847627427519.49596.25426.67515.95524.17224.89427.2020.69527.0121.069182.89287.255102524.442176279.33912.7717.3321489969.2515.92416.542914.632215.0428.93614.4876.9275550552064794.8312.8557.3574411.8982134915331216336.462261210.921735687.339.755.814178.87292.92.60338.839223.021318.2458881180.097181.74268481.52326026.19028114.9202354.11529.709830.80732.59515392.8099519221.648814.10.339821.0557253743117717697.503800.330516.516503.9771461.853482.8391.496.79902339.548552.141.02340241.368547.842.348438.6452.392.56702.83712.96822.7659780282814.2191.453.25043.3113183.024638994319.0659.1659.4074.1023.2429.08118.9132.643.3116.9510.4412.989.6911.3047.0156970833.9463.9977.5265.41950.2699.613696009.5301687480.188422.0725224.218.8759.7159.4273.1423.4229.04119.3832.793.3316.7610.3412.899.4910.9146.491199127.6457.74123.4054.8335791748.5113.65411032018517794.137667.947725.6335.84618277.538356.196.5184.616.8586.54083.58182.17881.423159.456282203.14852.000.844267212.95184.178646.38015.3952.75863.0230.49419.661.09846195531721330627031868510.120.350.450.4657.21625.9011.08352.74115.07722.31440.3835.31915.08252.222358.04167250605515.5222.6932.56613.16851.3823.799625501519.60596.61526.68515.99424.51824.95727.1821.00827.0721.139183.50287.063101765.566926279.62612.8317.3171486411.6315.85316.555814.870015.0008.92314.4706.7515596631931045.2012.8667.3606112.0542142322331213155.041258380.501758200.509.745.841298.87193.32.58638.972823.679118.3158461208.386681.99938281.96751326.47956214.8232784.11029.888931.41472.60115755.5852717621.662807.20.342820.322815353695.243770.326514.799502.6081452.469482.5511.506.83284338.278562.841.23341243.268547.241.818426.8453.512.58282.81512.97822.7779766904314.2191.013.24823.3432182.601646856719.0759.5059.2871.6923.4829.28118.0432.593.3316.9010.4312.759.6010.6846.3256890703.9563.2697.3135.39850.4899.867700554.6301185316.589092723222.118.7559.3459.2871.8223.5129.18117.4532.433.2916.8910.2512.709.5911.0446.321196127.3857.38123.3394.8335427649.2112.96411015718527837.727750.497701.7533.05878342.738419.826.5084.496.8386.34082.18882.07681.933159.456485893.30252.090.839269044.48183.888690.18022.4850.69963.0020.49819.791.10646740431579031321632718710.250.350.450.4657.45225.4511.08952.98615.17422.66190.3835.31215.56192.242324.24157350415915.6222.5802.63913.36611.4023.806632206919.71593.90926.67315.99524.19424.90027.2920.73927.0520.988184.81286.140102339.408641279.27812.7517.3281472539.6715.89416.610815.197515.0358.91514.4246.7905576651930168.3812.8267.3135111.8632140726331223284.421275489.171734495.839.805.791198.86993.32.61338.505523.669718.4418621208.018585.63200682.07321226.85705814.9091084.09729.121030.82082.59015437.4685789981.657807.90.344OpenBenchmarking.org

Incompact3D

Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder1232004006008001000SE +/- 3.54, N = 3SE +/- 10.03, N = 3SE +/- 2.19, N = 3810.95821.06820.321. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS12390180270360450SE +/- 4.54, N = 8SE +/- 6.01, N = 9SE +/- 2.52, N = 34323743531. (CXX) g++ options: -flto -pthread

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4121000K2000K3000K4000K5000KSE +/- 36406.50, N = 2SE +/- 35494.54, N = 3481156331177171. (CXX) g++ options: -O3 -fopenmp

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123150300450600750SE +/- 1.18, N = 3SE +/- 0.48, N = 3SE +/- 0.20, N = 3696.05697.50695.241. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123100200300400500SE +/- 4.81, N = 3SE +/- 5.13, N = 94483803771. (CXX) g++ options: -flto -pthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.07490.14980.22470.29960.3745SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 30.3330.3300.3261. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile123110220330440550SE +/- 0.43, N = 3SE +/- 2.15, N = 3SE +/- 1.15, N = 3514.48516.52514.80

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile123110220330440550SE +/- 0.16, N = 3SE +/- 0.30, N = 3SE +/- 0.32, N = 3501.20503.98502.61

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data123300600900120015001448.591461.851452.47

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yes123100200300400500SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3482.61482.84482.55

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.33750.6751.01251.351.6875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.491.491.501. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms123246810SE +/- 0.01425, N = 3SE +/- 0.03865, N = 3SE +/- 0.08887, N = 56.754076.799026.83284

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M12370140210280350SE +/- 1.66, N = 3SE +/- 0.27, N = 3SE +/- 2.23, N = 3342.98339.54338.271. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 6.38, N = 13SE +/- 5.73, N = 15SE +/- 3.43, N = 158565.28552.18562.81. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231020304050SE +/- 0.73, N = 13SE +/- 0.55, N = 15SE +/- 0.47, N = 1542.2241.0241.231. (CC) gcc options: -O3

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.012370140210280350SE +/- 1.76, N = 3SE +/- 0.67, N = 33423403411. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lrt -lz

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12350100150200250SE +/- 0.34, N = 3SE +/- 0.33, N = 3SE +/- 0.50, N = 3242.34241.36243.26

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 26.74, N = 3SE +/- 8.28, N = 15SE +/- 5.83, N = 158554.38547.88547.21. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231020304050SE +/- 0.58, N = 3SE +/- 0.43, N = 15SE +/- 0.65, N = 1542.7742.3441.811. (CC) gcc options: -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 99.84, N = 5SE +/- 66.22, N = 15SE +/- 46.50, N = 38195.208438.648426.84MIN: 7505MIN: 7752.96MIN: 8003.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1231224364860SE +/- 0.17, N = 3SE +/- 0.21, N = 3SE +/- 0.31, N = 352.5552.3953.51MIN: 35.45 / MAX: 124.71MIN: 35.47 / MAX: 120.48MIN: 35.6 / MAX: 125.131. (CC) gcc options: -pthread -ldl -lm

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1230.58111.16221.74332.32442.9055SE +/- 0.0040, N = 3SE +/- 0.0105, N = 3SE +/- 0.0165, N = 32.58192.56702.5828MIN: 2.55 / MAX: 2.62MIN: 2.52 / MAX: 2.63MIN: 2.51 / MAX: 2.65

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1230.63831.27661.91492.55323.1915SE +/- 0.0119, N = 3SE +/- 0.0157, N = 3SE +/- 0.0152, N = 32.81992.83712.8151MIN: 2.75 / MAX: 2.92MIN: 2.77 / MAX: 2.9MIN: 2.75 / MAX: 2.89

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1230.67281.34562.01842.69123.364SE +/- 0.0237, N = 3SE +/- 0.0135, N = 3SE +/- 0.0217, N = 32.99032.96822.9782MIN: 2.9 / MAX: 3.08MIN: 2.9 / MAX: 3.07MIN: 2.91 / MAX: 3.08

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1230.6251.251.8752.53.125SE +/- 0.0145, N = 3SE +/- 0.0043, N = 3SE +/- 0.0087, N = 32.76012.76592.7779MIN: 2.71 / MAX: 2.86MIN: 2.73 / MAX: 2.83MIN: 2.75 / MAX: 2.87

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1232M4M6M8M10MSE +/- 28500.12, N = 3SE +/- 50254.79, N = 3SE +/- 29445.58, N = 3774804778028287669043

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 1912348121620SE +/- 0.18, N = 5SE +/- 0.03, N = 3SE +/- 0.06, N = 314.014.214.21. (CC) gcc options: -O3 -pthread -lz -llzma

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1234080120160200SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3191.41191.45191.011. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1230.73131.46262.19392.92523.6565SE +/- 0.0133, N = 3SE +/- 0.0130, N = 3SE +/- 0.0025, N = 33.24313.25043.2482MIN: 3.18 / MAX: 3.32MIN: 3.19 / MAX: 3.32MIN: 3.2 / MAX: 3.32

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1230.75221.50442.25663.00883.761SE +/- 0.0186, N = 3SE +/- 0.0143, N = 3SE +/- 0.0299, N = 33.31403.31133.3432MIN: 3.25 / MAX: 3.4MIN: 3.26 / MAX: 3.4MIN: 3.25 / MAX: 3.45

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1234080120160200SE +/- 0.34, N = 3SE +/- 0.22, N = 3SE +/- 0.72, N = 3182.19183.02182.60

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V41231.4M2.8M4.2M5.6M7MSE +/- 24424.39, N = 3SE +/- 7475.10, N = 3SE +/- 4440.22, N = 3644101763899436468567

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m123510152025SE +/- 0.09, N = 3SE +/- 0.15, N = 4SE +/- 0.01, N = 318.8819.0619.07MIN: 16.76 / MAX: 26.52MIN: 16.61 / MAX: 34.07MIN: 16.77 / MAX: 34.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1231326395265SE +/- 0.08, N = 3SE +/- 0.14, N = 4SE +/- 0.16, N = 359.2859.1659.50MIN: 53.06 / MAX: 77.54MIN: 51.95 / MAX: 72.82MIN: 52.65 / MAX: 71.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231326395265SE +/- 0.04, N = 3SE +/- 0.06, N = 4SE +/- 0.16, N = 359.0059.4059.28MIN: 55.05 / MAX: 74.18MIN: 55.12 / MAX: 75.24MIN: 54.59 / MAX: 74.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231632486480SE +/- 0.20, N = 3SE +/- 0.63, N = 4SE +/- 0.29, N = 371.6874.1071.69MIN: 65.87 / MAX: 90.56MIN: 66.26 / MAX: 110.26MIN: 66.47 / MAX: 91.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123612182430SE +/- 0.08, N = 3SE +/- 0.03, N = 4SE +/- 0.02, N = 323.4623.2423.48MIN: 21.29 / MAX: 37.36MIN: 21.11 / MAX: 37.35MIN: 21.25 / MAX: 36.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18123714212835SE +/- 0.26, N = 3SE +/- 0.14, N = 4SE +/- 0.02, N = 329.1729.0829.28MIN: 26 / MAX: 40.97MIN: 25.74 / MAX: 44.35MIN: 25.57 / MAX: 39.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16123306090120150SE +/- 0.37, N = 3SE +/- 0.24, N = 4SE +/- 0.16, N = 3117.46118.91118.04MIN: 111.97 / MAX: 149.37MIN: 113.22 / MAX: 141.78MIN: 112.22 / MAX: 141.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123816243240SE +/- 0.20, N = 3SE +/- 0.12, N = 4SE +/- 0.13, N = 332.5632.6432.59MIN: 28.77 / MAX: 47.43MIN: 28.94 / MAX: 42.38MIN: 28.42 / MAX: 45.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1230.74931.49862.24792.99723.7465SE +/- 0.01, N = 3SE +/- 0.02, N = 4SE +/- 0.03, N = 33.253.313.33MIN: 2.61 / MAX: 14.28MIN: 2.6 / MAX: 4.93MIN: 2.62 / MAX: 5.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b012348121620SE +/- 0.07, N = 3SE +/- 0.26, N = 4SE +/- 0.14, N = 316.9816.9516.90MIN: 14.01 / MAX: 31.1MIN: 14.04 / MAX: 27.15MIN: 13.97 / MAX: 30.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1233691215SE +/- 0.08, N = 3SE +/- 0.20, N = 4SE +/- 0.08, N = 310.3810.4410.43MIN: 8.43 / MAX: 17.05MIN: 8.36 / MAX: 26.87MIN: 8.39 / MAX: 21.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21233691215SE +/- 0.09, N = 3SE +/- 0.16, N = 4SE +/- 0.20, N = 312.6312.9812.75MIN: 10.39 / MAX: 25.47MIN: 10.26 / MAX: 24.78MIN: 10.36 / MAX: 21.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31233691215SE +/- 0.13, N = 3SE +/- 0.14, N = 4SE +/- 0.09, N = 39.599.699.60MIN: 7.78 / MAX: 22.63MIN: 7.81 / MAX: 18.31MIN: 7.73 / MAX: 19.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.01, N = 3SE +/- 0.21, N = 4SE +/- 0.38, N = 310.8811.3010.68MIN: 8.93 / MAX: 27.73MIN: 8.86 / MAX: 23.26MIN: 8.89 / MAX: 17.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1231122334455SE +/- 0.03, N = 3SE +/- 0.66, N = 4SE +/- 0.07, N = 346.4047.0146.32MIN: 42.8 / MAX: 59.97MIN: 42.66 / MAX: 60.93MIN: 42.77 / MAX: 61.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V21231.2M2.4M3.6M4.8M6MSE +/- 3601.25, N = 3SE +/- 3137.20, N = 3SE +/- 6688.08, N = 3569131056970835689070

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1230.88881.77762.66643.55524.444SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.943.943.951. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31231428425670SE +/- 0.18, N = 3SE +/- 0.32, N = 3SE +/- 0.19, N = 363.4264.0063.27MIN: 60.02 / MAX: 120.02MIN: 60.33 / MAX: 93.06MIN: 60.45 / MAX: 98.451. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0123246810SE +/- 0.021, N = 3SE +/- 0.065, N = 3SE +/- 0.030, N = 37.3957.5267.313MIN: 6.57 / MAX: 16.47MIN: 6.6 / MAX: 20.05MIN: 6.61 / MAX: 17.321. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241231.22042.44083.66124.88166.102SE +/- 0.019, N = 3SE +/- 0.038, N = 3SE +/- 0.037, N = 35.4245.4195.398MIN: 4.8 / MAX: 14.75MIN: 4.88 / MAX: 15.69MIN: 4.83 / MAX: 15.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501231122334455SE +/- 0.51, N = 3SE +/- 0.33, N = 3SE +/- 0.27, N = 350.2250.2750.49MIN: 47.21 / MAX: 83.29MIN: 47.5 / MAX: 72.85MIN: 47.85 / MAX: 147.841. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01233691215SE +/- 0.127, N = 3SE +/- 0.039, N = 3SE +/- 0.066, N = 39.7329.6139.867MIN: 8.67 / MAX: 18.76MIN: 8.69 / MAX: 20.7MIN: 8.73 / MAX: 39.421. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123150K300K450K600K750KSE +/- 8641.46, N = 3SE +/- 6558.55, N = 3SE +/- 5594.95, N = 3706035.5696009.5700554.6

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT12360M120M180M240M300MSE +/- 252430.64, N = 3SE +/- 109501.99, N = 3SE +/- 702868.97, N = 3301333349.99301687480.19301185316.591. (CC) gcc options: -O3 -march=native -lm

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1230.450.91.351.82.25SE +/- 0.03, N = 32.02.02.01. (CC) gcc options: -fopenmp -O3 -lm

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123160K320K480K640K800KSE +/- 1902.33, N = 3SE +/- 1666.30, N = 3SE +/- 3366.02, N = 3721428.3725224.2723222.1

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m123510152025SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.18, N = 319.1118.8718.75MIN: 16.69 / MAX: 35.81MIN: 16.81 / MAX: 33.23MIN: 16.84 / MAX: 32.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1231326395265SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.27, N = 359.3959.7159.34MIN: 53.07 / MAX: 79.98MIN: 52.9 / MAX: 78.52MIN: 52.64 / MAX: 71.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231326395265SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 359.2859.4259.28MIN: 55.69 / MAX: 71.74MIN: 55.13 / MAX: 75.95MIN: 55.36 / MAX: 72.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501231632486480SE +/- 0.82, N = 3SE +/- 0.39, N = 3SE +/- 0.19, N = 372.5573.1471.82MIN: 66.75 / MAX: 91.97MIN: 66.24 / MAX: 103.74MIN: 65.77 / MAX: 87.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123612182430SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 323.3023.4223.51MIN: 21.24 / MAX: 38.11MIN: 21.27 / MAX: 37.51MIN: 21.21 / MAX: 37.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18123714212835SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.33, N = 329.0429.0429.18MIN: 25.89 / MAX: 42.02MIN: 25.69 / MAX: 36.07MIN: 25.64 / MAX: 44.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123306090120150SE +/- 0.18, N = 3SE +/- 0.23, N = 3SE +/- 0.17, N = 3117.19119.38117.45MIN: 112.25 / MAX: 143.19MIN: 113.9 / MAX: 142.52MIN: 112.54 / MAX: 135.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123816243240SE +/- 0.18, N = 3SE +/- 0.05, N = 3SE +/- 0.18, N = 332.5932.7932.43MIN: 28.72 / MAX: 51.96MIN: 28.77 / MAX: 48.75MIN: 28.44 / MAX: 46.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.74931.49862.24792.99723.7465SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 33.313.333.29MIN: 2.64 / MAX: 9.91MIN: 2.62 / MAX: 5.11MIN: 2.73 / MAX: 4.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b012348121620SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 317.0716.7616.89MIN: 14.1 / MAX: 30.1MIN: 13.96 / MAX: 31.87MIN: 14.11 / MAX: 30.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1233691215SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 310.5010.3410.25MIN: 8.45 / MAX: 18.4MIN: 8.42 / MAX: 16.24MIN: 8.43 / MAX: 24.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 312.6512.8912.70MIN: 10.41 / MAX: 23.3MIN: 10.42 / MAX: 26.77MIN: 10.47 / MAX: 19.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31233691215SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 39.679.499.59MIN: 7.8 / MAX: 15.65MIN: 7.78 / MAX: 14.76MIN: 7.81 / MAX: 16.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 311.2010.9111.04MIN: 8.97 / MAX: 20.5MIN: 8.91 / MAX: 18.07MIN: 8.96 / MAX: 21.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1231122334455SE +/- 0.46, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 346.8346.4946.32MIN: 42.34 / MAX: 64.41MIN: 42.72 / MAX: 62.2MIN: 43.57 / MAX: 62.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VKMark

VKMark is a collection of Vulkan tests/benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 1080123300600900120015001199119911961. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.08, N = 3127.15127.65127.391. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark123246810SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 47.387.747.381. Nodejs v12.18.2

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time123306090120150SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.14, N = 3123.68123.41123.341. RawTherapee, version 5.8, command line.

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1231.08682.17363.26044.34725.434SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.814.834.831. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 21238M16M24M32M40MSE +/- 289585.60, N = 3SE +/- 208729.08, N = 3SE +/- 503813.27, N = 335499453.035791748.535427649.2

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile123306090120150SE +/- 0.54, N = 3SE +/- 0.19, N = 3SE +/- 0.21, N = 3113.52113.65112.96

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KSE +/- 106.80, N = 3SE +/- 38.96, N = 3SE +/- 223.03, N = 31100841103201101571. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

GLmark2

This is a test of Linaro's glmark2 port, currently using the X11 OpenGL 2.0 target. GLmark2 is a basic OpenGL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1920 x 1080123400800120016002000184918511852

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 11.25, N = 3SE +/- 85.05, N = 3SE +/- 30.28, N = 37721.137794.137837.72MIN: 7547.72MIN: 7534.49MIN: 7613.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 101.76, N = 3SE +/- 22.70, N = 3SE +/- 13.61, N = 37913.617667.947750.49MIN: 7617.25MIN: 7509.56MIN: 7562.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12317003400510068008500SE +/- 38.84, N = 3SE +/- 27.04, N = 3SE +/- 16.23, N = 37746.877725.637701.75MIN: 7556.29MIN: 7520.66MIN: 7494.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU123816243240SE +/- 1.59, N = 15SE +/- 1.77, N = 15SE +/- 1.44, N = 1236.5635.8533.06MIN: 27.16MIN: 27.16MIN: 26.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 133.51, N = 3SE +/- 86.65, N = 3SE +/- 61.07, N = 38437.128277.538342.73MIN: 7874.47MIN: 7767.68MIN: 7929.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 128.90, N = 3SE +/- 144.66, N = 3SE +/- 28.77, N = 38193.618356.198419.82MIN: 7671.11MIN: 7776.27MIN: 8053.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.516.516.501. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough12320406080100SE +/- 0.21, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 384.3384.6184.491. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123246810SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 36.846.856.831. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.16, N = 386.4886.5486.341. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Hugin

Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching Time12320406080100SE +/- 0.57, N = 3SE +/- 0.26, N = 3SE +/- 0.13, N = 382.0183.5882.19

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 382.0682.1882.081. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.14, N = 3SE +/- 0.76, N = 3SE +/- 0.70, N = 381.2281.4281.931. (CC) gcc options: -O2 -ldl -lz -lpthread

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 10801234080120160200SE +/- 1.30, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3158.1159.4159.4

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1231.2M2.4M3.6M4.8M6MSE +/- 48644.36, N = 3SE +/- 74806.22, N = 3SE +/- 39149.01, N = 35718169562822056485891. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.7431.4862.2292.9723.715SE +/- 0.041, N = 3SE +/- 0.032, N = 3SE +/- 0.028, N = 153.2063.1483.302MIN: 2.88 / MAX: 3.79MIN: 2.89 / MAX: 3.84MIN: 2.87 / MAX: 4.18

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1231224364860SE +/- 0.35, N = 3SE +/- 0.37, N = 3SE +/- 0.44, N = 351.9252.0052.09MIN: 48.08 / MAX: 61.73MIN: 48.01 / MAX: 61.72MIN: 48.07 / MAX: 61.571. (CC) gcc options: -pthread -ldl -lm

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51230.19080.38160.57240.76320.954SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.8480.8440.839

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.1612360K120K180K240K300KSE +/- 3138.20, N = 3SE +/- 2051.97, N = 3SE +/- 1852.38, N = 3265074.45267212.95269044.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1234080120160200SE +/- 0.04, N = 3SE +/- 1.76, N = 3SE +/- 1.89, N = 3184.25184.17183.88MIN: 129.28 / MAX: 331.16MIN: 127.79 / MAX: 333.51MIN: 127.66 / MAX: 340.881. (CC) gcc options: -pthread -ldl -lm

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 6.35, N = 3SE +/- 8.66, N = 3SE +/- 57.65, N = 38722.48646.38690.11. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1232K4K6K8K10KSE +/- 51.24, N = 3SE +/- 41.71, N = 3SE +/- 92.88, N = 37994.308015.398022.481. (CC) gcc options: -O3

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under Load1231224364860SE +/- 2.25, N = 20SE +/- 1.88, N = 25SE +/- 1.93, N = 2553.7852.7650.701. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: No1231428425670SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 363.0363.0263.00

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.11210.22420.33630.44840.5605SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 30.4940.4940.498

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123510152025SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 319.3619.6619.791. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.24910.49820.74730.99641.2455SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.002, N = 31.1071.0981.106

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet123100K200K300K400K500KSE +/- 158.13, N = 3SE +/- 1459.42, N = 3SE +/- 564.75, N = 3467745461955467404

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12370K140K210K280K350KSE +/- 995.45, N = 3SE +/- 384.90, N = 3SE +/- 504.11, N = 3316112317213315790

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12370K140K210K280K350KSE +/- 172.14, N = 3SE +/- 1840.34, N = 3SE +/- 2441.23, N = 3309946306270313216

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12370K140K210K280K350KSE +/- 1183.02, N = 3SE +/- 1025.34, N = 3SE +/- 1866.22, N = 3328733318685327187

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime1233691215SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 310.1310.1210.251. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.07880.15760.23640.31520.394SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.350.350.351. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.10350.2070.31050.4140.5175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.460.460.461. (CXX) g++ options: -O3 -pthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression1231326395265SE +/- 0.26, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 357.6757.2257.451. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.2.1Test: Boat - Acceleration: CPU-only123612182430SE +/- 0.09, N = 3SE +/- 0.25, N = 13SE +/- 0.09, N = 325.2125.9025.45

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61230.2450.490.7350.981.225SE +/- 0.005, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 31.0821.0831.089

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document1231224364860SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 352.6852.7452.99

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.09, N = 2115.0815.0815.171. (CXX) g++ options: -rdynamic

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123510152025SE +/- 0.26, N = 3SE +/- 0.18, N = 15SE +/- 0.14, N = 322.5422.3122.66MIN: 17.75MIN: 17.69MIN: 17.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.380.381. (CXX) g++ options: -O3 -pthread

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123816243240SE +/- 0.11, N = 4SE +/- 0.12, N = 4SE +/- 0.11, N = 435.1335.3235.311. (CC) gcc options: -O2 -std=c99

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.22, N = 15SE +/- 0.18, N = 15SE +/- 0.09, N = 314.8215.0815.56MIN: 11.81MIN: 12.35MIN: 13.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass1230.5041.0081.5122.0162.52SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.222.222.241. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31235001000150020002500SE +/- 27.78, N = 3SE +/- 16.00, N = 3SE +/- 7.82, N = 32346.02358.02324.21. (CC) gcc options: -O3 -pthread -lz -llzma

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001239K18K27K36K45KSE +/- 193.32, N = 3SE +/- 90.86, N = 3SE +/- 137.35, N = 34187741672415731. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123110K220K330K440K550KSE +/- 423.52, N = 3SE +/- 1952.23, N = 3SE +/- 2233.09, N = 3508106506055504159

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast12348121620SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 315.6015.5215.621. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025SE +/- 0.03, N = 3SE +/- 0.23, N = 8SE +/- 0.35, N = 322.2422.6922.581. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101230.59381.18761.78142.37522.969SE +/- 0.015, N = 3SE +/- 0.008, N = 3SE +/- 0.007, N = 32.5732.5662.639

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.18, N = 15SE +/- 0.17, N = 15SE +/- 0.06, N = 313.0713.1713.37MIN: 10.63MIN: 10.78MIN: 12.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass1230.3150.630.9451.261.575SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.381.381.401. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz123612182430SE +/- 0.03, N = 4SE +/- 0.03, N = 4SE +/- 0.08, N = 423.8523.8023.81

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.4M2.8M4.2M5.6M7MSE +/- 2855.69, N = 3SE +/- 20483.82, N = 3SE +/- 23149.53, N = 36274275625501563220691. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123510152025SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 319.4919.6019.711. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123130260390520650SE +/- 1.56, N = 3SE +/- 0.89, N = 3SE +/- 1.20, N = 3596.25596.62593.911. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes123612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 326.6826.6926.67

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12348121620SE +/- 0.05, N = 5SE +/- 0.08, N = 5SE +/- 0.05, N = 515.9615.9916.001. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.2.1Test: Masskrug - Acceleration: CPU-only123612182430SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 324.1724.5224.19

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless123612182430SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 324.8924.9624.901. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime123612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 327.2027.1827.291. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.2.1Test: Server Room - Acceleration: CPU-only123510152025SE +/- 0.21, N = 3SE +/- 0.12, N = 3SE +/- 0.19, N = 320.7021.0120.74

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123612182430SE +/- 0.16, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 327.0127.0727.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics123510152025SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 321.0721.1420.99

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1234080120160200SE +/- 0.82, N = 3SE +/- 0.30, N = 3SE +/- 0.36, N = 3182.89183.50184.81MIN: 167.96 / MAX: 203.4MIN: 169.65 / MAX: 201.98MIN: 171.93 / MAX: 203.271. (CC) gcc options: -pthread -ldl -lm

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112360120180240300SE +/- 0.12, N = 3SE +/- 0.28, N = 3SE +/- 0.02, N = 3287.26287.06286.14MIN: 286.22 / MAX: 288.27MIN: 286.17 / MAX: 287.91MIN: 285.41 / MAX: 286.851. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12320K40K60K80K100KSE +/- 816.07, N = 3SE +/- 290.85, N = 3SE +/- 373.76, N = 3102524.44101765.57102339.411. (CC) gcc options: -O2 -lrt" -lrt

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212360120180240300SE +/- 0.05, N = 3SE +/- 0.44, N = 3SE +/- 0.29, N = 3279.34279.63279.28MIN: 276.5 / MAX: 293.72MIN: 276.69 / MAX: 295.47MIN: 276.39 / MAX: 296.431. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 312.7712.8312.751. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

GIMP

GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGIMP 2.10.18Test: unsharp-mask12348121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 317.3317.3217.33

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123300K600K900K1200K1500KSE +/- 5888.95, N = 3SE +/- 19097.96, N = 3SE +/- 15253.86, N = 81489969.251486411.631472539.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GIMP

GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGIMP 2.10.18Test: auto-levels12348121620SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 315.9215.8515.89

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU12348121620SE +/- 0.17, N = 3SE +/- 0.19, N = 3SE +/- 0.24, N = 316.5416.5616.61MIN: 13.54MIN: 13.67MIN: 13.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 314.6314.8715.20MIN: 13.38MIN: 13.38MIN: 13.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA12348121620SE +/- 0.21, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 315.0415.0015.041. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode123246810SE +/- 0.033, N = 5SE +/- 0.030, N = 5SE +/- 0.021, N = 58.9368.9238.9151. (CXX) g++ options: -fvisibility=hidden -logg -lm

GIMP

GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGIMP 2.10.18Test: rotate12348121620SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 314.4914.4714.42

Sockperf

This is a network socket API performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping Pong123246810SE +/- 0.065, N = 5SE +/- 0.049, N = 5SE +/- 0.074, N = 56.9276.7516.7901. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: Throughput123120K240K360K480K600KSE +/- 6800.99, N = 5SE +/- 3595.18, N = 5SE +/- 3270.45, N = 55550555596635576651. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123400K800K1200K1600K2000KSE +/- 35016.53, N = 3SE +/- 23617.43, N = 5SE +/- 22292.08, N = 32064794.831931045.201930168.381. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GIMP

GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGIMP 2.10.18Test: resize1233691215SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 312.8612.8712.83

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810SE +/- 0.02489, N = 3SE +/- 0.00711, N = 3SE +/- 0.00751, N = 37.357447.360617.31351MIN: 6.35MIN: 6.34MIN: 6.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 311.9012.0511.861. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.212350M100M150M200M250MSE +/- 326563.65, N = 3SE +/- 543041.48, N = 3SE +/- 483314.65, N = 32134915332142322332140726331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 16215.78, N = 3SE +/- 2985.46, N = 3SE +/- 4396.42, N = 31216336.461213155.041223284.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 16948.58, N = 3SE +/- 11741.76, N = 3SE +/- 8952.66, N = 32261210.921258380.501275489.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 11595.14, N = 3SE +/- 21162.12, N = 3SE +/- 4337.84, N = 31735687.331758200.501734495.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 39.759.749.801. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.31432.62863.94295.25726.5715SE +/- 0.02077, N = 3SE +/- 0.01362, N = 3SE +/- 0.01488, N = 35.814175.841295.79119MIN: 5.17MIN: 5.26MIN: 5.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123246810SE +/- 0.001, N = 3SE +/- 0.011, N = 3SE +/- 0.013, N = 38.8728.8718.8691. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108012320406080100SE +/- 0.38, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 392.993.393.31. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.58791.17581.76372.35162.9395SE +/- 0.015, N = 3SE +/- 0.030, N = 3SE +/- 0.031, N = 32.6032.5862.6131. (CXX) g++ options: -O3 -pthread -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123918273645SE +/- 0.18, N = 3SE +/- 0.23, N = 3SE +/- 0.51, N = 338.8438.9738.51MIN: 35.67MIN: 35.93MIN: 35.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123612182430SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.14, N = 323.0223.6823.67MIN: 19.03MIN: 20.06MIN: 20.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Files123510152025SE +/- 0.24, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 318.2518.3218.441. (CC) gcc options: -lm

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.312330060090012001500SE +/- 0.53, N = 3SE +/- 0.69, N = 3SE +/- 2.03, N = 31180.101208.391208.021. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory Allocations12320406080100SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 1.45, N = 381.7482.0085.631. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch Programs12320406080100SE +/- 0.27, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 381.5281.9782.071. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Processes123612182430SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 326.1926.4826.861. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Threads12348121620SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 314.9214.8214.911. (CC) gcc options: -lm

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No1230.92591.85182.77773.70364.6295SE +/- 0.023, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 34.1154.1104.097

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123714212835SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 329.7129.8929.12MIN: 26.4MIN: 26.42MIN: 26.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123714212835SE +/- 0.15, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 330.8131.4130.82MIN: 22.57MIN: 22.58MIN: 22.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 1001230.58521.17041.75562.34082.926SE +/- 0.009, N = 3SE +/- 0.007, N = 3SE +/- 0.004, N = 32.5952.6012.5901. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1233K6K9K12K15KSE +/- 120.97, N = 3SE +/- 110.35, N = 3SE +/- 161.49, N = 315392.8115755.5915437.471. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default1230.3740.7481.1221.4961.87SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.009, N = 31.6481.6621.6571. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801232004006008001000SE +/- 4.11, N = 3SE +/- 3.33, N = 3SE +/- 4.89, N = 3814.1807.2807.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.2.1Test: Server Rack - Acceleration: CPU-only1230.07740.15480.23220.30960.387SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.004, N = 30.3390.3420.344

191 Results Shown

Incompact3D
LeelaChessZero
Kripke
ASTC Encoder
LeelaChessZero
GROMACS
Build2
Timed Godot Game Engine Compilation
CP2K Molecular Dynamics
RealSR-NCNN
Kvazaar
NAMD
OpenFOAM
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
Monte Carlo Simulations of Ionised Nebulae
Numpy Benchmark
LZ4 Compression:
  3 - Decompression Speed
  3 - Compression Speed
oneDNN
dav1d
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer - Asian Dragon Obj
  Pathtracer - Crown
asmFish
Zstd Compression
CloverLeaf
Embree:
  Pathtracer ISPC - Asian Dragon
  Pathtracer - Asian Dragon
Timed FFmpeg Compilation
TensorFlow Lite
NCNN:
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
TensorFlow Lite
Kvazaar
Mobile Neural Network:
  inception-v3
  mobilenet-v1-1.0
  MobileNetV2_224
  resnet-v2-50
  SqueezeNetV1.0
InfluxDB
Hierarchical INTegration
CLOMP
InfluxDB
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
VKMark
Timed HMMer Search
Node.js V8 Web Tooling Benchmark
RawTherapee
x265
BYTE Unix Benchmark
Timed Eigen Compilation
Caffe
GLmark2
oneDNN:
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
Kvazaar
ASTC Encoder
Kvazaar
Basis Universal
Hugin
Basis Universal
SQLite Speedtest
Warsow
Stockfish
Sunflow Rendering System
dav1d
rav1e
KeyDB
dav1d
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Sockperf
RealSR-NCNN
IndigoBench
LibRaw
IndigoBench
TensorFlow Lite:
  SqueezeNet
  NASNet Mobile
  Mobilenet Float
  Mobilenet Quant
AOM AV1
simdjson:
  LargeRand
  PartialTweets
  DistinctUserID
WebP Image Encode
Darktable
rav1e
OCRMyPDF
WavPack Audio Encoding
oneDNN
simdjson
eSpeak-NG Speech Engine
oneDNN
AOM AV1
Zstd Compression
Caffe
PHPBench
Kvazaar
RNNoise
rav1e
oneDNN
AOM AV1
Unpacking Firefox
Crafty
x265
Google SynthMark
Waifu2x-NCNN Vulkan
Monkey Audio Encoding
Darktable
WebP Image Encode
AOM AV1
Darktable
Kvazaar
Dolfyn
dav1d
TNN
Coremark
TNN
ASTC Encoder
GIMP
Redis
GIMP
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
Timed MAFFT Alignment
Opus Codec Encoding
GIMP
Sockperf:
  Latency Ping Pong
  Throughput
Redis
GIMP
oneDNN
Basis Universal
Algebraic Multi-Grid Benchmark
Redis:
  LPUSH
  LPOP
  SADD
ASTC Encoder
oneDNN
WebP Image Encode
yquake2
LAMMPS Molecular Dynamics Simulator
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
OSBench
LULESH
OSBench:
  Memory Allocations
  Launch Programs
  Create Processes
  Create Threads
Waifu2x-NCNN Vulkan
oneDNN:
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
WebP Image Encode
FFTE
WebP Image Encode
yquake2
Darktable