ubuntu-2010-onlogic

Intel Xeon E-2278GEL testing with a Logic Supply RXM-181 (Z01-0001A027 BIOS) and Intel UHD P630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102011-HA-UBUNTU20190
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 3 Tests
AV1 2 Tests
Bioinformatics 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 3 Tests
Chess Test Suite 3 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 11 Tests
Compression Tests 3 Tests
CPU Massive 19 Tests
Creator Workloads 16 Tests
Cryptography 2 Tests
Database Test Suite 2 Tests
Encoding 5 Tests
Finance 2 Tests
Fortran Tests 4 Tests
Game Development 4 Tests
HPC - High Performance Computing 21 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Machine Learning 7 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 5 Tests
Multi-Core 17 Tests
NVIDIA GPU Compute 6 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 11 Tests
Programmer / Developer System Benchmarks 10 Tests
Python 2 Tests
Scientific Computing 12 Tests
Server 5 Tests
Server CPU Tests 11 Tests
Single-Threaded 5 Tests
Speech 2 Tests
Telephony 2 Tests
Texture Compression 3 Tests
Video Encoding 2 Tests
Vulkan Compute 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
January 30 2021
  1 Hour, 11 Minutes
1a
January 30 2021
  19 Hours, 5 Minutes
2
January 31 2021
  19 Hours, 24 Minutes
3
February 01 2021
  8 Hours, 55 Minutes
Invert Hiding All Results Option
  12 Hours, 9 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ubuntu-2010-onlogicProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution11a23Intel Xeon E-2278GEL @ 3.90GHz (8 Cores / 16 Threads)Logic Supply RXM-181 (Z01-0001A027 BIOS)Intel Cannon Lake PCH16GB512GB TS512GMTE510TIntel UHD P630 3GB (1150MHz)Realtek ALC233DELL P2415QIntel I219-LM + 2 x Intel I210Ubuntu 20.105.8.0-41-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.9intel4.6 Mesa 20.2.61.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

ubuntu-2010-onlogicqe: AUSURF112webp2: Quality 100, Lossless Compressionvkfft: basis: UASTC Level 2 + RDO Post-Processingwebp2: Quality 95, Compression Effort 7cp2k: Fayalite-FIST Dataai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scorenpb: EP.Dopenfoam: Motorbike 30Mastcenc: Exhaustivewebp2: Quality 75, Compression Effort 7gromacs: Water Benchmarkbrl-cad: VGR Performance Metriccloverleaf: Lagrangian-Eulerian Hydrodynamicsnumpy: asmfish: 1024 Hash Memory, 26 Depthbuild2: Time To Compilegcrypt: build-godot: Time To Compilekripke: compress-zstd: 19dav1d: Chimera 1080p 10-bitnpb: LU.Chmmer: Pfam Database Searchvkmark: 1920 x 1080askap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingembree: Pathtracer - Crownonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUvkresample: 2x - Doubleonnx: super-resolution-10 - OpenMP CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0coremark: CoreMark Size 666 - Iterations Per Secondembree: Pathtracer - Asian Dragon Objespeak: Text-To-Speech Synthesisncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetstockfish: Total Timebasis: UASTC Level 3onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUembree: Pathtracer ISPC - Asian Dragon Objclomp: Static OMP Speedupbuild-eigen: Time To Compilebuild-ffmpeg: Time To Compileonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUwarsow: 1920 x 1080onednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnode-web-tooling: financebench: Bonds OpenMPembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonsimdjson: Kostyaembree: Pathtracer ISPC - Asian Dragoncompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedonednn: IP Shapes 1D - u8s8f32 - CPUaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingindigobench: CPU - Bedroomindigobench: CPU - Supercarsqlite-speedtest: Timed Time - Size 1,000ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapcython-bench: N-Queensbasis: ETC1Sfinancebench: Repo OpenMPvkresample: 2x - Singlesimdjson: LargeRandbasis: UASTC Level 2npb: FT.Conednn: IP Shapes 1D - f32 - CPUrav1e: 5webp2: Quality 100, Compression Effort 5simdjson: PartialTweetssimdjson: DistinctUserIDastcenc: Thoroughrav1e: 1npb: CG.Cqmcpack: simple-H2Olzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionrav1e: 6unpack-firefox: firefox-84.0.source.tar.xzdav1d: Summer Nature 4Konednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUamg: cryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512quantlib: compress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedlzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressionphpbench: PHP Benchmark Suitenpb: MG.Csynthmark: VoiceMark_100compress-zstd: 3askap: Hogbom Clean OpenMPetcpak: ETC2rav1e: 10encode-wavpack: WAV To WavPackcrafty: Elapsed Timedav1d: Chimera 1080ptnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1lzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionencode-ape: WAV To APElulesh: etcpak: ETC1 + Ditheringaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingetcpak: ETC1encode-opus: WAV To Opus Encoderedis: LPUSHlzbench: Libdeflate 1 - Compressionredis: SETredis: LPOPredis: SADDredis: GETmafft: Multiple Sequence Alignment - LSU RNAonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUastcenc: Mediumonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUbasis: UASTC Level 0npb: EP.Cdav1d: Summer Nature 1080pastcenc: Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUwebp2: Defaultetcpak: DXT1lammps: Rhodopsin Proteinonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU11a23127362.992.07209.31501.9082191.151445.4031296808.351724.5171478.0551430687743870.70464.54387.71392.3120.51968561300.05329.0418947598231.162229.820228.9271752114312.588.1515451.70131.265525993.030613.3176.957145397261141561004.389324348.2124.0964.61642.4717.977243609.1158617.340230.81517.8828.6640.9243.6721.1124.25113.9722.292.449.985.776.645.777.8127.5517.9028.6940.9543.8121.1224.28114.0322.362.4510.005.766.675.787.7627.561176364995.5086004.066000.836006.348.291137.98522.989.90786.5877.991693508.553472.523469.6310.3971041.9687507.68747.99110.488.98445990.043.445983.544.532.905091155.971284.261.0892.50962.03125.82159.95149677.138021498.5320.3748.5937365.588.527851.09621.3460.60.6146.780.3862905.8138.534105381.44521.407101.124.64891122372167403.7400.9723.8737.62783.62784.8404.0400.8723.9736.33394.03387.666733115887512180.56003.65218.114501016551655680.84615.6381611.2104.640165.2423.15816.6937355877436.17372.039343.574650168562416174381160745212.7702738.3525280.7711161.00690.017299.4639.6061718046.712131980808.832827879.752261333.332703703.3312.5857.595456.234.4101118.35829.2281017.82427.627.1331.781733.31555.5161208.8925.0025.3193910.26872164.201446.2931285807.286723.5221478.4821429687742860.27464.94387.97392.4070.52169488303.97339.7418657804230.336229.308228.7551747781312.588.2415459.45131.309518992.413613.9066.927245397261141411005.932319747.9344.0794.58942.2077.901244321.4829327.341831.64119.3428.6840.9843.6821.0524.16113.8222.252.5110.155.846.835.867.8427.5418.6128.7040.9943.7121.1124.15113.7222.262.4710.065.826.866.048.0027.511166246695.5695896.895902.685899.127.669427.95353.087.79188.4277.886453402.403402.033382.7410.3571207.2656257.74667.97860.488.93585983.243.745980.344.842.625101146.111278.111.0902.51261.25426.32760.08049656.072917500.9110.3748.1897371.116.852061.10221.3670.60.6146.890.3852881.7738.049105381.44221.635100.754.55996122317033403.4400.5722.5735.52769.22767.8403.4400.5722.2735.03373.83377.066760615879502185.06043.15224.184501026542725669.09615.8021605.6103.950164.0783.18716.6897462464446.95368.304343.209650170564419174882160745112.8142738.0066279.6581159.32682.729298.8949.6021717450.172141981355.171729976.592247597.672565612.7512.6067.231236.233.9905918.37669.231983.13433.957.1231.034432.13705.4911203.1494.9944.617628.130492158.8012871480.066881.34464.97394.013303.67328.9718601604232.222230.795227.55512.588.2015474.02131.2355176.89251007.994244509.6366687.361130.961118847315898.045893.195884.767.814827.97473.089.59187.3647.936903387.303363.673382.387.74088.01940.488.99525980.143.765981.844.882.6572525.833500.6850.377369.786.645361.0920.60.620.3832901.1938.453105381.443101.584.570501223344672187.16034.75162.574501015661.401606.6165.2553.1547480933451.88651169562417173982161045012.7752739.3128280.583299.2159.61721412.5367.208524.1022818.09861032.05433.4431.026232.11675.4661208.6645.0114.669448.71496OpenBenchmarking.org

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF1121a235001000150020002500SE +/- 20.55, N = 3SE +/- 13.36, N = 3SE +/- 6.30, N = 32191.152164.202158.801. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF1121a23400800120016002000Min: 2154.34 / Avg: 2191.15 / Max: 2225.39Min: 2142.8 / Avg: 2164.2 / Max: 2188.76Min: 2149.35 / Avg: 2158.8 / Max: 2170.731. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500SE +/- 1.95, N = 3SE +/- 1.82, N = 31445.401446.291. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500Min: 1441.59 / Avg: 1445.4 / Max: 1448.06Min: 1444.18 / Avg: 1446.29 / Max: 1449.921. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.111a2330060090012001500SE +/- 1.76, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 312731296128512871. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.111a232004006008001000Min: 1270 / Avg: 1273.33 / Max: 1276Min: 1295 / Avg: 1295.67 / Max: 1297Min: 1284 / Avg: 1285 / Max: 12861. (CXX) g++ options: -O3 -pthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1a22004006008001000SE +/- 0.28, N = 3SE +/- 0.31, N = 3808.35807.291. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1a2140280420560700Min: 807.85 / Avg: 808.35 / Max: 808.84Min: 806.81 / Avg: 807.29 / Max: 807.871. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71a2160320480640800SE +/- 0.77, N = 3SE +/- 1.84, N = 3724.52723.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71a2130260390520650Min: 723.61 / Avg: 724.52 / Max: 726.04Min: 720.08 / Avg: 723.52 / Max: 726.361. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data1a23300600900120015001478.061478.481480.07

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score1a23006009001200150014301429

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score1a2150300450600750687687

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score1a2160320480640800743742

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1a232004006008001000SE +/- 10.93, N = 12SE +/- 8.64, N = 12SE +/- 13.60, N = 3870.70860.27881.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1a23150300450600750Min: 788.51 / Avg: 870.7 / Max: 903.38Min: 818.02 / Avg: 860.27 / Max: 898.51Min: 854.13 / Avg: 881.34 / Max: 895.231. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a23100200300400500SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3464.54464.94464.971. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a2380160240320400Min: 463.92 / Avg: 464.54 / Max: 465.21Min: 464.78 / Avg: 464.94 / Max: 465.08Min: 464.55 / Avg: 464.97 / Max: 465.231. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a280160240320400SE +/- 0.40, N = 3SE +/- 0.31, N = 3387.71387.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a270140210280350Min: 386.92 / Avg: 387.71 / Max: 388.15Min: 387.36 / Avg: 387.97 / Max: 388.351. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2390180270360450SE +/- 1.12, N = 3SE +/- 1.91, N = 3SE +/- 1.58, N = 3392.31392.41394.011. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2370140210280350Min: 390.91 / Avg: 392.31 / Max: 394.52Min: 389.16 / Avg: 392.41 / Max: 395.76Min: 392.03 / Avg: 394.01 / Max: 397.131. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1a20.11720.23440.35160.46880.586SE +/- 0.002, N = 3SE +/- 0.001, N = 30.5190.5211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1a2246810Min: 0.52 / Avg: 0.52 / Max: 0.52Min: 0.52 / Avg: 0.52 / Max: 0.521. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1a215K30K45K60K75K68561694881. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a2370140210280350SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3300.05303.97303.671. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a2350100150200250Min: 299.99 / Avg: 300.05 / Max: 300.07Min: 303.78 / Avg: 303.97 / Max: 304.1Min: 303.56 / Avg: 303.67 / Max: 303.821. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1a2370140210280350SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.58, N = 3329.04339.74328.97
OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1a2360120180240300Min: 328.53 / Avg: 329.04 / Max: 329.34Min: 339.58 / Avg: 339.74 / Max: 340.06Min: 327.81 / Avg: 328.97 / Max: 329.61

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a234M8M12M16M20MSE +/- 239170.10, N = 3SE +/- 47854.09, N = 3SE +/- 209621.37, N = 3189475981865780418601604
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a233M6M9M12M15MMin: 18649100 / Avg: 18947597.67 / Max: 19420546Min: 18602790 / Avg: 18657803.67 / Max: 18753135Min: 18282664 / Avg: 18601604.33 / Max: 18996725

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1a2350100150200250SE +/- 0.90, N = 3SE +/- 1.08, N = 3SE +/- 1.00, N = 3231.16230.34232.22
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1a234080120160200Min: 229.64 / Avg: 231.16 / Max: 232.75Min: 228.18 / Avg: 230.34 / Max: 231.58Min: 230.22 / Avg: 232.22 / Max: 233.27

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a2350100150200250SE +/- 0.49, N = 3SE +/- 0.19, N = 3SE +/- 0.52, N = 3229.82229.31230.801. (CC) gcc options: -O2 -fvisibility=hidden
OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a234080120160200Min: 228.92 / Avg: 229.82 / Max: 230.58Min: 228.92 / Avg: 229.31 / Max: 229.52Min: 230 / Avg: 230.8 / Max: 231.771. (CC) gcc options: -O2 -fvisibility=hidden

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1a2350100150200250SE +/- 1.32, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3228.93228.76227.56
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1a234080120160200Min: 226.82 / Avg: 228.93 / Max: 231.35Min: 228.52 / Avg: 228.76 / Max: 228.9Min: 227.34 / Avg: 227.56 / Max: 227.82

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a24M8M12M16M20MSE +/- 43251.00, N = 3SE +/- 22002.07, N = 317521143174778131. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a23M6M9M12M15MMin: 17435090 / Avg: 17521143.33 / Max: 17571790Min: 17442890 / Avg: 17477813.33 / Max: 175184601. (CXX) g++ options: -O3 -fopenmp

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191a233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.512.512.51. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191a2348121620Min: 12.5 / Avg: 12.5 / Max: 12.5Min: 12.5 / Avg: 12.5 / Max: 12.5Min: 12.5 / Avg: 12.5 / Max: 12.51. (CC) gcc options: -O3 -pthread -lz -llzma

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1a2320406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 388.1588.2488.20MIN: 57.28 / MAX: 196.21MIN: 57.38 / MAX: 196.19MIN: 57.19 / MAX: 197.381. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1a2320406080100Min: 88.11 / Avg: 88.15 / Max: 88.23Min: 88.09 / Avg: 88.24 / Max: 88.32Min: 87.99 / Avg: 88.2 / Max: 88.341. (CC) gcc options: -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1a233K6K9K12K15KSE +/- 6.60, N = 3SE +/- 11.20, N = 3SE +/- 16.88, N = 315451.7015459.4515474.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1a233K6K9K12K15KMin: 15439.37 / Avg: 15451.7 / Max: 15461.94Min: 15448 / Avg: 15459.45 / Max: 15481.86Min: 15451.1 / Avg: 15474.02 / Max: 15506.951. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1a23306090120150SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3131.27131.31131.241. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1a2320406080100Min: 131.19 / Avg: 131.27 / Max: 131.41Min: 131.27 / Avg: 131.31 / Max: 131.35Min: 131.22 / Avg: 131.24 / Max: 131.261. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

VKMark

VKMark is a collection of Vulkan tests/benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a23110220330440550SE +/- 2.40, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 35255185171. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF
OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a2390180270360450Min: 522 / Avg: 525.33 / Max: 530Min: 517 / Avg: 518 / Max: 519Min: 516 / Avg: 517 / Max: 5181. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000SE +/- 0.27, N = 3SE +/- 0.15, N = 3993.03992.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000Min: 992.57 / Avg: 993.03 / Max: 993.49Min: 992.1 / Avg: 992.41 / Max: 992.571. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1a2130260390520650SE +/- 0.23, N = 3SE +/- 0.37, N = 3613.32613.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1a2110220330440550Min: 612.88 / Avg: 613.32 / Max: 613.67Min: 613.49 / Avg: 613.91 / Max: 614.641. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a23246810SE +/- 0.0934, N = 4SE +/- 0.1026, N = 4SE +/- 0.0890, N = 56.95716.92726.8925MIN: 6.68 / MAX: 8.51MIN: 6.63 / MAX: 8.55MIN: 6.62 / MAX: 8.55
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a233691215Min: 6.84 / Avg: 6.96 / Max: 7.24Min: 6.78 / Avg: 6.93 / Max: 7.23Min: 6.78 / Avg: 6.89 / Max: 7.25

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1a21020304050SE +/- 0.00, N = 3SE +/- 0.17, N = 345451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1a2918273645Min: 44.5 / Avg: 44.5 / Max: 44.5Min: 44.5 / Avg: 44.67 / Max: 451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1a290180270360450SE +/- 1.26, N = 3SE +/- 1.15, N = 33973971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1a270140210280350Min: 395.5 / Avg: 397 / Max: 399.5Min: 395 / Avg: 397 / Max: 3991. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1a260120180240300SE +/- 0.33, N = 3SE +/- 0.17, N = 32612611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1a250100150200250Min: 261 / Avg: 261.33 / Max: 262Min: 260.5 / Avg: 260.67 / Max: 2611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a23K6K9K12K15KSE +/- 22.47, N = 3SE +/- 34.58, N = 314156141411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a22K4K6K8K10KMin: 14115 / Avg: 14155.83 / Max: 14192.5Min: 14071.5 / Avg: 14140.5 / Max: 141791. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000SE +/- 3.58, N = 3SE +/- 3.72, N = 3SE +/- 4.35, N = 31004.391005.931007.991. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000Min: 1000.32 / Avg: 1004.39 / Max: 1011.53Min: 1001.07 / Avg: 1005.93 / Max: 1013.24Min: 1000.36 / Avg: 1007.99 / Max: 1015.431. (CXX) g++ options: -O3 -pthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a27001400210028003500SE +/- 6.71, N = 3SE +/- 34.43, N = 3324331971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a26001200180024003000Min: 3234 / Avg: 3242.83 / Max: 3256Min: 3129.5 / Avg: 3197.33 / Max: 3241.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31a21122334455SE +/- 1.11, N = 3SE +/- 1.29, N = 348.2147.93MIN: 40.03 / MAX: 87.29MIN: 39.27 / MAX: 62.111. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31a21020304050Min: 46.02 / Avg: 48.21 / Max: 49.66Min: 45.35 / Avg: 47.93 / Max: 49.291. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01a20.92161.84322.76483.68644.608SE +/- 0.008, N = 3SE +/- 0.012, N = 34.0964.079MIN: 3.86 / MAX: 17.27MIN: 3.87 / MAX: 16.371. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01a2246810Min: 4.08 / Avg: 4.1 / Max: 4.11Min: 4.06 / Avg: 4.08 / Max: 4.11. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241a21.03862.07723.11584.15445.193SE +/- 0.014, N = 3SE +/- 0.008, N = 34.6164.589MIN: 4 / MAX: 5.38MIN: 4 / MAX: 5.391. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241a2246810Min: 4.59 / Avg: 4.62 / Max: 4.64Min: 4.58 / Avg: 4.59 / Max: 4.611. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501a21020304050SE +/- 0.06, N = 3SE +/- 0.03, N = 342.4742.21MIN: 41.07 / MAX: 45.74MIN: 40.94 / MAX: 44.991. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501a2918273645Min: 42.36 / Avg: 42.47 / Max: 42.58Min: 42.15 / Avg: 42.21 / Max: 42.241. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01a2246810SE +/- 0.010, N = 3SE +/- 0.038, N = 37.9777.901MIN: 7.4 / MAX: 21.28MIN: 7.36 / MAX: 20.811. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01a23691215Min: 7.96 / Avg: 7.98 / Max: 8Min: 7.86 / Avg: 7.9 / Max: 7.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second1a2350K100K150K200K250KSE +/- 1679.06, N = 14SE +/- 2311.32, N = 10SE +/- 2406.51, N = 9243609.12244321.48244509.641. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second1a2340K80K120K160K200KMin: 240015 / Avg: 243609.12 / Max: 265164.07Min: 240168.12 / Avg: 244321.48 / Max: 264834.89Min: 239189.74 / Avg: 244509.64 / Max: 263049.731. (CC) gcc options: -O2 -lrt" -lrt

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1a23246810SE +/- 0.0159, N = 3SE +/- 0.0153, N = 3SE +/- 0.0029, N = 37.34027.34187.3611MIN: 7.12 / MAX: 8.12MIN: 7.13 / MAX: 8.13MIN: 7.12 / MAX: 8.18
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1a233691215Min: 7.32 / Avg: 7.34 / Max: 7.37Min: 7.32 / Avg: 7.34 / Max: 7.37Min: 7.36 / Avg: 7.36 / Max: 7.37

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a23714212835SE +/- 0.44, N = 4SE +/- 0.22, N = 20SE +/- 0.19, N = 430.8231.6430.961. (CC) gcc options: -O2 -std=c99
OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a23714212835Min: 29.81 / Avg: 30.82 / Max: 31.95Min: 31.1 / Avg: 31.64 / Max: 34.74Min: 30.62 / Avg: 30.96 / Max: 31.51. (CC) gcc options: -O2 -std=c99

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025SE +/- 0.18, N = 3SE +/- 0.77, N = 317.8819.34MIN: 17.22 / MAX: 18.84MIN: 17.15 / MAX: 29.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025Min: 17.54 / Avg: 17.88 / Max: 18.14Min: 18.53 / Avg: 19.34 / Max: 20.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.6628.68MIN: 28.25 / MAX: 29.49MIN: 28.3 / MAX: 29.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2612182430Min: 28.65 / Avg: 28.66 / Max: 28.69Min: 28.65 / Avg: 28.68 / Max: 28.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.05, N = 340.9240.98MIN: 40.49 / MAX: 49.83MIN: 40.54 / MAX: 50.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645Min: 40.89 / Avg: 40.92 / Max: 40.94Min: 40.9 / Avg: 40.98 / Max: 41.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a21020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 343.6743.68MIN: 41.66 / MAX: 47.83MIN: 41.61 / MAX: 45.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a2918273645Min: 43.65 / Avg: 43.67 / Max: 43.71Min: 43.61 / Avg: 43.68 / Max: 43.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1a2510152025SE +/- 0.02, N = 3SE +/- 0.06, N = 321.1121.05MIN: 20.83 / MAX: 21.88MIN: 20.87 / MAX: 21.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1a2510152025Min: 21.08 / Avg: 21.11 / Max: 21.14Min: 20.93 / Avg: 21.05 / Max: 21.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181a2612182430SE +/- 0.01, N = 3SE +/- 0.09, N = 324.2524.16MIN: 23.71 / MAX: 26.65MIN: 23.71 / MAX: 25.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181a2612182430Min: 24.23 / Avg: 24.25 / Max: 24.27Min: 23.98 / Avg: 24.16 / Max: 24.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161a2306090120150SE +/- 0.05, N = 3SE +/- 0.04, N = 3113.97113.82MIN: 113.51 / MAX: 123.78MIN: 113.45 / MAX: 122.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161a220406080100Min: 113.89 / Avg: 113.97 / Max: 114.07Min: 113.77 / Avg: 113.82 / Max: 113.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1a2510152025SE +/- 0.27, N = 3SE +/- 0.35, N = 322.2922.25MIN: 21.06 / MAX: 23.6MIN: 21.23 / MAX: 23.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1a2510152025Min: 21.76 / Avg: 22.29 / Max: 22.6Min: 21.55 / Avg: 22.25 / Max: 22.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a20.56481.12961.69442.25922.824SE +/- 0.06, N = 3SE +/- 0.01, N = 32.442.51MIN: 2.28 / MAX: 2.98MIN: 2.35 / MAX: 3.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a2246810Min: 2.32 / Avg: 2.44 / Max: 2.51Min: 2.49 / Avg: 2.51 / Max: 2.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215SE +/- 0.14, N = 3SE +/- 0.02, N = 39.9810.15MIN: 9.65 / MAX: 19.45MIN: 9.86 / MAX: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215Min: 9.7 / Avg: 9.98 / Max: 10.15Min: 10.12 / Avg: 10.15 / Max: 10.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a21.3142.6283.9425.2566.57SE +/- 0.10, N = 3SE +/- 0.10, N = 35.775.84MIN: 5.35 / MAX: 7.36MIN: 5.28 / MAX: 7.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a2246810Min: 5.62 / Avg: 5.77 / Max: 5.96Min: 5.74 / Avg: 5.84 / Max: 6.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a2246810SE +/- 0.09, N = 3SE +/- 0.11, N = 36.646.83MIN: 6.5 / MAX: 8.16MIN: 6.65 / MAX: 7.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a23691215Min: 6.55 / Avg: 6.64 / Max: 6.82Min: 6.7 / Avg: 6.83 / Max: 7.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a21.31852.6373.95555.2746.5925SE +/- 0.03, N = 3SE +/- 0.04, N = 35.775.86MIN: 5.57 / MAX: 7.01MIN: 5.62 / MAX: 7.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a2246810Min: 5.72 / Avg: 5.77 / Max: 5.81Min: 5.82 / Avg: 5.86 / Max: 5.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.02, N = 3SE +/- 0.02, N = 37.817.84MIN: 7.6 / MAX: 9.81MIN: 7.59 / MAX: 9.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a23691215Min: 7.77 / Avg: 7.81 / Max: 7.85Min: 7.8 / Avg: 7.84 / Max: 7.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1a2612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 327.5527.54MIN: 27.17 / MAX: 28.55MIN: 27.2 / MAX: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1a2612182430Min: 27.49 / Avg: 27.55 / Max: 27.59Min: 27.51 / Avg: 27.54 / Max: 27.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025SE +/- 0.22, N = 3SE +/- 0.08, N = 317.9018.61MIN: 17.15 / MAX: 18.95MIN: 17.25 / MAX: 201. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025Min: 17.47 / Avg: 17.9 / Max: 18.19Min: 18.45 / Avg: 18.61 / Max: 18.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 328.6928.70MIN: 28.24 / MAX: 29.73MIN: 28.3 / MAX: 29.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2612182430Min: 28.66 / Avg: 28.69 / Max: 28.73Min: 28.69 / Avg: 28.7 / Max: 28.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 340.9540.99MIN: 40.51 / MAX: 41.79MIN: 40.54 / MAX: 49.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645Min: 40.91 / Avg: 40.95 / Max: 40.99Min: 40.92 / Avg: 40.99 / Max: 41.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501a21020304050SE +/- 0.07, N = 3SE +/- 0.05, N = 343.8143.71MIN: 41.64 / MAX: 53.86MIN: 41.74 / MAX: 46.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501a2918273645Min: 43.66 / Avg: 43.81 / Max: 43.89Min: 43.62 / Avg: 43.71 / Max: 43.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet1a2510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 321.1221.11MIN: 20.84 / MAX: 22.35MIN: 20.86 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet1a2510152025Min: 21.09 / Avg: 21.12 / Max: 21.14Min: 21.1 / Avg: 21.11 / Max: 21.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181a2612182430SE +/- 0.02, N = 3SE +/- 0.07, N = 324.2824.15MIN: 23.77 / MAX: 32.56MIN: 23.68 / MAX: 25.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181a2612182430Min: 24.24 / Avg: 24.28 / Max: 24.3Min: 24.02 / Avg: 24.15 / Max: 24.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg161a2306090120150SE +/- 0.12, N = 3SE +/- 0.06, N = 3114.03113.72MIN: 113.58 / MAX: 123.46MIN: 113.39 / MAX: 122.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg161a220406080100Min: 113.88 / Avg: 114.03 / Max: 114.27Min: 113.6 / Avg: 113.72 / Max: 113.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1a2510152025SE +/- 0.23, N = 3SE +/- 0.36, N = 322.3622.26MIN: 21.06 / MAX: 23.65MIN: 21.15 / MAX: 23.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1a2510152025Min: 21.9 / Avg: 22.36 / Max: 22.59Min: 21.54 / Avg: 22.26 / Max: 22.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a20.55581.11161.66742.22322.779SE +/- 0.06, N = 3SE +/- 0.05, N = 32.452.47MIN: 2.29 / MAX: 2.85MIN: 2.31 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a2246810Min: 2.32 / Avg: 2.45 / Max: 2.51Min: 2.37 / Avg: 2.47 / Max: 2.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215SE +/- 0.13, N = 3SE +/- 0.14, N = 310.0010.06MIN: 9.67 / MAX: 11.41MIN: 9.74 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215Min: 9.74 / Avg: 10 / Max: 10.15Min: 9.79 / Avg: 10.06 / Max: 10.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a21.30952.6193.92855.2386.5475SE +/- 0.11, N = 3SE +/- 0.12, N = 35.765.82MIN: 5.31 / MAX: 7.49MIN: 5.3 / MAX: 7.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a2246810Min: 5.63 / Avg: 5.76 / Max: 5.98Min: 5.66 / Avg: 5.82 / Max: 6.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a2246810SE +/- 0.08, N = 3SE +/- 0.18, N = 36.676.86MIN: 6.51 / MAX: 8.11MIN: 6.5 / MAX: 8.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a23691215Min: 6.56 / Avg: 6.67 / Max: 6.83Min: 6.67 / Avg: 6.86 / Max: 7.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810SE +/- 0.03, N = 3SE +/- 0.19, N = 35.786.04MIN: 5.59 / MAX: 6.95MIN: 5.6 / MAX: 7.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810Min: 5.72 / Avg: 5.78 / Max: 5.83Min: 5.85 / Avg: 6.04 / Max: 6.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.03, N = 3SE +/- 0.14, N = 37.768.00MIN: 7.45 / MAX: 9.82MIN: 7.65 / MAX: 9.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a23691215Min: 7.71 / Avg: 7.76 / Max: 7.79Min: 7.85 / Avg: 8 / Max: 8.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1a2612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5627.51MIN: 27.18 / MAX: 28.23MIN: 27.14 / MAX: 29.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1a2612182430Min: 27.53 / Avg: 27.56 / Max: 27.58Min: 27.47 / Avg: 27.51 / Max: 27.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1a233M6M9M12M15MSE +/- 37547.46, N = 3SE +/- 47603.72, N = 3SE +/- 160953.33, N = 41176364911662466118847311. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1a232M4M6M8M10MMin: 11717851 / Avg: 11763649.33 / Max: 11838088Min: 11574596 / Avg: 11662466 / Max: 11738142Min: 11565800 / Avg: 11884730.5 / Max: 122660531. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100SE +/- 0.31, N = 3SE +/- 0.30, N = 395.5195.571. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100Min: 94.88 / Avg: 95.51 / Max: 95.84Min: 94.96 / Avg: 95.57 / Max: 95.891. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2313002600390052006500SE +/- 18.98, N = 3SE +/- 6.73, N = 3SE +/- 12.51, N = 36004.065896.895898.04MIN: 5933.15MIN: 5837.81MIN: 5828.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2310002000300040005000Min: 5976.33 / Avg: 6004.06 / Max: 6040.37Min: 5886.6 / Avg: 5896.89 / Max: 5909.55Min: 5873.46 / Avg: 5898.04 / Max: 5914.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2313002600390052006500SE +/- 19.70, N = 3SE +/- 13.63, N = 3SE +/- 4.88, N = 36000.835902.685893.19MIN: 5915.81MIN: 5841.04MIN: 5838.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2310002000300040005000Min: 5961.57 / Avg: 6000.83 / Max: 6023.24Min: 5885.61 / Avg: 5902.68 / Max: 5929.63Min: 5884.41 / Avg: 5893.19 / Max: 5901.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2313002600390052006500SE +/- 16.93, N = 3SE +/- 3.24, N = 3SE +/- 6.65, N = 36006.345899.125884.76MIN: 5936.3MIN: 5840.53MIN: 5829.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2310002000300040005000Min: 5981.86 / Avg: 6006.34 / Max: 6038.83Min: 5894.5 / Avg: 5899.12 / Max: 5905.37Min: 5872.33 / Avg: 5884.76 / Max: 5895.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.06721, N = 15SE +/- 0.13886, N = 12SE +/- 0.15309, N = 128.291137.669427.81482MIN: 7.1MIN: 5.68MIN: 5.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a233691215Min: 7.45 / Avg: 8.29 / Max: 8.56Min: 6.16 / Avg: 7.67 / Max: 7.97Min: 6.2 / Avg: 7.81 / Max: 8.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a23246810SE +/- 0.0235, N = 3SE +/- 0.0122, N = 3SE +/- 0.0138, N = 37.98527.95357.9747MIN: 7.77 / MAX: 8.68MIN: 7.72 / MAX: 8.73MIN: 7.77 / MAX: 8.72
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a233691215Min: 7.95 / Avg: 7.99 / Max: 8.03Min: 7.93 / Avg: 7.95 / Max: 7.97Min: 7.95 / Avg: 7.97 / Max: 8

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1a230.6751.352.0252.73.375SE +/- 0.03, N = 3SE +/- 0.03, N = 9SE +/- 0.03, N = 32.93.03.01. (CC) gcc options: -fopenmp -O3 -lm
OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1a23246810Min: 2.9 / Avg: 2.93 / Max: 3Min: 2.8 / Avg: 2.96 / Max: 3.1Min: 3 / Avg: 3.03 / Max: 3.11. (CC) gcc options: -fopenmp -O3 -lm

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1a2320406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 389.9187.7989.59
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1a2320406080100Min: 89.89 / Avg: 89.91 / Max: 89.94Min: 87.75 / Avg: 87.79 / Max: 87.84Min: 89.44 / Avg: 89.59 / Max: 89.8

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a2320406080100SE +/- 0.42, N = 3SE +/- 1.19, N = 3SE +/- 0.24, N = 386.5988.4387.36
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a2320406080100Min: 86.05 / Avg: 86.59 / Max: 87.4Min: 87.2 / Avg: 88.43 / Max: 90.81Min: 86.93 / Avg: 87.36 / Max: 87.76

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a23246810SE +/- 0.12949, N = 12SE +/- 0.12374, N = 12SE +/- 0.12812, N = 127.991697.886457.93690MIN: 6.38MIN: 6.33MIN: 6.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a233691215Min: 6.6 / Avg: 7.99 / Max: 8.26Min: 6.55 / Avg: 7.89 / Max: 8.17Min: 6.55 / Avg: 7.94 / Max: 8.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108011428425670SE +/- 0.37, N = 362.9

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a238001600240032004000SE +/- 12.64, N = 3SE +/- 1.31, N = 3SE +/- 6.49, N = 33508.553402.403387.30MIN: 3442.98MIN: 3364.88MIN: 3338.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a236001200180024003000Min: 3483.3 / Avg: 3508.55 / Max: 3522.33Min: 3400.18 / Avg: 3402.4 / Max: 3404.7Min: 3375.62 / Avg: 3387.3 / Max: 3398.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a237001400210028003500SE +/- 42.16, N = 3SE +/- 7.92, N = 3SE +/- 1.67, N = 33472.523402.033363.67MIN: 3371.23MIN: 3340.17MIN: 3322.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a236001200180024003000Min: 3401.87 / Avg: 3472.52 / Max: 3547.69Min: 3391.2 / Avg: 3402.03 / Max: 3417.45Min: 3360.84 / Avg: 3363.67 / Max: 3366.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a237001400210028003500SE +/- 27.44, N = 3SE +/- 13.00, N = 3SE +/- 14.91, N = 33469.633382.743382.38MIN: 3389.1MIN: 3319.45MIN: 3321.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a236001200180024003000Min: 3427.09 / Avg: 3469.63 / Max: 3520.93Min: 3358.02 / Avg: 3382.74 / Max: 3402.1Min: 3358.86 / Avg: 3382.38 / Max: 3410.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215SE +/- 0.08, N = 3SE +/- 0.11, N = 310.3910.351. Nodejs v12.18.2
OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215Min: 10.29 / Avg: 10.39 / Max: 10.55Min: 10.17 / Avg: 10.35 / Max: 10.551. Nodejs v12.18.2

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a215K30K45K60K75KSE +/- 217.85, N = 3SE +/- 331.16, N = 371041.9771207.271. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a212K24K36K48K60KMin: 70607.33 / Avg: 71041.97 / Max: 71285.56Min: 70564.23 / Avg: 71207.27 / Max: 71666.21. (CXX) g++ options: -O3 -march=native -fopenmp

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1a23246810SE +/- 0.1307, N = 3SE +/- 0.0480, N = 3SE +/- 0.0611, N = 37.68747.74667.7408MIN: 7.27 / MAX: 9.52MIN: 7.5 / MAX: 9.39MIN: 7.48 / MAX: 9.45
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1a233691215Min: 7.45 / Avg: 7.69 / Max: 7.9Min: 7.69 / Avg: 7.75 / Max: 7.84Min: 7.67 / Avg: 7.74 / Max: 7.86

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1a23246810SE +/- 0.0156, N = 3SE +/- 0.0266, N = 3SE +/- 0.0068, N = 37.99117.97868.0194MIN: 7.78 / MAX: 9.04MIN: 7.81 / MAX: 8.99MIN: 7.79 / MAX: 9.01
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1a233691215Min: 7.96 / Avg: 7.99 / Max: 8.01Min: 7.94 / Avg: 7.98 / Max: 8.03Min: 8.01 / Avg: 8.02 / Max: 8.03

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1a230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1a2312345Min: 0.48 / Avg: 0.48 / Max: 0.49Min: 0.48 / Avg: 0.48 / Max: 0.49Min: 0.48 / Avg: 0.48 / Max: 0.491. (CXX) g++ options: -O3 -pthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1a233691215SE +/- 0.0549, N = 3SE +/- 0.0413, N = 3SE +/- 0.0858, N = 38.98448.93588.9952MIN: 8.77 / MAX: 9.8MIN: 8.75 / MAX: 9.83MIN: 8.75 / MAX: 9.95
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1a233691215Min: 8.91 / Avg: 8.98 / Max: 9.09Min: 8.88 / Avg: 8.94 / Max: 9.02Min: 8.88 / Avg: 9 / Max: 9.16

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2313002600390052006500SE +/- 3.49, N = 3SE +/- 0.86, N = 3SE +/- 1.13, N = 35990.05983.25980.11. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2310002000300040005000Min: 5984.9 / Avg: 5990.03 / Max: 5996.7Min: 5981.5 / Avg: 5983.17 / Max: 5984.4Min: 5978.2 / Avg: 5980.13 / Max: 5982.11. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 343.4443.7443.761. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1a23918273645Min: 43.43 / Avg: 43.44 / Max: 43.45Min: 43.73 / Avg: 43.74 / Max: 43.75Min: 43.75 / Avg: 43.76 / Max: 43.771. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a2313002600390052006500SE +/- 1.02, N = 3SE +/- 1.62, N = 3SE +/- 0.40, N = 35983.55980.35981.81. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a2310002000300040005000Min: 5982.1 / Avg: 5983.53 / Max: 5985.5Min: 5977.5 / Avg: 5980.33 / Max: 5983.1Min: 5981.1 / Avg: 5981.8 / Max: 5982.51. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 344.5344.8444.881. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1a23918273645Min: 44.51 / Avg: 44.53 / Max: 44.54Min: 44.83 / Avg: 44.84 / Max: 44.85Min: 44.87 / Avg: 44.88 / Max: 44.881. (CC) gcc options: -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a230.65361.30721.96082.61443.268SE +/- 0.03413, N = 14SE +/- 0.04605, N = 12SE +/- 0.04568, N = 122.905092.625102.65725MIN: 2.26MIN: 2.08MIN: 2.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 2.54 / Avg: 2.91 / Max: 3.09Min: 2.12 / Avg: 2.63 / Max: 2.69Min: 2.16 / Avg: 2.66 / Max: 2.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000SE +/- 5.07, N = 3SE +/- 12.47, N = 31155.971146.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000Min: 1145.83 / Avg: 1155.97 / Max: 1161.04Min: 1121.35 / Avg: 1146.11 / Max: 1161.041. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a230060090012001500SE +/- 8.33, N = 3SE +/- 11.46, N = 31284.261278.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a22004006008001000Min: 1267.61 / Avg: 1284.26 / Max: 1292.59Min: 1255.48 / Avg: 1278.11 / Max: 1292.591. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1a20.24530.49060.73590.98121.2265SE +/- 0.000, N = 3SE +/- 0.001, N = 31.0891.090
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1a2246810Min: 1.09 / Avg: 1.09 / Max: 1.09Min: 1.09 / Avg: 1.09 / Max: 1.09

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1a20.56521.13041.69562.26082.826SE +/- 0.004, N = 3SE +/- 0.002, N = 32.5092.512
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1a2246810Min: 2.5 / Avg: 2.51 / Max: 2.52Min: 2.51 / Avg: 2.51 / Max: 2.52

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001a21428425670SE +/- 0.23, N = 3SE +/- 0.20, N = 362.0361.251. (CC) gcc options: -O2 -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001a21224364860Min: 61.6 / Avg: 62.03 / Max: 62.39Min: 60.97 / Avg: 61.25 / Max: 61.641. (CC) gcc options: -O2 -ldl -lz -lpthread

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time1510152025Min: 8.52 / Avg: 10.85 / Max: 18.851. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2120406080100SE +/- 0.23, N = 392.07MIN: 29.47 / MAX: 1231. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1510152025Min: 3.69 / Avg: 4.77 / Max: 18.371. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap150100150200250SE +/- 0.86, N = 3209.31MIN: 54.45 / MAX: 280.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a23612182430SE +/- 0.03, N = 3SE +/- 0.25, N = 15SE +/- 0.09, N = 325.8226.3325.83
OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a23612182430Min: 25.76 / Avg: 25.82 / Max: 25.86Min: 25.72 / Avg: 26.33 / Max: 29.11Min: 25.74 / Avg: 25.83 / Max: 26.02

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21326395265SE +/- 0.33, N = 3SE +/- 0.31, N = 359.9560.081. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21224364860Min: 59.28 / Avg: 59.95 / Max: 60.3Min: 59.45 / Avg: 60.08 / Max: 60.421. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1a211K22K33K44K55KSE +/- 562.64, N = 3SE +/- 734.76, N = 349677.1449656.071. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1a29K18K27K36K45KMin: 48557.77 / Avg: 49677.14 / Max: 50336.59Min: 48225.09 / Avg: 49656.07 / Max: 50661.081. (CXX) g++ options: -O3 -march=native -fopenmp

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single11a23110220330440550SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 0.25, N = 3501.91498.53500.91500.691. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single11a2390180270360450Min: 501.44 / Avg: 501.91 / Max: 502.35Min: 497.9 / Avg: 498.53 / Max: 499.03Min: 500.61 / Avg: 500.91 / Max: 501.24Min: 500.3 / Avg: 500.69 / Max: 501.161. (CXX) g++ options: -O3 -pthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1a230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1a2312345Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.371. (CXX) g++ options: -O3 -pthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21a21122334455SE +/- 0.69, N = 4SE +/- 0.77, N = 348.5948.191. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21a21020304050Min: 46.59 / Avg: 48.59 / Max: 49.7Min: 46.64 / Avg: 48.19 / Max: 48.971. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C1a2316003200480064008000SE +/- 10.76, N = 3SE +/- 9.46, N = 3SE +/- 14.95, N = 37365.587371.117369.781. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C1a2313002600390052006500Min: 7350.88 / Avg: 7365.58 / Max: 7386.54Min: 7353.97 / Avg: 7371.11 / Max: 7386.63Min: 7353.25 / Avg: 7369.78 / Max: 7399.621. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.11268, N = 3SE +/- 0.07900, N = 15SE +/- 0.04844, N = 158.527856.852066.64536MIN: 8.22MIN: 6.11MIN: 5.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a233691215Min: 8.34 / Avg: 8.53 / Max: 8.73Min: 6.56 / Avg: 6.85 / Max: 7.41Min: 6.06 / Avg: 6.65 / Max: 6.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51a230.2480.4960.7440.9921.24SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 31.0961.1021.092
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51a23246810Min: 1.09 / Avg: 1.1 / Max: 1.1Min: 1.1 / Avg: 1.1 / Max: 1.11Min: 1.09 / Avg: 1.09 / Max: 1.09

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025SE +/- 0.21, N = 8SE +/- 0.23, N = 721.3521.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025Min: 19.86 / Avg: 21.35 / Max: 21.6Min: 19.98 / Avg: 21.37 / Max: 21.651. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1a230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1a23246810Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.61. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1a230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.610.610.621. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1a23246810Min: 0.61 / Avg: 0.61 / Max: 0.62Min: 0.61 / Avg: 0.61 / Max: 0.61Min: 0.61 / Avg: 0.62 / Max: 0.621. (CXX) g++ options: -O3 -pthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21122334455SE +/- 0.50, N = 3SE +/- 0.49, N = 346.7846.891. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21020304050Min: 45.79 / Avg: 46.78 / Max: 47.32Min: 45.92 / Avg: 46.89 / Max: 47.381. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a230.08690.17380.26070.34760.4345SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3860.3850.383
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a2312345Min: 0.39 / Avg: 0.39 / Max: 0.39Min: 0.38 / Avg: 0.39 / Max: 0.39Min: 0.38 / Avg: 0.38 / Max: 0.38

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a236001200180024003000SE +/- 1.91, N = 3SE +/- 0.89, N = 3SE +/- 2.44, N = 32905.812881.772901.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a235001000150020002500Min: 2903.78 / Avg: 2905.81 / Max: 2909.62Min: 2880.77 / Avg: 2881.77 / Max: 2883.55Min: 2896.34 / Avg: 2901.19 / Max: 2904.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1a23918273645SE +/- 0.55, N = 4SE +/- 0.63, N = 3SE +/- 0.55, N = 438.5338.0538.451. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1a23816243240Min: 36.89 / Avg: 38.53 / Max: 39.21Min: 36.81 / Avg: 38.05 / Max: 38.84Min: 36.83 / Avg: 38.45 / Max: 39.211. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1a2320406080100SE +/- 0.33, N = 31051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1a2320406080100Min: 104 / Avg: 104.67 / Max: 1051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression1a239182736453838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a230.32510.65020.97531.30041.6255SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 31.4451.4421.443
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a23246810Min: 1.44 / Avg: 1.45 / Max: 1.45Min: 1.43 / Avg: 1.44 / Max: 1.45Min: 1.44 / Avg: 1.44 / Max: 1.45

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025SE +/- 0.24, N = 7SE +/- 0.32, N = 421.4121.64
OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025Min: 20.66 / Avg: 21.41 / Max: 22.31Min: 20.77 / Avg: 21.63 / Max: 22.15

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1a2320406080100SE +/- 0.28, N = 3SE +/- 0.37, N = 3SE +/- 0.36, N = 3101.12100.75101.58MIN: 91.52 / MAX: 106.08MIN: 83.92 / MAX: 106.22MIN: 87.94 / MAX: 106.991. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1a2320406080100Min: 100.84 / Avg: 101.12 / Max: 101.67Min: 100.03 / Avg: 100.75 / Max: 101.23Min: 100.86 / Avg: 101.58 / Max: 101.961. (CC) gcc options: -pthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a231.0462.0923.1384.1845.23SE +/- 0.04669, N = 8SE +/- 0.04208, N = 10SE +/- 0.04478, N = 94.648914.559964.57050MIN: 3.94MIN: 3.76MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 4.32 / Avg: 4.65 / Max: 4.71Min: 4.18 / Avg: 4.56 / Max: 4.63Min: 4.21 / Avg: 4.57 / Max: 4.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a2330M60M90M120M150MSE +/- 10051.26, N = 3SE +/- 7169.92, N = 3SE +/- 8824.65, N = 31223721671223170331223344671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a2320M40M60M80M100MMin: 122353000 / Avg: 122372166.67 / Max: 122387000Min: 122302900 / Avg: 122317033.33 / Max: 122326200Min: 122325000 / Avg: 122334466.67 / Max: 1223521001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.40, N = 2403.7403.4
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a270140210280350Min: 403.4 / Avg: 403.7 / Max: 403.9Min: 403 / Avg: 403.4 / Max: 403.8

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a290180270360450SE +/- 0.12, N = 3SE +/- 0.31, N = 3400.9400.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a270140210280350Min: 400.7 / Avg: 400.93 / Max: 401.1Min: 400.1 / Avg: 400.5 / Max: 401.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2160320480640800SE +/- 0.50, N = 3SE +/- 1.30, N = 2723.8722.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2130260390520650Min: 722.8 / Avg: 723.8 / Max: 724.4Min: 721.2 / Avg: 722.5 / Max: 723.8

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2160320480640800SE +/- 0.83, N = 3737.6735.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2130260390520650Min: 734.3 / Avg: 735.5 / Max: 737.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a26001200180024003000SE +/- 4.37, N = 3SE +/- 10.39, N = 32783.62769.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a25001000150020002500Min: 2777.3 / Avg: 2783.6 / Max: 2792Min: 2752.9 / Avg: 2769.2 / Max: 2788.5

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a26001200180024003000SE +/- 4.28, N = 3SE +/- 12.45, N = 32784.82767.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a25001000150020002500Min: 2778.9 / Avg: 2784.77 / Max: 2793.1Min: 2748.9 / Avg: 2767.83 / Max: 2791.3

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.34, N = 3404.0403.4
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a270140210280350Min: 403.7 / Avg: 403.97 / Max: 404.2Min: 403 / Avg: 403.43 / Max: 404.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a290180270360450SE +/- 0.32, N = 3SE +/- 0.38, N = 3400.8400.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a270140210280350Min: 400.3 / Avg: 400.83 / Max: 401.4Min: 399.9 / Avg: 400.53 / Max: 401.2

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2160320480640800SE +/- 0.38, N = 3SE +/- 0.94, N = 3723.9722.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2130260390520650Min: 723.2 / Avg: 723.93 / Max: 724.5Min: 720.9 / Avg: 722.17 / Max: 724

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2160320480640800SE +/- 0.66, N = 3SE +/- 0.32, N = 3736.3735.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2130260390520650Min: 735.2 / Avg: 736.33 / Max: 737.5Min: 734.4 / Avg: 734.97 / Max: 735.5

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a27001400210028003500SE +/- 8.42, N = 3SE +/- 18.72, N = 33394.03373.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a26001200180024003000Min: 3381.4 / Avg: 3394.03 / Max: 3410Min: 3343.5 / Avg: 3373.83 / Max: 3408

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a27001400210028003500SE +/- 1.88, N = 3SE +/- 20.76, N = 33387.63377.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a26001200180024003000Min: 3384.3 / Avg: 3387.6 / Max: 3390.8Min: 3339.9 / Avg: 3377 / Max: 3411.7

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1a2140K280K420K560K700KSE +/- 2257.33, N = 3SE +/- 1574.42, N = 3667331667606
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1a2120K240K360K480K600KMin: 662816 / Avg: 667330.67 / Max: 669588Min: 664496 / Avg: 667606 / Max: 669588

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KSE +/- 801.33, N = 315887511587950
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KMin: 1586347 / Avg: 1587949.67 / Max: 1588751

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a235001000150020002500SE +/- 18.57, N = 3SE +/- 14.18, N = 3SE +/- 17.07, N = 32180.52185.02187.11. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a23400800120016002000Min: 2143.9 / Avg: 2180.53 / Max: 2204.1Min: 2157.3 / Avg: 2184.97 / Max: 2204.2Min: 2153.1 / Avg: 2187.13 / Max: 2206.41. (CXX) g++ options: -O3 -march=native -rdynamic

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1a2313002600390052006500SE +/- 5.95, N = 3SE +/- 0.40, N = 3SE +/- 2.77, N = 36003.66043.16034.71. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1a2310002000300040005000Min: 5994.7 / Avg: 6003.6 / Max: 6014.9Min: 6042.6 / Avg: 6043.1 / Max: 6043.9Min: 6031.4 / Avg: 6034.7 / Max: 6040.21. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1a2311002200330044005500SE +/- 1.32, N = 3SE +/- 3.55, N = 3SE +/- 4.86, N = 35218.115224.185162.571. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1a239001800270036004500Min: 5215.47 / Avg: 5218.11 / Max: 5219.48Min: 5217.53 / Avg: 5224.18 / Max: 5229.64Min: 5155.52 / Avg: 5162.57 / Max: 5171.91. (CC) gcc options: -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression1a231002003004005004504504501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1a2320406080100SE +/- 0.33, N = 3SE +/- 1.00, N = 31011021011. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1a2320406080100Min: 100 / Avg: 100.67 / Max: 101Min: 100 / Avg: 101 / Max: 1031. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2140K280K420K560K700KSE +/- 1381.24, N = 3SE +/- 1621.04, N = 3655165654272
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2110K220K330K440K550KMin: 652411 / Avg: 655165.33 / Max: 656726Min: 651371 / Avg: 654272.33 / Max: 656976

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2312002400360048006000SE +/- 4.97, N = 3SE +/- 1.64, N = 3SE +/- 9.93, N = 35680.845669.095661.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2310002000300040005000Min: 5671.94 / Avg: 5680.84 / Max: 5689.14Min: 5665.91 / Avg: 5669.09 / Max: 5671.34Min: 5641.69 / Avg: 5661.4 / Max: 5673.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001a2130260390520650SE +/- 0.20, N = 3SE +/- 0.18, N = 3615.64615.801. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001a2110220330440550Min: 615.36 / Avg: 615.64 / Max: 616.03Min: 615.49 / Avg: 615.8 / Max: 616.111. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a2330060090012001500SE +/- 1.09, N = 3SE +/- 3.28, N = 3SE +/- 5.21, N = 31611.21605.61606.61. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a2330060090012001500Min: 1609.1 / Avg: 1611.17 / Max: 1612.8Min: 1599.3 / Avg: 1605.57 / Max: 1610.4Min: 1596.3 / Avg: 1606.63 / Max: 1612.91. (CC) gcc options: -O3 -pthread -lz -llzma

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 3104.64103.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100Min: 104.17 / Avg: 104.64 / Max: 105.04Min: 103.84 / Avg: 103.95 / Max: 104.061. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21a234080120160200SE +/- 0.01, N = 3SE +/- 0.90, N = 3SE +/- 0.01, N = 3165.24164.08165.261. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21a23306090120150Min: 165.21 / Avg: 165.24 / Max: 165.26Min: 162.31 / Avg: 164.08 / Max: 165.23Min: 165.24 / Avg: 165.26 / Max: 165.271. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101a230.71711.43422.15132.86843.5855SE +/- 0.004, N = 3SE +/- 0.027, N = 3SE +/- 0.002, N = 33.1583.1873.154
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101a23246810Min: 3.15 / Avg: 3.16 / Max: 3.16Min: 3.16 / Avg: 3.19 / Max: 3.24Min: 3.15 / Avg: 3.15 / Max: 3.16

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1a248121620SE +/- 0.01, N = 5SE +/- 0.01, N = 516.6916.691. (CXX) g++ options: -rdynamic
OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1a248121620Min: 16.68 / Avg: 16.69 / Max: 16.73Min: 16.67 / Avg: 16.69 / Max: 16.731. (CXX) g++ options: -rdynamic

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1a231.6M3.2M4.8M6.4M8MSE +/- 25940.20, N = 3SE +/- 26469.09, N = 3SE +/- 8686.26, N = 37355877746246474809331. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1a231.3M2.6M3.9M5.2M6.5MMin: 7305305 / Avg: 7355877.33 / Max: 7391189Min: 7410315 / Avg: 7462464.33 / Max: 7496424Min: 7467738 / Avg: 7480933.33 / Max: 74973171. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1a23100200300400500SE +/- 0.72, N = 3SE +/- 1.54, N = 3SE +/- 1.05, N = 3436.17446.95451.88MIN: 338.67 / MAX: 653.63MIN: 335.55 / MAX: 660.93MIN: 337.72 / MAX: 664.931. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1a2380160240320400Min: 434.74 / Avg: 436.17 / Max: 436.98Min: 444.73 / Avg: 446.95 / Max: 449.92Min: 449.81 / Avg: 451.88 / Max: 453.221. (CC) gcc options: -pthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21a280160240320400SE +/- 2.01, N = 3SE +/- 0.47, N = 3372.04368.30MIN: 368.17 / MAX: 376.4MIN: 366.89 / MAX: 369.961. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21a270140210280350Min: 368.8 / Avg: 372.04 / Max: 375.72Min: 367.39 / Avg: 368.3 / Max: 368.941. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11a270140210280350SE +/- 0.07, N = 3SE +/- 0.09, N = 3343.57343.21MIN: 343.23 / MAX: 344.21MIN: 342.85 / MAX: 343.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11a260120180240300Min: 343.49 / Avg: 343.57 / Max: 343.72Min: 343.07 / Avg: 343.21 / Max: 343.371. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1a23140280420560700SE +/- 0.58, N = 36506506511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1a23110220330440550Min: 649 / Avg: 650 / Max: 6511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression1a2340801201602001681701691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a23120240360480600SE +/- 0.88, N = 3SE +/- 0.33, N = 35625645621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a23100200300400500Min: 560 / Avg: 561.67 / Max: 563Min: 563 / Avg: 563.67 / Max: 5641. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a2390180270360450SE +/- 0.33, N = 34164194171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a2370140210280350Min: 416 / Avg: 416.67 / Max: 4171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1a23400800120016002000SE +/- 11.50, N = 3SE +/- 8.19, N = 31743174817391. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1a2330060090012001500Min: 1720 / Avg: 1743 / Max: 1755Min: 1723 / Avg: 1739 / Max: 17501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression1a23204060801008182821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a2330060090012001500SE +/- 3.61, N = 3SE +/- 2.89, N = 3SE +/- 1.76, N = 31607160716101. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a2330060090012001500Min: 1602 / Avg: 1607 / Max: 1614Min: 1602 / Avg: 1607 / Max: 1612Min: 1607 / Avg: 1609.67 / Max: 16131. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a23100200300400500SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 34524514501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a2380160240320400Min: 451 / Avg: 452 / Max: 453Min: 450 / Avg: 450.67 / Max: 452Min: 447 / Avg: 450 / Max: 4521. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a233691215SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.07, N = 512.7712.8112.781. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a2348121620Min: 12.66 / Avg: 12.77 / Max: 12.92Min: 12.68 / Avg: 12.81 / Max: 12.93Min: 12.64 / Avg: 12.78 / Max: 12.971. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31a236001200180024003000SE +/- 2.52, N = 3SE +/- 1.49, N = 3SE +/- 1.38, N = 32738.352738.012739.311. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31a235001000150020002500Min: 2734.03 / Avg: 2738.35 / Max: 2742.77Min: 2735.89 / Avg: 2738.01 / Max: 2740.89Min: 2736.92 / Avg: 2739.31 / Max: 2741.711. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a2360120180240300SE +/- 0.11, N = 3SE +/- 0.99, N = 3SE +/- 0.04, N = 3280.77279.66280.581. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a2350100150200250Min: 280.56 / Avg: 280.77 / Max: 280.88Min: 277.68 / Avg: 279.66 / Max: 280.74Min: 280.52 / Avg: 280.58 / Max: 280.641. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000SE +/- 1.69, N = 3SE +/- 1.69, N = 31161.001159.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000Min: 1157.63 / Avg: 1161 / Max: 1162.69Min: 1157.63 / Avg: 1159.32 / Max: 1162.691. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2150300450600750SE +/- 9.06, N = 3SE +/- 2.68, N = 3690.02682.731. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2120240360480600Min: 680.96 / Avg: 690.02 / Max: 708.13Min: 679.22 / Avg: 682.73 / Max: 6881. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a2370140210280350SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.33, N = 3299.46298.89299.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a2350100150200250Min: 299.35 / Avg: 299.46 / Max: 299.56Min: 298.4 / Avg: 298.89 / Max: 299.61Min: 298.57 / Avg: 299.22 / Max: 299.591. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1a233691215SE +/- 0.014, N = 5SE +/- 0.014, N = 5SE +/- 0.013, N = 59.6069.6029.6171. (CXX) g++ options: -fvisibility=hidden -logg -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1a233691215Min: 9.59 / Avg: 9.61 / Max: 9.66Min: 9.57 / Avg: 9.6 / Max: 9.65Min: 9.6 / Avg: 9.62 / Max: 9.671. (CXX) g++ options: -fvisibility=hidden -logg -lm

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2400K800K1200K1600K2000KSE +/- 8997.19, N = 3SE +/- 3103.90, N = 31718046.711717450.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2300K600K900K1200K1500KMin: 1703298.25 / Avg: 1718046.71 / Max: 1734349Min: 1712627.5 / Avg: 1717450.17 / Max: 1723246.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression1a23501001502002502132142141. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1a2400K800K1200K1600K2000KSE +/- 17544.22, N = 3SE +/- 8366.29, N = 31980808.831981355.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1a2300K600K900K1200K1500KMin: 1945935 / Avg: 1980808.83 / Max: 2001601.38Min: 1964649 / Avg: 1981355.17 / Max: 1990522.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2600K1200K1800K2400K3000KSE +/- 6710.43, N = 3SE +/- 6466.66, N = 32827879.751729976.591. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2500K1000K1500K2000K2500KMin: 2816153 / Avg: 2827879.75 / Max: 2839395.75Min: 1717917.88 / Avg: 1729976.59 / Max: 1740054.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2500K1000K1500K2000K2500KSE +/- 6947.45, N = 3SE +/- 3348.75, N = 32261333.332247597.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2400K800K1200K1600K2000KMin: 2251752.25 / Avg: 2261333.33 / Max: 2274839Min: 2241707.25 / Avg: 2247597.67 / Max: 2253303.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2600K1200K1800K2400K3000KSE +/- 17672.66, N = 3SE +/- 10642.86, N = 32703703.332565612.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2500K1000K1500K2000K2500KMin: 2684632.5 / Avg: 2703703.33 / Max: 2739010.75Min: 2553064 / Avg: 2565612.75 / Max: 25867771. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1a233691215SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 312.5912.6112.541. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1a2348121620Min: 12.55 / Avg: 12.58 / Max: 12.61Min: 12.46 / Avg: 12.61 / Max: 12.78Min: 12.5 / Avg: 12.54 / Max: 12.591. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.02011, N = 3SE +/- 0.02083, N = 3SE +/- 0.00510, N = 37.595457.231237.20852MIN: 7.47MIN: 7.1MIN: 7.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a233691215Min: 7.56 / Avg: 7.6 / Max: 7.63Min: 7.21 / Avg: 7.23 / Max: 7.27Min: 7.2 / Avg: 7.21 / Max: 7.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.236.231. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810Min: 6.23 / Avg: 6.23 / Max: 6.23Min: 6.23 / Avg: 6.23 / Max: 6.241. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a230.99231.98462.97693.96924.9615SE +/- 0.03942, N = 3SE +/- 0.00961, N = 3SE +/- 0.02669, N = 34.410113.990594.10228MIN: 4.14MIN: 3.92MIN: 3.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 4.33 / Avg: 4.41 / Max: 4.46Min: 3.97 / Avg: 3.99 / Max: 4Min: 4.05 / Avg: 4.1 / Max: 4.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a23510152025SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 318.3618.3818.10MIN: 17.58MIN: 17.99MIN: 17.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a23510152025Min: 18.1 / Avg: 18.36 / Max: 18.61Min: 18.31 / Avg: 18.38 / Max: 18.43Min: 18 / Avg: 18.1 / Max: 18.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215SE +/- 0.005, N = 3SE +/- 0.005, N = 39.2289.2311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215Min: 9.22 / Avg: 9.23 / Max: 9.24Min: 9.22 / Avg: 9.23 / Max: 9.241. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1a232004006008001000SE +/- 16.69, N = 3SE +/- 1.43, N = 3SE +/- 1.30, N = 31017.82983.131032.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1a232004006008001000Min: 984.59 / Avg: 1017.82 / Max: 1037.17Min: 980.51 / Avg: 983.13 / Max: 985.43Min: 1029.52 / Avg: 1032.05 / Max: 1033.831. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1a2390180270360450SE +/- 0.99, N = 3SE +/- 0.48, N = 3SE +/- 1.00, N = 3427.62433.95433.44MIN: 366.19 / MAX: 462MIN: 372.19 / MAX: 467.89MIN: 370.03 / MAX: 468.991. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1a2380160240320400Min: 425.74 / Avg: 427.62 / Max: 429.12Min: 433 / Avg: 433.95 / Max: 434.54Min: 431.6 / Avg: 433.44 / Max: 435.061. (CC) gcc options: -pthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1a2246810SE +/- 0.03, N = 3SE +/- 0.00, N = 37.137.121. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1a23691215Min: 7.09 / Avg: 7.13 / Max: 7.18Min: 7.12 / Avg: 7.12 / Max: 7.131. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23714212835SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 331.7831.0331.03MIN: 31.28MIN: 30.81MIN: 30.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23714212835Min: 31.42 / Avg: 31.78 / Max: 31.97Min: 30.94 / Avg: 31.03 / Max: 31.16Min: 30.96 / Avg: 31.03 / Max: 31.111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a23816243240SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 333.3232.1432.12MIN: 32.4MIN: 31.94MIN: 31.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a23714212835Min: 32.54 / Avg: 33.32 / Max: 33.83Min: 32.05 / Avg: 32.14 / Max: 32.32Min: 32.03 / Avg: 32.12 / Max: 32.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1a231.24112.48223.72334.96446.2055SE +/- 0.029, N = 3SE +/- 0.020, N = 3SE +/- 0.030, N = 35.5165.4915.4661. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1a23246810Min: 5.46 / Avg: 5.52 / Max: 5.56Min: 5.46 / Avg: 5.49 / Max: 5.53Min: 5.41 / Avg: 5.47 / Max: 5.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a2330060090012001500SE +/- 1.99, N = 3SE +/- 0.69, N = 3SE +/- 1.26, N = 31208.891203.151208.661. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a232004006008001000Min: 1206.27 / Avg: 1208.89 / Max: 1212.8Min: 1202.28 / Avg: 1203.15 / Max: 1204.52Min: 1206.15 / Avg: 1208.66 / Max: 1210.171. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1a231.12752.2553.38254.515.6375SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 35.0024.9945.0111. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1a23246810Min: 4.97 / Avg: 5 / Max: 5.03Min: 4.97 / Avg: 4.99 / Max: 5.01Min: 5 / Avg: 5.01 / Max: 5.021. (CXX) g++ options: -O3 -pthread -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a231.19692.39383.59074.78765.9845SE +/- 0.03383, N = 3SE +/- 0.01117, N = 3SE +/- 0.02611, N = 35.319394.617624.66944MIN: 5.1MIN: 4.49MIN: 4.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 5.27 / Avg: 5.32 / Max: 5.38Min: 4.6 / Avg: 4.62 / Max: 4.64Min: 4.62 / Avg: 4.67 / Max: 4.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a233691215SE +/- 0.04090, N = 3SE +/- 0.02351, N = 3SE +/- 0.13403, N = 310.268708.130498.71496MIN: 10.12MIN: 8.05MIN: 8.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a233691215Min: 10.19 / Avg: 10.27 / Max: 10.31Min: 8.09 / Avg: 8.13 / Max: 8.17Min: 8.57 / Avg: 8.71 / Max: 8.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

200 Results Shown

Quantum ESPRESSO
WebP2 Image Encode
VkFFT
Basis Universal
WebP2 Image Encode
CP2K Molecular Dynamics
AI Benchmark Alpha:
  Device AI Score
  Device Training Score
  Device Inference Score
NAS Parallel Benchmarks
OpenFOAM
ASTC Encoder
WebP2 Image Encode
GROMACS
BRL-CAD
CloverLeaf
Numpy Benchmark
asmFish
Build2
Gcrypt Library
Timed Godot Game Engine Compilation
Kripke
Zstd Compression
dav1d
NAS Parallel Benchmarks
Timed HMMer Search
VKMark
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
Embree
ONNX Runtime:
  fcn-resnet101-11 - OpenMP CPU
  bertsquad-10 - OpenMP CPU
  yolov4 - OpenMP CPU
  shufflenet-v2-10 - OpenMP CPU
VkResample
ONNX Runtime
Mobile Neural Network:
  inception-v3
  mobilenet-v1-1.0
  MobileNetV2_224
  resnet-v2-50
  SqueezeNetV1.0
Coremark
Embree
eSpeak-NG Speech Engine
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
Stockfish
Basis Universal
oneDNN:
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
Embree
CLOMP
Timed Eigen Compilation
Timed FFmpeg Compilation
oneDNN
Warsow
oneDNN:
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Node.js V8 Web Tooling Benchmark
FinanceBench
Embree:
  Pathtracer ISPC - Crown
  Pathtracer - Asian Dragon
simdjson
Embree
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
  3 - Decompression Speed
  3 - Compression Speed
oneDNN
ASKAP:
  tConvolve MPI - Gridding
  tConvolve MPI - Degridding
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
SQLite Speedtest
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
Cython Benchmark
Basis Universal
FinanceBench
VkResample
simdjson
Basis Universal
NAS Parallel Benchmarks
oneDNN
rav1e
WebP2 Image Encode
simdjson:
  PartialTweets
  DistinctUserID
ASTC Encoder
rav1e
NAS Parallel Benchmarks
QMCPACK
lzbench:
  XZ 0 - Decompression
  XZ 0 - Compression
rav1e
Unpacking Firefox
dav1d
oneDNN
Algebraic Multi-Grid Benchmark
Cryptsetup:
  Twofish-XTS 512b Decryption
  Twofish-XTS 512b Encryption
  Serpent-XTS 512b Decryption
  Serpent-XTS 512b Encryption
  AES-XTS 512b Decryption
  AES-XTS 512b Encryption
  Twofish-XTS 256b Decryption
  Twofish-XTS 256b Encryption
  Serpent-XTS 256b Decryption
  Serpent-XTS 256b Encryption
  AES-XTS 256b Decryption
  AES-XTS 256b Encryption
  PBKDF2-whirlpool
  PBKDF2-sha512
QuantLib
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
lzbench:
  Crush 0 - Decompression
  Crush 0 - Compression
PHPBench
NAS Parallel Benchmarks
Google SynthMark
Zstd Compression
ASKAP
Etcpak
rav1e
WavPack Audio Encoding
Crafty
dav1d
TNN:
  CPU - MobileNet v2
  CPU - SqueezeNet v1.1
lzbench:
  Brotli 2 - Decompression
  Brotli 2 - Compression
  Brotli 0 - Decompression
  Brotli 0 - Compression
  Zstd 8 - Decompression
  Zstd 8 - Compression
  Zstd 1 - Decompression
  Zstd 1 - Compression
Monkey Audio Encoding
LULESH
Etcpak
ASKAP:
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
Etcpak
Opus Codec Encoding
Redis
lzbench
Redis:
  SET
  LPOP
  SADD
  GET
Timed MAFFT Alignment
oneDNN
ASTC Encoder
oneDNN:
  IP Shapes 3D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
Basis Universal
NAS Parallel Benchmarks
dav1d
ASTC Encoder
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
WebP2 Image Encode
Etcpak
LAMMPS Molecular Dynamics Simulator
oneDNN:
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU