ubuntu-2010-onlogic

Intel Xeon E-2278GEL testing with a Logic Supply RXM-181 (Z01-0001A027 BIOS) and Intel UHD P630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102011-HA-UBUNTU20190
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 3 Tests
AV1 2 Tests
Bioinformatics 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 3 Tests
Chess Test Suite 3 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 11 Tests
Compression Tests 3 Tests
CPU Massive 19 Tests
Creator Workloads 16 Tests
Cryptography 2 Tests
Database Test Suite 2 Tests
Encoding 5 Tests
Finance 2 Tests
Fortran Tests 4 Tests
Game Development 4 Tests
HPC - High Performance Computing 21 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Machine Learning 7 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 5 Tests
Multi-Core 17 Tests
NVIDIA GPU Compute 6 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 11 Tests
Programmer / Developer System Benchmarks 10 Tests
Python 2 Tests
Scientific Computing 12 Tests
Server 5 Tests
Server CPU Tests 11 Tests
Single-Threaded 5 Tests
Speech 2 Tests
Telephony 2 Tests
Texture Compression 3 Tests
Video Encoding 2 Tests
Vulkan Compute 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
January 30 2021
  1 Hour, 11 Minutes
1a
January 30 2021
  19 Hours, 5 Minutes
2
January 31 2021
  19 Hours, 24 Minutes
3
February 01 2021
  8 Hours, 55 Minutes
Invert Hiding All Results Option
  12 Hours, 9 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ubuntu-2010-onlogicProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution11a23Intel Xeon E-2278GEL @ 3.90GHz (8 Cores / 16 Threads)Logic Supply RXM-181 (Z01-0001A027 BIOS)Intel Cannon Lake PCH16GB512GB TS512GMTE510TIntel UHD P630 3GB (1150MHz)Realtek ALC233DELL P2415QIntel I219-LM + 2 x Intel I210Ubuntu 20.105.8.0-41-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.9intel4.6 Mesa 20.2.61.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

ubuntu-2010-onlogicredis: LPOPonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUredis: GETonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnpb: EP.Cncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - regnety_400monednn: Convolution Batch Shapes Auto - f32 - CPUdav1d: Chimera 1080ponednn: Recurrent Neural Network Inference - f32 - CPUclomp: Static OMP Speedupnumpy: onednn: Recurrent Neural Network Inference - u8s8f32 - CPUncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: CPU - blazefacencnn: CPU - shufflenet-v2ncnn: Vulkan GPU - shufflenet-v2espeak: Text-To-Speech Synthesisonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnpb: EP.Donednn: Convolution Batch Shapes Auto - u8s8f32 - CPUbuild-eigen: Time To Compilebuild-ffmpeg: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUcython-bench: N-Queensonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUvkfft: ncnn: CPU - efficientnet-b0crafty: Elapsed Timesimdjson: DistinctUserIDncnn: CPU-v3-v3 - mobilenet-v3vkmark: 1920 x 1080onednn: IP Shapes 3D - f32 - CPUqe: AUSURF112dav1d: Summer Nature 1080ponnx: super-resolution-10 - OpenMP CPUbrl-cad: VGR Performance Metriconednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicsqmcpack: simple-H2Osqlite-speedtest: Timed Time - Size 1,000lzbench: Zstd 8 - Compressionncnn: CPU - mnasnetcompress-lz4: 1 - Compression Speedlzbench: Brotli 2 - Compressionaskap: tConvolve OpenMP - Griddingunpack-firefox: firefox-84.0.source.tar.xzrav1e: 10ncnn: Vulkan GPU - mnasnettnn: CPU - MobileNet v2lzbench: Crush 0 - Compressionmnn: SqueezeNetV1.0embree: Pathtracer - Crownrav1e: 5webp2: Defaultaskap: tConvolve MPI - Griddingbasis: UASTC Level 2npb: CG.Cdav1d: Summer Nature 4Kbuild2: Time To Compilencnn: Vulkan GPU - blazefacecompress-lz4: 3 - Compression Speedrav1e: 1embree: Pathtracer ISPC - Crowncompress-lz4: 9 - Compression Speedlzbench: Brotli 0 - Compressionetcpak: ETC2vkresample: 2x - Singleembree: Pathtracer ISPC - Asian Dragonaskap: Hogbom Clean OpenMPcompress-lz4: 1 - Decompression Speedgcrypt: mnn: resnet-v2-50cryptsetup: AES-XTS 512b Encryptionredis: SADDbuild-godot: Time To Compilencnn: Vulkan GPU - efficientnet-b0cryptsetup: AES-XTS 256b Decryptionmnn: MobileNetV2_224mnn: inception-v3mafft: Multiple Sequence Alignment - LSU RNAncnn: Vulkan GPU - resnet18cryptsetup: AES-XTS 512b Decryptionlzbench: Zstd 8 - Decompressionembree: Pathtracer - Asian Dragonaskap: tConvolve MPI - Degriddingetcpak: DXT1lzbench: Libdeflate 1 - Compressionncnn: Vulkan GPU - googlenetlzbench: Zstd 1 - Compressionwebp2: Quality 75, Compression Effort 7mnn: mobilenet-v1-1.0embree: Pathtracer ISPC - Asian Dragon Objetcpak: ETC1 + Ditheringnode-web-tooling: gromacs: Water Benchmarkncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - resnet18coremark: CoreMark Size 666 - Iterations Per Secondvkresample: 2x - Doublelzbench: Brotli 0 - Decompressioncompress-zstd: 3encode-ape: WAV To APEnpb: MG.Clammps: Rhodopsin Proteincryptsetup: AES-XTS 256b Encryptionquantlib: cryptsetup: Serpent-XTS 512b Encryptionncnn: CPU - alexnetembree: Pathtracer - Asian Dragon Objncnn: Vulkan GPU - vgg16kripke: cryptsetup: Serpent-XTS 256b Decryptionastcenc: Thoroughfinancebench: Bonds OpenMPncnn: Vulkan GPU - resnet50basis: ETC1Srav1e: 6etcpak: ETC1lzbench: Zstd 1 - Decompressionncnn: Vulkan GPU - mobilenetcryptsetup: Serpent-XTS 512b Decryptionncnn: CPU - googlenetcryptsetup: Serpent-XTS 256b Encryptioncompress-lz4: 9 - Decompression Speedencode-opus: WAV To Opus Encodelzbench: Brotli 2 - Decompressioncryptsetup: Twofish-XTS 256b Decryptionncnn: CPU - yolov4-tinyaskap: tConvolve OpenMP - Degriddingnpb: LU.Castcenc: Fastwebp2: Quality 95, Compression Effort 7phpbench: PHP Benchmark Suitecp2k: Fayalite-FIST Dataai-benchmark: Device Inference Scorebasis: UASTC Level 2 + RDO Post-Processingncnn: CPU - vgg16indigobench: CPU - Supercartnn: CPU - SqueezeNet v1.1onnx: shufflenet-v2-10 - OpenMP CPUdav1d: Chimera 1080p 10-bitcryptsetup: Twofish-XTS 512b Encryptionwebp2: Quality 100, Compression Effort 5ncnn: Vulkan GPU - yolov4-tinyaskap: tConvolve MT - Griddingopenfoam: Motorbike 30Mindigobench: CPU - Bedroomnpb: FT.Ccryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 512b Decryptionai-benchmark: Device AI Scorencnn: CPU - squeezenet_ssdastcenc: Exhaustivebasis: UASTC Level 3askap: tConvolve MT - Degriddingwebp2: Quality 100, Lossless Compressionhmmer: Pfam Database Searchcompress-lz4: 3 - Decompression Speedcryptsetup: PBKDF2-sha512lulesh: ncnn: Vulkan GPU - alexnetamg: financebench: Repo OpenMPcryptsetup: PBKDF2-whirlpoolncnn: CPU - mobilenetncnn: Vulkan GPU - squeezenet_ssdredis: LPUSHbasis: UASTC Level 0redis: SETsynthmark: VoiceMark_100encode-wavpack: WAV To WavPackncnn: CPU - resnet50ai-benchmark: Device Training Scoreonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUastcenc: Mediumcompress-zstd: 19simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyalzbench: Crush 0 - Decompressionlzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionwarsow: 1920 x 1080ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ncnn: CPU - regnety_400monednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPU11a231273501.90862.9209.3192.072827879.758.5278510.26875.319394.410112703703.337.595451017.825.7817.9033.3155436.173508.552.9329.043472.527.762.446.646.6730.8153469.63870.7031.781789.90786.5876006.3425.8214.6489111763649189475986000.836004.0612969.9873558770.615.7752518.35822191.15427.623243685617.99169300.0538.53462.031815.775218.11168690.01721.4073.1585.76372.0391017.9776.95711.0965.5161155.9748.5932905.81101.12231.1622.4544.530.3867.687443.44416165.242498.5328.9844104.6406003.6229.82042.4712784.82261333.33228.92710.003394.04.61648.21212.58524.282783.617437.99111284.261208.89221322.36452392.3124.0967.9852280.77110.390.5197.8124.25243609.1158611004.3895621611.212.7705680.845.0023387.62180.5737.621.117.3402114.0317521143723.946.7871041.96875043.8159.9511.445299.463160727.56723.822.29736.35990.09.606650404.040.921161.0015451.707.13724.5176551651478.055743808.351113.972.509343.5741415688.15400.921.34640.95613.317464.541.0897365.58400.8403.7143028.66387.7195.508993.0301445.403131.2655983.515887512738.352521.1212237216749677.13802166733127.5528.691718046.719.2281980808.83615.63816.69343.67687453972616.2312.50.60.370.484501053817.888.291132.905091729976.596.852068.130494.617623.990592565612.757.23123983.136.0418.6132.1370446.953402.403.0339.743402.038.002.516.836.8631.6413382.74860.2731.034487.79188.4275899.1226.3274.5599611662466186578045902.685896.89128510.1574624640.615.8651818.37662164.20433.953197694887.88645303.9738.04961.254825.845224.18170682.72921.6353.1875.82368.3041027.9016.92721.1025.4911146.1148.1892881.77100.75230.3362.4744.840.3857.746643.74419164.078500.9118.9358103.9506043.1229.30842.2072767.82247597.67228.75510.063373.84.58947.93412.60624.152769.217487.97861278.111203.14921422.26451392.4074.0797.9535279.65810.350.5217.8424.16244321.4829321005.9325641605.612.8145669.094.9943377.02185.0735.521.057.3418113.7217477813722.246.8971207.26562543.7160.0801.442298.894160727.51722.522.25735.05983.29.602650403.440.981159.3215459.457.12723.5226542721478.482742807.286113.822.512343.2091414188.24400.521.36740.99613.906464.941.0907371.11400.5403.4142928.68387.9795.569992.4131446.293131.3095980.315879502738.006621.1112231703349656.07291766760627.5428.701717450.179.2311981355.17615.80216.68943.68687453972616.2312.50.60.370.484501053819.347.669422.625106.645368.714964.669444.102287.208521032.0532.1167451.883387.303.0328.973363.6730.9613382.38881.3431.026289.59187.3645884.7625.8334.5705011884731186016045893.195898.04128774809330.6251718.09862158.80433.447.93690303.6738.453825162.571693.1541016.89251.0925.4662901.19101.58232.22244.880.3837.740843.76417165.255500.6858.99526034.7230.795227.55512.53617398.01941208.664214450394.0137.9747280.583244509.6366681007.9945621606.612.7755661.405.0112187.17.36111.443299.21516105980.19.61765115474.021480.06688.20464.977369.78131.2355981.82739.312812233446712.50.60.370.48450105387.814822.65725OpenBenchmarking.org

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1510152025Min: 3.69 / Avg: 4.77 / Max: 18.371. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time1510152025Min: 8.52 / Avg: 10.85 / Max: 18.851. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2600K1200K1800K2400K3000KSE +/- 6710.43, N = 3SE +/- 6466.66, N = 32827879.751729976.591. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2500K1000K1500K2000K2500KMin: 2816153 / Avg: 2827879.75 / Max: 2839395.75Min: 1717917.88 / Avg: 1729976.59 / Max: 1740054.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.11268, N = 3SE +/- 0.07900, N = 15SE +/- 0.04844, N = 158.527856.852066.64536MIN: 8.22MIN: 6.11MIN: 5.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a233691215Min: 8.34 / Avg: 8.53 / Max: 8.73Min: 6.56 / Avg: 6.85 / Max: 7.41Min: 6.06 / Avg: 6.65 / Max: 6.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a233691215SE +/- 0.04090, N = 3SE +/- 0.02351, N = 3SE +/- 0.13403, N = 310.268708.130498.71496MIN: 10.12MIN: 8.05MIN: 8.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a233691215Min: 10.19 / Avg: 10.27 / Max: 10.31Min: 8.09 / Avg: 8.13 / Max: 8.17Min: 8.57 / Avg: 8.71 / Max: 8.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a231.19692.39383.59074.78765.9845SE +/- 0.03383, N = 3SE +/- 0.01117, N = 3SE +/- 0.02611, N = 35.319394.617624.66944MIN: 5.1MIN: 4.49MIN: 4.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 5.27 / Avg: 5.32 / Max: 5.38Min: 4.6 / Avg: 4.62 / Max: 4.64Min: 4.62 / Avg: 4.67 / Max: 4.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a230.99231.98462.97693.96924.9615SE +/- 0.03942, N = 3SE +/- 0.00961, N = 3SE +/- 0.02669, N = 34.410113.990594.10228MIN: 4.14MIN: 3.92MIN: 3.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 4.33 / Avg: 4.41 / Max: 4.46Min: 3.97 / Avg: 3.99 / Max: 4Min: 4.05 / Avg: 4.1 / Max: 4.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2600K1200K1800K2400K3000KSE +/- 17672.66, N = 3SE +/- 10642.86, N = 32703703.332565612.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2500K1000K1500K2000K2500KMin: 2684632.5 / Avg: 2703703.33 / Max: 2739010.75Min: 2553064 / Avg: 2565612.75 / Max: 25867771. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.02011, N = 3SE +/- 0.02083, N = 3SE +/- 0.00510, N = 37.595457.231237.20852MIN: 7.47MIN: 7.1MIN: 7.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a233691215Min: 7.56 / Avg: 7.6 / Max: 7.63Min: 7.21 / Avg: 7.23 / Max: 7.27Min: 7.2 / Avg: 7.21 / Max: 7.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1a232004006008001000SE +/- 16.69, N = 3SE +/- 1.43, N = 3SE +/- 1.30, N = 31017.82983.131032.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1a232004006008001000Min: 984.59 / Avg: 1017.82 / Max: 1037.17Min: 980.51 / Avg: 983.13 / Max: 985.43Min: 1029.52 / Avg: 1032.05 / Max: 1033.831. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810SE +/- 0.03, N = 3SE +/- 0.19, N = 35.786.04MIN: 5.59 / MAX: 6.95MIN: 5.6 / MAX: 7.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810Min: 5.72 / Avg: 5.78 / Max: 5.83Min: 5.85 / Avg: 6.04 / Max: 6.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025SE +/- 0.22, N = 3SE +/- 0.08, N = 317.9018.61MIN: 17.15 / MAX: 18.95MIN: 17.25 / MAX: 201. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025Min: 17.47 / Avg: 17.9 / Max: 18.19Min: 18.45 / Avg: 18.61 / Max: 18.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a23816243240SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 333.3232.1432.12MIN: 32.4MIN: 31.94MIN: 31.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a23714212835Min: 32.54 / Avg: 33.32 / Max: 33.83Min: 32.05 / Avg: 32.14 / Max: 32.32Min: 32.03 / Avg: 32.12 / Max: 32.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1a23100200300400500SE +/- 0.72, N = 3SE +/- 1.54, N = 3SE +/- 1.05, N = 3436.17446.95451.88MIN: 338.67 / MAX: 653.63MIN: 335.55 / MAX: 660.93MIN: 337.72 / MAX: 664.931. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1a2380160240320400Min: 434.74 / Avg: 436.17 / Max: 436.98Min: 444.73 / Avg: 446.95 / Max: 449.92Min: 449.81 / Avg: 451.88 / Max: 453.221. (CC) gcc options: -pthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a238001600240032004000SE +/- 12.64, N = 3SE +/- 1.31, N = 3SE +/- 6.49, N = 33508.553402.403387.30MIN: 3442.98MIN: 3364.88MIN: 3338.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a236001200180024003000Min: 3483.3 / Avg: 3508.55 / Max: 3522.33Min: 3400.18 / Avg: 3402.4 / Max: 3404.7Min: 3375.62 / Avg: 3387.3 / Max: 3398.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1a230.6751.352.0252.73.375SE +/- 0.03, N = 3SE +/- 0.03, N = 9SE +/- 0.03, N = 32.93.03.01. (CC) gcc options: -fopenmp -O3 -lm
OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1a23246810Min: 2.9 / Avg: 2.93 / Max: 3Min: 2.8 / Avg: 2.96 / Max: 3.1Min: 3 / Avg: 3.03 / Max: 3.11. (CC) gcc options: -fopenmp -O3 -lm

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1a2370140210280350SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.58, N = 3329.04339.74328.97
OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1a2360120180240300Min: 328.53 / Avg: 329.04 / Max: 329.34Min: 339.58 / Avg: 339.74 / Max: 340.06Min: 327.81 / Avg: 328.97 / Max: 329.61

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a237001400210028003500SE +/- 42.16, N = 3SE +/- 7.92, N = 3SE +/- 1.67, N = 33472.523402.033363.67MIN: 3371.23MIN: 3340.17MIN: 3322.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a236001200180024003000Min: 3401.87 / Avg: 3472.52 / Max: 3547.69Min: 3391.2 / Avg: 3402.03 / Max: 3417.45Min: 3360.84 / Avg: 3363.67 / Max: 3366.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.03, N = 3SE +/- 0.14, N = 37.768.00MIN: 7.45 / MAX: 9.82MIN: 7.65 / MAX: 9.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a23691215Min: 7.71 / Avg: 7.76 / Max: 7.79Min: 7.85 / Avg: 8 / Max: 8.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a20.56481.12961.69442.25922.824SE +/- 0.06, N = 3SE +/- 0.01, N = 32.442.51MIN: 2.28 / MAX: 2.98MIN: 2.35 / MAX: 3.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a2246810Min: 2.32 / Avg: 2.44 / Max: 2.51Min: 2.49 / Avg: 2.51 / Max: 2.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a2246810SE +/- 0.09, N = 3SE +/- 0.11, N = 36.646.83MIN: 6.5 / MAX: 8.16MIN: 6.65 / MAX: 7.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a23691215Min: 6.55 / Avg: 6.64 / Max: 6.82Min: 6.7 / Avg: 6.83 / Max: 7.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a2246810SE +/- 0.08, N = 3SE +/- 0.18, N = 36.676.86MIN: 6.51 / MAX: 8.11MIN: 6.5 / MAX: 8.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a23691215Min: 6.56 / Avg: 6.67 / Max: 6.83Min: 6.67 / Avg: 6.86 / Max: 7.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a23714212835SE +/- 0.44, N = 4SE +/- 0.22, N = 20SE +/- 0.19, N = 430.8231.6430.961. (CC) gcc options: -O2 -std=c99
OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a23714212835Min: 29.81 / Avg: 30.82 / Max: 31.95Min: 31.1 / Avg: 31.64 / Max: 34.74Min: 30.62 / Avg: 30.96 / Max: 31.51. (CC) gcc options: -O2 -std=c99

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a237001400210028003500SE +/- 27.44, N = 3SE +/- 13.00, N = 3SE +/- 14.91, N = 33469.633382.743382.38MIN: 3389.1MIN: 3319.45MIN: 3321.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a236001200180024003000Min: 3427.09 / Avg: 3469.63 / Max: 3520.93Min: 3358.02 / Avg: 3382.74 / Max: 3402.1Min: 3358.86 / Avg: 3382.38 / Max: 3410.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1a232004006008001000SE +/- 10.93, N = 12SE +/- 8.64, N = 12SE +/- 13.60, N = 3870.70860.27881.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1a23150300450600750Min: 788.51 / Avg: 870.7 / Max: 903.38Min: 818.02 / Avg: 860.27 / Max: 898.51Min: 854.13 / Avg: 881.34 / Max: 895.231. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23714212835SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 331.7831.0331.03MIN: 31.28MIN: 30.81MIN: 30.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23714212835Min: 31.42 / Avg: 31.78 / Max: 31.97Min: 30.94 / Avg: 31.03 / Max: 31.16Min: 30.96 / Avg: 31.03 / Max: 31.111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1a2320406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 389.9187.7989.59
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1a2320406080100Min: 89.89 / Avg: 89.91 / Max: 89.94Min: 87.75 / Avg: 87.79 / Max: 87.84Min: 89.44 / Avg: 89.59 / Max: 89.8

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a2320406080100SE +/- 0.42, N = 3SE +/- 1.19, N = 3SE +/- 0.24, N = 386.5988.4387.36
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a2320406080100Min: 86.05 / Avg: 86.59 / Max: 87.4Min: 87.2 / Avg: 88.43 / Max: 90.81Min: 86.93 / Avg: 87.36 / Max: 87.76

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2313002600390052006500SE +/- 16.93, N = 3SE +/- 3.24, N = 3SE +/- 6.65, N = 36006.345899.125884.76MIN: 5936.3MIN: 5840.53MIN: 5829.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2310002000300040005000Min: 5981.86 / Avg: 6006.34 / Max: 6038.83Min: 5894.5 / Avg: 5899.12 / Max: 5905.37Min: 5872.33 / Avg: 5884.76 / Max: 5895.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a23612182430SE +/- 0.03, N = 3SE +/- 0.25, N = 15SE +/- 0.09, N = 325.8226.3325.83
OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a23612182430Min: 25.76 / Avg: 25.82 / Max: 25.86Min: 25.72 / Avg: 26.33 / Max: 29.11Min: 25.74 / Avg: 25.83 / Max: 26.02

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a231.0462.0923.1384.1845.23SE +/- 0.04669, N = 8SE +/- 0.04208, N = 10SE +/- 0.04478, N = 94.648914.559964.57050MIN: 3.94MIN: 3.76MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 4.32 / Avg: 4.65 / Max: 4.71Min: 4.18 / Avg: 4.56 / Max: 4.63Min: 4.21 / Avg: 4.57 / Max: 4.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1a233M6M9M12M15MSE +/- 37547.46, N = 3SE +/- 47603.72, N = 3SE +/- 160953.33, N = 41176364911662466118847311. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1a232M4M6M8M10MMin: 11717851 / Avg: 11763649.33 / Max: 11838088Min: 11574596 / Avg: 11662466 / Max: 11738142Min: 11565800 / Avg: 11884730.5 / Max: 122660531. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a234M8M12M16M20MSE +/- 239170.10, N = 3SE +/- 47854.09, N = 3SE +/- 209621.37, N = 3189475981865780418601604
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a233M6M9M12M15MMin: 18649100 / Avg: 18947597.67 / Max: 19420546Min: 18602790 / Avg: 18657803.67 / Max: 18753135Min: 18282664 / Avg: 18601604.33 / Max: 18996725

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2313002600390052006500SE +/- 19.70, N = 3SE +/- 13.63, N = 3SE +/- 4.88, N = 36000.835902.685893.19MIN: 5915.81MIN: 5841.04MIN: 5838.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2310002000300040005000Min: 5961.57 / Avg: 6000.83 / Max: 6023.24Min: 5885.61 / Avg: 5902.68 / Max: 5929.63Min: 5884.41 / Avg: 5893.19 / Max: 5901.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2313002600390052006500SE +/- 18.98, N = 3SE +/- 6.73, N = 3SE +/- 12.51, N = 36004.065896.895898.04MIN: 5933.15MIN: 5837.81MIN: 5828.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2310002000300040005000Min: 5976.33 / Avg: 6004.06 / Max: 6040.37Min: 5886.6 / Avg: 5896.89 / Max: 5909.55Min: 5873.46 / Avg: 5898.04 / Max: 5914.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.111a2330060090012001500SE +/- 1.76, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 312731296128512871. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.111a232004006008001000Min: 1270 / Avg: 1273.33 / Max: 1276Min: 1295 / Avg: 1295.67 / Max: 1297Min: 1284 / Avg: 1285 / Max: 12861. (CXX) g++ options: -O3 -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215SE +/- 0.14, N = 3SE +/- 0.02, N = 39.9810.15MIN: 9.65 / MAX: 19.45MIN: 9.86 / MAX: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215Min: 9.7 / Avg: 9.98 / Max: 10.15Min: 10.12 / Avg: 10.15 / Max: 10.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1a231.6M3.2M4.8M6.4M8MSE +/- 25940.20, N = 3SE +/- 26469.09, N = 3SE +/- 8686.26, N = 37355877746246474809331. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1a231.3M2.6M3.9M5.2M6.5MMin: 7305305 / Avg: 7355877.33 / Max: 7391189Min: 7410315 / Avg: 7462464.33 / Max: 7496424Min: 7467738 / Avg: 7480933.33 / Max: 74973171. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1a230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.610.610.621. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1a23246810Min: 0.61 / Avg: 0.61 / Max: 0.62Min: 0.61 / Avg: 0.61 / Max: 0.61Min: 0.61 / Avg: 0.62 / Max: 0.621. (CXX) g++ options: -O3 -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a21.31852.6373.95555.2746.5925SE +/- 0.03, N = 3SE +/- 0.04, N = 35.775.86MIN: 5.57 / MAX: 7.01MIN: 5.62 / MAX: 7.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a2246810Min: 5.72 / Avg: 5.77 / Max: 5.81Min: 5.82 / Avg: 5.86 / Max: 5.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VKMark

VKMark is a collection of Vulkan tests/benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a23110220330440550SE +/- 2.40, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 35255185171. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF
OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a2390180270360450Min: 522 / Avg: 525.33 / Max: 530Min: 517 / Avg: 518 / Max: 519Min: 516 / Avg: 517 / Max: 5181. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a23510152025SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 318.3618.3818.10MIN: 17.58MIN: 17.99MIN: 17.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a23510152025Min: 18.1 / Avg: 18.36 / Max: 18.61Min: 18.31 / Avg: 18.38 / Max: 18.43Min: 18 / Avg: 18.1 / Max: 18.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF1121a235001000150020002500SE +/- 20.55, N = 3SE +/- 13.36, N = 3SE +/- 6.30, N = 32191.152164.202158.801. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF1121a23400800120016002000Min: 2154.34 / Avg: 2191.15 / Max: 2225.39Min: 2142.8 / Avg: 2164.2 / Max: 2188.76Min: 2149.35 / Avg: 2158.8 / Max: 2170.731. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1a2390180270360450SE +/- 0.99, N = 3SE +/- 0.48, N = 3SE +/- 1.00, N = 3427.62433.95433.44MIN: 366.19 / MAX: 462MIN: 372.19 / MAX: 467.89MIN: 370.03 / MAX: 468.991. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1a2380160240320400Min: 425.74 / Avg: 427.62 / Max: 429.12Min: 433 / Avg: 433.95 / Max: 434.54Min: 431.6 / Avg: 433.44 / Max: 435.061. (CC) gcc options: -pthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a27001400210028003500SE +/- 6.71, N = 3SE +/- 34.43, N = 3324331971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a26001200180024003000Min: 3234 / Avg: 3242.83 / Max: 3256Min: 3129.5 / Avg: 3197.33 / Max: 3241.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1a215K30K45K60K75K68561694881. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a23246810SE +/- 0.12949, N = 12SE +/- 0.12374, N = 12SE +/- 0.12812, N = 127.991697.886457.93690MIN: 6.38MIN: 6.33MIN: 6.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a233691215Min: 6.6 / Avg: 7.99 / Max: 8.26Min: 6.55 / Avg: 7.89 / Max: 8.17Min: 6.55 / Avg: 7.94 / Max: 8.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a2370140210280350SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3300.05303.97303.671. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a2350100150200250Min: 299.99 / Avg: 300.05 / Max: 300.07Min: 303.78 / Avg: 303.97 / Max: 304.1Min: 303.56 / Avg: 303.67 / Max: 303.821. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1a23918273645SE +/- 0.55, N = 4SE +/- 0.63, N = 3SE +/- 0.55, N = 438.5338.0538.451. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1a23816243240Min: 36.89 / Avg: 38.53 / Max: 39.21Min: 36.81 / Avg: 38.05 / Max: 38.84Min: 36.83 / Avg: 38.45 / Max: 39.211. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001a21428425670SE +/- 0.23, N = 3SE +/- 0.20, N = 362.0361.251. (CC) gcc options: -O2 -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001a21224364860Min: 61.6 / Avg: 62.03 / Max: 62.39Min: 60.97 / Avg: 61.25 / Max: 61.641. (CC) gcc options: -O2 -ldl -lz -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression1a23204060801008182821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a21.3142.6283.9425.2566.57SE +/- 0.10, N = 3SE +/- 0.10, N = 35.775.84MIN: 5.35 / MAX: 7.36MIN: 5.28 / MAX: 7.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a2246810Min: 5.62 / Avg: 5.77 / Max: 5.96Min: 5.74 / Avg: 5.84 / Max: 6.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1a2311002200330044005500SE +/- 1.32, N = 3SE +/- 3.55, N = 3SE +/- 4.86, N = 35218.115224.185162.571. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1a239001800270036004500Min: 5215.47 / Avg: 5218.11 / Max: 5219.48Min: 5217.53 / Avg: 5224.18 / Max: 5229.64Min: 5155.52 / Avg: 5162.57 / Max: 5171.91. (CC) gcc options: -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression1a2340801201602001681701691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2150300450600750SE +/- 9.06, N = 3SE +/- 2.68, N = 3690.02682.731. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2120240360480600Min: 680.96 / Avg: 690.02 / Max: 708.13Min: 679.22 / Avg: 682.73 / Max: 6881. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025SE +/- 0.24, N = 7SE +/- 0.32, N = 421.4121.64
OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025Min: 20.66 / Avg: 21.41 / Max: 22.31Min: 20.77 / Avg: 21.63 / Max: 22.15

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101a230.71711.43422.15132.86843.5855SE +/- 0.004, N = 3SE +/- 0.027, N = 3SE +/- 0.002, N = 33.1583.1873.154
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101a23246810Min: 3.15 / Avg: 3.16 / Max: 3.16Min: 3.16 / Avg: 3.19 / Max: 3.24Min: 3.15 / Avg: 3.15 / Max: 3.16

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a21.30952.6193.92855.2386.5475SE +/- 0.11, N = 3SE +/- 0.12, N = 35.765.82MIN: 5.31 / MAX: 7.49MIN: 5.3 / MAX: 7.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a2246810Min: 5.63 / Avg: 5.76 / Max: 5.98Min: 5.66 / Avg: 5.82 / Max: 6.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21a280160240320400SE +/- 2.01, N = 3SE +/- 0.47, N = 3372.04368.30MIN: 368.17 / MAX: 376.4MIN: 366.89 / MAX: 369.961. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21a270140210280350Min: 368.8 / Avg: 372.04 / Max: 375.72Min: 367.39 / Avg: 368.3 / Max: 368.941. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1a2320406080100SE +/- 0.33, N = 3SE +/- 1.00, N = 31011021011. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1a2320406080100Min: 100 / Avg: 100.67 / Max: 101Min: 100 / Avg: 101 / Max: 1031. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01a2246810SE +/- 0.010, N = 3SE +/- 0.038, N = 37.9777.901MIN: 7.4 / MAX: 21.28MIN: 7.36 / MAX: 20.811. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01a23691215Min: 7.96 / Avg: 7.98 / Max: 8Min: 7.86 / Avg: 7.9 / Max: 7.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a23246810SE +/- 0.0934, N = 4SE +/- 0.1026, N = 4SE +/- 0.0890, N = 56.95716.92726.8925MIN: 6.68 / MAX: 8.51MIN: 6.63 / MAX: 8.55MIN: 6.62 / MAX: 8.55
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a233691215Min: 6.84 / Avg: 6.96 / Max: 7.24Min: 6.78 / Avg: 6.93 / Max: 7.23Min: 6.78 / Avg: 6.89 / Max: 7.25

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51a230.2480.4960.7440.9921.24SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 31.0961.1021.092
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51a23246810Min: 1.09 / Avg: 1.1 / Max: 1.1Min: 1.1 / Avg: 1.1 / Max: 1.11Min: 1.09 / Avg: 1.09 / Max: 1.09

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1a231.24112.48223.72334.96446.2055SE +/- 0.029, N = 3SE +/- 0.020, N = 3SE +/- 0.030, N = 35.5165.4915.4661. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1a23246810Min: 5.46 / Avg: 5.52 / Max: 5.56Min: 5.46 / Avg: 5.49 / Max: 5.53Min: 5.41 / Avg: 5.47 / Max: 5.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000SE +/- 5.07, N = 3SE +/- 12.47, N = 31155.971146.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000Min: 1145.83 / Avg: 1155.97 / Max: 1161.04Min: 1121.35 / Avg: 1146.11 / Max: 1161.041. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21a21122334455SE +/- 0.69, N = 4SE +/- 0.77, N = 348.5948.191. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21a21020304050Min: 46.59 / Avg: 48.59 / Max: 49.7Min: 46.64 / Avg: 48.19 / Max: 48.971. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a236001200180024003000SE +/- 1.91, N = 3SE +/- 0.89, N = 3SE +/- 2.44, N = 32905.812881.772901.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a235001000150020002500Min: 2903.78 / Avg: 2905.81 / Max: 2909.62Min: 2880.77 / Avg: 2881.77 / Max: 2883.55Min: 2896.34 / Avg: 2901.19 / Max: 2904.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1a2320406080100SE +/- 0.28, N = 3SE +/- 0.37, N = 3SE +/- 0.36, N = 3101.12100.75101.58MIN: 91.52 / MAX: 106.08MIN: 83.92 / MAX: 106.22MIN: 87.94 / MAX: 106.991. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1a2320406080100Min: 100.84 / Avg: 101.12 / Max: 101.67Min: 100.03 / Avg: 100.75 / Max: 101.23Min: 100.86 / Avg: 101.58 / Max: 101.961. (CC) gcc options: -pthread

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1a2350100150200250SE +/- 0.90, N = 3SE +/- 1.08, N = 3SE +/- 1.00, N = 3231.16230.34232.22
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1a234080120160200Min: 229.64 / Avg: 231.16 / Max: 232.75Min: 228.18 / Avg: 230.34 / Max: 231.58Min: 230.22 / Avg: 232.22 / Max: 233.27

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a20.55581.11161.66742.22322.779SE +/- 0.06, N = 3SE +/- 0.05, N = 32.452.47MIN: 2.29 / MAX: 2.85MIN: 2.31 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a2246810Min: 2.32 / Avg: 2.45 / Max: 2.51Min: 2.37 / Avg: 2.47 / Max: 2.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 344.5344.8444.881. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1a23918273645Min: 44.51 / Avg: 44.53 / Max: 44.54Min: 44.83 / Avg: 44.84 / Max: 44.85Min: 44.87 / Avg: 44.88 / Max: 44.881. (CC) gcc options: -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a230.08690.17380.26070.34760.4345SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3860.3850.383
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a2312345Min: 0.39 / Avg: 0.39 / Max: 0.39Min: 0.38 / Avg: 0.39 / Max: 0.39Min: 0.38 / Avg: 0.38 / Max: 0.38

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1a23246810SE +/- 0.1307, N = 3SE +/- 0.0480, N = 3SE +/- 0.0611, N = 37.68747.74667.7408MIN: 7.27 / MAX: 9.52MIN: 7.5 / MAX: 9.39MIN: 7.48 / MAX: 9.45
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1a233691215Min: 7.45 / Avg: 7.69 / Max: 7.9Min: 7.69 / Avg: 7.75 / Max: 7.84Min: 7.67 / Avg: 7.74 / Max: 7.86

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 343.4443.7443.761. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1a23918273645Min: 43.43 / Avg: 43.44 / Max: 43.45Min: 43.73 / Avg: 43.74 / Max: 43.75Min: 43.75 / Avg: 43.76 / Max: 43.771. (CC) gcc options: -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a2390180270360450SE +/- 0.33, N = 34164194171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a2370140210280350Min: 416 / Avg: 416.67 / Max: 4171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21a234080120160200SE +/- 0.01, N = 3SE +/- 0.90, N = 3SE +/- 0.01, N = 3165.24164.08165.261. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21a23306090120150Min: 165.21 / Avg: 165.24 / Max: 165.26Min: 162.31 / Avg: 164.08 / Max: 165.23Min: 165.24 / Avg: 165.26 / Max: 165.271. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single11a23110220330440550SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 0.25, N = 3501.91498.53500.91500.691. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single11a2390180270360450Min: 501.44 / Avg: 501.91 / Max: 502.35Min: 497.9 / Avg: 498.53 / Max: 499.03Min: 500.61 / Avg: 500.91 / Max: 501.24Min: 500.3 / Avg: 500.69 / Max: 501.161. (CXX) g++ options: -O3 -pthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1a233691215SE +/- 0.0549, N = 3SE +/- 0.0413, N = 3SE +/- 0.0858, N = 38.98448.93588.9952MIN: 8.77 / MAX: 9.8MIN: 8.75 / MAX: 9.83MIN: 8.75 / MAX: 9.95
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1a233691215Min: 8.91 / Avg: 8.98 / Max: 9.09Min: 8.88 / Avg: 8.94 / Max: 9.02Min: 8.88 / Avg: 9 / Max: 9.16

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 3104.64103.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100Min: 104.17 / Avg: 104.64 / Max: 105.04Min: 103.84 / Avg: 103.95 / Max: 104.061. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1a2313002600390052006500SE +/- 5.95, N = 3SE +/- 0.40, N = 3SE +/- 2.77, N = 36003.66043.16034.71. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1a2310002000300040005000Min: 5994.7 / Avg: 6003.6 / Max: 6014.9Min: 6042.6 / Avg: 6043.1 / Max: 6043.9Min: 6031.4 / Avg: 6034.7 / Max: 6040.21. (CC) gcc options: -O3

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a2350100150200250SE +/- 0.49, N = 3SE +/- 0.19, N = 3SE +/- 0.52, N = 3229.82229.31230.801. (CC) gcc options: -O2 -fvisibility=hidden
OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a234080120160200Min: 228.92 / Avg: 229.82 / Max: 230.58Min: 228.92 / Avg: 229.31 / Max: 229.52Min: 230 / Avg: 230.8 / Max: 231.771. (CC) gcc options: -O2 -fvisibility=hidden

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501a21020304050SE +/- 0.06, N = 3SE +/- 0.03, N = 342.4742.21MIN: 41.07 / MAX: 45.74MIN: 40.94 / MAX: 44.991. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501a2918273645Min: 42.36 / Avg: 42.47 / Max: 42.58Min: 42.15 / Avg: 42.21 / Max: 42.241. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a26001200180024003000SE +/- 4.28, N = 3SE +/- 12.45, N = 32784.82767.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a25001000150020002500Min: 2778.9 / Avg: 2784.77 / Max: 2793.1Min: 2748.9 / Avg: 2767.83 / Max: 2791.3

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2500K1000K1500K2000K2500KSE +/- 6947.45, N = 3SE +/- 3348.75, N = 32261333.332247597.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2400K800K1200K1600K2000KMin: 2251752.25 / Avg: 2261333.33 / Max: 2274839Min: 2241707.25 / Avg: 2247597.67 / Max: 2253303.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1a2350100150200250SE +/- 1.32, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3228.93228.76227.56
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1a234080120160200Min: 226.82 / Avg: 228.93 / Max: 231.35Min: 228.52 / Avg: 228.76 / Max: 228.9Min: 227.34 / Avg: 227.56 / Max: 227.82

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215SE +/- 0.13, N = 3SE +/- 0.14, N = 310.0010.06MIN: 9.67 / MAX: 11.41MIN: 9.74 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215Min: 9.74 / Avg: 10 / Max: 10.15Min: 9.79 / Avg: 10.06 / Max: 10.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a27001400210028003500SE +/- 8.42, N = 3SE +/- 18.72, N = 33394.03373.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a26001200180024003000Min: 3381.4 / Avg: 3394.03 / Max: 3410Min: 3343.5 / Avg: 3373.83 / Max: 3408

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241a21.03862.07723.11584.15445.193SE +/- 0.014, N = 3SE +/- 0.008, N = 34.6164.589MIN: 4 / MAX: 5.38MIN: 4 / MAX: 5.391. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241a2246810Min: 4.59 / Avg: 4.62 / Max: 4.64Min: 4.58 / Avg: 4.59 / Max: 4.611. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31a21122334455SE +/- 1.11, N = 3SE +/- 1.29, N = 348.2147.93MIN: 40.03 / MAX: 87.29MIN: 39.27 / MAX: 62.111. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31a21020304050Min: 46.02 / Avg: 48.21 / Max: 49.66Min: 45.35 / Avg: 47.93 / Max: 49.291. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1a233691215SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 312.5912.6112.541. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1a2348121620Min: 12.55 / Avg: 12.58 / Max: 12.61Min: 12.46 / Avg: 12.61 / Max: 12.78Min: 12.5 / Avg: 12.54 / Max: 12.591. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181a2612182430SE +/- 0.02, N = 3SE +/- 0.07, N = 324.2824.15MIN: 23.77 / MAX: 32.56MIN: 23.68 / MAX: 25.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181a2612182430Min: 24.24 / Avg: 24.28 / Max: 24.3Min: 24.02 / Avg: 24.15 / Max: 24.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a26001200180024003000SE +/- 4.37, N = 3SE +/- 10.39, N = 32783.62769.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a25001000150020002500Min: 2777.3 / Avg: 2783.6 / Max: 2792Min: 2752.9 / Avg: 2769.2 / Max: 2788.5

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1a23400800120016002000SE +/- 11.50, N = 3SE +/- 8.19, N = 31743174817391. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1a2330060090012001500Min: 1720 / Avg: 1743 / Max: 1755Min: 1723 / Avg: 1739 / Max: 17501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1a23246810SE +/- 0.0156, N = 3SE +/- 0.0266, N = 3SE +/- 0.0068, N = 37.99117.97868.0194MIN: 7.78 / MAX: 9.04MIN: 7.81 / MAX: 8.99MIN: 7.79 / MAX: 9.01
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1a233691215Min: 7.96 / Avg: 7.99 / Max: 8.01Min: 7.94 / Avg: 7.98 / Max: 8.03Min: 8.01 / Avg: 8.02 / Max: 8.03

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a230060090012001500SE +/- 8.33, N = 3SE +/- 11.46, N = 31284.261278.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a22004006008001000Min: 1267.61 / Avg: 1284.26 / Max: 1292.59Min: 1255.48 / Avg: 1278.11 / Max: 1292.591. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a2330060090012001500SE +/- 1.99, N = 3SE +/- 0.69, N = 3SE +/- 1.26, N = 31208.891203.151208.661. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a232004006008001000Min: 1206.27 / Avg: 1208.89 / Max: 1212.8Min: 1202.28 / Avg: 1203.15 / Max: 1204.52Min: 1206.15 / Avg: 1208.66 / Max: 1210.171. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression1a23501001502002502132142141. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1a2510152025SE +/- 0.23, N = 3SE +/- 0.36, N = 322.3622.26MIN: 21.06 / MAX: 23.65MIN: 21.15 / MAX: 23.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1a2510152025Min: 21.9 / Avg: 22.36 / Max: 22.59Min: 21.54 / Avg: 22.26 / Max: 22.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a23100200300400500SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 34524514501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a2380160240320400Min: 451 / Avg: 452 / Max: 453Min: 450 / Avg: 450.67 / Max: 452Min: 447 / Avg: 450 / Max: 4521. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2390180270360450SE +/- 1.12, N = 3SE +/- 1.91, N = 3SE +/- 1.58, N = 3392.31392.41394.011. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2370140210280350Min: 390.91 / Avg: 392.31 / Max: 394.52Min: 389.16 / Avg: 392.41 / Max: 395.76Min: 392.03 / Avg: 394.01 / Max: 397.131. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01a20.92161.84322.76483.68644.608SE +/- 0.008, N = 3SE +/- 0.012, N = 34.0964.079MIN: 3.86 / MAX: 17.27MIN: 3.87 / MAX: 16.371. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01a2246810Min: 4.08 / Avg: 4.1 / Max: 4.11Min: 4.06 / Avg: 4.08 / Max: 4.11. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a23246810SE +/- 0.0235, N = 3SE +/- 0.0122, N = 3SE +/- 0.0138, N = 37.98527.95357.9747MIN: 7.77 / MAX: 8.68MIN: 7.72 / MAX: 8.73MIN: 7.77 / MAX: 8.72
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a233691215Min: 7.95 / Avg: 7.99 / Max: 8.03Min: 7.93 / Avg: 7.95 / Max: 7.97Min: 7.95 / Avg: 7.97 / Max: 8

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a2360120180240300SE +/- 0.11, N = 3SE +/- 0.99, N = 3SE +/- 0.04, N = 3280.77279.66280.581. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a2350100150200250Min: 280.56 / Avg: 280.77 / Max: 280.88Min: 277.68 / Avg: 279.66 / Max: 280.74Min: 280.52 / Avg: 280.58 / Max: 280.641. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215SE +/- 0.08, N = 3SE +/- 0.11, N = 310.3910.351. Nodejs v12.18.2
OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215Min: 10.29 / Avg: 10.39 / Max: 10.55Min: 10.17 / Avg: 10.35 / Max: 10.551. Nodejs v12.18.2

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1a20.11720.23440.35160.46880.586SE +/- 0.002, N = 3SE +/- 0.001, N = 30.5190.5211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1a2246810Min: 0.52 / Avg: 0.52 / Max: 0.52Min: 0.52 / Avg: 0.52 / Max: 0.521. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.02, N = 3SE +/- 0.02, N = 37.817.84MIN: 7.6 / MAX: 9.81MIN: 7.59 / MAX: 9.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a23691215Min: 7.77 / Avg: 7.81 / Max: 7.85Min: 7.8 / Avg: 7.84 / Max: 7.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181a2612182430SE +/- 0.01, N = 3SE +/- 0.09, N = 324.2524.16MIN: 23.71 / MAX: 26.65MIN: 23.71 / MAX: 25.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181a2612182430Min: 24.23 / Avg: 24.25 / Max: 24.27Min: 23.98 / Avg: 24.16 / Max: 24.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second1a2350K100K150K200K250KSE +/- 1679.06, N = 14SE +/- 2311.32, N = 10SE +/- 2406.51, N = 9243609.12244321.48244509.641. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second1a2340K80K120K160K200KMin: 240015 / Avg: 243609.12 / Max: 265164.07Min: 240168.12 / Avg: 244321.48 / Max: 264834.89Min: 239189.74 / Avg: 244509.64 / Max: 263049.731. (CC) gcc options: -O2 -lrt" -lrt

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000SE +/- 3.58, N = 3SE +/- 3.72, N = 3SE +/- 4.35, N = 31004.391005.931007.991. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000Min: 1000.32 / Avg: 1004.39 / Max: 1011.53Min: 1001.07 / Avg: 1005.93 / Max: 1013.24Min: 1000.36 / Avg: 1007.99 / Max: 1015.431. (CXX) g++ options: -O3 -pthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a23120240360480600SE +/- 0.88, N = 3SE +/- 0.33, N = 35625645621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a23100200300400500Min: 560 / Avg: 561.67 / Max: 563Min: 563 / Avg: 563.67 / Max: 5641. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a2330060090012001500SE +/- 1.09, N = 3SE +/- 3.28, N = 3SE +/- 5.21, N = 31611.21605.61606.61. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a2330060090012001500Min: 1609.1 / Avg: 1611.17 / Max: 1612.8Min: 1599.3 / Avg: 1605.57 / Max: 1610.4Min: 1596.3 / Avg: 1606.63 / Max: 1612.91. (CC) gcc options: -O3 -pthread -lz -llzma

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a233691215SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.07, N = 512.7712.8112.781. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a2348121620Min: 12.66 / Avg: 12.77 / Max: 12.92Min: 12.68 / Avg: 12.81 / Max: 12.93Min: 12.64 / Avg: 12.78 / Max: 12.971. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2312002400360048006000SE +/- 4.97, N = 3SE +/- 1.64, N = 3SE +/- 9.93, N = 35680.845669.095661.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2310002000300040005000Min: 5671.94 / Avg: 5680.84 / Max: 5689.14Min: 5665.91 / Avg: 5669.09 / Max: 5671.34Min: 5641.69 / Avg: 5661.4 / Max: 5673.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1a231.12752.2553.38254.515.6375SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 35.0024.9945.0111. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1a23246810Min: 4.97 / Avg: 5 / Max: 5.03Min: 4.97 / Avg: 4.99 / Max: 5.01Min: 5 / Avg: 5.01 / Max: 5.021. (CXX) g++ options: -O3 -pthread -lm

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a27001400210028003500SE +/- 1.88, N = 3SE +/- 20.76, N = 33387.63377.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a26001200180024003000Min: 3384.3 / Avg: 3387.6 / Max: 3390.8Min: 3339.9 / Avg: 3377 / Max: 3411.7

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a235001000150020002500SE +/- 18.57, N = 3SE +/- 14.18, N = 3SE +/- 17.07, N = 32180.52185.02187.11. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a23400800120016002000Min: 2143.9 / Avg: 2180.53 / Max: 2204.1Min: 2157.3 / Avg: 2184.97 / Max: 2204.2Min: 2153.1 / Avg: 2187.13 / Max: 2206.41. (CXX) g++ options: -O3 -march=native -rdynamic

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2160320480640800SE +/- 0.83, N = 3737.6735.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2130260390520650Min: 734.3 / Avg: 735.5 / Max: 737.1

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1a2510152025SE +/- 0.02, N = 3SE +/- 0.06, N = 321.1121.05MIN: 20.83 / MAX: 21.88MIN: 20.87 / MAX: 21.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1a2510152025Min: 21.08 / Avg: 21.11 / Max: 21.14Min: 20.93 / Avg: 21.05 / Max: 21.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1a23246810SE +/- 0.0159, N = 3SE +/- 0.0153, N = 3SE +/- 0.0029, N = 37.34027.34187.3611MIN: 7.12 / MAX: 8.12MIN: 7.13 / MAX: 8.13MIN: 7.12 / MAX: 8.18
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1a233691215Min: 7.32 / Avg: 7.34 / Max: 7.37Min: 7.32 / Avg: 7.34 / Max: 7.37Min: 7.36 / Avg: 7.36 / Max: 7.37

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg161a2306090120150SE +/- 0.12, N = 3SE +/- 0.06, N = 3114.03113.72MIN: 113.58 / MAX: 123.46MIN: 113.39 / MAX: 122.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg161a220406080100Min: 113.88 / Avg: 114.03 / Max: 114.27Min: 113.6 / Avg: 113.72 / Max: 113.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a24M8M12M16M20MSE +/- 43251.00, N = 3SE +/- 22002.07, N = 317521143174778131. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a23M6M9M12M15MMin: 17435090 / Avg: 17521143.33 / Max: 17571790Min: 17442890 / Avg: 17477813.33 / Max: 175184601. (CXX) g++ options: -O3 -fopenmp

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2160320480640800SE +/- 0.38, N = 3SE +/- 0.94, N = 3723.9722.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2130260390520650Min: 723.2 / Avg: 723.93 / Max: 724.5Min: 720.9 / Avg: 722.17 / Max: 724

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21122334455SE +/- 0.50, N = 3SE +/- 0.49, N = 346.7846.891. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21020304050Min: 45.79 / Avg: 46.78 / Max: 47.32Min: 45.92 / Avg: 46.89 / Max: 47.381. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a215K30K45K60K75KSE +/- 217.85, N = 3SE +/- 331.16, N = 371041.9771207.271. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a212K24K36K48K60KMin: 70607.33 / Avg: 71041.97 / Max: 71285.56Min: 70564.23 / Avg: 71207.27 / Max: 71666.21. (CXX) g++ options: -O3 -march=native -fopenmp

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501a21020304050SE +/- 0.07, N = 3SE +/- 0.05, N = 343.8143.71MIN: 41.64 / MAX: 53.86MIN: 41.74 / MAX: 46.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501a2918273645Min: 43.66 / Avg: 43.81 / Max: 43.89Min: 43.62 / Avg: 43.71 / Max: 43.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21326395265SE +/- 0.33, N = 3SE +/- 0.31, N = 359.9560.081. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21224364860Min: 59.28 / Avg: 59.95 / Max: 60.3Min: 59.45 / Avg: 60.08 / Max: 60.421. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a230.32510.65020.97531.30041.6255SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 31.4451.4421.443
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a23246810Min: 1.44 / Avg: 1.45 / Max: 1.45Min: 1.43 / Avg: 1.44 / Max: 1.45Min: 1.44 / Avg: 1.44 / Max: 1.45

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a2370140210280350SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.33, N = 3299.46298.89299.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a2350100150200250Min: 299.35 / Avg: 299.46 / Max: 299.56Min: 298.4 / Avg: 298.89 / Max: 299.61Min: 298.57 / Avg: 299.22 / Max: 299.591. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a2330060090012001500SE +/- 3.61, N = 3SE +/- 2.89, N = 3SE +/- 1.76, N = 31607160716101. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a2330060090012001500Min: 1602 / Avg: 1607 / Max: 1614Min: 1602 / Avg: 1607 / Max: 1612Min: 1607 / Avg: 1609.67 / Max: 16131. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1a2612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5627.51MIN: 27.18 / MAX: 28.23MIN: 27.14 / MAX: 29.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1a2612182430Min: 27.53 / Avg: 27.56 / Max: 27.58Min: 27.47 / Avg: 27.51 / Max: 27.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2160320480640800SE +/- 0.50, N = 3SE +/- 1.30, N = 2723.8722.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2130260390520650Min: 722.8 / Avg: 723.8 / Max: 724.4Min: 721.2 / Avg: 722.5 / Max: 723.8

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1a2510152025SE +/- 0.27, N = 3SE +/- 0.35, N = 322.2922.25MIN: 21.06 / MAX: 23.6MIN: 21.23 / MAX: 23.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1a2510152025Min: 21.76 / Avg: 22.29 / Max: 22.6Min: 21.55 / Avg: 22.25 / Max: 22.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2160320480640800SE +/- 0.66, N = 3SE +/- 0.32, N = 3736.3735.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2130260390520650Min: 735.2 / Avg: 736.33 / Max: 737.5Min: 734.4 / Avg: 734.97 / Max: 735.5

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2313002600390052006500SE +/- 3.49, N = 3SE +/- 0.86, N = 3SE +/- 1.13, N = 35990.05983.25980.11. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2310002000300040005000Min: 5984.9 / Avg: 5990.03 / Max: 5996.7Min: 5981.5 / Avg: 5983.17 / Max: 5984.4Min: 5978.2 / Avg: 5980.13 / Max: 5982.11. (CC) gcc options: -O3

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1a233691215SE +/- 0.014, N = 5SE +/- 0.014, N = 5SE +/- 0.013, N = 59.6069.6029.6171. (CXX) g++ options: -fvisibility=hidden -logg -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1a233691215Min: 9.59 / Avg: 9.61 / Max: 9.66Min: 9.57 / Avg: 9.6 / Max: 9.65Min: 9.6 / Avg: 9.62 / Max: 9.671. (CXX) g++ options: -fvisibility=hidden -logg -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1a23140280420560700SE +/- 0.58, N = 36506506511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1a23110220330440550Min: 649 / Avg: 650 / Max: 6511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.34, N = 3404.0403.4
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a270140210280350Min: 403.7 / Avg: 403.97 / Max: 404.2Min: 403 / Avg: 403.43 / Max: 404.1

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.05, N = 340.9240.98MIN: 40.49 / MAX: 49.83MIN: 40.54 / MAX: 50.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645Min: 40.89 / Avg: 40.92 / Max: 40.94Min: 40.9 / Avg: 40.98 / Max: 41.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000SE +/- 1.69, N = 3SE +/- 1.69, N = 31161.001159.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000Min: 1157.63 / Avg: 1161 / Max: 1162.69Min: 1157.63 / Avg: 1159.32 / Max: 1162.691. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1a233K6K9K12K15KSE +/- 6.60, N = 3SE +/- 11.20, N = 3SE +/- 16.88, N = 315451.7015459.4515474.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1a233K6K9K12K15KMin: 15439.37 / Avg: 15451.7 / Max: 15461.94Min: 15448 / Avg: 15459.45 / Max: 15481.86Min: 15451.1 / Avg: 15474.02 / Max: 15506.951. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1a2246810SE +/- 0.03, N = 3SE +/- 0.00, N = 37.137.121. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1a23691215Min: 7.09 / Avg: 7.13 / Max: 7.18Min: 7.12 / Avg: 7.12 / Max: 7.131. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71a2160320480640800SE +/- 0.77, N = 3SE +/- 1.84, N = 3724.52723.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71a2130260390520650Min: 723.61 / Avg: 724.52 / Max: 726.04Min: 720.08 / Avg: 723.52 / Max: 726.361. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2140K280K420K560K700KSE +/- 1381.24, N = 3SE +/- 1621.04, N = 3655165654272
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2110K220K330K440K550KMin: 652411 / Avg: 655165.33 / Max: 656726Min: 651371 / Avg: 654272.33 / Max: 656976

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data1a23300600900120015001478.061478.481480.07

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score1a2160320480640800743742

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1a22004006008001000SE +/- 0.28, N = 3SE +/- 0.31, N = 3808.35807.291. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1a2140280420560700Min: 807.85 / Avg: 808.35 / Max: 808.84Min: 806.81 / Avg: 807.29 / Max: 807.871. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161a2306090120150SE +/- 0.05, N = 3SE +/- 0.04, N = 3113.97113.82MIN: 113.51 / MAX: 123.78MIN: 113.45 / MAX: 122.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161a220406080100Min: 113.89 / Avg: 113.97 / Max: 114.07Min: 113.77 / Avg: 113.82 / Max: 113.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1a20.56521.13041.69562.26082.826SE +/- 0.004, N = 3SE +/- 0.002, N = 32.5092.512
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1a2246810Min: 2.5 / Avg: 2.51 / Max: 2.52Min: 2.51 / Avg: 2.51 / Max: 2.52

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11a270140210280350SE +/- 0.07, N = 3SE +/- 0.09, N = 3343.57343.21MIN: 343.23 / MAX: 344.21MIN: 342.85 / MAX: 343.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11a260120180240300Min: 343.49 / Avg: 343.57 / Max: 343.72Min: 343.07 / Avg: 343.21 / Max: 343.371. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a23K6K9K12K15KSE +/- 22.47, N = 3SE +/- 34.58, N = 314156141411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a22K4K6K8K10KMin: 14115 / Avg: 14155.83 / Max: 14192.5Min: 14071.5 / Avg: 14140.5 / Max: 141791. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1a2320406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 388.1588.2488.20MIN: 57.28 / MAX: 196.21MIN: 57.38 / MAX: 196.19MIN: 57.19 / MAX: 197.381. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1a2320406080100Min: 88.11 / Avg: 88.15 / Max: 88.23Min: 88.09 / Avg: 88.24 / Max: 88.32Min: 87.99 / Avg: 88.2 / Max: 88.341. (CC) gcc options: -pthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a290180270360450SE +/- 0.12, N = 3SE +/- 0.31, N = 3400.9400.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a270140210280350Min: 400.7 / Avg: 400.93 / Max: 401.1Min: 400.1 / Avg: 400.5 / Max: 401.1

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025SE +/- 0.21, N = 8SE +/- 0.23, N = 721.3521.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025Min: 19.86 / Avg: 21.35 / Max: 21.6Min: 19.98 / Avg: 21.37 / Max: 21.651. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 340.9540.99MIN: 40.51 / MAX: 41.79MIN: 40.54 / MAX: 49.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645Min: 40.91 / Avg: 40.95 / Max: 40.99Min: 40.92 / Avg: 40.99 / Max: 41.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1a2130260390520650SE +/- 0.23, N = 3SE +/- 0.37, N = 3613.32613.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1a2110220330440550Min: 612.88 / Avg: 613.32 / Max: 613.67Min: 613.49 / Avg: 613.91 / Max: 614.641. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a23100200300400500SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3464.54464.94464.971. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a2380160240320400Min: 463.92 / Avg: 464.54 / Max: 465.21Min: 464.78 / Avg: 464.94 / Max: 465.08Min: 464.55 / Avg: 464.97 / Max: 465.231. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1a20.24530.49060.73590.98121.2265SE +/- 0.000, N = 3SE +/- 0.001, N = 31.0891.090
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1a2246810Min: 1.09 / Avg: 1.09 / Max: 1.09Min: 1.09 / Avg: 1.09 / Max: 1.09

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C1a2316003200480064008000SE +/- 10.76, N = 3SE +/- 9.46, N = 3SE +/- 14.95, N = 37365.587371.117369.781. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C1a2313002600390052006500Min: 7350.88 / Avg: 7365.58 / Max: 7386.54Min: 7353.97 / Avg: 7371.11 / Max: 7386.63Min: 7353.25 / Avg: 7369.78 / Max: 7399.621. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a290180270360450SE +/- 0.32, N = 3SE +/- 0.38, N = 3400.8400.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a270140210280350Min: 400.3 / Avg: 400.83 / Max: 401.4Min: 399.9 / Avg: 400.53 / Max: 401.2

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.40, N = 2403.7403.4
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a270140210280350Min: 403.4 / Avg: 403.7 / Max: 403.9Min: 403 / Avg: 403.4 / Max: 403.8

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score1a23006009001200150014301429

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.6628.68MIN: 28.25 / MAX: 29.49MIN: 28.3 / MAX: 29.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2612182430Min: 28.65 / Avg: 28.66 / Max: 28.69Min: 28.65 / Avg: 28.68 / Max: 28.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a280160240320400SE +/- 0.40, N = 3SE +/- 0.31, N = 3387.71387.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a270140210280350Min: 386.92 / Avg: 387.71 / Max: 388.15Min: 387.36 / Avg: 387.97 / Max: 388.351. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100SE +/- 0.31, N = 3SE +/- 0.30, N = 395.5195.571. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100Min: 94.88 / Avg: 95.51 / Max: 95.84Min: 94.96 / Avg: 95.57 / Max: 95.891. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000SE +/- 0.27, N = 3SE +/- 0.15, N = 3993.03992.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000Min: 992.57 / Avg: 993.03 / Max: 993.49Min: 992.1 / Avg: 992.41 / Max: 992.571. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500SE +/- 1.95, N = 3SE +/- 1.82, N = 31445.401446.291. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500Min: 1441.59 / Avg: 1445.4 / Max: 1448.06Min: 1444.18 / Avg: 1446.29 / Max: 1449.921. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1a23306090120150SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3131.27131.31131.241. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1a2320406080100Min: 131.19 / Avg: 131.27 / Max: 131.41Min: 131.27 / Avg: 131.31 / Max: 131.35Min: 131.22 / Avg: 131.24 / Max: 131.261. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a2313002600390052006500SE +/- 1.02, N = 3SE +/- 1.62, N = 3SE +/- 0.40, N = 35983.55980.35981.81. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a2310002000300040005000Min: 5982.1 / Avg: 5983.53 / Max: 5985.5Min: 5977.5 / Avg: 5980.33 / Max: 5983.1Min: 5981.1 / Avg: 5981.8 / Max: 5982.51. (CC) gcc options: -O3

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KSE +/- 801.33, N = 315887511587950
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KMin: 1586347 / Avg: 1587949.67 / Max: 1588751

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31a236001200180024003000SE +/- 2.52, N = 3SE +/- 1.49, N = 3SE +/- 1.38, N = 32738.352738.012739.311. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31a235001000150020002500Min: 2734.03 / Avg: 2738.35 / Max: 2742.77Min: 2735.89 / Avg: 2738.01 / Max: 2740.89Min: 2736.92 / Avg: 2739.31 / Max: 2741.711. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet1a2510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 321.1221.11MIN: 20.84 / MAX: 22.35MIN: 20.86 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet1a2510152025Min: 21.09 / Avg: 21.12 / Max: 21.14Min: 21.1 / Avg: 21.11 / Max: 21.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a2330M60M90M120M150MSE +/- 10051.26, N = 3SE +/- 7169.92, N = 3SE +/- 8824.65, N = 31223721671223170331223344671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a2320M40M60M80M100MMin: 122353000 / Avg: 122372166.67 / Max: 122387000Min: 122302900 / Avg: 122317033.33 / Max: 122326200Min: 122325000 / Avg: 122334466.67 / Max: 1223521001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1a211K22K33K44K55KSE +/- 562.64, N = 3SE +/- 734.76, N = 349677.1449656.071. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1a29K18K27K36K45KMin: 48557.77 / Avg: 49677.14 / Max: 50336.59Min: 48225.09 / Avg: 49656.07 / Max: 50661.081. (CXX) g++ options: -O3 -march=native -fopenmp

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1a2140K280K420K560K700KSE +/- 2257.33, N = 3SE +/- 1574.42, N = 3667331667606
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1a2120K240K360K480K600KMin: 662816 / Avg: 667330.67 / Max: 669588Min: 664496 / Avg: 667606 / Max: 669588

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1a2612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 327.5527.54MIN: 27.17 / MAX: 28.55MIN: 27.2 / MAX: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1a2612182430Min: 27.49 / Avg: 27.55 / Max: 27.59Min: 27.51 / Avg: 27.54 / Max: 27.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 328.6928.70MIN: 28.24 / MAX: 29.73MIN: 28.3 / MAX: 29.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2612182430Min: 28.66 / Avg: 28.69 / Max: 28.73Min: 28.69 / Avg: 28.7 / Max: 28.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2400K800K1200K1600K2000KSE +/- 8997.19, N = 3SE +/- 3103.90, N = 31718046.711717450.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2300K600K900K1200K1500KMin: 1703298.25 / Avg: 1718046.71 / Max: 1734349Min: 1712627.5 / Avg: 1717450.17 / Max: 1723246.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215SE +/- 0.005, N = 3SE +/- 0.005, N = 39.2289.2311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215Min: 9.22 / Avg: 9.23 / Max: 9.24Min: 9.22 / Avg: 9.23 / Max: 9.241. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1a2400K800K1200K1600K2000KSE +/- 17544.22, N = 3SE +/- 8366.29, N = 31980808.831981355.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1a2300K600K900K1200K1500KMin: 1945935 / Avg: 1980808.83 / Max: 2001601.38Min: 1964649 / Avg: 1981355.17 / Max: 1990522.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001a2130260390520650SE +/- 0.20, N = 3SE +/- 0.18, N = 3615.64615.801. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001a2110220330440550Min: 615.36 / Avg: 615.64 / Max: 616.03Min: 615.49 / Avg: 615.8 / Max: 616.111. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1a248121620SE +/- 0.01, N = 5SE +/- 0.01, N = 516.6916.691. (CXX) g++ options: -rdynamic
OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1a248121620Min: 16.68 / Avg: 16.69 / Max: 16.73Min: 16.67 / Avg: 16.69 / Max: 16.731. (CXX) g++ options: -rdynamic

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a21020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 343.6743.68MIN: 41.66 / MAX: 47.83MIN: 41.61 / MAX: 45.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a2918273645Min: 43.65 / Avg: 43.67 / Max: 43.71Min: 43.61 / Avg: 43.68 / Max: 43.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score1a2150300450600750687687

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1a21020304050SE +/- 0.00, N = 3SE +/- 0.17, N = 345451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1a2918273645Min: 44.5 / Avg: 44.5 / Max: 44.5Min: 44.5 / Avg: 44.67 / Max: 451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1a290180270360450SE +/- 1.26, N = 3SE +/- 1.15, N = 33973971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1a270140210280350Min: 395.5 / Avg: 397 / Max: 399.5Min: 395 / Avg: 397 / Max: 3991. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1a260120180240300SE +/- 0.33, N = 3SE +/- 0.17, N = 32612611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1a250100150200250Min: 261 / Avg: 261.33 / Max: 262Min: 260.5 / Avg: 260.67 / Max: 2611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.236.231. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810Min: 6.23 / Avg: 6.23 / Max: 6.23Min: 6.23 / Avg: 6.23 / Max: 6.241. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191a233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.512.512.51. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191a2348121620Min: 12.5 / Avg: 12.5 / Max: 12.5Min: 12.5 / Avg: 12.5 / Max: 12.5Min: 12.5 / Avg: 12.5 / Max: 12.51. (CC) gcc options: -O3 -pthread -lz -llzma

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1a230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1a23246810Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.61. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1a230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1a2312345Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.371. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1a230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1a2312345Min: 0.48 / Avg: 0.48 / Max: 0.49Min: 0.48 / Avg: 0.48 / Max: 0.49Min: 0.48 / Avg: 0.48 / Max: 0.491. (CXX) g++ options: -O3 -pthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression1a231002003004005004504504501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1a2320406080100SE +/- 0.33, N = 31051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1a2320406080100Min: 104 / Avg: 104.67 / Max: 1051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression1a239182736453838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108011428425670SE +/- 0.37, N = 362.9

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap150100150200250SE +/- 0.86, N = 3209.31MIN: 54.45 / MAX: 280.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2120406080100SE +/- 0.23, N = 392.07MIN: 29.47 / MAX: 1231. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025SE +/- 0.18, N = 3SE +/- 0.77, N = 317.8819.34MIN: 17.22 / MAX: 18.84MIN: 17.15 / MAX: 29.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025Min: 17.54 / Avg: 17.88 / Max: 18.14Min: 18.53 / Avg: 19.34 / Max: 20.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.06721, N = 15SE +/- 0.13886, N = 12SE +/- 0.15309, N = 128.291137.669427.81482MIN: 7.1MIN: 5.68MIN: 5.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a233691215Min: 7.45 / Avg: 8.29 / Max: 8.56Min: 6.16 / Avg: 7.67 / Max: 7.97Min: 6.2 / Avg: 7.81 / Max: 8.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a230.65361.30721.96082.61443.268SE +/- 0.03413, N = 14SE +/- 0.04605, N = 12SE +/- 0.04568, N = 122.905092.625102.65725MIN: 2.26MIN: 2.08MIN: 2.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a23246810Min: 2.54 / Avg: 2.91 / Max: 3.09Min: 2.12 / Avg: 2.63 / Max: 2.69Min: 2.16 / Avg: 2.66 / Max: 2.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

200 Results Shown

DDraceNetwork:
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymap - Total Frame Time
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2 - Total Frame Time
Redis
oneDNN:
  IP Shapes 1D - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
Redis
oneDNN
NAS Parallel Benchmarks
NCNN:
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU - regnety_400m
oneDNN
dav1d
oneDNN
CLOMP
Numpy Benchmark
oneDNN
NCNN:
  Vulkan GPU-v2-v2 - mobilenet-v2
  CPU - blazeface
  CPU - shufflenet-v2
  Vulkan GPU - shufflenet-v2
eSpeak-NG Speech Engine
oneDNN
NAS Parallel Benchmarks
oneDNN
Timed Eigen Compilation
Timed FFmpeg Compilation
oneDNN
Cython Benchmark
oneDNN
Stockfish
asmFish
oneDNN:
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
VkFFT
NCNN
Crafty
simdjson
NCNN
VKMark
oneDNN
Quantum ESPRESSO
dav1d
ONNX Runtime
BRL-CAD
oneDNN
CloverLeaf
QMCPACK
SQLite Speedtest
lzbench
NCNN
LZ4 Compression
lzbench
ASKAP
Unpacking Firefox
rav1e
NCNN
TNN
lzbench
Mobile Neural Network
Embree
rav1e
WebP2 Image Encode
ASKAP
Basis Universal
NAS Parallel Benchmarks
dav1d
Build2
NCNN
LZ4 Compression
rav1e
Embree
LZ4 Compression
lzbench
Etcpak
VkResample
Embree
ASKAP
LZ4 Compression
Gcrypt Library
Mobile Neural Network
Cryptsetup
Redis
Timed Godot Game Engine Compilation
NCNN
Cryptsetup
Mobile Neural Network:
  MobileNetV2_224
  inception-v3
Timed MAFFT Alignment
NCNN
Cryptsetup
lzbench
Embree
ASKAP
Etcpak
lzbench
NCNN
lzbench
WebP2 Image Encode
Mobile Neural Network
Embree
Etcpak
Node.js V8 Web Tooling Benchmark
GROMACS
NCNN:
  CPU-v2-v2 - mobilenet-v2
  CPU - resnet18
Coremark
VkResample
lzbench
Zstd Compression
Monkey Audio Encoding
NAS Parallel Benchmarks
LAMMPS Molecular Dynamics Simulator
Cryptsetup
QuantLib
Cryptsetup
NCNN
Embree
NCNN
Kripke
Cryptsetup
ASTC Encoder
FinanceBench
NCNN
Basis Universal
rav1e
Etcpak
lzbench
NCNN
Cryptsetup
NCNN
Cryptsetup
LZ4 Compression
Opus Codec Encoding
lzbench
Cryptsetup
NCNN
ASKAP
NAS Parallel Benchmarks
ASTC Encoder
WebP2 Image Encode
PHPBench
CP2K Molecular Dynamics
AI Benchmark Alpha
Basis Universal
NCNN
IndigoBench
TNN
ONNX Runtime
dav1d
Cryptsetup
WebP2 Image Encode
NCNN
ASKAP
OpenFOAM
IndigoBench
NAS Parallel Benchmarks
Cryptsetup:
  Twofish-XTS 256b Encryption
  Twofish-XTS 512b Decryption
AI Benchmark Alpha
NCNN
ASTC Encoder
Basis Universal
ASKAP
WebP2 Image Encode
Timed HMMer Search
LZ4 Compression
Cryptsetup
LULESH
NCNN
Algebraic Multi-Grid Benchmark
FinanceBench
Cryptsetup
NCNN:
  CPU - mobilenet
  Vulkan GPU - squeezenet_ssd
Redis
Basis Universal
Redis
Google SynthMark
WavPack Audio Encoding
NCNN
AI Benchmark Alpha
ONNX Runtime:
  fcn-resnet101-11 - OpenMP CPU
  bertsquad-10 - OpenMP CPU
  yolov4 - OpenMP CPU
ASTC Encoder
Zstd Compression
simdjson:
  PartialTweets
  LargeRand
  Kostya
lzbench:
  Crush 0 - Decompression
  XZ 0 - Decompression
  XZ 0 - Compression
Warsow
DDraceNetwork:
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymap
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2
NCNN
oneDNN:
  Deconvolution Batch shapes_1d - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU