HP Zbook

Intel Core i9-10885H testing with a HP 8736 (S91 Ver. 01.02.01 BIOS) and NVIDIA Quadro RTX 5000 with Max-Q Design 16GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101076-HA-HPZBOOK6247
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 2 Tests
Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Chess Test Suite 4 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 13 Tests
Compression Tests 2 Tests
CPU Massive 23 Tests
Creator Workloads 22 Tests
Database Test Suite 3 Tests
Encoding 4 Tests
Fortran Tests 2 Tests
Game Development 4 Tests
HPC - High Performance Computing 19 Tests
Imaging 5 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 12 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 3 Tests
Multi-Core 22 Tests
NVIDIA GPU Compute 24 Tests
Intel oneAPI 3 Tests
OpenCL 6 Tests
OpenGL Demos Test Suite 2 Tests
OpenMPI Tests 4 Tests
Productivity 2 Tests
Programmer / Developer System Benchmarks 10 Tests
Python Tests 4 Tests
Renderers 2 Tests
Scientific Computing 5 Tests
Server 6 Tests
Server CPU Tests 11 Tests
Single-Threaded 6 Tests
Speech 3 Tests
Telephony 3 Tests
Texture Compression 3 Tests
Unigine Test Suite 2 Tests
Video Encoding 2 Tests
Vulkan Compute 6 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
r1
January 04 2021
  21 Hours, 19 Minutes
r2
January 05 2021
  21 Hours, 8 Minutes
r3
January 06 2021
  20 Hours, 49 Minutes
Invert Hiding All Results Option
  21 Hours, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


HP ZbookOpenBenchmarking.orgPhoronix Test SuiteIntel Core i9-10885H @ 5.30GHz (8 Cores / 16 Threads)HP 8736 (S91 Ver. 01.02.01 BIOS)Intel Comet Lake PCH32GB2048GB KXG50PNV2T04 KIOXIANVIDIA Quadro RTX 5000 with Max-Q Design 16GB (600/6000MHz)Intel Comet Lake PCH cAVSIntel Wi-Fi 6 AX201Ubuntu 20.045.6.0-1034-oem (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.80.024.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 10.1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionHP Zbook BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe0 - Thermald 1.9.1- GPU Compute Cores: 3072- Python 3.8.3- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

r1r2r3Result OverviewPhoronix Test Suite100%112%124%136%CLOMPDDraceNetworkRedisViennaCLTNNeSpeak-NG Speech EngineRNNoiseNCNNMonkey Audio EncodingLuxCoreRender OpenCLStockfishNeatBenchLeelaChessZeroTimed Eigen CompilationSQLite SpeedtestWaifu2x-NCNN VulkanWarsowasmFishBetsy GPU CompressorRodiniaGROMACSCryptsetupTimed MAFFT AlignmentBlenderPHPBenchHashcatNode.js V8 Web Tooling BenchmarkCraftyASTC EncoderMobile Neural NetworkPlaidMLArrayFireUnpacking FirefoxVkFFTLZ4 CompressionGraphicsMagickUnigine SuperpositionNumpy BenchmarkNAMD CUDAVkResampleLAMMPS Molecular Dynamics SimulatorUnigine HeavensimdjsonOpenVINORedShift DemoTimed Linux Kernel Compilationrav1edav1dLevelDBBuild2RawTherapeecl-memBasis UniversalRealSR-NCNNInkscapeTensorFlow LiteMandelGPUBRL-CADDeepSpeechEmbreeOpus Codec EncodingclpeakCoremarkZstd CompressionTimed FFmpeg CompilationHigh Performance Conjugate GradientGEGLAI Benchmark AlphaOctaneBenchTimed HMMer SearchFAHBenchoneDNNyquake2DarktableIndigoBenchFinanceBench

HP Zbookvkfft: dav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitplaidml: No - Inference - IMDB LSTM - OpenCLplaidml: No - Inference - Mobilenet - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLplaidml: No - Inference - DenseNet 201 - OpenCLopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUneatbench: GPUddnet: 1920 x 1080 - Fullscreen - OpenGL 3.0 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.0 - Default - Multeasymapddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapunigine-heaven: 1920 x 1080 - Fullscreen - OpenGLunigine-super: 1920 x 1080 - Fullscreen - Low - OpenGLunigine-super: 1920 x 1080 - Fullscreen - High - OpenGLunigine-super: 1920 x 1080 - Fullscreen - Ultra - OpenGLunigine-super: 1920 x 1080 - Fullscreen - Medium - OpenGLwarsow: 1920 x 1080yquake2: OpenGL 1.x - 1920 x 1080yquake2: OpenGL 3.x - 1920 x 1080yquake2: Software CPU - 1920 x 1080embree: Pathtracer - Crownembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonrav1e: 1rav1e: 5rav1e: 6rav1e: 10cl-mem: Copycl-mem: Readcl-mem: Writesimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDclpeak: Global Memory Bandwidthhpcg: viennacl: OpenCL LU Factorizationclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Integer Compute INThashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecryptsetup: PBKDF2-sha512cryptsetup: PBKDF2-whirlpoolcoremark: CoreMark Size 666 - Iterations Per Secondindigobench: CPU - Bedroomindigobench: CPU - Supercarluxcorerender-cl: DLSCluxcorerender-cl: Foodluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: Rainbow Colors and Prismleveldb: Fill Syncleveldb: Overwriteleveldb: Rand Fillleveldb: Seq Fillcompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-zstd: 3compress-zstd: 19cryptsetup: AES-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptionlczero: OpenCLcrafty: Elapsed Timestockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthfahbench: gromacs: Water Benchmarklammps: Rhodopsin Proteinredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETnode-web-tooling: mandelgpu: GPUoctanebench: Total Scorenumpy: ai-benchmark: Device Inference Scoreai-benchmark: Device Training Scoreai-benchmark: Device AI Scorephpbench: PHP Benchmark Suiteclomp: Static OMP Speedupbrl-cad: VGR Performance Metricnamd-cuda: ATPase Simulation - 327,506 Atomsleveldb: Hot Readleveldb: Fill Syncleveldb: Overwriteleveldb: Rand Fillleveldb: Rand Readleveldb: Seek Randleveldb: Rand Deleteleveldb: Seq Filltensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2financebench: Black-Scholes OpenCLvkresample: 2x - Doublevkresample: 2x - Singlearrayfire: Conjugate Gradient OpenCLonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1openvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUrealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yeswaifu2x-ncnn: 2x - 3 - Yesbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestredshift: rodinia: OpenCL Particle Filterhmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAbuild-ffmpeg: Time To Compilebuild-linux-kernel: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compiledeepspeech: CPUencode-ape: WAV To APEencode-opus: WAV To Opus Encodeespeak: Text-To-Speech Synthesisrnnoise: astcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3basis: UASTC Level 2 + RDO Post-Processingsqlite-speedtest: Timed Time - Size 1,000darktable: Boat - CPU-onlydarktable: Masskrug - CPU-onlydarktable: Server Rack - CPU-onlydarktable: Server Room - CPU-onlygegl: Cropgegl: Scalegegl: Cartoongegl: Reflectgegl: Antialiasgegl: Tile Glassgegl: Wavelet Blurgegl: Color Enhancegegl: Rotate 90 Degreesinkscape: SVG Files To PNGrawtherapee: Total Benchmark Timeblender: BMW27 - CUDAblender: Classroom - CUDAblender: Fishy Cat - CUDAblender: Barbershop - CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: Pabellon Barcelona - CUDAblender: Pabellon Barcelona - NVIDIA OptiXunpack-firefox: firefox-84.0.source.tar.xzr1r2r325820489.84112.75460.0286.08463.341246.781819.24110.071.281.260.800.793442.783363.5527.5170.36158.21413.88435.20139.126177.765.925.190.4955.659.96060.76.08067.07357.55559.13430.3471.0691.4443.422236.6330.3215.70.760.50.860.89324.633.9617768.29245940.64340.425504.352433486666785857666673736671023100000301233207902721155521467751919349816282223414.8075580.9392.1472.701.272.265.300.543.243.137.58120.679823.257.889676.355.729679.82833.628.84005.64002.4874.1872.3482.0482.53346.83348.3878.0871.7482.7483.0132779497414970313315984719186.46110.6175.1983394660.22660539.422041750.083248596.082375800.2513.06251986408.7189.085068419.5873081615468379113.7639090.221036.9463361.77740.92541.0379.62012.69447.22847.2353548925163190302594239119236716466019717.477667256.86724.9922.5497.1657512.47213.177622.7255821.68719.775949.8789318.00808.967824.747727155.413795.027140.503795.814.363817144.233797.324.455648.89958.1615.23910.64662.56826.627.315.747.936.6710.002.5419.9872.0918.6215.5037.8135.9527.6419.1626.527.235.747.926.6310.012.5520.0571.9618.6215.4437.2535.5227.5819.16321.420272.9073165.243202.534961.995069.441.171.2114.73499.8126.0205.8548.0164617.115105.52610.497100.257151.656210.05168.74481.2951710.5127.62426.47422.0845.447.6854.29447.9957.8247.28855.499110.838840.34749.54715.9147.1280.1814.1818.9006.95486.78928.18336.55628.24357.99354.11437.69720.99680.58691.00250.78168.87734.8141.47116.7660.351192.96608.80196.2116.02825647486.46112.03459.6185.83477.391244.951823.06109.981.281.270.800.793403.453307.5327.1169.30100.58412.43429.37139.905178.166.525.490.6967.959.96060.76.09896.97947.56569.25960.3461.0641.4403.404235.4329.9215.60.750.50.870.88324.583.9606864.23355858.32340.465519.392426020000085445000003704001020000000301433207875721155511477741943008830020223304.9832860.9382.1502.771.322.315.390.543.243.137.48127.789839.957.369653.756.079664.82831.028.74080.54055.1881.4876.6487.4486.33381.93388.5882.1878.1485.7486.4131739584148983929215974611186.47770.6105.1692104092.332628039.252094056.313012560.832413657.013.17252826584.8189.101719419.3673081415448324172.5638220.222387.0993424.91840.95541.0279.69212.62947.29647.2873560345168183304756239224237129467056717.476257.06225.1902.5317.0440412.44473.167692.7767021.69929.764689.7770117.90359.006924.714577159.483797.057159.423800.414.378527154.663799.454.470628.98258.5305.29110.67563.18026.637.225.816.955.969.052.6020.0171.9118.7115.4637.3035.5927.5118.9126.537.225.736.985.869.022.2918.2071.8218.3315.5337.3435.5127.5217.15295.547264.9483166.573207.354978.255079.891.191.2314.656100.6176.1025.7897.9124607.055105.57210.564100.397152.208210.71267.54381.0731610.8617.60227.17821.3165.597.6154.38449.3758.0637.34555.742110.926840.31950.26815.8707.1500.1814.1748.8396.97387.31928.49636.55728.24257.95054.31237.54121.04880.93490.82251.90167.96731.6738.07116.1560.181190.05609.56196.2816.13725683487.57112.65459.7185.95478.731247.931817.78109.991.281.270.800.793405.923347.9327.6151.49130.66412.38434.24139.184177.466.225.390.5968.659.96060.66.06416.99767.54969.19670.3471.0641.4433.420235.1329.9214.80.750.50.860.88324.783.9545765.91805892.70340.595540.442419690000085353333333664331016800000298133207900731155511477761886103810352223892.4447260.9352.1562.761.302.295.410.543.443.237.38079.189810.058.899685.257.019695.22835.128.84023.04026.9874.1870.9483.6483.03336.03362.9874.4873.5483.0483.8134169560012962935316180674186.61580.6145.1792809233.482634908.832083566.293009326.752433543.813.18252822614.4189.316553417.0373081415448297053.6640330.221717.1283386.08440.76040.9819.57312.64447.38847.4153562585178263304079239537237406467747317.476333257.61525.2252.5487.1457412.60893.112912.7487421.62109.737329.8123818.03269.066284.737287169.033797.727151.583798.124.385357147.093792.874.466568.94458.7865.28510.65863.56326.537.235.817.035.969.062.5720.2171.8618.6615.4937.2235.6627.6319.3826.517.195.817.055.918.992.2918.2671.8618.3815.5037.2635.5327.5517.60299.396272.6763164.513212.105006.345073.091.191.2214.694100.7486.0935.7927.9034597.027105.50510.608100.203151.478210.94568.69981.0398310.5927.61627.71322.0445.637.5854.65449.9058.0627.35355.766111.040841.22850.26115.8637.1550.1814.1788.8267.00086.99328.31336.64628.05557.84354.10137.69121.06880.71290.93251.80168.08733.0238.07116.2660.251192.80608.62196.4116.103OpenBenchmarking.org

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1r1r2r36K12K18K24K30KSE +/- 62.93, N = 3SE +/- 58.68, N = 3SE +/- 108.37, N = 32582025647256831. (CXX) g++ options: -O3 -pthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pr1r2r3110220330440550SE +/- 5.73, N = 14SE +/- 3.02, N = 14SE +/- 3.24, N = 13489.84486.46487.57MIN: 317.1 / MAX: 898.12MIN: 316.37 / MAX: 900.57MIN: 316.7 / MAX: 911.471. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4Kr1r2r3306090120150SE +/- 1.06, N = 6SE +/- 1.08, N = 6SE +/- 1.07, N = 6112.75112.03112.65MIN: 99.69 / MAX: 158.99MIN: 99.17 / MAX: 157.08MIN: 99.62 / MAX: 158.581. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pr1r2r3100200300400500SE +/- 3.60, N = 14SE +/- 3.46, N = 13SE +/- 3.80, N = 13460.02459.61459.71MIN: 375.05 / MAX: 590.01MIN: 374.03 / MAX: 582.97MIN: 374.63 / MAX: 587.931. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitr1r2r320406080100SE +/- 0.99, N = 4SE +/- 1.05, N = 4SE +/- 1.03, N = 486.0885.8385.95MIN: 54.34 / MAX: 256.39MIN: 54.27 / MAX: 257.58MIN: 54.21 / MAX: 255.721. (CC) gcc options: -pthread

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLr1r2r3100200300400500SE +/- 0.36, N = 3SE +/- 1.92, N = 3SE +/- 2.86, N = 3463.34477.39478.73

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLr1r2r330060090012001500SE +/- 3.10, N = 3SE +/- 2.03, N = 3SE +/- 4.92, N = 31246.781244.951247.93

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLr1r2r3400800120016002000SE +/- 7.57, N = 3SE +/- 3.54, N = 3SE +/- 8.53, N = 31819.241823.061817.78

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLr1r2r320406080100SE +/- 0.19, N = 3SE +/- 0.42, N = 3SE +/- 0.40, N = 3110.07109.98109.99

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUr1r2r30.2880.5760.8641.1521.44SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.281.281.281. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUr1r2r30.28580.57160.85741.14321.429SE +/- 0.01, N = 3SE +/- 0.02, N = 4SE +/- 0.02, N = 31.261.271.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUr1r2r30.180.360.540.720.9SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.800.800.801. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUr1r2r30.17780.35560.53340.71120.889SE +/- 0.01, N = 3SE +/- 0.01, N = 9SE +/- 0.01, N = 50.790.790.791. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUr1r2r37001400210028003500SE +/- 33.67, N = 3SE +/- 38.35, N = 4SE +/- 34.05, N = 63442.783403.453405.921. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUr1r2r37001400210028003500SE +/- 35.01, N = 3SE +/- 33.23, N = 5SE +/- 40.89, N = 43363.553307.533347.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUr1r2r3612182430SE +/- 0.57, N = 15SE +/- 0.47, N = 15SE +/- 0.60, N = 1527.527.127.6

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: RaiNyMore2r1r2r34080120160200SE +/- 9.09, N = 15SE +/- 9.59, N = 15SE +/- 11.09, N = 15170.36169.30151.49MIN: 2.43 / MAX: 499.5MIN: 2.38 / MAX: 499.5MIN: 2.37 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2r1r2r3306090120150SE +/- 13.14, N = 12SE +/- 9.86, N = 15158.21100.58130.66MIN: 7.02 / MAX: 449.03MIN: 6.72 / MAX: 493.34MIN: 6.67 / MAX: 498.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: Multeasymapr1r2r390180270360450SE +/- 0.79, N = 3SE +/- 2.87, N = 3SE +/- 4.35, N = 3413.88412.43412.38MIN: 119.86 / MAX: 499.75MIN: 103.17 / MAX: 499.75MIN: 127.91 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymapr1r2r390180270360450SE +/- 0.25, N = 3SE +/- 2.73, N = 3SE +/- 2.45, N = 3435.20429.37434.24MIN: 99.45 / MAX: 499.75MIN: 112.88 / MAX: 499.75MIN: 115.25 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Unigine Heaven

This test calculates the average frame-rate within the Heaven demo for the Unigine engine. This engine is extremely demanding on the system's graphics card. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Heaven 4.0Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGLr1r2r3306090120150SE +/- 0.71, N = 3SE +/- 0.96, N = 3SE +/- 0.56, N = 3139.13139.91139.18

Unigine Superposition

This test calculates the average frame-rate within the Superposition demo for the Unigine engine, released in 2017. This engine is extremely demanding on the system's graphics card. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Low - Renderer: OpenGLr1r2r34080120160200SE +/- 0.23, N = 3SE +/- 0.71, N = 3SE +/- 0.52, N = 3177.7178.1177.4MAX: 260.1MAX: 259.4MAX: 263.9

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: High - Renderer: OpenGLr1r2r31530456075SE +/- 0.19, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 365.966.566.2MAX: 81.6MAX: 80.8MAX: 80.3

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Ultra - Renderer: OpenGLr1r2r3612182430SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 325.125.425.3MAX: 29.3MAX: 29.4MAX: 29.7

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Medium - Renderer: OpenGLr1r2r320406080100SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 390.490.690.5MAX: 114.5MAX: 114.4MAX: 113

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080r1r2r32004006008001000SE +/- 13.76, N = 12SE +/- 1.46, N = 3SE +/- 1.81, N = 3955.6967.9968.6

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 1080r1r2r31326395265SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 359.959.959.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 1080r1r2r313263952656060601. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080r1r2r31428425670SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 360.760.760.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crownr1r2r3246810SE +/- 0.0766, N = 3SE +/- 0.0737, N = 3SE +/- 0.0667, N = 36.08066.09896.0641MIN: 5.86 / MAX: 11.02MIN: 5.88 / MAX: 10.98MIN: 5.86 / MAX: 10.95

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crownr1r2r3246810SE +/- 0.0830, N = 3SE +/- 0.0728, N = 4SE +/- 0.0756, N = 57.07356.97946.9976MIN: 6.66 / MAX: 12.73MIN: 6.57 / MAX: 12.32MIN: 6.56 / MAX: 12.56

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragonr1r2r3246810SE +/- 0.0643, N = 3SE +/- 0.0719, N = 3SE +/- 0.0754, N = 37.55557.56567.5496MIN: 7.18 / MAX: 12.55MIN: 7.18 / MAX: 12.51MIN: 7.19 / MAX: 12.66

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragonr1r2r33691215SE +/- 0.0822, N = 3SE +/- 0.0236, N = 3SE +/- 0.1308, N = 39.13439.25969.1967MIN: 8.81 / MAX: 15.06MIN: 8.82 / MAX: 14.99MIN: 8.85 / MAX: 15

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1r1r2r30.07810.15620.23430.31240.3905SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 30.3470.3460.347

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5r1r2r30.24050.4810.72150.9621.2025SE +/- 0.005, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 31.0691.0641.064

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6r1r2r30.32490.64980.97471.29961.6245SE +/- 0.010, N = 3SE +/- 0.006, N = 3SE +/- 0.012, N = 31.4441.4401.443

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10r1r2r30.771.542.313.083.85SE +/- 0.044, N = 3SE +/- 0.035, N = 3SE +/- 0.027, N = 33.4223.4043.420

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyr1r2r350100150200250SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.27, N = 3236.6235.4235.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readr1r2r370140210280350SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3330.3329.9329.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writer1r2r350100150200250SE +/- 0.47, N = 3SE +/- 0.26, N = 3SE +/- 0.50, N = 3215.7215.6214.81. (CC) gcc options: -O2 -flto -lOpenCL

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostyar1r2r30.1710.3420.5130.6840.855SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.760.750.751. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomr1r2r30.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.50.50.51. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsr1r2r30.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.860.870.861. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDr1r2r30.20030.40060.60090.80121.0015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.890.880.881. (CXX) g++ options: -O3 -pthread

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthr1r2r370140210280350SE +/- 0.32, N = 3SE +/- 0.28, N = 3SE +/- 0.28, N = 3324.63324.58324.781. (CXX) g++ options: -O3 -rdynamic -lOpenCL

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1r1r2r30.89141.78282.67423.56564.457SE +/- 0.00082, N = 3SE +/- 0.00692, N = 3SE +/- 0.01196, N = 33.961773.960683.954571. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU Factorizationr1r2r31530456075SE +/- 0.36, N = 3SE +/- 0.08, N = 3SE +/- 0.44, N = 368.2964.2365.921. (CXX) g++ options: -rdynamic -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatr1r2r313002600390052006500SE +/- 83.30, N = 15SE +/- 64.05, N = 3SE +/- 47.53, N = 35940.645858.325892.701. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doubler1r2r370140210280350SE +/- 3.78, N = 3SE +/- 3.68, N = 3SE +/- 3.74, N = 3340.42340.46340.591. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTr1r2r312002400360048006000SE +/- 71.93, N = 15SE +/- 81.08, N = 15SE +/- 81.16, N = 155504.355519.395540.441. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5r1r2r35000M10000M15000M20000M25000MSE +/- 110495102.96, N = 3SE +/- 81107726.72, N = 3SE +/- 49256167.13, N = 3243348666672426020000024196900000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1r1r2r32000M4000M6000M8000M10000MSE +/- 31347213.24, N = 3SE +/- 17380832.35, N = 3SE +/- 18653000.95, N = 3858576666785445000008535333333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zipr1r2r380K160K240K320K400KSE +/- 1589.90, N = 3SE +/- 1858.31, N = 3SE +/- 3670.30, N = 3373667370400366433

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512r1r2r3200M400M600M800M1000MSE +/- 11546345.54, N = 15SE +/- 2594224.35, N = 3SE +/- 1852025.92, N = 3102310000010200000001016800000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSr1r2r360K120K180K240K300KSE +/- 1322.04, N = 3SE +/- 851.14, N = 3SE +/- 545.69, N = 3301233301433298133

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlr1r2r350100150200250SE +/- 1.72, N = 8SE +/- 1.60, N = 10SE +/- 1.72, N = 82072072071. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotater1r2r32004006008001000SE +/- 2.52, N = 3SE +/- 3.18, N = 3SE +/- 1.86, N = 39028759001. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenr1r2r31632486480SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 37272731. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedr1r2r3306090120150SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 31151151151. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingr1r2r3120240360480600SE +/- 2.73, N = 3SE +/- 5.00, N = 3SE +/- 5.36, N = 35525515511. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussianr1r2r3306090120150SE +/- 1.33, N = 3SE +/- 1.00, N = 3SE +/- 1.20, N = 31461471471. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacer1r2r32004006008001000SE +/- 5.03, N = 3SE +/- 5.70, N = 3SE +/- 4.51, N = 37757747761. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512r1r2r3400K800K1200K1600K2000KSE +/- 7117.07, N = 3SE +/- 1201.00, N = 3SE +/- 12877.64, N = 3191934919430081886103

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpoolr1r2r3200K400K600K800K1000KSE +/- 4903.32, N = 3SE +/- 2314.28, N = 3SE +/- 2497.33, N = 3816282830020810352

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondr1r2r350K100K150K200K250KSE +/- 2532.07, N = 3SE +/- 1894.03, N = 3SE +/- 2209.16, N = 3223414.81223304.98223892.441. (CC) gcc options: -O2 -lrt" -lrt

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroomr1r2r30.21130.42260.63390.84521.0565SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.9390.9380.935

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercarr1r2r30.48510.97021.45531.94042.4255SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 32.1472.1502.156

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCr1r2r30.62331.24661.86992.49323.1165SE +/- 0.06, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 32.702.772.76MIN: 0.69 / MAX: 2.81MIN: 2.57 / MAX: 2.84MIN: 2.56 / MAX: 2.84

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Foodr1r2r30.2970.5940.8911.1881.485SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.02, N = 31.271.321.30MIN: 0.13 / MAX: 1.57MIN: 0.29 / MAX: 1.57MIN: 0.26 / MAX: 1.57

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore Benchmarkr1r2r30.51981.03961.55942.07922.599SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 32.262.312.29MIN: 0.14 / MAX: 2.63MIN: 0.27 / MAX: 2.63MIN: 0.27 / MAX: 2.64

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and Prismr1r2r31.21732.43463.65194.86926.0865SE +/- 0.12, N = 12SE +/- 0.02, N = 3SE +/- 0.02, N = 35.305.395.41MIN: 1.66 / MAX: 5.7MIN: 4.6 / MAX: 5.67MIN: 4.58 / MAX: 5.7

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill Syncr1r2r30.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.50.50.51. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Overwriter1r2r31020304050SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 343.243.243.41. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random Fillr1r2r31020304050SE +/- 0.21, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 343.143.143.21. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential Fillr1r2r3918273645SE +/- 0.44, N = 4SE +/- 0.46, N = 4SE +/- 0.39, N = 537.537.437.31. (CXX) g++ options: -O3 -lsnappy -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speedr1r2r32K4K6K8K10KSE +/- 6.52, N = 3SE +/- 4.75, N = 3SE +/- 11.24, N = 38120.678127.788079.181. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speedr1r2r32K4K6K8K10KSE +/- 3.96, N = 3SE +/- 2.38, N = 3SE +/- 10.11, N = 39823.29839.99810.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speedr1r2r31326395265SE +/- 0.61, N = 5SE +/- 0.58, N = 3SE +/- 0.48, N = 357.8857.3658.891. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speedr1r2r32K4K6K8K10KSE +/- 1.84, N = 5SE +/- 16.28, N = 3SE +/- 0.67, N = 39676.39653.79685.21. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speedr1r2r31326395265SE +/- 0.59, N = 5SE +/- 0.36, N = 3SE +/- 0.66, N = 355.7256.0757.011. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speedr1r2r32K4K6K8K10KSE +/- 1.80, N = 5SE +/- 15.38, N = 3SE +/- 0.78, N = 39679.89664.89695.21. (CC) gcc options: -O3

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3r1r2r36001200180024003000SE +/- 7.25, N = 3SE +/- 8.65, N = 3SE +/- 4.18, N = 32833.62831.02835.11. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19r1r2r3714212835SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 328.828.728.81. (CC) gcc options: -O3 -pthread -lz -llzma

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryptionr1r2r39001800270036004500SE +/- 1.66, N = 3SE +/- 25.91, N = 3SE +/- 20.10, N = 34005.64080.54023.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryptionr1r2r39001800270036004500SE +/- 4.92, N = 3SE +/- 17.20, N = 3SE +/- 15.07, N = 34002.44055.14026.9

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryptionr1r2r32004006008001000SE +/- 0.92, N = 3SE +/- 1.25, N = 3SE +/- 2.67, N = 3874.1881.4874.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryptionr1r2r32004006008001000SE +/- 1.62, N = 3SE +/- 1.50, N = 3SE +/- 4.03, N = 3872.3876.6870.9

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryptionr1r2r3110220330440550SE +/- 0.75, N = 3SE +/- 1.08, N = 3SE +/- 2.51, N = 3482.0487.4483.6

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryptionr1r2r3110220330440550SE +/- 0.34, N = 3SE +/- 1.43, N = 3SE +/- 2.21, N = 3482.5486.3483.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryptionr1r2r37001400210028003500SE +/- 3.15, N = 3SE +/- 15.69, N = 3SE +/- 25.61, N = 33346.83381.93336.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryptionr1r2r37001400210028003500SE +/- 1.21, N = 3SE +/- 10.03, N = 3SE +/- 13.02, N = 33348.33388.53362.9

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryptionr1r2r32004006008001000SE +/- 0.83, N = 3SE +/- 0.87, N = 3SE +/- 4.25, N = 3878.0882.1874.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryptionr1r2r32004006008001000SE +/- 1.28, N = 3SE +/- 1.17, N = 3SE +/- 4.24, N = 3871.7878.1873.5

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryptionr1r2r3110220330440550SE +/- 0.10, N = 3SE +/- 1.44, N = 3SE +/- 2.34, N = 3482.7485.7483.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryptionr1r2r3110220330440550SE +/- 0.30, N = 2SE +/- 0.97, N = 3SE +/- 2.12, N = 3483.0486.4483.8

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLr1r2r33K6K9K12K15KSE +/- 160.45, N = 3SE +/- 176.76, N = 3SE +/- 44.68, N = 31327713173134161. (CXX) g++ options: -flto -pthread

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Timer1r2r32M4M6M8M10MSE +/- 45086.65, N = 3SE +/- 7176.35, N = 3SE +/- 16578.83, N = 39497414958414895600121. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Timer1r2r32M4M6M8M10MSE +/- 85083.98, N = 8SE +/- 85742.14, N = 3SE +/- 67987.28, N = 129703133983929296293531. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthr1r2r33M6M9M12M15MSE +/- 174263.56, N = 3SE +/- 148124.86, N = 3SE +/- 142852.80, N = 3159847191597461116180674

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2r1r2r34080120160200SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3186.46186.48186.62

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmarkr1r2r30.13880.27760.41640.55520.694SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.6170.6100.6141. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinr1r2r31.16962.33923.50884.67845.848SE +/- 0.111, N = 15SE +/- 0.109, N = 15SE +/- 0.110, N = 155.1985.1695.1791. (CXX) g++ options: -O3 -pthread -lm

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPr1r2r3700K1400K2100K2800K3500KSE +/- 36042.05, N = 3SE +/- 3702.86, N = 3SE +/- 181152.66, N = 123394660.202104092.332809233.481. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDr1r2r3600K1200K1800K2400K3000KSE +/- 28020.60, N = 3SE +/- 23332.27, N = 15SE +/- 27994.25, N = 32660539.422628039.252634908.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHr1r2r3400K800K1200K1600K2000KSE +/- 25221.07, N = 3SE +/- 21753.96, N = 4SE +/- 8925.21, N = 32041750.082094056.312083566.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETr1r2r3700K1400K2100K2800K3500KSE +/- 41615.25, N = 3SE +/- 13828.40, N = 3SE +/- 8077.93, N = 33248596.083012560.833009326.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETr1r2r3500K1000K1500K2000K2500KSE +/- 17218.21, N = 3SE +/- 3903.32, N = 3SE +/- 6859.51, N = 32375800.252413657.002433543.801. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmarkr1r2r33691215SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 313.0613.1713.181. Nodejs v10.19.0

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUr1r2r350M100M150M200M250MSE +/- 1032565.22, N = 3SE +/- 157365.45, N = 3SE +/- 1449538.54, N = 3251986408.7252826584.8252822614.41. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total Scorer1r2r34080120160200189.09189.10189.32

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarkr1r2r390180270360450SE +/- 1.54, N = 3SE +/- 0.84, N = 3SE +/- 0.70, N = 3419.58419.36417.03

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Scorer1r2r3160320480640800730730730

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Scorer1r2r32004006008001000816814814

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Scorer1r2r330060090012001500154615441544

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suiter1r2r3200K400K600K800K1000KSE +/- 4346.11, N = 3SE +/- 2600.83, N = 3SE +/- 587.84, N = 3837911832417829705

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedupr1r2r30.83251.6652.49753.334.1625SE +/- 0.03, N = 3SE +/- 0.03, N = 15SE +/- 0.03, N = 153.72.53.61. (CC) gcc options: -fopenmp -O3 -lm

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metricr1r2r314K28K42K56K70K6390963822640331. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atomsr1r2r30.050.10.150.20.25SE +/- 0.00131, N = 3SE +/- 0.00245, N = 5SE +/- 0.00272, N = 40.221030.222380.22171

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot Readr1r2r3246810SE +/- 0.013, N = 3SE +/- 0.075, N = 3SE +/- 0.049, N = 36.9467.0997.1281. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill Syncr1r2r37001400210028003500SE +/- 33.91, N = 3SE +/- 25.98, N = 3SE +/- 60.32, N = 33361.783424.923386.081. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Overwriter1r2r3918273645SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 340.9340.9640.761. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Fillr1r2r3918273645SE +/- 0.19, N = 3SE +/- 0.20, N = 3SE +/- 0.07, N = 341.0441.0340.981. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Readr1r2r33691215SE +/- 0.250, N = 12SE +/- 0.206, N = 15SE +/- 0.214, N = 159.6209.6929.5731. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek Randomr1r2r33691215SE +/- 0.11, N = 15SE +/- 0.10, N = 15SE +/- 0.11, N = 1412.6912.6312.641. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Deleter1r2r31122334455SE +/- 0.49, N = 5SE +/- 0.57, N = 4SE +/- 0.56, N = 447.2347.3047.391. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential Fillr1r2r31122334455SE +/- 0.54, N = 4SE +/- 0.58, N = 4SE +/- 0.48, N = 547.2447.2947.421. (CXX) g++ options: -O3 -lsnappy -lpthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetr1r2r380K160K240K320K400KSE +/- 2566.21, N = 3SE +/- 2576.61, N = 3SE +/- 2539.06, N = 3354892356034356258

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4r1r2r31.1M2.2M3.3M4.4M5.5MSE +/- 5618.75, N = 3SE +/- 7685.69, N = 3SE +/- 8609.77, N = 3516319051681835178263

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobiler1r2r370K140K210K280K350KSE +/- 3140.84, N = 3SE +/- 2025.87, N = 3SE +/- 1284.72, N = 3302594304756304079

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Floatr1r2r350K100K150K200K250KSE +/- 1996.41, N = 3SE +/- 1820.00, N = 3SE +/- 1638.46, N = 3239119239224239537

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quantr1r2r350K100K150K200K250KSE +/- 1686.36, N = 3SE +/- 1668.46, N = 3SE +/- 1810.35, N = 3236716237129237406

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2r1r2r31000K2000K3000K4000K5000KSE +/- 8775.31, N = 3SE +/- 8796.49, N = 3SE +/- 8398.83, N = 3466019746705674677473

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: Multeasymap - Total Frame Timer1r2r33691215Min: 2 / Avg: 2.43 / Max: 6.55Min: 2 / Avg: 2.46 / Max: 6.5Min: 2 / Avg: 2.39 / Max: 7.281. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Timer1r2r33691215Min: 2 / Avg: 2.3 / Max: 10.06Min: 2 / Avg: 2.32 / Max: 5.18Min: 2 / Avg: 2.32 / Max: 8.681. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLr1r2r348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 317.4817.4817.481. (CXX) g++ options: -O3 -lOpenCL

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Doubler1r2r360120180240300SE +/- 0.20, N = 3SE +/- 0.11, N = 3SE +/- 0.20, N = 3256.87257.06257.621. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singler1r2r3612182430SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 324.9925.1925.231. (CXX) g++ options: -O3 -pthread

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLr1r2r30.57351.1471.72052.2942.8675SE +/- 0.015, N = 3SE +/- 0.022, N = 3SE +/- 0.018, N = 32.5492.5312.5481. (CXX) g++ options: -rdynamic

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUr1r2r3246810SE +/- 0.05152, N = 3SE +/- 0.11582, N = 12SE +/- 0.02993, N = 37.165757.044047.14574MIN: 5.58MIN: 4.11MIN: 5.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUr1r2r33691215SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 312.4712.4412.61MIN: 12.08MIN: 12.09MIN: 12.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUr1r2r30.7151.432.1452.863.575SE +/- 0.01732, N = 3SE +/- 0.02081, N = 3SE +/- 0.06527, N = 123.177623.167693.11291MIN: 2.58MIN: 2.39MIN: 1.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUr1r2r30.62481.24961.87442.49923.124SE +/- 0.00400, N = 3SE +/- 0.01530, N = 3SE +/- 0.00352, N = 32.725582.776702.74874MIN: 2.54MIN: 2.56MIN: 2.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUr1r2r3510152025SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 321.6921.7021.62MIN: 21.47MIN: 21.48MIN: 21.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUr1r2r33691215SE +/- 0.04555, N = 3SE +/- 0.03928, N = 3SE +/- 0.03582, N = 39.775949.764689.73732MIN: 8.77MIN: 8.72MIN: 8.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUr1r2r33691215SE +/- 0.23621, N = 12SE +/- 0.15643, N = 15SE +/- 0.22537, N = 129.878939.777019.81238MIN: 6.66MIN: 6.67MIN: 6.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUr1r2r348121620SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 318.0117.9018.03MIN: 17.22MIN: 17.18MIN: 17.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUr1r2r33691215SE +/- 0.04374, N = 3SE +/- 0.01715, N = 3SE +/- 0.11418, N = 38.967829.006929.06628MIN: 8.14MIN: 8.15MIN: 81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUr1r2r31.06822.13643.20464.27285.341SE +/- 0.10403, N = 12SE +/- 0.06823, N = 15SE +/- 0.07477, N = 154.747724.714574.73728MIN: 3.29MIN: 3.29MIN: 3.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUr1r2r315003000450060007500SE +/- 12.55, N = 3SE +/- 1.75, N = 3SE +/- 6.55, N = 37155.417159.487169.03MIN: 7025.22MIN: 7040.61MIN: 7046.491. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUr1r2r38001600240032004000SE +/- 2.45, N = 3SE +/- 2.65, N = 3SE +/- 3.77, N = 33795.023797.053797.72MIN: 3682.24MIN: 3673.18MIN: 3684.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUr1r2r315003000450060007500SE +/- 2.95, N = 3SE +/- 4.70, N = 3SE +/- 6.73, N = 37140.507159.427151.58MIN: 7021.68MIN: 7041.4MIN: 7027.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUr1r2r38001600240032004000SE +/- 6.76, N = 3SE +/- 4.34, N = 3SE +/- 3.22, N = 33795.813800.413798.12MIN: 3687.23MIN: 3681.23MIN: 3685.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUr1r2r30.98671.97342.96013.94684.9335SE +/- 0.00310, N = 3SE +/- 0.00806, N = 3SE +/- 0.00559, N = 34.363814.378524.38535MIN: 4.23MIN: 4.25MIN: 4.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUr1r2r315003000450060007500SE +/- 3.89, N = 3SE +/- 0.92, N = 3SE +/- 2.23, N = 37144.237154.667147.09MIN: 7028.46MIN: 7035.88MIN: 7033.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUr1r2r38001600240032004000SE +/- 1.61, N = 3SE +/- 1.20, N = 3SE +/- 1.33, N = 33797.323799.453792.87MIN: 3686.53MIN: 3692.97MIN: 3672.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUr1r2r31.00592.01183.01774.02365.0295SE +/- 0.00967, N = 3SE +/- 0.01661, N = 3SE +/- 0.00726, N = 34.455644.470624.46656MIN: 4.02MIN: 4.02MIN: 4.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by ALibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: SqueezeNetV1.0r1r2r33691215SE +/- 0.373, N = 10SE +/- 0.316, N = 11SE +/- 0.373, N = 108.8998.9828.944MIN: 4.96 / MAX: 31.21MIN: 5.05 / MAX: 31.35MIN: 5.01 / MAX: 31.891. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: resnet-v2-50r1r2r31326395265SE +/- 0.40, N = 10SE +/- 0.35, N = 11SE +/- 0.40, N = 1058.1658.5358.79MIN: 36.86 / MAX: 81.73MIN: 37.33 / MAX: 83.74MIN: 36.87 / MAX: 85.771. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: MobileNetV2_224r1r2r31.19052.3813.57154.7625.9525SE +/- 0.210, N = 10SE +/- 0.185, N = 11SE +/- 0.209, N = 105.2395.2915.285MIN: 3.19 / MAX: 26.27MIN: 3.3 / MAX: 27.38MIN: 3.27 / MAX: 26.821. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: mobilenet-v1-1.0r1r2r33691215SE +/- 0.01, N = 10SE +/- 0.01, N = 11SE +/- 0.01, N = 1010.6510.6810.66MIN: 10.33 / MAX: 34.53MIN: 10.35 / MAX: 33.35MIN: 10.33 / MAX: 32.251. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: inception-v3r1r2r31428425670SE +/- 0.15, N = 10SE +/- 0.18, N = 11SE +/- 0.22, N = 1062.5763.1863.56MIN: 60.82 / MAX: 96.05MIN: 61.02 / MAX: 104.39MIN: 60.92 / MAX: 102.851. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetr1r2r3612182430SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 326.6226.6326.53MIN: 25.69 / MAX: 38.05MIN: 25.7 / MAX: 41.21MIN: 25.78 / MAX: 41.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2r1r2r3246810SE +/- 0.67, N = 3SE +/- 0.73, N = 3SE +/- 0.73, N = 37.317.227.23MIN: 5.51 / MAX: 16.43MIN: 5.54 / MAX: 12.03MIN: 5.55 / MAX: 12.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3r1r2r31.30732.61463.92195.22926.5365SE +/- 0.65, N = 3SE +/- 0.65, N = 3SE +/- 0.62, N = 35.745.815.81MIN: 4.3 / MAX: 7.75MIN: 4.43 / MAX: 17.76MIN: 4.48 / MAX: 10.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2r1r2r3246810SE +/- 0.03, N = 3SE +/- 0.94, N = 3SE +/- 0.95, N = 37.936.957.03MIN: 7.52 / MAX: 16.61MIN: 5.01 / MAX: 9.68MIN: 5.04 / MAX: 20.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetr1r2r3246810SE +/- 0.02, N = 3SE +/- 0.75, N = 3SE +/- 0.74, N = 36.675.965.96MIN: 5.99 / MAX: 21.18MIN: 4.32 / MAX: 14.32MIN: 4.33 / MAX: 28.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0r1r2r33691215SE +/- 0.05, N = 3SE +/- 0.96, N = 3SE +/- 0.96, N = 310.009.059.06MIN: 9.46 / MAX: 24.32MIN: 6.99 / MAX: 21.76MIN: 7.04 / MAX: 12.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefacer1r2r30.5851.171.7552.342.925SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 32.542.602.57MIN: 2.35 / MAX: 2.74MIN: 2.45 / MAX: 10.37MIN: 2.45 / MAX: 2.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetr1r2r3510152025SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 319.9820.0120.21MIN: 18.95 / MAX: 23.24MIN: 18.96 / MAX: 24.67MIN: 19.11 / MAX: 32.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16r1r2r31632486480SE +/- 0.20, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 372.0971.9171.86MIN: 70.5 / MAX: 88.28MIN: 70.43 / MAX: 92.47MIN: 70.48 / MAX: 881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18r1r2r3510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 318.6218.7118.66MIN: 17.08 / MAX: 32.57MIN: 17.06 / MAX: 33.58MIN: 17.05 / MAX: 30.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetr1r2r348121620SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 315.5015.4615.49MIN: 14.41 / MAX: 55.15MIN: 14.35 / MAX: 27.24MIN: 14.41 / MAX: 24.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50r1r2r3918273645SE +/- 0.51, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 337.8137.3037.22MIN: 34.04 / MAX: 52.8MIN: 33.91 / MAX: 56.28MIN: 33.9 / MAX: 52.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyr1r2r3816243240SE +/- 0.48, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 335.9535.5935.66MIN: 34.4 / MAX: 55.63MIN: 34.42 / MAX: 51.24MIN: 34.45 / MAX: 49.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdr1r2r3714212835SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 327.6427.5127.63MIN: 27 / MAX: 40.14MIN: 26.93 / MAX: 43.6MIN: 27.02 / MAX: 46.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mr1r2r3510152025SE +/- 0.06, N = 3SE +/- 0.24, N = 3SE +/- 0.10, N = 319.1618.9119.38MIN: 18.07 / MAX: 22.36MIN: 13.5 / MAX: 30.63MIN: 14.45 / MAX: 42.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetr1r2r3612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 326.5226.5326.51MIN: 25.69 / MAX: 43.81MIN: 25.76 / MAX: 43.91MIN: 25.69 / MAX: 45.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2r1r2r3246810SE +/- 0.74, N = 3SE +/- 0.79, N = 3SE +/- 0.73, N = 37.237.227.19MIN: 5.54 / MAX: 9.59MIN: 5.41 / MAX: 20.72MIN: 5.52 / MAX: 9.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3r1r2r31.30732.61463.92195.22926.5365SE +/- 0.62, N = 3SE +/- 0.65, N = 3SE +/- 0.64, N = 35.745.735.81MIN: 4.43 / MAX: 9.64MIN: 4.33 / MAX: 10.47MIN: 4.41 / MAX: 25.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2r1r2r3246810SE +/- 0.07, N = 3SE +/- 0.96, N = 3SE +/- 0.93, N = 37.926.987.05MIN: 7.27 / MAX: 20.3MIN: 4.98 / MAX: 27.09MIN: 5.04 / MAX: 20.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetr1r2r3246810SE +/- 0.00, N = 3SE +/- 0.71, N = 3SE +/- 0.76, N = 36.635.865.91MIN: 6.21 / MAX: 8.85MIN: 4.3 / MAX: 15.47MIN: 4.32 / MAX: 7.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0r1r2r33691215SE +/- 0.10, N = 3SE +/- 0.95, N = 3SE +/- 0.94, N = 310.019.028.99MIN: 9.44 / MAX: 29.57MIN: 7 / MAX: 19.29MIN: 6.99 / MAX: 13.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefacer1r2r30.57381.14761.72142.29522.869SE +/- 0.02, N = 3SE +/- 0.26, N = 3SE +/- 0.25, N = 32.552.292.29MIN: 2.43 / MAX: 2.76MIN: 1.68 / MAX: 8.91MIN: 1.69 / MAX: 12.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetr1r2r3510152025SE +/- 0.06, N = 3SE +/- 1.77, N = 3SE +/- 1.84, N = 320.0518.2018.26MIN: 18.94 / MAX: 32.96MIN: 14.26 / MAX: 31.74MIN: 14.28 / MAX: 36.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16r1r2r31632486480SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 371.9671.8271.86MIN: 70.52 / MAX: 88.3MIN: 70.37 / MAX: 86.67MIN: 70.4 / MAX: 88.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18r1r2r3510152025SE +/- 0.00, N = 3SE +/- 0.34, N = 3SE +/- 0.27, N = 318.6218.3318.38MIN: 17.13 / MAX: 20.97MIN: 14.43 / MAX: 32.39MIN: 14.4 / MAX: 32.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetr1r2r348121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 315.4415.5315.50MIN: 14.41 / MAX: 26.42MIN: 14.41 / MAX: 25.62MIN: 14.41 / MAX: 26.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50r1r2r3918273645SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 337.2537.3437.26MIN: 34.07 / MAX: 48.19MIN: 33.97 / MAX: 56.32MIN: 33.79 / MAX: 52.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyr1r2r3816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 335.5235.5135.53MIN: 34.38 / MAX: 51.44MIN: 33.05 / MAX: 50.05MIN: 32.99 / MAX: 52.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdr1r2r3612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5827.5227.55MIN: 26.94 / MAX: 43.23MIN: 26.95 / MAX: 42.6MIN: 26.92 / MAX: 41.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mr1r2r3510152025SE +/- 0.09, N = 3SE +/- 1.83, N = 3SE +/- 1.77, N = 319.1617.1517.60MIN: 17.94 / MAX: 21.24MIN: 13.3 / MAX: 38.12MIN: 13.79 / MAX: 32.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2r1r2r370140210280350SE +/- 2.78, N = 8SE +/- 0.81, N = 3SE +/- 0.36, N = 3321.42295.55299.40MIN: 300.42 / MAX: 371.06MIN: 292.39 / MAX: 306.56MIN: 297.92 / MAX: 315.551. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1r1r2r360120180240300SE +/- 1.46, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3272.91264.95272.68MIN: 264.43 / MAX: 277.05MIN: 264.07 / MAX: 268.01MIN: 271.53 / MAX: 277.61. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUr1r2r37001400210028003500SE +/- 4.35, N = 3SE +/- 3.88, N = 3SE +/- 7.78, N = 33165.243166.573164.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUr1r2r37001400210028003500SE +/- 2.58, N = 3SE +/- 1.22, N = 4SE +/- 2.51, N = 33202.533207.353212.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUr1r2r311002200330044005500SE +/- 4.97, N = 3SE +/- 19.24, N = 3SE +/- 4.20, N = 34961.994978.255006.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUr1r2r311002200330044005500SE +/- 15.43, N = 3SE +/- 9.68, N = 9SE +/- 14.45, N = 55069.445079.895073.091. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUr1r2r30.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 61.171.191.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUr1r2r30.27680.55360.83041.10721.384SE +/- 0.00, N = 3SE +/- 0.00, N = 5SE +/- 0.00, N = 41.211.231.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Nor1r2r348121620SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 314.7314.6614.69

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yesr1r2r320406080100SE +/- 0.31, N = 3SE +/- 0.48, N = 3SE +/- 0.35, N = 399.81100.62100.75

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yesr1r2r3246810SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.011, N = 36.0206.1026.093

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highestr1r2r31.31722.63443.95165.26886.586SE +/- 0.068, N = 12SE +/- 0.008, N = 3SE +/- 0.024, N = 35.8545.7895.7921. (CXX) g++ options: -O3 -O2 -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highestr1r2r3246810SE +/- 0.064, N = 13SE +/- 0.018, N = 3SE +/- 0.023, N = 38.0167.9127.9031. (CXX) g++ options: -O3 -O2 -lpthread -ldl

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0r1r2r3100200300400500SE +/- 0.88, N = 3SE +/- 0.33, N = 3461460459

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filterr1r2r3246810SE +/- 0.065, N = 3SE +/- 0.013, N = 3SE +/- 0.016, N = 37.1157.0557.0271. (CXX) g++ options: -O2 -lOpenCL

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Searchr1r2r320406080100SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3105.53105.57105.511. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAr1r2r33691215SE +/- 0.08, N = 12SE +/- 0.10, N = 15SE +/- 0.10, N = 1410.5010.5610.611. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compiler1r2r320406080100SE +/- 0.78, N = 3SE +/- 0.39, N = 3SE +/- 0.30, N = 3100.26100.40100.20

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compiler1r2r3306090120150SE +/- 0.33, N = 3SE +/- 0.24, N = 3SE +/- 0.75, N = 3151.66152.21151.48

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compiler1r2r350100150200250SE +/- 0.40, N = 3SE +/- 0.49, N = 3SE +/- 0.85, N = 3210.05210.71210.95

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compiler1r2r31530456075SE +/- 0.16, N = 3SE +/- 0.30, N = 3SE +/- 0.22, N = 368.7467.5468.70

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUr1r2r320406080100SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 381.3081.0781.04

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APEr1r2r33691215SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.01, N = 510.5110.8610.591. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encoder1r2r3246810SE +/- 0.009, N = 5SE +/- 0.004, N = 5SE +/- 0.008, N = 57.6247.6027.6161. (CXX) g++ options: -fvisibility=hidden -logg -lm

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesisr1r2r3714212835SE +/- 0.29, N = 4SE +/- 0.12, N = 4SE +/- 0.04, N = 426.4727.1827.711. (CC) gcc options: -O2 -std=c99 -lpthread -lm

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28r1r2r3510152025SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 322.0821.3222.041. (CC) gcc options: -O2 -pedantic -fvisibility=hidden -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fastr1r2r31.26682.53363.80045.06726.334SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 125.445.595.631. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Mediumr1r2r3246810SE +/- 0.14, N = 15SE +/- 0.11, N = 15SE +/- 0.16, N = 157.687.617.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thoroughr1r2r31224364860SE +/- 0.54, N = 3SE +/- 0.54, N = 3SE +/- 0.42, N = 354.2954.3854.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustiver1r2r3100200300400500SE +/- 0.52, N = 3SE +/- 0.81, N = 3SE +/- 0.54, N = 3447.99449.37449.901. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1Sr1r2r31326395265SE +/- 0.38, N = 3SE +/- 0.15, N = 3SE +/- 0.56, N = 357.8258.0658.061. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0r1r2r3246810SE +/- 0.079, N = 3SE +/- 0.061, N = 3SE +/- 0.095, N = 37.2887.3457.3531. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2r1r2r31326395265SE +/- 0.55, N = 3SE +/- 0.41, N = 3SE +/- 0.58, N = 355.5055.7455.771. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3r1r2r320406080100SE +/- 0.55, N = 3SE +/- 0.55, N = 3SE +/- 0.53, N = 3110.84110.93111.041. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processingr1r2r32004006008001000SE +/- 0.74, N = 3SE +/- 0.35, N = 3SE +/- 0.62, N = 3840.35840.32841.231. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000r1r2r31122334455SE +/- 0.25, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 349.5550.2750.261. (CC) gcc options: -O2 -ldl -lz -lpthread

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Boat - Acceleration: CPU-onlyr1r2r348121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 315.9115.8715.86

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Masskrug - Acceleration: CPU-onlyr1r2r3246810SE +/- 0.097, N = 12SE +/- 0.096, N = 12SE +/- 0.099, N = 127.1287.1507.155

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Rack - Acceleration: CPU-onlyr1r2r30.04070.08140.12210.16280.2035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1810.1810.181

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Room - Acceleration: CPU-onlyr1r2r30.94071.88142.82213.76284.7035SE +/- 0.010, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 34.1814.1744.178

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Cropr1r2r3246810SE +/- 0.065, N = 11SE +/- 0.073, N = 9SE +/- 0.077, N = 88.9008.8398.826

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Scaler1r2r3246810SE +/- 0.055, N = 12SE +/- 0.059, N = 13SE +/- 0.056, N = 146.9546.9737.000

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Cartoonr1r2r320406080100SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 386.7987.3286.99

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Reflectr1r2r3714212835SE +/- 0.29, N = 3SE +/- 0.30, N = 3SE +/- 0.22, N = 328.1828.5028.31

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Antialiasr1r2r3816243240SE +/- 0.45, N = 3SE +/- 0.35, N = 3SE +/- 0.38, N = 336.5636.5636.65

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Tile Glassr1r2r3714212835SE +/- 0.36, N = 3SE +/- 0.27, N = 3SE +/- 0.39, N = 328.2428.2428.06

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Wavelet Blurr1r2r31326395265SE +/- 0.25, N = 3SE +/- 0.39, N = 3SE +/- 0.25, N = 357.9957.9557.84

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Color Enhancer1r2r31224364860SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.28, N = 354.1154.3154.10

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Rotate 90 Degreesr1r2r3918273645SE +/- 0.31, N = 3SE +/- 0.36, N = 3SE +/- 0.43, N = 337.7037.5437.69

Inkscape

Inkscape is an open-source vector graphics editor. This test profile times how long it takes to complete various operations by Inkscape. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterInkscapeOperation: SVG Files To PNGr1r2r3510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 321.0021.0521.071. Inkscape 0.92.5 (2060ec1f9f, 2020-04-08)

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Timer1r2r320406080100SE +/- 0.53, N = 3SE +/- 0.46, N = 3SE +/- 0.45, N = 380.5980.9380.711. RawTherapee, version 5.8, command line.

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDAr1r2r320406080100SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 391.0090.8290.93

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDAr1r2r360120180240300SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3250.78251.90251.80

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDAr1r2r34080120160200SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3168.87167.96168.08

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDAr1r2r3160320480640800SE +/- 0.24, N = 3SE +/- 0.26, N = 3SE +/- 0.41, N = 3734.81731.67733.02

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXr1r2r3918273645SE +/- 3.33, N = 15SE +/- 0.02, N = 3SE +/- 0.05, N = 341.4738.0738.07

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXr1r2r3306090120150SE +/- 0.13, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3116.76116.15116.26

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXr1r2r31428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 360.3560.1860.25

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXr1r2r330060090012001500SE +/- 0.44, N = 3SE +/- 0.85, N = 3SE +/- 2.01, N = 31192.961190.051192.80

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDAr1r2r3130260390520650SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3608.80609.56608.62

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXr1r2r34080120160200SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3196.21196.28196.41

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xzr1r2r348121620SE +/- 0.08, N = 4SE +/- 0.09, N = 4SE +/- 0.14, N = 416.0316.1416.10

253 Results Shown

VkFFT
dav1d:
  Chimera 1080p
  Summer Nature 4K
  Summer Nature 1080p
  Chimera 1080p 10-bit
PlaidML:
  No - Inference - IMDB LSTM - OpenCL
  No - Inference - Mobilenet - OpenCL
  Yes - Inference - Mobilenet - OpenCL
  No - Inference - DenseNet 201 - OpenCL
OpenVINO:
  Face Detection 0106 FP16 - CPU
  Face Detection 0106 FP32 - CPU
  Person Detection 0106 FP16 - CPU
  Person Detection 0106 FP32 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Age Gender Recognition Retail 0013 FP32 - CPU
NeatBench
DDraceNetwork:
  1920 x 1080 - Fullscreen - OpenGL 3.0 - Default - RaiNyMore2
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2
  1920 x 1080 - Fullscreen - OpenGL 3.0 - Default - Multeasymap
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymap
Unigine Heaven
Unigine Superposition:
  1920 x 1080 - Fullscreen - Low - OpenGL
  1920 x 1080 - Fullscreen - High - OpenGL
  1920 x 1080 - Fullscreen - Ultra - OpenGL
  1920 x 1080 - Fullscreen - Medium - OpenGL
Warsow
yquake2:
  OpenGL 1.x - 1920 x 1080
  OpenGL 3.x - 1920 x 1080
  Software CPU - 1920 x 1080
Embree:
  Pathtracer - Crown
  Pathtracer ISPC - Crown
  Pathtracer - Asian Dragon
  Pathtracer ISPC - Asian Dragon
rav1e:
  1
  5
  6
  10
cl-mem:
  Copy
  Read
  Write
simdjson:
  Kostya
  LargeRand
  PartialTweets
  DistinctUserID
clpeak
High Performance Conjugate Gradient
ViennaCL
clpeak:
  Single-Precision Float
  Double-Precision Double
  Integer Compute INT
Hashcat:
  MD5
  SHA1
  7-Zip
  SHA-512
  TrueCrypt RIPEMD160 + XTS
GraphicsMagick:
  Swirl
  Rotate
  Sharpen
  Enhanced
  Resizing
  Noise-Gaussian
  HWB Color Space
Cryptsetup:
  PBKDF2-sha512
  PBKDF2-whirlpool
Coremark
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
LuxCoreRender OpenCL:
  DLSC
  Food
  LuxCore Benchmark
  Rainbow Colors and Prism
LevelDB:
  Fill Sync
  Overwrite
  Rand Fill
  Seq Fill
LZ4 Compression:
  1 - Compression Speed
  1 - Decompression Speed
  3 - Compression Speed
  3 - Decompression Speed
  9 - Compression Speed
  9 - Decompression Speed
Zstd Compression:
  3
  19
Cryptsetup:
  AES-XTS 256b Encryption
  AES-XTS 256b Decryption
  Serpent-XTS 256b Encryption
  Serpent-XTS 256b Decryption
  Twofish-XTS 256b Encryption
  Twofish-XTS 256b Decryption
  AES-XTS 512b Encryption
  AES-XTS 512b Decryption
  Serpent-XTS 512b Encryption
  Serpent-XTS 512b Decryption
  Twofish-XTS 512b Decryption
  Twofish-XTS 512b Encryption
LeelaChessZero
Crafty
Stockfish
asmFish
FAHBench
GROMACS
LAMMPS Molecular Dynamics Simulator
Redis:
  LPOP
  SADD
  LPUSH
  GET
  SET
Node.js V8 Web Tooling Benchmark
MandelGPU
OctaneBench
Numpy Benchmark
AI Benchmark Alpha:
  Device Inference Score
  Device Training Score
  Device AI Score
PHPBench
CLOMP
BRL-CAD
NAMD CUDA
LevelDB:
  Hot Read
  Fill Sync
  Overwrite
  Rand Fill
  Rand Read
  Seek Rand
  Rand Delete
  Seq Fill
TensorFlow Lite:
  SqueezeNet
  Inception V4
  NASNet Mobile
  Mobilenet Float
  Mobilenet Quant
  Inception ResNet V2
DDraceNetwork:
  1920 x 1080 - Fullscreen - OpenGL 3.0 - Default - Multeasymap - Total Frame Time
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymap - Total Frame Time
FinanceBench
VkResample:
  2x - Double
  2x - Single
ArrayFire
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Mobile Neural Network:
  SqueezeNetV1.0
  resnet-v2-50
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
NCNN:
  CPU - mobilenet
  CPU-v2-v2 - mobilenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU - shufflenet-v2
  CPU - mnasnet
  CPU - efficientnet-b0
  CPU - blazeface
  CPU - googlenet
  CPU - vgg16
  CPU - resnet18
  CPU - alexnet
  CPU - resnet50
  CPU - yolov4-tiny
  CPU - squeezenet_ssd
  CPU - regnety_400m
  Vulkan GPU - mobilenet
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU - shufflenet-v2
  Vulkan GPU - mnasnet
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - blazeface
  Vulkan GPU - googlenet
  Vulkan GPU - vgg16
  Vulkan GPU - resnet18
  Vulkan GPU - alexnet
  Vulkan GPU - resnet50
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - regnety_400m
TNN:
  CPU - MobileNet v2
  CPU - SqueezeNet v1.1
OpenVINO:
  Face Detection 0106 FP16 - CPU
  Face Detection 0106 FP32 - CPU
  Person Detection 0106 FP16 - CPU
  Person Detection 0106 FP32 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Age Gender Recognition Retail 0013 FP32 - CPU
RealSR-NCNN:
  4x - No
  4x - Yes
Waifu2x-NCNN Vulkan
Betsy GPU Compressor:
  ETC1 - Highest
  ETC2 RGB - Highest
RedShift Demo
Rodinia
Timed HMMer Search
Timed MAFFT Alignment
Timed FFmpeg Compilation
Timed Linux Kernel Compilation
Build2
Timed Eigen Compilation
DeepSpeech
Monkey Audio Encoding
Opus Codec Encoding
eSpeak-NG Speech Engine
RNNoise
ASTC Encoder:
  Fast
  Medium
  Thorough
  Exhaustive
Basis Universal:
  ETC1S
  UASTC Level 0
  UASTC Level 2
  UASTC Level 3
  UASTC Level 2 + RDO Post-Processing
SQLite Speedtest
Darktable:
  Boat - CPU-only
  Masskrug - CPU-only
  Server Rack - CPU-only
  Server Room - CPU-only
GEGL:
  Crop
  Scale
  Cartoon
  Reflect
  Antialias
  Tile Glass
  Wavelet Blur
  Color Enhance
  Rotate 90 Degrees
Inkscape
RawTherapee
Blender:
  BMW27 - CUDA
  Classroom - CUDA
  Fishy Cat - CUDA
  Barbershop - CUDA
  BMW27 - NVIDIA OptiX
  Classroom - NVIDIA OptiX
  Fishy Cat - NVIDIA OptiX
  Barbershop - NVIDIA OptiX
  Pabellon Barcelona - CUDA
  Pabellon Barcelona - NVIDIA OptiX
Unpacking Firefox