Broadwell 2021

Intel Core i7-5600U testing with a LENOVO 20BSCTO1WW (N14ET49W 1.27 BIOS) and Intel HD 5500 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101027-HA-BROADWELL87&grr&sor.

Broadwell 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionR123Intel Core i7-5600U @ 3.20GHz (2 Cores / 4 Threads)LENOVO 20BSCTO1WW (N14ET49W 1.27 BIOS)Intel Broadwell-U-OPI8GB128GB SAMSUNG MZNTE128Intel HD 5500 3GB (950MHz)Intel Broadwell-U AudioIntel I218-LM + Intel 7265Ubuntu 20.105.9.1-050901-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 21.0.0-devel (git-bd69765 2021-01-01 groovy-oibaf-ppa)OpenCL 3.01.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x2f - Thermald 2.3Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Broadwell 2021vkfft: build2: Time To Compilebuild-ffmpeg: Time To Compileclomp: Static OMP Speedupbrl-cad: VGR Performance Metricncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUwarsow: 1280 x 1024vkmark: 1920 x 1080vkmark: 1280 x 1024vkmark: 800 x 600vkmark: 1024 x 768hmmer: Pfam Database Searchonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUbuild-eigen: Time To Compilenode-web-tooling: sqlite-speedtest: Timed Time - Size 1,000unpack-firefox: firefox-84.0.source.tar.xzwarsow: 1920 x 1080simdjson: Kostyasimdjson: LargeRandlibplacebo: av1_grain_laplibplacebo: hdr_peakdetectlibplacebo: polar_nocomputelibplacebo: deband_heavysimdjson: PartialTweetssimdjson: DistinctUserIDphpbench: PHP Benchmark Suiteonednn: IP Shapes 3D - u8s8f32 - CPUbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestencode-wavpack: WAV To WavPackcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512encode-ape: WAV To APEencode-ogg: WAV To Oggcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUvkresample: 2x - Doublevkresample: 2x - Singleencode-opus: WAV To Opus Encodemafft: Multiple Sequence Alignment - LSU RNAonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUR1231126919.504341.9281.31458027.8271.3379.44101.1136.8445.51153.1847.185.6422.5014.2621.4012.6314.5561.4927.8171.2379.35101.5537.0845.64153.0047.335.6522.4614.4521.5312.6914.5661.5520853.020903.320830.570.746569316761096129.00611261.411309.111297.4116.4137.2495.58127.46245.60.490.33537.1132875.8923.7338.650.580.595291007.0195216.14816.18518.946315.0314.9488.7505.21285.11304.9314.8313.2489.0505.11577.41577.5513672126334816.84927.95560703.73328932.113729.1499152.680152.24010.89417.00821.004013.294710.109415.453220.908033.884027.796626.201435.35401122918.462340.6391.21456227.7271.0379.04101.1737.0245.65152.8847.015.6222.1114.0521.3412.5614.4461.3827.1370.9778.90101.2236.7045.41152.9846.265.4821.9714.1321.3612.5714.4761.4120912.720853.420852.070.546168516911097129.09211332.211320.111316.9116.8607.3094.57727.40445.80.490.33534.5132911.3323.7238.610.580.595284476.8080916.17716.22118.948315.5315.7490.5506.81289.31287.5316.0314.3488.6505.41586.71581.4513001126284316.84327.94260082.67301931.695629.4045155.244152.37710.90317.19621.087513.33259.1228715.544820.517233.500327.746326.222535.63731124906.249340.2011.31450427.6471.2678.88101.0136.8145.51152.9147.165.6322.3014.1621.3012.6514.5361.6127.2571.1879.12101.2336.7645.73152.9346.705.4722.0113.9821.2112.6114.5461.4720847.720858.020812.269.546368916701092129.22011243.511214.911200.4116.0357.3195.18727.36445.80.490.33534.1532916.0823.7338.630.580.595302306.8142316.20816.24418.946315.4314.4487.8506.41273.41285.9315.9315.2488.7503.41559.61590.4513336126334516.86827.92460345.61161631.969129.4752156.008153.24510.92217.36921.077213.460110.0792815.563820.625133.473627.301026.248635.8033OpenBenchmarking.org

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1R1322004006008001000SE +/- 2.52, N = 3SE +/- 1.45, N = 3SE +/- 2.00, N = 31126112411221. (CXX) g++ options: -O3 -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32R12004006008001000SE +/- 0.82, N = 3SE +/- 0.55, N = 3SE +/- 0.32, N = 3906.25918.46919.50

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile32R170140210280350SE +/- 0.31, N = 3SE +/- 0.15, N = 3SE +/- 0.25, N = 3340.20340.64341.93

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3R120.29250.5850.87751.171.4625SE +/- 0.00, N = 3SE +/- 0.01, N = 12SE +/- 0.00, N = 31.31.31.21. (CC) gcc options: -fopenmp -O3 -lm

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricR1233K6K9K12K15K1458014562145041. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m32R1714212835SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 327.6427.7227.82MIN: 27.49 / MAX: 29.98MIN: 27.48 / MAX: 31MIN: 27.57 / MAX: 37.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd23R11632486480SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 371.0371.2671.33MIN: 70.28 / MAX: 76.92MIN: 70.52 / MAX: 78.34MIN: 70.87 / MAX: 74.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny32R120406080100SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.27, N = 378.8879.0479.44MIN: 78.2 / MAX: 90.69MIN: 78.34 / MAX: 85.24MIN: 78.27 / MAX: 87.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet503R1220406080100SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3101.01101.11101.17MIN: 100.5 / MAX: 114.21MIN: 100.49 / MAX: 113.37MIN: 100.65 / MAX: 111.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet3R12918273645SE +/- 0.09, N = 3SE +/- 0.18, N = 3SE +/- 0.15, N = 336.8136.8437.02MIN: 34.96 / MAX: 79.9MIN: 35.27 / MAX: 39.39MIN: 35.32 / MAX: 103.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18R1321020304050SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 345.5145.5145.65MIN: 45.16 / MAX: 47.35MIN: 45.23 / MAX: 48.14MIN: 45.26 / MAX: 56.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1623R1306090120150SE +/- 0.24, N = 3SE +/- 0.19, N = 3SE +/- 0.18, N = 3152.88152.91153.18MIN: 151.28 / MAX: 160.12MIN: 151.95 / MAX: 164.13MIN: 152.21 / MAX: 161.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet23R11122334455SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 347.0147.1647.18MIN: 46.74 / MAX: 49.94MIN: 46.7 / MAX: 59.52MIN: 46.85 / MAX: 56.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface23R11.2692.5383.8075.0766.345SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.625.635.64MIN: 5.54 / MAX: 6.01MIN: 5.5 / MAX: 5.84MIN: 5.57 / MAX: 5.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b023R1510152025SE +/- 0.35, N = 3SE +/- 0.39, N = 3SE +/- 0.41, N = 322.1122.3022.50MIN: 20.59 / MAX: 33.72MIN: 21.02 / MAX: 25.56MIN: 20.99 / MAX: 25.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet23R148121620SE +/- 0.41, N = 3SE +/- 0.40, N = 3SE +/- 0.37, N = 314.0514.1614.26MIN: 13.01 / MAX: 18.86MIN: 13.06 / MAX: 28.01MIN: 13.24 / MAX: 17.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v232R1510152025SE +/- 0.57, N = 3SE +/- 0.63, N = 3SE +/- 0.57, N = 321.3021.3421.40MIN: 20.01 / MAX: 23.24MIN: 20 / MAX: 24.44MIN: 20.12 / MAX: 23.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v32R133691215SE +/- 0.13, N = 3SE +/- 0.19, N = 3SE +/- 0.19, N = 312.5612.6312.65MIN: 12.21 / MAX: 15.08MIN: 12.19 / MAX: 14.68MIN: 12.18 / MAX: 14.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v223R148121620SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.19, N = 314.4414.5314.55MIN: 14.04 / MAX: 16.1MIN: 13.97 / MAX: 26.42MIN: 14.04 / MAX: 28.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet2R131428425670SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 361.3861.4961.61MIN: 61.05 / MAX: 87.54MIN: 61.05 / MAX: 63.53MIN: 61.11 / MAX: 113.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m23R1714212835SE +/- 0.60, N = 3SE +/- 0.53, N = 3SE +/- 0.07, N = 327.1327.2527.81MIN: 25.79 / MAX: 37.93MIN: 26 / MAX: 29.97MIN: 26.38 / MAX: 30.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd23R11632486480SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 370.9771.1871.23MIN: 70.58 / MAX: 80.98MIN: 70.63 / MAX: 78.41MIN: 70.8 / MAX: 121.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny23R120406080100SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 378.9079.1279.35MIN: 78.21 / MAX: 91.76MIN: 78.31 / MAX: 86.73MIN: 78.46 / MAX: 91.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet5023R120406080100SE +/- 0.21, N = 3SE +/- 0.28, N = 3SE +/- 0.16, N = 3101.22101.23101.55MIN: 100.01 / MAX: 110.27MIN: 100.55 / MAX: 114.34MIN: 100.72 / MAX: 112.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet23R1918273645SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 336.7036.7637.08MIN: 35.28 / MAX: 38.63MIN: 35.25 / MAX: 46.5MIN: 34.94 / MAX: 39.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet182R131020304050SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.29, N = 345.4145.6445.73MIN: 45.11 / MAX: 48.07MIN: 45.25 / MAX: 47.78MIN: 45.08 / MAX: 48.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1632R1306090120150SE +/- 0.31, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3152.93152.98153.00MIN: 151.38 / MAX: 166.09MIN: 152.01 / MAX: 164.13MIN: 151.95 / MAX: 164.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet23R11122334455SE +/- 0.81, N = 3SE +/- 0.77, N = 3SE +/- 0.16, N = 346.2646.7047.33MIN: 43.6 / MAX: 59.76MIN: 44.45 / MAX: 57.93MIN: 46.92 / MAX: 49.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface32R11.27132.54263.81395.08526.3565SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 35.475.485.65MIN: 5.12 / MAX: 5.79MIN: 5.14 / MAX: 5.72MIN: 5.56 / MAX: 7.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b023R1510152025SE +/- 0.66, N = 3SE +/- 0.59, N = 3SE +/- 0.30, N = 321.9722.0122.46MIN: 20.55 / MAX: 27.24MIN: 20.61 / MAX: 33.2MIN: 21.52 / MAX: 24.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet32R148121620SE +/- 0.45, N = 3SE +/- 0.47, N = 3SE +/- 0.26, N = 313.9814.1314.45MIN: 13.03 / MAX: 14.74MIN: 13.1 / MAX: 14.93MIN: 13.46 / MAX: 16.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v232R1510152025SE +/- 0.57, N = 3SE +/- 0.58, N = 3SE +/- 0.37, N = 321.2121.3621.53MIN: 19.96 / MAX: 25.67MIN: 19.98 / MAX: 35.94MIN: 20.55 / MAX: 22.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v323R13691215SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 312.5712.6112.69MIN: 12.24 / MAX: 14.08MIN: 12.23 / MAX: 13.54MIN: 12.21 / MAX: 24.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v223R148121620SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.14, N = 314.4714.5414.56MIN: 14.06 / MAX: 28.34MIN: 14.01 / MAX: 17.8MIN: 14.07 / MAX: 17.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet23R11428425670SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 361.4161.4761.55MIN: 61.02 / MAX: 63.86MIN: 61.15 / MAX: 64.55MIN: 61.05 / MAX: 64.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3R124K8K12K16K20KSE +/- 5.57, N = 3SE +/- 16.92, N = 3SE +/- 36.91, N = 320847.720853.020912.7MIN: 20792.3MIN: 20801.4MIN: 20792.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU23R14K8K12K16K20KSE +/- 9.04, N = 3SE +/- 3.46, N = 3SE +/- 20.57, N = 320853.420858.020903.3MIN: 20778.6MIN: 20799.8MIN: 20835.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3R124K8K12K16K20KSE +/- 37.82, N = 3SE +/- 29.30, N = 3SE +/- 51.92, N = 320812.220830.520852.0MIN: 20694.5MIN: 20740.9MIN: 20691.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

Resolution: 1280 x 1024

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1280 x 1024R1231632486480SE +/- 0.09, N = 3SE +/- 0.44, N = 3SE +/- 0.63, N = 1370.770.569.5

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 1080R132100200300400500SE +/- 1.76, N = 3SE +/- 2.33, N = 3SE +/- 0.67, N = 34654634611. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 1280 x 1024

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1280 x 1024R132150300450600750SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 2.19, N = 36936896851. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 800 x 600

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 800 x 6002R13400800120016002000SE +/- 0.88, N = 3SE +/- 7.54, N = 3SE +/- 10.20, N = 31691167616701. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 1024 x 768

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1024 x 7682R132004006008001000SE +/- 6.11, N = 3SE +/- 6.69, N = 3SE +/- 4.18, N = 31097109610921. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchR123306090120150SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3129.01129.09129.221. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU3R122K4K6K8K10KSE +/- 44.20, N = 3SE +/- 48.62, N = 3SE +/- 15.69, N = 311243.511261.411332.2MIN: 11158.2MIN: 11146.6MIN: 11227.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3R122K4K6K8K10KSE +/- 13.64, N = 3SE +/- 65.97, N = 3SE +/- 44.81, N = 311214.911309.111320.1MIN: 11171.5MIN: 11154.3MIN: 11234.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU3R122K4K6K8K10KSE +/- 18.25, N = 3SE +/- 25.05, N = 3SE +/- 15.94, N = 311200.411297.411316.9MIN: 11129.9MIN: 11225.5MIN: 11169.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile3R12306090120150SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3116.04116.41116.86

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark32R1246810SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 37.317.307.241. Nodejs v12.18.2

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00023R120406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.57, N = 394.5895.1995.581. (CC) gcc options: -O2 -ldl -lz -lpthread

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz32R1612182430SE +/- 0.16, N = 19SE +/- 0.32, N = 6SE +/- 0.33, N = 627.3627.4027.46

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108032R11020304050SE +/- 0.20, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 345.845.845.6

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya32R10.11030.22060.33090.44120.5515SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.490.490.491. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom32R10.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.331. (CXX) g++ options: -O3 -pthread

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lapR123120240360480600SE +/- 0.34, N = 3SE +/- 1.77, N = 3SE +/- 1.17, N = 3537.11534.51534.151. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect32R17K14K21K28K35KSE +/- 401.65, N = 3SE +/- 358.35, N = 3SE +/- 290.63, N = 332916.0832911.3332875.891. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute3R12612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 323.7323.7323.721. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavyR132918273645SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 338.6538.6338.611. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets32R10.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID32R10.13280.26560.39840.53120.664SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.590.590.591. (CXX) g++ options: -O3 -pthread

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite3R12110K220K330K440K550KSE +/- 222.67, N = 3SE +/- 498.65, N = 3SE +/- 388.18, N = 3530230529100528447

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU23R1246810SE +/- 0.09974, N = 3SE +/- 0.10395, N = 15SE +/- 0.06965, N = 156.808096.814237.01952MIN: 6.36MIN: 6.35MIN: 6.611. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: HighestR12348121620SE +/- 0.14, N = 13SE +/- 0.23, N = 4SE +/- 0.27, N = 316.1516.1816.211. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: HighestR12348121620SE +/- 0.20, N = 13SE +/- 0.26, N = 3SE +/- 0.28, N = 316.1916.2216.241. (CXX) g++ options: -O3 -O2 -lpthread -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackR132510152025SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 518.9518.9518.951. (CXX) g++ options: -rdynamic

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption23R170140210280350SE +/- 1.09, N = 3SE +/- 0.48, N = 3SE +/- 0.58, N = 3315.5315.4315.0

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption2R1370140210280350SE +/- 0.57, N = 3SE +/- 0.73, N = 3SE +/- 0.15, N = 3315.7314.9314.4

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption2R13110220330440550SE +/- 1.01, N = 3SE +/- 1.22, N = 3SE +/- 0.51, N = 3490.5488.7487.8

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption23R1110220330440550SE +/- 0.47, N = 3SE +/- 1.01, N = 3SE +/- 0.87, N = 3506.8506.4505.2

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption2R1330060090012001500SE +/- 13.32, N = 3SE +/- 7.00, N = 3SE +/- 6.22, N = 31289.31285.11273.4

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b EncryptionR12330060090012001500SE +/- 6.50, N = 3SE +/- 6.69, N = 3SE +/- 2.40, N = 31304.91287.51285.9

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption23R170140210280350SE +/- 0.26, N = 3SE +/- 0.86, N = 3SE +/- 0.48, N = 3316.0315.9314.8

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption32R170140210280350SE +/- 0.35, N = 3SE +/- 0.43, N = 3SE +/- 0.84, N = 3315.2314.3313.2

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b DecryptionR132110220330440550SE +/- 0.20, N = 3SE +/- 1.51, N = 3SE +/- 1.59, N = 3489.0488.7488.6

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption2R13110220330440550SE +/- 0.92, N = 3SE +/- 0.99, N = 3SE +/- 1.12, N = 3505.4505.1503.4

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption2R1330060090012001500SE +/- 15.21, N = 3SE +/- 11.32, N = 3SE +/- 11.68, N = 31586.71577.41559.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption32R130060090012001500SE +/- 1.02, N = 3SE +/- 15.55, N = 3SE +/- 19.58, N = 31590.41581.41577.5

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpoolR132110K220K330K440K550KSE +/- 335.33, N = 3SE +/- 335.33, N = 3513672513336513001

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512R132300K600K900K1200K1500KSE +/- 1520.33, N = 3SE +/- 878.73, N = 3SE +/- 2024.67, N = 3126334812633451262843

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE2R1348121620SE +/- 0.06, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 516.8416.8516.871. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg32R1714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 327.9227.9427.961. (CC) gcc options: -O2 -ffast-math -fsigned-char

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondR13213K26K39K52K65KSE +/- 220.80, N = 3SE +/- 283.31, N = 3SE +/- 445.62, N = 360703.7360345.6160082.671. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU23R1714212835SE +/- 0.34, N = 3SE +/- 0.27, N = 3SE +/- 0.29, N = 331.7031.9732.11MIN: 30.17MIN: 30.65MIN: 30.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUR123714212835SE +/- 0.32, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 329.1529.4029.48MIN: 27.38MIN: 28MIN: 27.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleR123306090120150SE +/- 1.71, N = 3SE +/- 1.45, N = 3SE +/- 0.61, N = 3152.68155.24156.011. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleR123306090120150SE +/- 0.49, N = 3SE +/- 1.86, N = 3SE +/- 1.40, N = 3152.24152.38153.251. (CXX) g++ options: -O3 -pthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeR1233691215SE +/- 0.03, N = 5SE +/- 0.02, N = 5SE +/- 0.04, N = 510.8910.9010.921. (CXX) g++ options: -fvisibility=hidden -logg -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAR12348121620SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 317.0117.2017.371. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUR132510152025SE +/- 0.31, N = 3SE +/- 0.23, N = 3SE +/- 0.27, N = 321.0021.0821.09MIN: 19.61MIN: 19.85MIN: 20.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUR1233691215SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 313.2913.3313.46MIN: 12.99MIN: 13.01MIN: 13.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU23R13691215SE +/- 0.02320, N = 3SE +/- 0.13687, N = 4SE +/- 0.04347, N = 39.1228710.0792810.10940MIN: 8.62MIN: 9.14MIN: 9.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUR12348121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 315.4515.5415.56MIN: 14.57MIN: 14.82MIN: 14.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU23R1510152025SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.18, N = 320.5220.6320.91MIN: 20.16MIN: 20.29MIN: 20.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU32R1816243240SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 333.4733.5033.88MIN: 33.25MIN: 33.31MIN: 33.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU32R1714212835SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 327.3027.7527.80MIN: 26.98MIN: 27.23MIN: 27.491. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUR123612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 326.2026.2226.25MIN: 26.09MIN: 26.12MIN: 26.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUR123816243240SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 335.3535.6435.80MIN: 33.41MIN: 33.73MIN: 33.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.4