Broadwell 2021

Intel Core i7-5600U testing with a LENOVO 20BSCTO1WW (N14ET49W 1.27 BIOS) and Intel HD 5500 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101027-HA-BROADWELL87&rdt&grr.

Broadwell 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionR123Intel Core i7-5600U @ 3.20GHz (2 Cores / 4 Threads)LENOVO 20BSCTO1WW (N14ET49W 1.27 BIOS)Intel Broadwell-U-OPI8GB128GB SAMSUNG MZNTE128Intel HD 5500 3GB (950MHz)Intel Broadwell-U AudioIntel I218-LM + Intel 7265Ubuntu 20.105.9.1-050901-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 21.0.0-devel (git-bd69765 2021-01-01 groovy-oibaf-ppa)OpenCL 3.01.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x2f - Thermald 2.3Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Broadwell 2021vkfft: build2: Time To Compilebuild-ffmpeg: Time To Compileclomp: Static OMP Speedupbrl-cad: VGR Performance Metricncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUwarsow: 1280 x 1024vkmark: 1920 x 1080vkmark: 1280 x 1024vkmark: 800 x 600vkmark: 1024 x 768hmmer: Pfam Database Searchonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUbuild-eigen: Time To Compilenode-web-tooling: sqlite-speedtest: Timed Time - Size 1,000unpack-firefox: firefox-84.0.source.tar.xzwarsow: 1920 x 1080simdjson: Kostyasimdjson: LargeRandlibplacebo: av1_grain_laplibplacebo: hdr_peakdetectlibplacebo: polar_nocomputelibplacebo: deband_heavysimdjson: PartialTweetssimdjson: DistinctUserIDphpbench: PHP Benchmark Suiteonednn: IP Shapes 3D - u8s8f32 - CPUbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestencode-wavpack: WAV To WavPackcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512encode-ape: WAV To APEencode-ogg: WAV To Oggcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUvkresample: 2x - Doublevkresample: 2x - Singleencode-opus: WAV To Opus Encodemafft: Multiple Sequence Alignment - LSU RNAonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUR1231126919.504341.9281.31458027.8271.3379.44101.1136.8445.51153.1847.185.6422.5014.2621.4012.6314.5561.4927.8171.2379.35101.5537.0845.64153.0047.335.6522.4614.4521.5312.6914.5661.5520853.020903.320830.570.746569316761096129.00611261.411309.111297.4116.4137.2495.58127.46245.60.490.33537.1132875.8923.7338.650.580.595291007.0195216.14816.18518.946315.0314.9488.7505.21285.11304.9314.8313.2489.0505.11577.41577.5513672126334816.84927.95560703.73328932.113729.1499152.680152.24010.89417.00821.004013.294710.109415.453220.908033.884027.796626.201435.35401122918.462340.6391.21456227.7271.0379.04101.1737.0245.65152.8847.015.6222.1114.0521.3412.5614.4461.3827.1370.9778.90101.2236.7045.41152.9846.265.4821.9714.1321.3612.5714.4761.4120912.720853.420852.070.546168516911097129.09211332.211320.111316.9116.8607.3094.57727.40445.80.490.33534.5132911.3323.7238.610.580.595284476.8080916.17716.22118.948315.5315.7490.5506.81289.31287.5316.0314.3488.6505.41586.71581.4513001126284316.84327.94260082.67301931.695629.4045155.244152.37710.90317.19621.087513.33259.1228715.544820.517233.500327.746326.222535.63731124906.249340.2011.31450427.6471.2678.88101.0136.8145.51152.9147.165.6322.3014.1621.3012.6514.5361.6127.2571.1879.12101.2336.7645.73152.9346.705.4722.0113.9821.2112.6114.5461.4720847.720858.020812.269.546368916701092129.22011243.511214.911200.4116.0357.3195.18727.36445.80.490.33534.1532916.0823.7338.630.580.595302306.8142316.20816.24418.946315.4314.4487.8506.41273.41285.9315.9315.2488.7503.41559.61590.4513336126334516.86827.92460345.61161631.969129.4752156.008153.24510.92217.36921.077213.460110.0792815.563820.625133.473627.301026.248635.8033OpenBenchmarking.org

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1R1232004006008001000SE +/- 2.52, N = 3SE +/- 2.00, N = 3SE +/- 1.45, N = 31126112211241. (CXX) g++ options: -O3 -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileR1232004006008001000SE +/- 0.32, N = 3SE +/- 0.55, N = 3SE +/- 0.82, N = 3919.50918.46906.25

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileR12370140210280350SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.31, N = 3341.93340.64340.20

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupR1230.29250.5850.87751.171.4625SE +/- 0.01, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 31.31.21.31. (CC) gcc options: -fopenmp -O3 -lm

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricR1233K6K9K12K15K1458014562145041. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mR123714212835SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 327.8227.7227.64MIN: 27.57 / MAX: 37.34MIN: 27.48 / MAX: 31MIN: 27.49 / MAX: 29.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdR1231632486480SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 371.3371.0371.26MIN: 70.87 / MAX: 74.75MIN: 70.28 / MAX: 76.92MIN: 70.52 / MAX: 78.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyR12320406080100SE +/- 0.27, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 379.4479.0478.88MIN: 78.27 / MAX: 87.18MIN: 78.34 / MAX: 85.24MIN: 78.2 / MAX: 90.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50R12320406080100SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3101.11101.17101.01MIN: 100.49 / MAX: 113.37MIN: 100.65 / MAX: 111.1MIN: 100.5 / MAX: 114.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetR123918273645SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 336.8437.0236.81MIN: 35.27 / MAX: 39.39MIN: 35.32 / MAX: 103.85MIN: 34.96 / MAX: 79.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18R1231020304050SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 345.5145.6545.51MIN: 45.16 / MAX: 47.35MIN: 45.26 / MAX: 56.06MIN: 45.23 / MAX: 48.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16R123306090120150SE +/- 0.18, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3153.18152.88152.91MIN: 152.21 / MAX: 161.01MIN: 151.28 / MAX: 160.12MIN: 151.95 / MAX: 164.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetR1231122334455SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 347.1847.0147.16MIN: 46.85 / MAX: 56.58MIN: 46.74 / MAX: 49.94MIN: 46.7 / MAX: 59.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefaceR1231.2692.5383.8075.0766.345SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.645.625.63MIN: 5.57 / MAX: 5.86MIN: 5.54 / MAX: 6.01MIN: 5.5 / MAX: 5.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0R123510152025SE +/- 0.41, N = 3SE +/- 0.35, N = 3SE +/- 0.39, N = 322.5022.1122.30MIN: 20.99 / MAX: 25.9MIN: 20.59 / MAX: 33.72MIN: 21.02 / MAX: 25.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetR12348121620SE +/- 0.37, N = 3SE +/- 0.41, N = 3SE +/- 0.40, N = 314.2614.0514.16MIN: 13.24 / MAX: 17.01MIN: 13.01 / MAX: 18.86MIN: 13.06 / MAX: 28.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2R123510152025SE +/- 0.57, N = 3SE +/- 0.63, N = 3SE +/- 0.57, N = 321.4021.3421.30MIN: 20.12 / MAX: 23.22MIN: 20 / MAX: 24.44MIN: 20.01 / MAX: 23.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3R1233691215SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 312.6312.5612.65MIN: 12.19 / MAX: 14.68MIN: 12.21 / MAX: 15.08MIN: 12.18 / MAX: 14.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2R12348121620SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 314.5514.4414.53MIN: 14.04 / MAX: 28.86MIN: 14.04 / MAX: 16.1MIN: 13.97 / MAX: 26.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetR1231428425670SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 361.4961.3861.61MIN: 61.05 / MAX: 63.53MIN: 61.05 / MAX: 87.54MIN: 61.11 / MAX: 113.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mR123714212835SE +/- 0.07, N = 3SE +/- 0.60, N = 3SE +/- 0.53, N = 327.8127.1327.25MIN: 26.38 / MAX: 30.33MIN: 25.79 / MAX: 37.93MIN: 26 / MAX: 29.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdR1231632486480SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 371.2370.9771.18MIN: 70.8 / MAX: 121.35MIN: 70.58 / MAX: 80.98MIN: 70.63 / MAX: 78.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyR12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 379.3578.9079.12MIN: 78.46 / MAX: 91.83MIN: 78.21 / MAX: 91.76MIN: 78.31 / MAX: 86.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50R12320406080100SE +/- 0.16, N = 3SE +/- 0.21, N = 3SE +/- 0.28, N = 3101.55101.22101.23MIN: 100.72 / MAX: 112.98MIN: 100.01 / MAX: 110.27MIN: 100.55 / MAX: 114.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetR123918273645SE +/- 0.27, N = 3SE +/- 0.23, N = 3SE +/- 0.12, N = 337.0836.7036.76MIN: 34.94 / MAX: 39.37MIN: 35.28 / MAX: 38.63MIN: 35.25 / MAX: 46.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18R1231020304050SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.29, N = 345.6445.4145.73MIN: 45.25 / MAX: 47.78MIN: 45.11 / MAX: 48.07MIN: 45.08 / MAX: 48.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16R123306090120150SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.31, N = 3153.00152.98152.93MIN: 151.95 / MAX: 164.37MIN: 152.01 / MAX: 164.13MIN: 151.38 / MAX: 166.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetR1231122334455SE +/- 0.16, N = 3SE +/- 0.81, N = 3SE +/- 0.77, N = 347.3346.2646.70MIN: 46.92 / MAX: 49.74MIN: 43.6 / MAX: 59.76MIN: 44.45 / MAX: 57.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefaceR1231.27132.54263.81395.08526.3565SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 35.655.485.47MIN: 5.56 / MAX: 7.88MIN: 5.14 / MAX: 5.72MIN: 5.12 / MAX: 5.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0R123510152025SE +/- 0.30, N = 3SE +/- 0.66, N = 3SE +/- 0.59, N = 322.4621.9722.01MIN: 21.52 / MAX: 24.07MIN: 20.55 / MAX: 27.24MIN: 20.61 / MAX: 33.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetR12348121620SE +/- 0.26, N = 3SE +/- 0.47, N = 3SE +/- 0.45, N = 314.4514.1313.98MIN: 13.46 / MAX: 16.94MIN: 13.1 / MAX: 14.93MIN: 13.03 / MAX: 14.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2R123510152025SE +/- 0.37, N = 3SE +/- 0.58, N = 3SE +/- 0.57, N = 321.5321.3621.21MIN: 20.55 / MAX: 22.52MIN: 19.98 / MAX: 35.94MIN: 19.96 / MAX: 25.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3R1233691215SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 312.6912.5712.61MIN: 12.21 / MAX: 24.64MIN: 12.24 / MAX: 14.08MIN: 12.23 / MAX: 13.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2R12348121620SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 314.5614.4714.54MIN: 14.07 / MAX: 17.44MIN: 14.06 / MAX: 28.34MIN: 14.01 / MAX: 17.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetR1231428425670SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 361.5561.4161.47MIN: 61.05 / MAX: 64.2MIN: 61.02 / MAX: 63.86MIN: 61.15 / MAX: 64.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUR1234K8K12K16K20KSE +/- 16.92, N = 3SE +/- 36.91, N = 3SE +/- 5.57, N = 320853.020912.720847.7MIN: 20801.4MIN: 20792.3MIN: 20792.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUR1234K8K12K16K20KSE +/- 20.57, N = 3SE +/- 9.04, N = 3SE +/- 3.46, N = 320903.320853.420858.0MIN: 20835.5MIN: 20778.6MIN: 20799.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUR1234K8K12K16K20KSE +/- 29.30, N = 3SE +/- 51.92, N = 3SE +/- 37.82, N = 320830.520852.020812.2MIN: 20740.9MIN: 20691.5MIN: 20694.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

Resolution: 1280 x 1024

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1280 x 1024R1231632486480SE +/- 0.09, N = 3SE +/- 0.44, N = 3SE +/- 0.63, N = 1370.770.569.5

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 1080R123100200300400500SE +/- 1.76, N = 3SE +/- 0.67, N = 3SE +/- 2.33, N = 34654614631. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 1280 x 1024

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1280 x 1024R123150300450600750SE +/- 0.67, N = 3SE +/- 2.19, N = 3SE +/- 1.53, N = 36936856891. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 800 x 600

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 800 x 600R123400800120016002000SE +/- 7.54, N = 3SE +/- 0.88, N = 3SE +/- 10.20, N = 31676169116701. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 1024 x 768

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1024 x 768R1232004006008001000SE +/- 6.69, N = 3SE +/- 6.11, N = 3SE +/- 4.18, N = 31096109710921. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchR123306090120150SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3129.01129.09129.221. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUR1232K4K6K8K10KSE +/- 48.62, N = 3SE +/- 15.69, N = 3SE +/- 44.20, N = 311261.411332.211243.5MIN: 11146.6MIN: 11227.9MIN: 11158.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUR1232K4K6K8K10KSE +/- 65.97, N = 3SE +/- 44.81, N = 3SE +/- 13.64, N = 311309.111320.111214.9MIN: 11154.3MIN: 11234.3MIN: 11171.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUR1232K4K6K8K10KSE +/- 25.05, N = 3SE +/- 15.94, N = 3SE +/- 18.25, N = 311297.411316.911200.4MIN: 11225.5MIN: 11169.5MIN: 11129.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To CompileR123306090120150SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3116.41116.86116.04

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling BenchmarkR123246810SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 37.247.307.311. Nodejs v12.18.2

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000R12320406080100SE +/- 0.57, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 395.5894.5895.191. (CC) gcc options: -O2 -ldl -lz -lpthread

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xzR123612182430SE +/- 0.33, N = 6SE +/- 0.32, N = 6SE +/- 0.16, N = 1927.4627.4027.36

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080R1231020304050SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.20, N = 345.645.845.8

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaR1230.11030.22060.33090.44120.5515SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.490.490.491. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomR1230.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.331. (CXX) g++ options: -O3 -pthread

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lapR123120240360480600SE +/- 0.34, N = 3SE +/- 1.77, N = 3SE +/- 1.17, N = 3537.11534.51534.151. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetectR1237K14K21K28K35KSE +/- 290.63, N = 3SE +/- 358.35, N = 3SE +/- 401.65, N = 332875.8932911.3332916.081. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocomputeR123612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 323.7323.7223.731. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavyR123918273645SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 338.6538.6138.631. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsR1230.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDR1230.13280.26560.39840.53120.664SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.590.590.591. (CXX) g++ options: -O3 -pthread

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteR123110K220K330K440K550KSE +/- 498.65, N = 3SE +/- 388.18, N = 3SE +/- 222.67, N = 3529100528447530230

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUR123246810SE +/- 0.06965, N = 15SE +/- 0.09974, N = 3SE +/- 0.10395, N = 157.019526.808096.81423MIN: 6.61MIN: 6.36MIN: 6.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: HighestR12348121620SE +/- 0.14, N = 13SE +/- 0.23, N = 4SE +/- 0.27, N = 316.1516.1816.211. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: HighestR12348121620SE +/- 0.20, N = 13SE +/- 0.26, N = 3SE +/- 0.28, N = 316.1916.2216.241. (CXX) g++ options: -O3 -O2 -lpthread -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackR123510152025SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 518.9518.9518.951. (CXX) g++ options: -rdynamic

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b DecryptionR12370140210280350SE +/- 0.58, N = 3SE +/- 1.09, N = 3SE +/- 0.48, N = 3315.0315.5315.4

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b EncryptionR12370140210280350SE +/- 0.73, N = 3SE +/- 0.57, N = 3SE +/- 0.15, N = 3314.9315.7314.4

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b DecryptionR123110220330440550SE +/- 1.22, N = 3SE +/- 1.01, N = 3SE +/- 0.51, N = 3488.7490.5487.8

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b EncryptionR123110220330440550SE +/- 0.87, N = 3SE +/- 0.47, N = 3SE +/- 1.01, N = 3505.2506.8506.4

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b DecryptionR12330060090012001500SE +/- 7.00, N = 3SE +/- 13.32, N = 3SE +/- 6.22, N = 31285.11289.31273.4

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b EncryptionR12330060090012001500SE +/- 6.50, N = 3SE +/- 6.69, N = 3SE +/- 2.40, N = 31304.91287.51285.9

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b DecryptionR12370140210280350SE +/- 0.48, N = 3SE +/- 0.26, N = 3SE +/- 0.86, N = 3314.8316.0315.9

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b EncryptionR12370140210280350SE +/- 0.84, N = 3SE +/- 0.43, N = 3SE +/- 0.35, N = 3313.2314.3315.2

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b DecryptionR123110220330440550SE +/- 0.20, N = 3SE +/- 1.59, N = 3SE +/- 1.51, N = 3489.0488.6488.7

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b EncryptionR123110220330440550SE +/- 0.99, N = 3SE +/- 0.92, N = 3SE +/- 1.12, N = 3505.1505.4503.4

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b DecryptionR12330060090012001500SE +/- 11.32, N = 3SE +/- 15.21, N = 3SE +/- 11.68, N = 31577.41586.71559.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b EncryptionR12330060090012001500SE +/- 19.58, N = 3SE +/- 15.55, N = 3SE +/- 1.02, N = 31577.51581.41590.4

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpoolR123110K220K330K440K550KSE +/- 335.33, N = 3SE +/- 335.33, N = 3513672513001513336

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512R123300K600K900K1200K1500KSE +/- 1520.33, N = 3SE +/- 2024.67, N = 3SE +/- 878.73, N = 3126334812628431263345

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APER12348121620SE +/- 0.06, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 516.8516.8416.871. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To OggR123714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 327.9627.9427.921. (CC) gcc options: -O2 -ffast-math -fsigned-char

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondR12313K26K39K52K65KSE +/- 220.80, N = 3SE +/- 445.62, N = 3SE +/- 283.31, N = 360703.7360082.6760345.611. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUR123714212835SE +/- 0.29, N = 3SE +/- 0.34, N = 3SE +/- 0.27, N = 332.1131.7031.97MIN: 30.06MIN: 30.17MIN: 30.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUR123714212835SE +/- 0.32, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 329.1529.4029.48MIN: 27.38MIN: 28MIN: 27.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleR123306090120150SE +/- 1.71, N = 3SE +/- 1.45, N = 3SE +/- 0.61, N = 3152.68155.24156.011. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleR123306090120150SE +/- 0.49, N = 3SE +/- 1.86, N = 3SE +/- 1.40, N = 3152.24152.38153.251. (CXX) g++ options: -O3 -pthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeR1233691215SE +/- 0.03, N = 5SE +/- 0.02, N = 5SE +/- 0.04, N = 510.8910.9010.921. (CXX) g++ options: -fvisibility=hidden -logg -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAR12348121620SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 317.0117.2017.371. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUR123510152025SE +/- 0.31, N = 3SE +/- 0.27, N = 3SE +/- 0.23, N = 321.0021.0921.08MIN: 19.61MIN: 20.08MIN: 19.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUR1233691215SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 313.2913.3313.46MIN: 12.99MIN: 13.01MIN: 13.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUR1233691215SE +/- 0.04347, N = 3SE +/- 0.02320, N = 3SE +/- 0.13687, N = 410.109409.1228710.07928MIN: 9.65MIN: 8.62MIN: 9.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUR12348121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 315.4515.5415.56MIN: 14.57MIN: 14.82MIN: 14.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUR123510152025SE +/- 0.18, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 320.9120.5220.63MIN: 20.39MIN: 20.16MIN: 20.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUR123816243240SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 333.8833.5033.47MIN: 33.42MIN: 33.31MIN: 33.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUR123714212835SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 327.8027.7527.30MIN: 27.49MIN: 27.23MIN: 26.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUR123612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 326.2026.2226.25MIN: 26.09MIN: 26.12MIN: 26.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUR123816243240SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 335.3535.6435.80MIN: 33.41MIN: 33.73MIN: 33.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.5