tigerlake ncnn cassandra

Intel Core i7-1165G7 testing with a Dell 0GG9PT (3.0.3 BIOS) and Intel Xe TGL GT2 3GB on Ubuntu 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2107284-IB-TIGERLAKE93.

tigerlake ncnn cassandraProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution1234Intel Core i7-1165G7 @ 4.70GHz (4 Cores / 8 Threads)Dell 0GG9PT (3.0.3 BIOS)Intel Tiger Lake-LP16GBKioxia KBG40ZNS256G NVMe 256GBIntel Xe TGL GT2 3GB (1300MHz)Realtek ALC289Intel Wi-Fi 6 AX201Ubuntu 21.045.13.0-051300-generic (x86_64)GNOME Shell 3.38.4X Server + Wayland4.6 Mesa 21.2.0-devel (git-dd98918 2021-07-12 hirsute-oibaf-ppa)1.2.182GCC 10.3.0ext41920x1200OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x88 - Thermald 2.4.3Java Details- OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2)Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

tigerlake ncnn cassandrancnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mcassandra: Readscassandra: Writescassandra: Mixed 1:1cassandra: Mixed 1:3123420.765.164.214.424.386.871.5814.6654.814.1913.0230.1829.4125.859.5819.5811.6215.7714.1211.9316.423.0116.4336.7212.8411.2415.2745.991215.513631247906339083340720.735.14.164.424.396.931.5814.6954.9314.2313.0529.8329.2625.559.5421.1811.4915.5311.4711.9316.342.9917.4536.7812.7311.1315.2142.6314.8914.723860448732329323265025.836.415.365.785.759.642.3520.3765.7318.7616.7637.1036.6033.2813.6420.4311.6615.3312.1411.7616.702.9916.0036.7912.2111.4515.5742.7813.2815.73317254041125.616.485.626.326.339.702.3220.4365.7918.7716.7836.8836.2333.1713.6521.7111.4414.379.6110.9115.203.0715.3936.8611.5811.9115.7043.9814.1515.00OpenBenchmarking.org

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenet1234612182430SE +/- 0.25, N = 3SE +/- 0.06, N = 320.7620.7325.8325.61MIN: 20.21 / MAX: 31.73MIN: 20.16 / MAX: 31.55MIN: 24.93 / MAX: 39.44MIN: 24.96 / MAX: 39.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v21234246810SE +/- 0.62, N = 3SE +/- 0.61, N = 35.165.106.416.48MIN: 5.07 / MAX: 13.41MIN: 5 / MAX: 8.58MIN: 5.06 / MAX: 19.58MIN: 5.07 / MAX: 17.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v312341.26452.5293.79355.0586.3225SE +/- 0.56, N = 3SE +/- 0.33, N = 34.214.165.365.62MIN: 4.1 / MAX: 11.21MIN: 4.09 / MAX: 8.27MIN: 4.11 / MAX: 17.33MIN: 4.13 / MAX: 16.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v21234246810SE +/- 0.64, N = 3SE +/- 0.02, N = 34.424.425.786.32MIN: 4.32 / MAX: 7.16MIN: 4.3 / MAX: 7.83MIN: 4.33 / MAX: 17.73MIN: 5.92 / MAX: 17.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnet1234246810SE +/- 0.66, N = 3SE +/- 0.04, N = 24.384.395.756.33MIN: 4.24 / MAX: 15.06MIN: 4.29 / MAX: 12.65MIN: 4.27 / MAX: 18.15MIN: 5.9 / MAX: 16.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b012343691215SE +/- 0.03, N = 3SE +/- 0.04, N = 36.876.939.649.70MIN: 6.78 / MAX: 10.6MIN: 6.81 / MAX: 17.97MIN: 9.24 / MAX: 21.07MIN: 9.37 / MAX: 21.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazeface12340.52881.05761.58642.11522.644SE +/- 0.01, N = 3SE +/- 0.02, N = 31.581.582.352.32MIN: 1.55 / MAX: 1.81MIN: 1.55 / MAX: 1.82MIN: 2.27 / MAX: 11.55MIN: 2.24 / MAX: 8.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenet1234510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 314.6614.6920.3720.43MIN: 14.52 / MAX: 26.1MIN: 14.46 / MAX: 25.44MIN: 19.94 / MAX: 32.31MIN: 19.79 / MAX: 37.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg1612341530456075SE +/- 0.31, N = 3SE +/- 0.02, N = 354.8054.9365.7365.79MIN: 53.87 / MAX: 66.81MIN: 54.22 / MAX: 65.37MIN: 63.4 / MAX: 82.52MIN: 63.74 / MAX: 172.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet181234510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 314.1914.2318.7618.77MIN: 13.87 / MAX: 25.18MIN: 13.85 / MAX: 25.47MIN: 18.3 / MAX: 30.77MIN: 18.35 / MAX: 31.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnet123448121620SE +/- 0.05, N = 3SE +/- 0.03, N = 313.0213.0516.7616.78MIN: 12.74 / MAX: 16.73MIN: 12.71 / MAX: 22.09MIN: 16.36 / MAX: 28.34MIN: 16.43 / MAX: 27.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet501234918273645SE +/- 0.23, N = 3SE +/- 0.03, N = 330.1829.8337.1036.88MIN: 28.19 / MAX: 42.88MIN: 27.81 / MAX: 41.79MIN: 36.02 / MAX: 90.9MIN: 36.04 / MAX: 52.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tiny1234816243240SE +/- 0.27, N = 3SE +/- 0.08, N = 329.4129.2636.6036.23MIN: 28.58 / MAX: 41.83MIN: 28.32 / MAX: 40.02MIN: 35.29 / MAX: 48.79MIN: 35.39 / MAX: 49.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssd1234816243240SE +/- 0.15, N = 3SE +/- 0.07, N = 325.8525.5533.2833.17MIN: 25.11 / MAX: 35.16MIN: 24.94 / MAX: 37.73MIN: 32.43 / MAX: 44.71MIN: 32.55 / MAX: 51.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400m123448121620SE +/- 0.00, N = 3SE +/- 0.01, N = 39.589.5413.6413.65MIN: 9.41 / MAX: 21.37MIN: 9.39 / MAX: 22.14MIN: 13.29 / MAX: 25.27MIN: 13.33 / MAX: 25.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenet1234510152025SE +/- 0.12, N = 3SE +/- 0.23, N = 319.5821.1820.4321.71MIN: 17.39 / MAX: 27.41MIN: 17.49 / MAX: 31.49MIN: 16.5 / MAX: 30.15MIN: 17.43 / MAX: 37.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v212343691215SE +/- 0.06, N = 3SE +/- 0.07, N = 311.6211.4911.6611.44MIN: 10.36 / MAX: 13.31MIN: 10.23 / MAX: 15.56MIN: 10.25 / MAX: 13.43MIN: 10.04 / MAX: 12.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123448121620SE +/- 0.30, N = 3SE +/- 0.43, N = 315.7715.5315.3314.37MIN: 14.57 / MAX: 16.84MIN: 14.5 / MAX: 16.15MIN: 14.22 / MAX: 16.99MIN: 12.6 / MAX: 16.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2123448121620SE +/- 1.25, N = 3SE +/- 0.62, N = 314.1211.4712.149.61MIN: 13.69 / MAX: 16.12MIN: 7.66 / MAX: 19.94MIN: 8.42 / MAX: 18.12MIN: 7.62 / MAX: 14.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnet12343691215SE +/- 0.09, N = 3SE +/- 0.06, N = 311.9311.9311.7610.91MIN: 11.57 / MAX: 12.12MIN: 11.63 / MAX: 12.39MIN: 10.37 / MAX: 13.13MIN: 10.29 / MAX: 12.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0123448121620SE +/- 0.28, N = 3SE +/- 0.25, N = 316.4216.3416.7015.20MIN: 16.16 / MAX: 18.42MIN: 15.22 / MAX: 18.97MIN: 15.67 / MAX: 19.67MIN: 13.79 / MAX: 19.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazeface12340.69081.38162.07242.76323.454SE +/- 0.23, N = 3SE +/- 0.18, N = 33.012.992.993.07MIN: 2.39 / MAX: 4.02MIN: 2.37 / MAX: 4.31MIN: 2.38 / MAX: 7.68MIN: 2.37 / MAX: 4.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenet123448121620SE +/- 0.22, N = 3SE +/- 0.14, N = 316.4317.4516.0015.39MIN: 16.08 / MAX: 17.64MIN: 17.24 / MAX: 18.17MIN: 15.05 / MAX: 18.21MIN: 14.17 / MAX: 18.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg161234816243240SE +/- 0.05, N = 3SE +/- 0.01, N = 336.7236.7836.7936.86MIN: 36 / MAX: 37.31MIN: 36.33 / MAX: 37.58MIN: 35.93 / MAX: 37.31MIN: 36.04 / MAX: 37.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet1812343691215SE +/- 0.31, N = 3SE +/- 0.22, N = 312.8412.7312.2111.58MIN: 12.38 / MAX: 13.09MIN: 11.03 / MAX: 15.29MIN: 11.12 / MAX: 13.08MIN: 10.83 / MAX: 12.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnet12343691215SE +/- 0.09, N = 3SE +/- 0.38, N = 311.2411.1311.4511.91MIN: 10.22 / MAX: 15.13MIN: 10.17 / MAX: 14.29MIN: 10.23 / MAX: 15.09MIN: 10.69 / MAX: 14.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50123448121620SE +/- 0.12, N = 2SE +/- 0.05, N = 315.2715.2115.5715.70MIN: 14.66 / MAX: 15.6MIN: 14.33 / MAX: 15.56MIN: 14.83 / MAX: 16.09MIN: 14.71 / MAX: 16.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tiny12341020304050SE +/- 0.71, N = 3SE +/- 0.10, N = 345.9942.6342.7843.98MIN: 30 / MAX: 62.2MIN: 27.86 / MAX: 56.53MIN: 27.08 / MAX: 63.78MIN: 28.09 / MAX: 72.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssd123448121620SE +/- 0.80, N = 3SE +/- 1.01, N = 312.0014.8913.2814.15MIN: 11.4 / MAX: 12.65MIN: 13.1 / MAX: 25.87MIN: 11.64 / MAX: 15.59MIN: 11.78 / MAX: 17.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400m123448121620SE +/- 0.34, N = 3SE +/- 0.30, N = 315.5114.7215.7315.00MIN: 14.64 / MAX: 16.49MIN: 14.07 / MAX: 14.92MIN: 14.39 / MAX: 17.03MIN: 13.17 / MAX: 16.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

Apache Cassandra

Test: Reads

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Reads1238K16K24K32K40KSE +/- 804.65, N = 12363123860431725

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes12310K20K30K40K50KSE +/- 108.17, N = 3479064873240411

Apache Cassandra

Test: Mixed 1:1

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Mixed 1:1127K14K21K28K35K3390832932

Apache Cassandra

Test: Mixed 1:3

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Mixed 1:3127K14K21K28K35K3340732650


Phoronix Test Suite v10.8.4