newnew

Intel Core i7-1165G7 testing with a Dell 0GG9PT (3.15.0 BIOS) and Intel Xe TGL GT2 15GB on Ubuntu 23.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308049-NE-NEWNEW95665&grw&sro.

newnewProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcIntel Core i7-1165G7 @ 4.70GHz (4 Cores / 8 Threads)Dell 0GG9PT (3.15.0 BIOS)Intel Tiger Lake-LP16GBKioxia KBG40ZNS256G NVMe 256GBIntel Xe TGL GT2 15GB (1300MHz)Realtek ALC289Intel Wi-Fi 6 AX201Ubuntu 23.046.2.0-24-generic (x86_64)GNOME Shell 44.0X Server + Wayland4.6 Mesa 23.0.2GCC 12.2.0ext41920x1200OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xa6 - Thermald 2.5.2 Java Details- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu123.04)Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

newnewbrl-cad: VGR Performance Metricncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetbuild-gcc: Time To Compilevvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fastervvenc: Bosphorus 1080p - Fastvvenc: Bosphorus 1080p - Fastervkfft: FFT + iFFT R2C / C2Rvkfft: FFT + iFFT C2C 1D batched in half precisionvkfft: FFT + iFFT C2C Bluestein in single precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingvkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp16-scalarvkpeak: fp16-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkpeak: int16-scalarvkpeak: int16-vec4vkresample: 2x - Singleapache-iotdb: 100 - 1 - 200apache-iotdb: 100 - 1 - 200apache-iotdb: 100 - 1 - 500apache-iotdb: 100 - 1 - 500apache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 500apache-iotdb: 200 - 1 - 500apache-iotdb: 100 - 100 - 200apache-iotdb: 100 - 100 - 200apache-iotdb: 100 - 100 - 500apache-iotdb: 100 - 100 - 500apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 500apache-iotdb: 200 - 100 - 500dragonflydb: 10 - 1:5dragonflydb: 20 - 1:5dragonflydb: 50 - 1:5dragonflydb: 10 - 1:10dragonflydb: 20 - 1:10dragonflydb: 50 - 1:10dragonflydb: 10 - 1:100dragonflydb: 20 - 1:100dragonflydb: 50 - 1:100cassandra: Writesabc5200820.744.603.584.034.558.851.3215.8259.0911.178.6128.7429.4412.9511.76189.403.9721.074.663.603.503.936.991.2615.8659.0311.188.6828.8429.6013.0511.19207.463.972404.7741.6543.6805.1412.7965585142461033748649448176934.741478.582309.293182.23474.88493.63907.91979.25100.009640563.2216.781265418.9124.981035973.61121696504.0222.0822588608.6768.4821059562.6211.9617866468.2691.6716383690.96284.411331051.901620397.171509161.501283252.401553942.491559408.511267626.251504800.261590238.43398645196720.744.603.593.473.866.920.9412.7854.259.257.4326.5729.0312.238.61204.363.9120.654.633.593.493.896.980.9914.0957.8810.228.0828.7129.2313.018.57193.903.942404.3201.6503.7815.12412.6375589142321034748249378176935.021478.502309.193182.01474.86493.63907.82979.3100.01064483316.41124779625.141007122.6212.521651562.3623.121253793.5374.0821812047.21200.9718637109.8385.6410851137.82370.361310891.481572424.891546553.321275557.391532531.221545294.471305698.481532061.631538729.61408235188020.744.533.513.473.836.640.9612.4254.139.197.3026.5729.2512.058.39200.233.8920.694.603.543.493.876.811.0814.0158.2810.408.1528.7829.1113.009.62188.794.032394.1391.6773.8535.24913.8025688142411035747850878183934.811479.042309.283182.29474.89493.65907.87979.23100.009655887.0316.031266283.9424.73995398.4312.781659495.2922.6317612257.1376.878804114.19499.989365385.68164.995724897.6757.341394989.581561730.491559859.231325104.131620928.161574780.451297415.251655250.861548401.5439536OpenBenchmarking.org

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricabc11K22K33K44K55KSE +/- 16.50, N = 2SE +/- 147.00, N = 2SE +/- 34.00, N = 25200851967518801. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetabc510152025SE +/- 0.05, N = 2SE +/- 0.04, N = 2SE +/- 0.07, N = 220.7420.7420.74MIN: 20.17 / MAX: 31.42MIN: 20.23 / MAX: 31.9MIN: 20.26 / MAX: 32.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2abc1.0352.073.1054.145.175SE +/- 0.04, N = 2SE +/- 0.02, N = 2SE +/- 0.02, N = 24.604.604.53MIN: 4.38 / MAX: 12.37MIN: 4.34 / MAX: 14.83MIN: 4.3 / MAX: 15.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3abc0.80781.61562.42343.23124.039SE +/- 0.03, N = 2SE +/- 0.04, N = 2SE +/- 0.00, N = 23.583.593.51MIN: 3.35 / MAX: 13.78MIN: 3.37 / MAX: 14.17MIN: 3.32 / MAX: 11.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2abc0.90681.81362.72043.62724.534SE +/- 0.57, N = 2SE +/- 0.01, N = 2SE +/- 0.02, N = 24.033.473.47MIN: 3.35 / MAX: 10.89MIN: 3.32 / MAX: 12.41MIN: 3.36 / MAX: 13.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetabc1.02382.04763.07144.09525.119SE +/- 0.63, N = 2SE +/- 0.07, N = 2SE +/- 0.01, N = 24.553.863.83MIN: 3.81 / MAX: 15.97MIN: 3.69 / MAX: 12.48MIN: 3.68 / MAX: 12.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0abc246810SE +/- 0.29, N = 2SE +/- 0.04, N = 2SE +/- 0.05, N = 28.856.926.64MIN: 6.6 / MAX: 21.28MIN: 6.5 / MAX: 16.04MIN: 6.38 / MAX: 17.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceabc0.2970.5940.8911.1881.485SE +/- 0.03, N = 2SE +/- 0.02, N = 2SE +/- 0.03, N = 21.320.940.96MIN: 1.2 / MAX: 4.08MIN: 0.9 / MAX: 1.09MIN: 0.9 / MAX: 5.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetabc48121620SE +/- 0.07, N = 2SE +/- 0.20, N = 2SE +/- 0.27, N = 215.8212.7812.42MIN: 15.12 / MAX: 28.62MIN: 11.9 / MAX: 22.46MIN: 11.68 / MAX: 23.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16abc1326395265SE +/- 0.11, N = 2SE +/- 1.96, N = 2SE +/- 2.37, N = 259.0954.2554.13MIN: 57.54 / MAX: 76.29MIN: 50.32 / MAX: 71.7MIN: 48.55 / MAX: 72.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18abc3691215SE +/- 0.04, N = 2SE +/- 0.11, N = 2SE +/- 0.50, N = 211.179.259.19MIN: 10.66 / MAX: 21.33MIN: 8.51 / MAX: 24.74MIN: 8.38 / MAX: 20.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetabc246810SE +/- 0.04, N = 2SE +/- 0.09, N = 2SE +/- 0.25, N = 28.617.437.30MIN: 8.27 / MAX: 20.05MIN: 6.9 / MAX: 16.46MIN: 6.79 / MAX: 17.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50abc714212835SE +/- 0.18, N = 2SE +/- 2.23, N = 2SE +/- 2.20, N = 228.7426.5726.57MIN: 27.89 / MAX: 40MIN: 23.28 / MAX: 38.91MIN: 23.37 / MAX: 38.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyabc714212835SE +/- 0.42, N = 2SE +/- 0.03, N = 2SE +/- 0.03, N = 229.4429.0329.25MIN: 28.43 / MAX: 46.03MIN: 28.45 / MAX: 40.14MIN: 28.42 / MAX: 40.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdabc3691215SE +/- 0.03, N = 2SE +/- 0.86, N = 2SE +/- 0.97, N = 212.9512.2312.05MIN: 12.67 / MAX: 23.3MIN: 10.77 / MAX: 31.43MIN: 10.72 / MAX: 23.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mabc3691215SE +/- 0.10, N = 2SE +/- 0.03, N = 2SE +/- 0.01, N = 211.768.618.39MIN: 11.26 / MAX: 22.16MIN: 8.21 / MAX: 17.64MIN: 8.14 / MAX: 18.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerabc4080120160200SE +/- 4.42, N = 2SE +/- 3.85, N = 2SE +/- 5.48, N = 2189.40204.36200.23MIN: 169.61 / MAX: 225.69MIN: 168.8 / MAX: 231.88MIN: 169.16 / MAX: 234.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetabc0.89331.78662.67993.57324.4665SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.03, N = 23.973.913.89MIN: 3.76 / MAX: 14.59MIN: 3.71 / MAX: 13.7MIN: 3.67 / MAX: 13.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetabc510152025SE +/- 0.19, N = 2SE +/- 0.03, N = 2SE +/- 0.03, N = 221.0720.6520.69MIN: 20.27 / MAX: 31.83MIN: 20.22 / MAX: 31.66MIN: 20.24 / MAX: 31.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2abc1.04852.0973.14554.1945.2425SE +/- 0.00, N = 2SE +/- 0.03, N = 2SE +/- 0.04, N = 24.664.634.60MIN: 4.46 / MAX: 14.54MIN: 4.44 / MAX: 14.09MIN: 4.38 / MAX: 141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3abc0.811.622.433.244.05SE +/- 0.03, N = 2SE +/- 0.02, N = 2SE +/- 0.01, N = 23.603.593.54MIN: 3.38 / MAX: 12.52MIN: 3.4 / MAX: 12.21MIN: 3.33 / MAX: 11.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2abc0.78751.5752.36253.153.9375SE +/- 0.01, N = 2SE +/- 0.01, N = 2SE +/- 0.02, N = 23.503.493.49MIN: 3.35 / MAX: 12.33MIN: 3.33 / MAX: 13.62MIN: 3.35 / MAX: 11.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetabc0.88431.76862.65293.53724.4215SE +/- 0.01, N = 2SE +/- 0.03, N = 2SE +/- 0.04, N = 23.933.893.87MIN: 3.73 / MAX: 14.41MIN: 3.74 / MAX: 12.97MIN: 3.71 / MAX: 12.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0abc246810SE +/- 0.02, N = 2SE +/- 0.02, N = 2SE +/- 0.15, N = 26.996.986.81MIN: 6.55 / MAX: 16.31MIN: 6.57 / MAX: 17.79MIN: 6.42 / MAX: 17.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceabc0.28350.5670.85051.1341.4175SE +/- 0.02, N = 2SE +/- 0.05, N = 2SE +/- 0.14, N = 21.260.991.08MIN: 1.16 / MAX: 4.12MIN: 0.91 / MAX: 3.77MIN: 0.91 / MAX: 3.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetabc48121620SE +/- 0.02, N = 2SE +/- 1.53, N = 2SE +/- 1.79, N = 215.8614.0914.01MIN: 15.16 / MAX: 26.57MIN: 12.03 / MAX: 25.74MIN: 11.83 / MAX: 25.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16abc1326395265SE +/- 0.15, N = 2SE +/- 1.40, N = 2SE +/- 0.65, N = 259.0357.8858.28MIN: 57.6 / MAX: 79.49MIN: 52 / MAX: 70.84MIN: 52.14 / MAX: 69.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18abc3691215SE +/- 0.02, N = 2SE +/- 1.05, N = 2SE +/- 0.82, N = 211.1810.2210.40MIN: 10.62 / MAX: 21.34MIN: 8.49 / MAX: 20.91MIN: 8.57 / MAX: 21.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetabc246810SE +/- 0.04, N = 2SE +/- 0.52, N = 2SE +/- 0.55, N = 28.688.088.15MIN: 8.25 / MAX: 19.5MIN: 7.01 / MAX: 17.55MIN: 7.01 / MAX: 17.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50abc714212835SE +/- 0.19, N = 2SE +/- 0.12, N = 2SE +/- 0.15, N = 228.8428.7128.78MIN: 27.92 / MAX: 41.94MIN: 27.9 / MAX: 39.22MIN: 27.96 / MAX: 39.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyabc714212835SE +/- 0.26, N = 2SE +/- 0.15, N = 2SE +/- 0.05, N = 229.6029.2329.11MIN: 28.6 / MAX: 44.5MIN: 28.34 / MAX: 41.02MIN: 28.38 / MAX: 40.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdabc3691215SE +/- 0.04, N = 2SE +/- 0.01, N = 2SE +/- 0.03, N = 213.0513.0113.00MIN: 12.7 / MAX: 23.56MIN: 12.67 / MAX: 27.85MIN: 12.68 / MAX: 23.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mabc3691215SE +/- 0.30, N = 2SE +/- 0.01, N = 2SE +/- 1.13, N = 211.198.579.62MIN: 10.38 / MAX: 21.88MIN: 8.18 / MAX: 21.08MIN: 8.23 / MAX: 23.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerabc50100150200250SE +/- 8.01, N = 2SE +/- 0.73, N = 2SE +/- 0.68, N = 2207.46193.90188.79MIN: 169.78 / MAX: 244.36MIN: 167.64 / MAX: 231.56MIN: 169.1 / MAX: 243.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetabc0.90681.81362.72043.62724.534SE +/- 0.09, N = 2SE +/- 0.00, N = 2SE +/- 0.10, N = 23.973.944.03MIN: 3.7 / MAX: 15.04MIN: 3.73 / MAX: 12.78MIN: 3.74 / MAX: 10.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To Compileabc5001000150020002500SE +/- 0.32, N = 2SE +/- 2.12, N = 2SE +/- 1.40, N = 22404.772404.322394.14

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastabc0.37730.75461.13191.50921.8865SE +/- 0.037, N = 2SE +/- 0.028, N = 2SE +/- 0.036, N = 21.6541.6501.6771. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterabc0.86691.73382.60073.46764.3345SE +/- 0.014, N = 2SE +/- 0.014, N = 2SE +/- 0.081, N = 23.6803.7813.8531. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastabc1.1812.3623.5434.7245.905SE +/- 0.000, N = 2SE +/- 0.008, N = 2SE +/- 0.115, N = 25.1405.1245.2491. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterabc48121620SE +/- 0.08, N = 2SE +/- 0.11, N = 2SE +/- 0.79, N = 212.8012.6413.801. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT R2C / C2Rabc12002400360048006000SE +/- 3.50, N = 2SE +/- 30.50, N = 2SE +/- 69.00, N = 25585558956881. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in half precisionabc3K6K9K12K15KSE +/- 5.00, N = 2SE +/- 1.50, N = 2SE +/- 4.00, N = 21424614232142411. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein in single precisionabc2004006008001000SE +/- 0.50, N = 2SE +/- 1.00, N = 2SE +/- 1.00, N = 21033103410351. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precisionabc16003200480064008000SE +/- 14.00, N = 2SE +/- 4.00, N = 2SE +/- 3.50, N = 27486748274781. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C multidimensional in single precisionabc11002200330044005500SE +/- 17.50, N = 2SE +/- 6.50, N = 2SE +/- 9.00, N = 24944493750871. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingabc2K4K6K8K10KSE +/- 4.00, N = 2SE +/- 0.50, N = 2SE +/- 5.00, N = 28176817681831. (CXX) g++ options: -O3

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarabc2004006008001000SE +/- 0.27, N = 2SE +/- 0.00, N = 2SE +/- 0.29, N = 2934.74935.02934.81

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4abc30060090012001500SE +/- 0.52, N = 2SE +/- 0.51, N = 2SE +/- 0.00, N = 21478.581478.501479.04

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarabc5001000150020002500SE +/- 0.07, N = 2SE +/- 0.02, N = 2SE +/- 0.04, N = 22309.292309.192309.28

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4abc7001400210028003500SE +/- 0.11, N = 2SE +/- 0.04, N = 2SE +/- 0.12, N = 23182.233182.013182.29

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarabc100200300400500SE +/- 0.03, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 2474.88474.86474.89

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4abc110220330440550SE +/- 0.02, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 2493.63493.63493.65

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarabc2004006008001000SE +/- 0.10, N = 2SE +/- 0.01, N = 2SE +/- 0.03, N = 2907.91907.82907.87

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4abc2004006008001000SE +/- 0.08, N = 2SE +/- 0.05, N = 2SE +/- 0.00, N = 2979.25979.30979.23

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singleabc20406080100SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 2100.01100.01100.011. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200abc140K280K420K560K700K640563.22644833.00655887.03

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200abc4812162016.7816.4116.03MAX: 906.87MAX: 1017.33MAX: 1027.73

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500abc300K600K900K1200K1500K1265418.911247796.001266283.94

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500abc61218243024.9825.1424.73MAX: 1064.56MAX: 1026.24MAX: 998.23

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200abc200K400K600K800K1000K1035973.611007122.62995398.43

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200abc369121512.0012.5212.78MAX: 771.24MAX: 794.23MAX: 800.68

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500abc400K800K1200K1600K2000K1696504.021651562.361659495.29

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500abc61218243022.0823.1022.63MAX: 863.43MAX: 906.47MAX: 909.27

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200abc5M10M15M20M25M22588608.6721253793.5317612257.13

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200abc2040608010068.4874.0876.87MAX: 1696.56MAX: 1547.17MAX: 9168.34

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500abc5M10M15M20M25M21059562.6021812047.218804114.19

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500abc110220330440550211.96200.97499.98MAX: 1593.95MAX: 1387.04MAX: 7223.32

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200abc4M8M12M16M20M17866468.2618637109.839365385.68

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200abc408012016020091.6785.64164.99MAX: 7883.93MAX: 1651.27MAX: 11718.16

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500abc4M8M12M16M20M16383690.9610851137.825724897.60

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500abc160320480640800284.41370.36757.34MAX: 2551.38MAX: 7948.19MAX: 21117.8

Dragonflydb

Clients Per Thread: 10 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:5abc300K600K900K1200K1500KSE +/- 124523.27, N = 2SE +/- 115490.54, N = 2SE +/- 156954.29, N = 21331051.901310891.481394989.581. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 20 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:5abc300K600K900K1200K1500KSE +/- 138897.05, N = 2SE +/- 146499.45, N = 2SE +/- 157442.25, N = 21620397.171572424.891561730.491. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 50 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:5abc300K600K900K1200K1500KSE +/- 35029.57, N = 2SE +/- 81765.99, N = 2SE +/- 148775.44, N = 21509161.501546553.321559859.231. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 10 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:10abc300K600K900K1200K1500KSE +/- 119195.59, N = 2SE +/- 90004.70, N = 2SE +/- 143882.01, N = 21283252.401275557.391325104.131. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 20 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:10abc300K600K900K1200K1500KSE +/- 93795.63, N = 2SE +/- 144264.17, N = 2SE +/- 153108.99, N = 21553942.491532531.221620928.161. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 50 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:10abc300K600K900K1200K1500KSE +/- 99951.62, N = 2SE +/- 111709.66, N = 2SE +/- 128692.32, N = 21559408.511545294.471574780.451. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 10 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:100abc300K600K900K1200K1500KSE +/- 117619.34, N = 2SE +/- 95110.31, N = 2SE +/- 141978.48, N = 21267626.251305698.481297415.251. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 20 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:100abc400K800K1200K1600K2000KSE +/- 66621.90, N = 2SE +/- 71249.46, N = 2SE +/- 168860.22, N = 21504800.261532061.631655250.861. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 50 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:100abc300K600K900K1200K1500KSE +/- 113274.82, N = 2SE +/- 98659.37, N = 2SE +/- 65233.84, N = 21590238.431538729.611548401.541. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesabc9K18K27K36K45KSE +/- 2472.50, N = 2SE +/- 1520.50, N = 2SE +/- 2294.50, N = 2398644082339536


Phoronix Test Suite v10.8.5