Intel Core i7-1165G7 testing with a Dell 0GG9PT (3.15.0 BIOS) and Intel Xe TGL GT2 15GB on Ubuntu 23.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2308049-NE-NEWNEW95665 newnew - Phoronix Test Suite newnew Intel Core i7-1165G7 testing with a Dell 0GG9PT (3.15.0 BIOS) and Intel Xe TGL GT2 15GB on Ubuntu 23.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2308049-NE-NEWNEW95665&export=txt&grr&sro .
newnew Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c Intel Core i7-1165G7 @ 4.70GHz (4 Cores / 8 Threads) Dell 0GG9PT (3.15.0 BIOS) Intel Tiger Lake-LP 16GB Kioxia KBG40ZNS256G NVMe 256GB Intel Xe TGL GT2 15GB (1300MHz) Realtek ALC289 Intel Wi-Fi 6 AX201 Ubuntu 23.04 6.2.0-24-generic (x86_64) GNOME Shell 44.0 X Server + Wayland 4.6 Mesa 23.0.2 GCC 12.2.0 ext4 1920x1200 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xa6 - Thermald 2.5.2 Java Details - OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu123.04) Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
newnew build-gcc: Time To Compile brl-cad: VGR Performance Metric vvenc: Bosphorus 4K - Fast vkfft: FFT + iFFT C2C 1D batched in single precision apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 vvenc: Bosphorus 4K - Faster vkpeak: int16-vec4 vkpeak: int16-scalar vkpeak: int32-vec4 vkpeak: int32-scalar vkpeak: fp16-vec4 vkpeak: fp16-scalar vkpeak: fp32-vec4 vkpeak: fp32-scalar ncnn: Vulkan GPU - FastestDet ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: CPU - FastestDet ncnn: CPU - vision_transformer ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet cassandra: Writes vvenc: Bosphorus 1080p - Fast vkfft: FFT + iFFT C2C 1D batched in single precision, no reshuffling vkfft: FFT + iFFT C2C 1D batched in half precision dragonflydb: 50 - 1:100 dragonflydb: 50 - 1:5 dragonflydb: 50 - 1:10 dragonflydb: 10 - 1:10 dragonflydb: 20 - 1:5 dragonflydb: 10 - 1:5 dragonflydb: 10 - 1:100 dragonflydb: 20 - 1:100 dragonflydb: 20 - 1:10 vkfft: FFT + iFFT C2C Bluestein in single precision apache-iotdb: 100 - 100 - 500 apache-iotdb: 100 - 100 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 200 vvenc: Bosphorus 1080p - Faster vkfft: FFT + iFFT C2C multidimensional in single precision apache-iotdb: 100 - 100 - 200 apache-iotdb: 100 - 100 - 200 vkresample: 2x - Single apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 200 apache-iotdb: 100 - 1 - 500 apache-iotdb: 100 - 1 - 500 vkfft: FFT + iFFT R2C / C2R apache-iotdb: 100 - 1 - 200 apache-iotdb: 100 - 1 - 200 vkfft: FFT + iFFT C2C 1D batched in double precision a b c 2404.774 52008 1.654 7486 284.41 16383690.96 3.680 979.25 907.91 493.63 474.88 3182.23 2309.29 1478.58 934.74 3.97 207.46 11.19 13.05 29.60 28.84 8.68 11.18 59.03 15.86 1.26 6.99 3.93 3.50 3.60 4.66 21.07 3.97 189.40 11.76 12.95 29.44 28.74 8.61 11.17 59.09 15.82 1.32 8.85 4.55 4.03 3.58 4.60 20.74 39864 5.14 8176 14246 1590238.43 1509161.50 1559408.51 1283252.40 1620397.17 1331051.90 1267626.25 1504800.26 1553942.49 1033 211.96 21059562.6 91.67 17866468.26 12.796 4944 68.48 22588608.67 100.009 22.08 1696504.02 12 1035973.61 24.98 1265418.91 5585 16.78 640563.22 2404.320 51967 1.650 7482 370.36 10851137.82 3.781 979.3 907.82 493.63 474.86 3182.01 2309.19 1478.50 935.02 3.94 193.90 8.57 13.01 29.23 28.71 8.08 10.22 57.88 14.09 0.99 6.98 3.89 3.49 3.59 4.63 20.65 3.91 204.36 8.61 12.23 29.03 26.57 7.43 9.25 54.25 12.78 0.94 6.92 3.86 3.47 3.59 4.60 20.74 40823 5.124 8176 14232 1538729.61 1546553.32 1545294.47 1275557.39 1572424.89 1310891.48 1305698.48 1532061.63 1532531.22 1034 200.97 21812047.21 85.64 18637109.83 12.637 4937 74.08 21253793.53 100.010 23.1 1651562.36 12.52 1007122.62 25.14 1247796 5589 16.41 644833 2394.139 51880 1.677 7478 757.34 5724897.6 3.853 979.23 907.87 493.65 474.89 3182.29 2309.28 1479.04 934.81 4.03 188.79 9.62 13.00 29.11 28.78 8.15 10.40 58.28 14.01 1.08 6.81 3.87 3.49 3.54 4.60 20.69 3.89 200.23 8.39 12.05 29.25 26.57 7.30 9.19 54.13 12.42 0.96 6.64 3.83 3.47 3.51 4.53 20.74 39536 5.249 8183 14241 1548401.54 1559859.23 1574780.45 1325104.13 1561730.49 1394989.58 1297415.25 1655250.86 1620928.16 1035 499.98 8804114.19 164.99 9365385.68 13.802 5087 76.87 17612257.13 100.009 22.63 1659495.29 12.78 995398.43 24.73 1266283.94 5688 16.03 655887.03 OpenBenchmarking.org
Timed GCC Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GCC Compilation 13.2 Time To Compile a b c 500 1000 1500 2000 2500 SE +/- 0.32, N = 2 SE +/- 2.12, N = 2 SE +/- 1.40, N = 2 2404.77 2404.32 2394.14
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric a b c 11K 22K 33K 44K 55K SE +/- 16.50, N = 2 SE +/- 147.00, N = 2 SE +/- 34.00, N = 2 52008 51967 51880 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast a b c 0.3773 0.7546 1.1319 1.5092 1.8865 SE +/- 0.037, N = 2 SE +/- 0.028, N = 2 SE +/- 0.036, N = 2 1.654 1.650 1.677 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
VkFFT Test: FFT + iFFT C2C 1D batched in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision a b c 1600 3200 4800 6400 8000 SE +/- 14.00, N = 2 SE +/- 4.00, N = 2 SE +/- 3.50, N = 2 7486 7482 7478 1. (CXX) g++ options: -O3
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 a b c 160 320 480 640 800 284.41 370.36 757.34 MAX: 2551.38 MAX: 7948.19 MAX: 21117.8
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 a b c 4M 8M 12M 16M 20M 16383690.96 10851137.82 5724897.60
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster a b c 0.8669 1.7338 2.6007 3.4676 4.3345 SE +/- 0.014, N = 2 SE +/- 0.014, N = 2 SE +/- 0.081, N = 2 3.680 3.781 3.853 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
vkpeak int16-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int16-vec4 a b c 200 400 600 800 1000 SE +/- 0.08, N = 2 SE +/- 0.05, N = 2 SE +/- 0.00, N = 2 979.25 979.30 979.23
vkpeak int16-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int16-scalar a b c 200 400 600 800 1000 SE +/- 0.10, N = 2 SE +/- 0.01, N = 2 SE +/- 0.03, N = 2 907.91 907.82 907.87
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-vec4 a b c 110 220 330 440 550 SE +/- 0.02, N = 2 SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 493.63 493.63 493.65
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-scalar a b c 100 200 300 400 500 SE +/- 0.03, N = 2 SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 474.88 474.86 474.89
vkpeak fp16-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp16-vec4 a b c 700 1400 2100 2800 3500 SE +/- 0.11, N = 2 SE +/- 0.04, N = 2 SE +/- 0.12, N = 2 3182.23 3182.01 3182.29
vkpeak fp16-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp16-scalar a b c 500 1000 1500 2000 2500 SE +/- 0.07, N = 2 SE +/- 0.02, N = 2 SE +/- 0.04, N = 2 2309.29 2309.19 2309.28
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-vec4 a b c 300 600 900 1200 1500 SE +/- 0.52, N = 2 SE +/- 0.51, N = 2 SE +/- 0.00, N = 2 1478.58 1478.50 1479.04
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-scalar a b c 200 400 600 800 1000 SE +/- 0.27, N = 2 SE +/- 0.00, N = 2 SE +/- 0.29, N = 2 934.74 935.02 934.81
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet a b c 0.9068 1.8136 2.7204 3.6272 4.534 SE +/- 0.09, N = 2 SE +/- 0.00, N = 2 SE +/- 0.10, N = 2 3.97 3.94 4.03 MIN: 3.7 / MAX: 15.04 MIN: 3.73 / MAX: 12.78 MIN: 3.74 / MAX: 10.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer a b c 50 100 150 200 250 SE +/- 8.01, N = 2 SE +/- 0.73, N = 2 SE +/- 0.68, N = 2 207.46 193.90 188.79 MIN: 169.78 / MAX: 244.36 MIN: 167.64 / MAX: 231.56 MIN: 169.1 / MAX: 243.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m a b c 3 6 9 12 15 SE +/- 0.30, N = 2 SE +/- 0.01, N = 2 SE +/- 1.13, N = 2 11.19 8.57 9.62 MIN: 10.38 / MAX: 21.88 MIN: 8.18 / MAX: 21.08 MIN: 8.23 / MAX: 23.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd a b c 3 6 9 12 15 SE +/- 0.04, N = 2 SE +/- 0.01, N = 2 SE +/- 0.03, N = 2 13.05 13.01 13.00 MIN: 12.7 / MAX: 23.56 MIN: 12.67 / MAX: 27.85 MIN: 12.68 / MAX: 23.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny a b c 7 14 21 28 35 SE +/- 0.26, N = 2 SE +/- 0.15, N = 2 SE +/- 0.05, N = 2 29.60 29.23 29.11 MIN: 28.6 / MAX: 44.5 MIN: 28.34 / MAX: 41.02 MIN: 28.38 / MAX: 40.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 a b c 7 14 21 28 35 SE +/- 0.19, N = 2 SE +/- 0.12, N = 2 SE +/- 0.15, N = 2 28.84 28.71 28.78 MIN: 27.92 / MAX: 41.94 MIN: 27.9 / MAX: 39.22 MIN: 27.96 / MAX: 39.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet a b c 2 4 6 8 10 SE +/- 0.04, N = 2 SE +/- 0.52, N = 2 SE +/- 0.55, N = 2 8.68 8.08 8.15 MIN: 8.25 / MAX: 19.5 MIN: 7.01 / MAX: 17.55 MIN: 7.01 / MAX: 17.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 a b c 3 6 9 12 15 SE +/- 0.02, N = 2 SE +/- 1.05, N = 2 SE +/- 0.82, N = 2 11.18 10.22 10.40 MIN: 10.62 / MAX: 21.34 MIN: 8.49 / MAX: 20.91 MIN: 8.57 / MAX: 21.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 a b c 13 26 39 52 65 SE +/- 0.15, N = 2 SE +/- 1.40, N = 2 SE +/- 0.65, N = 2 59.03 57.88 58.28 MIN: 57.6 / MAX: 79.49 MIN: 52 / MAX: 70.84 MIN: 52.14 / MAX: 69.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet a b c 4 8 12 16 20 SE +/- 0.02, N = 2 SE +/- 1.53, N = 2 SE +/- 1.79, N = 2 15.86 14.09 14.01 MIN: 15.16 / MAX: 26.57 MIN: 12.03 / MAX: 25.74 MIN: 11.83 / MAX: 25.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface a b c 0.2835 0.567 0.8505 1.134 1.4175 SE +/- 0.02, N = 2 SE +/- 0.05, N = 2 SE +/- 0.14, N = 2 1.26 0.99 1.08 MIN: 1.16 / MAX: 4.12 MIN: 0.91 / MAX: 3.77 MIN: 0.91 / MAX: 3.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 a b c 2 4 6 8 10 SE +/- 0.02, N = 2 SE +/- 0.02, N = 2 SE +/- 0.15, N = 2 6.99 6.98 6.81 MIN: 6.55 / MAX: 16.31 MIN: 6.57 / MAX: 17.79 MIN: 6.42 / MAX: 17.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet a b c 0.8843 1.7686 2.6529 3.5372 4.4215 SE +/- 0.01, N = 2 SE +/- 0.03, N = 2 SE +/- 0.04, N = 2 3.93 3.89 3.87 MIN: 3.73 / MAX: 14.41 MIN: 3.74 / MAX: 12.97 MIN: 3.71 / MAX: 12.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 a b c 0.7875 1.575 2.3625 3.15 3.9375 SE +/- 0.01, N = 2 SE +/- 0.01, N = 2 SE +/- 0.02, N = 2 3.50 3.49 3.49 MIN: 3.35 / MAX: 12.33 MIN: 3.33 / MAX: 13.62 MIN: 3.35 / MAX: 11.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 a b c 0.81 1.62 2.43 3.24 4.05 SE +/- 0.03, N = 2 SE +/- 0.02, N = 2 SE +/- 0.01, N = 2 3.60 3.59 3.54 MIN: 3.38 / MAX: 12.52 MIN: 3.4 / MAX: 12.21 MIN: 3.33 / MAX: 11.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 a b c 1.0485 2.097 3.1455 4.194 5.2425 SE +/- 0.00, N = 2 SE +/- 0.03, N = 2 SE +/- 0.04, N = 2 4.66 4.63 4.60 MIN: 4.46 / MAX: 14.54 MIN: 4.44 / MAX: 14.09 MIN: 4.38 / MAX: 14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet a b c 5 10 15 20 25 SE +/- 0.19, N = 2 SE +/- 0.03, N = 2 SE +/- 0.03, N = 2 21.07 20.65 20.69 MIN: 20.27 / MAX: 31.83 MIN: 20.22 / MAX: 31.66 MIN: 20.24 / MAX: 31.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet a b c 0.8933 1.7866 2.6799 3.5732 4.4665 SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 SE +/- 0.03, N = 2 3.97 3.91 3.89 MIN: 3.76 / MAX: 14.59 MIN: 3.71 / MAX: 13.7 MIN: 3.67 / MAX: 13.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer a b c 40 80 120 160 200 SE +/- 4.42, N = 2 SE +/- 3.85, N = 2 SE +/- 5.48, N = 2 189.40 204.36 200.23 MIN: 169.61 / MAX: 225.69 MIN: 168.8 / MAX: 231.88 MIN: 169.16 / MAX: 234.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m a b c 3 6 9 12 15 SE +/- 0.10, N = 2 SE +/- 0.03, N = 2 SE +/- 0.01, N = 2 11.76 8.61 8.39 MIN: 11.26 / MAX: 22.16 MIN: 8.21 / MAX: 17.64 MIN: 8.14 / MAX: 18.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd a b c 3 6 9 12 15 SE +/- 0.03, N = 2 SE +/- 0.86, N = 2 SE +/- 0.97, N = 2 12.95 12.23 12.05 MIN: 12.67 / MAX: 23.3 MIN: 10.77 / MAX: 31.43 MIN: 10.72 / MAX: 23.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny a b c 7 14 21 28 35 SE +/- 0.42, N = 2 SE +/- 0.03, N = 2 SE +/- 0.03, N = 2 29.44 29.03 29.25 MIN: 28.43 / MAX: 46.03 MIN: 28.45 / MAX: 40.14 MIN: 28.42 / MAX: 40.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 a b c 7 14 21 28 35 SE +/- 0.18, N = 2 SE +/- 2.23, N = 2 SE +/- 2.20, N = 2 28.74 26.57 26.57 MIN: 27.89 / MAX: 40 MIN: 23.28 / MAX: 38.91 MIN: 23.37 / MAX: 38.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet a b c 2 4 6 8 10 SE +/- 0.04, N = 2 SE +/- 0.09, N = 2 SE +/- 0.25, N = 2 8.61 7.43 7.30 MIN: 8.27 / MAX: 20.05 MIN: 6.9 / MAX: 16.46 MIN: 6.79 / MAX: 17.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 a b c 3 6 9 12 15 SE +/- 0.04, N = 2 SE +/- 0.11, N = 2 SE +/- 0.50, N = 2 11.17 9.25 9.19 MIN: 10.66 / MAX: 21.33 MIN: 8.51 / MAX: 24.74 MIN: 8.38 / MAX: 20.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 a b c 13 26 39 52 65 SE +/- 0.11, N = 2 SE +/- 1.96, N = 2 SE +/- 2.37, N = 2 59.09 54.25 54.13 MIN: 57.54 / MAX: 76.29 MIN: 50.32 / MAX: 71.7 MIN: 48.55 / MAX: 72.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet a b c 4 8 12 16 20 SE +/- 0.07, N = 2 SE +/- 0.20, N = 2 SE +/- 0.27, N = 2 15.82 12.78 12.42 MIN: 15.12 / MAX: 28.62 MIN: 11.9 / MAX: 22.46 MIN: 11.68 / MAX: 23.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface a b c 0.297 0.594 0.891 1.188 1.485 SE +/- 0.03, N = 2 SE +/- 0.02, N = 2 SE +/- 0.03, N = 2 1.32 0.94 0.96 MIN: 1.2 / MAX: 4.08 MIN: 0.9 / MAX: 1.09 MIN: 0.9 / MAX: 5.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 a b c 2 4 6 8 10 SE +/- 0.29, N = 2 SE +/- 0.04, N = 2 SE +/- 0.05, N = 2 8.85 6.92 6.64 MIN: 6.6 / MAX: 21.28 MIN: 6.5 / MAX: 16.04 MIN: 6.38 / MAX: 17.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet a b c 1.0238 2.0476 3.0714 4.0952 5.119 SE +/- 0.63, N = 2 SE +/- 0.07, N = 2 SE +/- 0.01, N = 2 4.55 3.86 3.83 MIN: 3.81 / MAX: 15.97 MIN: 3.69 / MAX: 12.48 MIN: 3.68 / MAX: 12.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 a b c 0.9068 1.8136 2.7204 3.6272 4.534 SE +/- 0.57, N = 2 SE +/- 0.01, N = 2 SE +/- 0.02, N = 2 4.03 3.47 3.47 MIN: 3.35 / MAX: 10.89 MIN: 3.32 / MAX: 12.41 MIN: 3.36 / MAX: 13.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 a b c 0.8078 1.6156 2.4234 3.2312 4.039 SE +/- 0.03, N = 2 SE +/- 0.04, N = 2 SE +/- 0.00, N = 2 3.58 3.59 3.51 MIN: 3.35 / MAX: 13.78 MIN: 3.37 / MAX: 14.17 MIN: 3.32 / MAX: 11.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 a b c 1.035 2.07 3.105 4.14 5.175 SE +/- 0.04, N = 2 SE +/- 0.02, N = 2 SE +/- 0.02, N = 2 4.60 4.60 4.53 MIN: 4.38 / MAX: 12.37 MIN: 4.34 / MAX: 14.83 MIN: 4.3 / MAX: 15.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet a b c 5 10 15 20 25 SE +/- 0.05, N = 2 SE +/- 0.04, N = 2 SE +/- 0.07, N = 2 20.74 20.74 20.74 MIN: 20.17 / MAX: 31.42 MIN: 20.23 / MAX: 31.9 MIN: 20.26 / MAX: 32.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.1.3 Test: Writes a b c 9K 18K 27K 36K 45K SE +/- 2472.50, N = 2 SE +/- 1520.50, N = 2 SE +/- 2294.50, N = 2 39864 40823 39536
VVenC Video Input: Bosphorus 1080p - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Fast a b c 1.181 2.362 3.543 4.724 5.905 SE +/- 0.000, N = 2 SE +/- 0.008, N = 2 SE +/- 0.115, N = 2 5.140 5.124 5.249 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
VkFFT Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling a b c 2K 4K 6K 8K 10K SE +/- 4.00, N = 2 SE +/- 0.50, N = 2 SE +/- 5.00, N = 2 8176 8176 8183 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in half precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in half precision a b c 3K 6K 9K 12K 15K SE +/- 5.00, N = 2 SE +/- 1.50, N = 2 SE +/- 4.00, N = 2 14246 14232 14241 1. (CXX) g++ options: -O3
Dragonflydb Clients Per Thread: 50 - Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:100 a b c 300K 600K 900K 1200K 1500K SE +/- 113274.82, N = 2 SE +/- 98659.37, N = 2 SE +/- 65233.84, N = 2 1590238.43 1538729.61 1548401.54 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:5 a b c 300K 600K 900K 1200K 1500K SE +/- 35029.57, N = 2 SE +/- 81765.99, N = 2 SE +/- 148775.44, N = 2 1509161.50 1546553.32 1559859.23 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:10 a b c 300K 600K 900K 1200K 1500K SE +/- 99951.62, N = 2 SE +/- 111709.66, N = 2 SE +/- 128692.32, N = 2 1559408.51 1545294.47 1574780.45 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 10 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:10 a b c 300K 600K 900K 1200K 1500K SE +/- 119195.59, N = 2 SE +/- 90004.70, N = 2 SE +/- 143882.01, N = 2 1283252.40 1275557.39 1325104.13 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 20 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:5 a b c 300K 600K 900K 1200K 1500K SE +/- 138897.05, N = 2 SE +/- 146499.45, N = 2 SE +/- 157442.25, N = 2 1620397.17 1572424.89 1561730.49 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 10 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:5 a b c 300K 600K 900K 1200K 1500K SE +/- 124523.27, N = 2 SE +/- 115490.54, N = 2 SE +/- 156954.29, N = 2 1331051.90 1310891.48 1394989.58 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 10 - Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:100 a b c 300K 600K 900K 1200K 1500K SE +/- 117619.34, N = 2 SE +/- 95110.31, N = 2 SE +/- 141978.48, N = 2 1267626.25 1305698.48 1297415.25 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 20 - Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:100 a b c 400K 800K 1200K 1600K 2000K SE +/- 66621.90, N = 2 SE +/- 71249.46, N = 2 SE +/- 168860.22, N = 2 1504800.26 1532061.63 1655250.86 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Dragonflydb Clients Per Thread: 20 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:10 a b c 300K 600K 900K 1200K 1500K SE +/- 93795.63, N = 2 SE +/- 144264.17, N = 2 SE +/- 153108.99, N = 2 1553942.49 1532531.22 1620928.16 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
VkFFT Test: FFT + iFFT C2C Bluestein in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein in single precision a b c 200 400 600 800 1000 SE +/- 0.50, N = 2 SE +/- 1.00, N = 2 SE +/- 1.00, N = 2 1033 1034 1035 1. (CXX) g++ options: -O3
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 a b c 110 220 330 440 550 211.96 200.97 499.98 MAX: 1593.95 MAX: 1387.04 MAX: 7223.32
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 a b c 5M 10M 15M 20M 25M 21059562.60 21812047.21 8804114.19
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 a b c 40 80 120 160 200 91.67 85.64 164.99 MAX: 7883.93 MAX: 1651.27 MAX: 11718.16
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 a b c 4M 8M 12M 16M 20M 17866468.26 18637109.83 9365385.68
VVenC Video Input: Bosphorus 1080p - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Faster a b c 4 8 12 16 20 SE +/- 0.08, N = 2 SE +/- 0.11, N = 2 SE +/- 0.79, N = 2 12.80 12.64 13.80 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
VkFFT Test: FFT + iFFT C2C multidimensional in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C multidimensional in single precision a b c 1100 2200 3300 4400 5500 SE +/- 17.50, N = 2 SE +/- 6.50, N = 2 SE +/- 9.00, N = 2 4944 4937 5087 1. (CXX) g++ options: -O3
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 a b c 20 40 60 80 100 68.48 74.08 76.87 MAX: 1696.56 MAX: 1547.17 MAX: 9168.34
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 a b c 5M 10M 15M 20M 25M 22588608.67 21253793.53 17612257.13
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single a b c 20 40 60 80 100 SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 100.01 100.01 100.01 1. (CXX) g++ options: -O3
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 a b c 6 12 18 24 30 22.08 23.10 22.63 MAX: 863.43 MAX: 906.47 MAX: 909.27
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 a b c 400K 800K 1200K 1600K 2000K 1696504.02 1651562.36 1659495.29
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 a b c 3 6 9 12 15 12.00 12.52 12.78 MAX: 771.24 MAX: 794.23 MAX: 800.68
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 a b c 200K 400K 600K 800K 1000K 1035973.61 1007122.62 995398.43
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 a b c 6 12 18 24 30 24.98 25.14 24.73 MAX: 1064.56 MAX: 1026.24 MAX: 998.23
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 a b c 300K 600K 900K 1200K 1500K 1265418.91 1247796.00 1266283.94
VkFFT Test: FFT + iFFT R2C / C2R OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT R2C / C2R a b c 1200 2400 3600 4800 6000 SE +/- 3.50, N = 2 SE +/- 30.50, N = 2 SE +/- 69.00, N = 2 5585 5589 5688 1. (CXX) g++ options: -O3
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 a b c 4 8 12 16 20 16.78 16.41 16.03 MAX: 906.87 MAX: 1017.33 MAX: 1027.73
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 a b c 140K 280K 420K 560K 700K 640563.22 644833.00 655887.03
Phoronix Test Suite v10.8.4