Xeon E February Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102034-HA-XEONEFEBR09&sor .
Xeon E February Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads) Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) Intel Cannon Lake PCH 2 x 32 GB DDR4-2667MT/s Samsung M378A4G43MB1-CTD Samsung SSD 970 EVO Plus 250GB NVIDIA Quadro RTX 4000 8GB Intel Cannon Lake PCH cAVS Intel I219-LM + Intel I210 Ubuntu 20.10 5.8.0-41-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 NVIDIA OpenCL 1.2 CUDA 11.2.109 1.2.155 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled
Xeon E February quantlib: etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression qe: AUSURF112 lammps: Rhodopsin Protein lulesh: simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID stockfish: Total Time build2: Time To Compile ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m kripke: 1 2 3 2798.9 1476.929 377.841 209.651 354.413 49 134 574 2018 103 2197 123 573 535 715 216 826 271 1565.14 6.853 4921.7766 0.61 0.47 0.77 0.79 17031729 170.558 19.23 5.09 3.92 4.78 3.87 6.78 1.87 14.74 64.33 15.21 12.78 28.22 27.89 20.06 13.53 32200890 2802.3 1412.188 376.421 209.535 353.819 49 134 569 2020 102 2186 122 572 532 712 216 826 271 1594.09 6.843 4908.8063 0.61 0.47 0.77 0.79 16996646 169.668 18.98 5.08 3.94 4.90 3.89 6.79 1.85 15.09 65.30 15.30 13.10 28.76 27.87 20.08 13.60 32018747 2795.5 1478.440 377.581 209.485 354.134 49 134 572 2020 102 2186 123 573 533 715 216 827 271 1546.90 6.826 4922.5669 0.61 0.47 0.77 0.79 17127732 169.128 18.87 5.07 4.10 4.78 4.09 6.86 1.89 14.87 64.50 15.36 12.92 28.18 28.25 20.27 13.42 31764537 OpenBenchmarking.org
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 1 3 600 1200 1800 2400 3000 SE +/- 3.82, N = 3 SE +/- 1.13, N = 3 SE +/- 8.87, N = 3 2802.3 2798.9 2795.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 1 2 300 600 900 1200 1500 SE +/- 0.36, N = 3 SE +/- 2.04, N = 3 SE +/- 2.29, N = 3 1478.44 1476.93 1412.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 3 2 80 160 240 320 400 SE +/- 0.48, N = 3 SE +/- 0.34, N = 3 SE +/- 1.05, N = 3 377.84 377.58 376.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 50 100 150 200 250 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 209.65 209.54 209.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 3 2 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.24, N = 3 354.41 354.13 353.82 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 11 22 33 44 55 SE +/- 0.33, N = 3 49 49 49 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 30 60 90 120 150 SE +/- 0.33, N = 3 134 134 134 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 3 2 120 240 360 480 600 574 572 569 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 2 1 400 800 1200 1600 2000 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 SE +/- 1.76, N = 3 2020 2020 2018 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 3 2 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 103 102 102 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 3 2 500 1000 1500 2000 2500 SE +/- 1.15, N = 3 SE +/- 5.67, N = 3 2197 2186 2186 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 1 2 30 60 90 120 150 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 123 123 122 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 1 2 120 240 360 480 600 SE +/- 0.67, N = 3 573 573 572 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 3 2 120 240 360 480 600 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 535 533 532 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 1 2 150 300 450 600 750 SE +/- 0.58, N = 3 715 715 712 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 3 2 1 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 216 216 216 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 2 1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 827 826 826 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 1 60 120 180 240 300 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 271 271 271 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 3 1 2 300 600 900 1200 1500 SE +/- 2.60, N = 3 SE +/- 18.61, N = 3 SE +/- 20.18, N = 3 1546.90 1565.14 1594.09 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 2 4 6 8 10 SE +/- 0.031, N = 3 SE +/- 0.017, N = 3 SE +/- 0.004, N = 3 6.853 6.843 6.826 1. (CXX) g++ options: -O3 -pthread -lm
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 3 1 2 1100 2200 3300 4400 5500 SE +/- 2.96, N = 3 SE +/- 1.29, N = 3 SE +/- 11.46, N = 3 4922.57 4921.78 4908.81 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 3 2 1 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.61 0.61 0.61 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom 3 2 1 0.1058 0.2116 0.3174 0.4232 0.529 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.47 0.47 0.47 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 3 2 1 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.77 0.77 0.77 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID 3 2 1 0.1778 0.3556 0.5334 0.7112 0.889 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.79 0.79 0.79 1. (CXX) g++ options: -O3 -pthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 3 1 2 4M 8M 12M 16M 20M SE +/- 185355.58, N = 3 SE +/- 34943.70, N = 3 SE +/- 220484.45, N = 5 17127732 17031729 16996646 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 3 2 1 40 80 120 160 200 SE +/- 0.41, N = 3 SE +/- 0.48, N = 3 SE +/- 0.69, N = 3 169.13 169.67 170.56
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 3 2 1 5 10 15 20 25 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 18.87 18.98 19.23 MIN: 18.56 / MAX: 19.41 MIN: 18.58 / MAX: 37.45 MIN: 18.19 / MAX: 79.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 3 2 1 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.07 5.08 5.09 MIN: 4.97 / MAX: 5.33 MIN: 4.97 / MAX: 5.36 MIN: 4.99 / MAX: 6.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 0.9225 1.845 2.7675 3.69 4.6125 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.19, N = 3 3.92 3.94 4.10 MIN: 3.87 / MAX: 4.36 MIN: 3.9 / MAX: 4.57 MIN: 3.88 / MAX: 40.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 3 2 1.1025 2.205 3.3075 4.41 5.5125 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 4.78 4.78 4.90 MIN: 4.66 / MAX: 10.99 MIN: 4.68 / MAX: 5.13 MIN: 4.78 / MAX: 5.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 0.9203 1.8406 2.7609 3.6812 4.6015 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.21, N = 3 3.87 3.89 4.09 MIN: 3.81 / MAX: 4.16 MIN: 3.83 / MAX: 4.18 MIN: 3.84 / MAX: 100.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.21, N = 3 6.78 6.79 6.86 MIN: 6.45 / MAX: 7.15 MIN: 6.45 / MAX: 7.27 MIN: 6.43 / MAX: 7.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 2 1 3 0.4253 0.8506 1.2759 1.7012 2.1265 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 1.85 1.87 1.89 MIN: 1.72 / MAX: 2.07 MIN: 1.69 / MAX: 2.15 MIN: 1.68 / MAX: 2.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 3 2 4 8 12 16 20 SE +/- 0.41, N = 3 SE +/- 0.52, N = 3 SE +/- 0.68, N = 3 14.74 14.87 15.09 MIN: 13.7 / MAX: 16.27 MIN: 13.77 / MAX: 17.68 MIN: 13.78 / MAX: 111.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 3 2 15 30 45 60 75 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.32, N = 3 64.33 64.50 65.30 MIN: 63.96 / MAX: 67.59 MIN: 64.12 / MAX: 78.38 MIN: 64.15 / MAX: 177.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 15.21 15.30 15.36 MIN: 14.69 / MAX: 16.3 MIN: 14.59 / MAX: 15.79 MIN: 14.69 / MAX: 16.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 3 2 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 SE +/- 0.17, N = 3 12.78 12.92 13.10 MIN: 12.49 / MAX: 13.2 MIN: 12.55 / MAX: 13.91 MIN: 12.6 / MAX: 69.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 3 1 2 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.23, N = 3 28.18 28.22 28.76 MIN: 27.79 / MAX: 35.47 MIN: 27.8 / MAX: 29.66 MIN: 27.36 / MAX: 145.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 2 1 3 7 14 21 28 35 SE +/- 0.28, N = 3 SE +/- 0.08, N = 3 SE +/- 0.56, N = 3 27.87 27.89 28.25 MIN: 27.13 / MAX: 28.96 MIN: 27.63 / MAX: 28.51 MIN: 27.12 / MAX: 134.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 SE +/- 0.46, N = 3 20.06 20.08 20.27 MIN: 19.26 / MAX: 87.17 MIN: 19.69 / MAX: 20.86 MIN: 19.36 / MAX: 122.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 3 1 2 3 6 9 12 15 SE +/- 0.23, N = 3 SE +/- 0.26, N = 3 SE +/- 0.24, N = 3 13.42 13.53 13.60 MIN: 12.88 / MAX: 14.1 MIN: 12.93 / MAX: 14.66 MIN: 13.03 / MAX: 14.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 2 3 7M 14M 21M 28M 35M SE +/- 56385.48, N = 3 SE +/- 108765.20, N = 3 SE +/- 61243.70, N = 3 32200890 32018747 31764537 1. (CXX) g++ options: -O3 -fopenmp
Phoronix Test Suite v10.8.4