3900XT August AMD Ryzen 9 3900XT 12-Core testing with a MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108220-PTS-3900XTAU43&grw&rdt .
3900XT August Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen 9 3900XT 12-Core @ 3.80GHz (12 Cores / 24 Threads) MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) AMD Starship/Matisse 16GB 500GB Seagate FireCuda 520 SSD ZP500GM30002 AMD Radeon RX 56/64 8GB (1630/945MHz) AMD Vega 10 HDMI Audio ASUS MG28U Realtek Device 2600 + Realtek Device 3000 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-rc1-phx (x86_64) 20201228 GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.1 (LLVM 11.0.0) 1.2.131 GCC 10.3.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8701021 Graphics Details - GLAMOR Java Details - OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10) Python Details - Python 2.7.18 + Python 3.8.10 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900XT August ecp-candle: P1B2 ecp-candle: P3B1 ecp-candle: P3B2 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface gravitymark: 3840 x 2160 - Vulkan gravitymark: 2560 x 1440 - Vulkan ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface gravitymark: 1920 x 1200 - Vulkan ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m qe: AUSURF112 tachyon: Total Time dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit openvkl: vklBenchmark ISPC openvkl: vklBenchmark Scalar yafaray: Total Time For Sample Scene nginx: 1 nginx: 20 nginx: 100 nginx: 200 nginx: 500 nginx: 1000 apache: 1 apache: 20 apache: 100 apache: 200 apache: 500 apache: 1000 gravitymark: 1920 x 1080 - Vulkan keydb: cassandra: Reads cassandra: Writes cassandra: Mixed 1:1 cassandra: Mixed 1:3 1 2 3 4 187.563 1242.733 760.932 16.15 5.19 5.02 4.61 6.84 2.13 61.4 85.2 4.55 17.02 59.59 16.92 12.63 25.88 25.86 18.38 11.84 8.45 2.85 3.81 2.66 3.01 9.6 1.57 100.2 5.69 9.97 2.42 3.95 6 11.45 5.54 4.98 544.33 60.559 498.18 193.64 437.38 361.53 66 38 101.974 51487.16 266318.67 276959.91 260145.65 242814.58 233289.6 9819.79 66526.1 65160.84 63185.75 58422.38 55930.05 100.9 600652.06 123479 127426 95847 92247 41.063 1241.392 759.863 16.09 5.15 5.03 4.61 6.8 2.1 61.5 86.3 4.54 15.98 59.56 16 12.36 25.81 25.84 18.53 11.8 8.41 2.86 3.84 2.66 3.02 9.69 1.58 99.7 5.71 9.9 2.38 3.95 6 11.45 5.54 4.99 534.37 60.7376 503.15 209.38 441.14 368 66 38 102.554 51412.14 265150.9 274678.21 259053.33 241096.6 232542.88 9833.46 67436.4 64993.78 62187.44 58016.4 55861.84 100.9 602060.78 108039 137378 109185 104788 41.807 1243.462 763.125 16.08 5.18 5.00 4.60 6.79 2.1 61.1 86.3 4.55 16.01 59.70 16.01 12.45 25.83 25.80 18.45 11.79 8.39 2.86 3.82 2.66 3.01 10.36 1.58 99.4 5.70 10.15 2.39 3.96 6.02 11.37 5.56 5.00 541.64 60.5657 503.72 207.39 435.40 367.16 66 38 102.969 51412.19 263918.89 276564.47 259157.05 242729.86 231805.36 9868.89 67073.14 64743.08 62443.50 58124.42 55057.75 100.5 602741.90 106972 131031 103053 97929 40.991 1244.139 747.022 16.16 5.21 5.01 4.62 6.8 2.1 61 85.9 4.56 15.96 59.52 16.05 12.37 25.87 25.97 18.52 11.75 8.61 2.86 3.81 2.64 3.01 9.93 1.58 100.1 5.68 9.72 2.37 3.96 6.01 11.42 5.59 5.01 532.54 60.7096 496.97 205.4 433.79 365.79 66 38 102.552 51364.07 262979.25 274091.83 256384.27 240391.97 229994.85 10254.01 66776.03 64953.13 62016.76 58148.68 55570.45 100.8 594994.72 103621 137372 102546 101007 OpenBenchmarking.org
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 1 2 3 4 40 80 120 160 200 187.56 41.06 41.81 40.99
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 1 2 3 4 300 600 900 1200 1500 1242.73 1241.39 1243.46 1244.14
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B2 1 2 3 4 160 320 480 640 800 760.93 759.86 763.13 747.02
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet 2 1 3 4 4 8 12 16 20 SE +/- 0.02, N = 3 16.01 16.15 16.08 16.16 MIN: 15.76 / MAX: 16.56 MIN: 15.89 / MAX: 16.79 MIN: 15.82 / MAX: 20.04 MIN: 15.87 / MAX: 18.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 2 1 3 4 1.1723 2.3446 3.5169 4.6892 5.8615 SE +/- 0.01, N = 3 5.21 5.19 5.18 5.21 MIN: 5.08 / MAX: 11.5 MIN: 5.07 / MAX: 5.41 MIN: 5.06 / MAX: 5.47 MIN: 5.11 / MAX: 5.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 2 1 3 4 1.1318 2.2636 3.3954 4.5272 5.659 SE +/- 0.00, N = 3 5.01 5.02 5.00 5.01 MIN: 4.96 / MAX: 6.46 MIN: 4.99 / MAX: 5.08 MIN: 4.96 / MAX: 5.19 MIN: 4.97 / MAX: 5.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet 2 1 3 4 1.0395 2.079 3.1185 4.158 5.1975 SE +/- 0.01, N = 3 4.59 4.61 4.60 4.62 MIN: 4.55 / MAX: 4.79 MIN: 4.55 / MAX: 4.8 MIN: 4.55 / MAX: 5.11 MIN: 4.56 / MAX: 4.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 2 1 3 4 2 4 6 8 10 SE +/- 0.00, N = 3 6.77 6.84 6.79 6.80 MIN: 6.71 / MAX: 6.87 MIN: 6.72 / MAX: 12.01 MIN: 6.72 / MAX: 7.38 MIN: 6.73 / MAX: 7.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface 2 1 3 4 0.4793 0.9586 1.4379 1.9172 2.3965 SE +/- 0.00, N = 3 2.09 2.13 2.10 2.10 MIN: 2.06 / MAX: 2.18 MIN: 2.09 / MAX: 2.22 MIN: 2.06 / MAX: 2.21 MIN: 2.08 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
GravityMark Resolution: 3840 x 2160 - Renderer: Vulkan OpenBenchmarking.org Frames Per Second, More Is Better GravityMark 1.2 Resolution: 3840 x 2160 - Renderer: Vulkan 1 2 3 4 14 28 42 56 70 SE +/- 0.06, N = 3 61.4 61.5 61.1 61.0
GravityMark Resolution: 2560 x 1440 - Renderer: Vulkan OpenBenchmarking.org Frames Per Second, More Is Better GravityMark 1.2 Resolution: 2560 x 1440 - Renderer: Vulkan 1 2 3 4 20 40 60 80 100 SE +/- 0.22, N = 3 85.2 86.3 86.3 85.9
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 2 1 3 4 1.026 2.052 3.078 4.104 5.13 SE +/- 0.01, N = 3 4.55 4.55 4.55 4.56 MIN: 4.48 / MAX: 6.2 MIN: 4.48 / MAX: 4.64 MIN: 4.47 / MAX: 6.8 MIN: 4.49 / MAX: 4.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet 2 1 3 4 4 8 12 16 20 SE +/- 0.08, N = 3 15.90 17.02 16.01 15.96 MIN: 15.55 / MAX: 16.84 MIN: 16.66 / MAX: 17.5 MIN: 15.56 / MAX: 16.67 MIN: 15.6 / MAX: 16.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 2 1 3 4 13 26 39 52 65 SE +/- 0.02, N = 3 59.34 59.59 59.70 59.52 MIN: 58.74 / MAX: 60.09 MIN: 58.73 / MAX: 63.45 MIN: 58.94 / MAX: 64.68 MIN: 58.76 / MAX: 60 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 1 2 3 4 4 8 12 16 20 SE +/- 0.02, N = 3 16.92 16.00 16.01 16.05 MIN: 16.77 / MAX: 17.18 MIN: 15.88 / MAX: 16.12 MIN: 15.86 / MAX: 17.46 MIN: 15.94 / MAX: 16.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet 2 1 3 4 3 6 9 12 15 SE +/- 0.08, N = 3 12.35 12.63 12.45 12.37 MIN: 12.24 / MAX: 12.91 MIN: 12.51 / MAX: 12.76 MIN: 12.24 / MAX: 14.38 MIN: 12.26 / MAX: 12.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 2 1 3 4 6 12 18 24 30 SE +/- 0.01, N = 3 25.83 25.88 25.83 25.87 MIN: 25.68 / MAX: 28.12 MIN: 25.72 / MAX: 33.46 MIN: 25.66 / MAX: 27.89 MIN: 25.71 / MAX: 27.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny 2 1 3 4 6 12 18 24 30 SE +/- 0.01, N = 3 25.78 25.86 25.80 25.97 MIN: 25.63 / MAX: 26.06 MIN: 25.72 / MAX: 26.33 MIN: 25.58 / MAX: 33.68 MIN: 25.67 / MAX: 49.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd 2 1 3 4 5 10 15 20 25 SE +/- 0.02, N = 3 18.39 18.38 18.45 18.52 MIN: 18.08 / MAX: 20.38 MIN: 18.02 / MAX: 18.83 MIN: 18.01 / MAX: 18.9 MIN: 18.17 / MAX: 19.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m 2 1 3 4 3 6 9 12 15 SE +/- 0.01, N = 3 11.72 11.84 11.79 11.75 MIN: 11.64 / MAX: 12.25 MIN: 11.76 / MAX: 12.06 MIN: 11.69 / MAX: 19.48 MIN: 11.67 / MAX: 14.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet 1 2 3 4 2 4 6 8 10 SE +/- 0.06, N = 3 8.45 8.41 8.39 8.61 MIN: 7.95 / MAX: 14.18 MIN: 7.95 / MAX: 10.38 MIN: 7.93 / MAX: 10.4 MIN: 7.92 / MAX: 10.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 4 0.6435 1.287 1.9305 2.574 3.2175 SE +/- 0.01, N = 3 2.85 2.86 2.86 2.86 MIN: 2.76 / MAX: 3.74 MIN: 2.76 / MAX: 4.19 MIN: 2.76 / MAX: 3.75 MIN: 2.77 / MAX: 3.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 0.864 1.728 2.592 3.456 4.32 SE +/- 0.01, N = 3 3.81 3.84 3.82 3.81 MIN: 3.75 / MAX: 4.95 MIN: 3.75 / MAX: 9.33 MIN: 3.75 / MAX: 4.96 MIN: 3.75 / MAX: 4.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 4 0.5985 1.197 1.7955 2.394 2.9925 SE +/- 0.00, N = 3 2.66 2.66 2.66 2.64 MIN: 2.53 / MAX: 3.8 MIN: 2.53 / MAX: 3.7 MIN: 2.54 / MAX: 3.62 MIN: 2.53 / MAX: 3.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mnasnet 1 2 3 4 0.6795 1.359 2.0385 2.718 3.3975 SE +/- 0.00, N = 3 3.01 3.02 3.01 3.01 MIN: 2.93 / MAX: 3.96 MIN: 2.93 / MAX: 4.36 MIN: 2.93 / MAX: 3.97 MIN: 2.93 / MAX: 3.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 4 3 6 9 12 15 SE +/- 0.33, N = 3 9.60 9.69 10.36 9.93 MIN: 8.9 / MAX: 24.56 MIN: 8.96 / MAX: 27.17 MIN: 8.92 / MAX: 32.86 MIN: 8.97 / MAX: 27.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: blazeface 1 2 3 4 0.3555 0.711 1.0665 1.422 1.7775 SE +/- 0.00, N = 3 1.57 1.58 1.58 1.58 MIN: 1.5 / MAX: 2.04 MIN: 1.51 / MAX: 2.48 MIN: 1.51 / MAX: 2.31 MIN: 1.51 / MAX: 2.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
GravityMark Resolution: 1920 x 1200 - Renderer: Vulkan OpenBenchmarking.org Frames Per Second, More Is Better GravityMark 1.2 Resolution: 1920 x 1200 - Renderer: Vulkan 1 2 3 4 20 40 60 80 100 SE +/- 0.15, N = 3 100.2 99.7 99.4 100.1
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: googlenet 1 2 3 4 1.2848 2.5696 3.8544 5.1392 6.424 SE +/- 0.01, N = 3 5.69 5.71 5.70 5.68 MIN: 5.64 / MAX: 9.58 MIN: 5.65 / MAX: 9.16 MIN: 5.64 / MAX: 9.66 MIN: 5.64 / MAX: 6.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: vgg16 1 2 3 4 3 6 9 12 15 SE +/- 0.10, N = 3 9.97 9.90 10.15 9.72 MIN: 9.24 / MAX: 38.42 MIN: 9.18 / MAX: 25.61 MIN: 9.24 / MAX: 33.58 MIN: 9.22 / MAX: 20.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet18 1 2 3 4 0.5445 1.089 1.6335 2.178 2.7225 SE +/- 0.01, N = 3 2.42 2.38 2.39 2.37 MIN: 2.24 / MAX: 4.75 MIN: 2.25 / MAX: 2.99 MIN: 2.24 / MAX: 8.24 MIN: 2.25 / MAX: 2.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: alexnet 1 2 3 4 0.891 1.782 2.673 3.564 4.455 SE +/- 0.01, N = 3 3.95 3.95 3.96 3.96 MIN: 3.87 / MAX: 4.96 MIN: 3.87 / MAX: 4.85 MIN: 3.88 / MAX: 4.94 MIN: 3.87 / MAX: 4.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet50 1 2 3 4 2 4 6 8 10 SE +/- 0.01, N = 3 6.00 6.00 6.02 6.01 MIN: 5.96 / MAX: 6.87 MIN: 5.96 / MAX: 6.88 MIN: 5.97 / MAX: 9.98 MIN: 5.98 / MAX: 6.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 4 3 6 9 12 15 SE +/- 0.03, N = 3 11.45 11.45 11.37 11.42 MIN: 11.21 / MAX: 11.77 MIN: 11.15 / MAX: 16.14 MIN: 11.15 / MAX: 15.33 MIN: 11.23 / MAX: 11.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd 1 2 3 4 1.2578 2.5156 3.7734 5.0312 6.289 SE +/- 0.02, N = 3 5.54 5.54 5.56 5.59 MIN: 5.06 / MAX: 8.46 MIN: 5.03 / MAX: 8.42 MIN: 5.03 / MAX: 10.66 MIN: 5.05 / MAX: 8.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m 1 2 3 4 1.1273 2.2546 3.3819 4.5092 5.6365 SE +/- 0.01, N = 3 4.98 4.99 5.00 5.01 MIN: 4.94 / MAX: 6.33 MIN: 4.95 / MAX: 6.39 MIN: 4.94 / MAX: 6.39 MIN: 4.96 / MAX: 6.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.8 Input: AUSURF112 1 2 3 4 120 240 360 480 600 SE +/- 2.87, N = 3 544.33 534.37 541.64 532.54 1. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time 1 2 3 4 14 28 42 56 70 SE +/- 0.08, N = 3 60.56 60.74 60.57 60.71 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Chimera 1080p 1 2 3 4 110 220 330 440 550 SE +/- 0.79, N = 3 498.18 503.15 503.72 496.97 MIN: 386.3 / MAX: 616.42 MIN: 388.29 / MAX: 622.26 MIN: 387.68 / MAX: 629.54 MIN: 385.12 / MAX: 617.82 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Summer Nature 4K 1 2 3 4 50 100 150 200 250 SE +/- 0.42, N = 3 193.64 209.38 207.39 205.40 MIN: 155.23 / MAX: 200.34 MIN: 173.79 / MAX: 219.59 MIN: 165.27 / MAX: 219.97 MIN: 168.49 / MAX: 215.3 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Summer Nature 1080p 1 2 3 4 100 200 300 400 500 SE +/- 0.50, N = 3 437.38 441.14 435.40 433.79 MIN: 373.76 / MAX: 477.61 MIN: 376.9 / MAX: 483.06 MIN: 364.25 / MAX: 476.12 MIN: 367.81 / MAX: 472.79 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Chimera 1080p 10-bit 1 2 3 4 80 160 240 320 400 SE +/- 0.27, N = 3 361.53 368.00 367.16 365.79 MIN: 277.3 / MAX: 484.09 MIN: 280.67 / MAX: 494.91 MIN: 279.54 / MAX: 495.81 MIN: 278.77 / MAX: 497.08 1. (CC) gcc options: -pthread
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark ISPC 1 2 3 4 15 30 45 60 75 66 66 66 66 MIN: 6 / MAX: 896 MIN: 6 / MAX: 895 MIN: 6 / MAX: 896 MIN: 6 / MAX: 893
OpenVKL Benchmark: vklBenchmark Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark Scalar 1 2 3 4 9 18 27 36 45 38 38 38 38 MIN: 3 / MAX: 795 MIN: 3 / MAX: 800 MIN: 3 / MAX: 796 MIN: 3 / MAX: 799
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene 1 2 3 4 20 40 60 80 100 SE +/- 0.02, N = 3 101.97 102.55 102.97 102.55 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
nginx Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1 1 2 3 4 11K 22K 33K 44K 55K SE +/- 86.10, N = 3 51487.16 51412.14 51412.19 51364.07 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 1 2 3 4 60K 120K 180K 240K 300K SE +/- 364.62, N = 3 266318.67 265150.90 263918.89 262979.25 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 1 2 3 4 60K 120K 180K 240K 300K SE +/- 536.26, N = 3 276959.91 274678.21 276564.47 274091.83 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 1 2 3 4 60K 120K 180K 240K 300K SE +/- 453.03, N = 3 260145.65 259053.33 259157.05 256384.27 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 500 1 2 3 4 50K 100K 150K 200K 250K SE +/- 754.09, N = 3 242814.58 241096.60 242729.86 240391.97 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1000 1 2 3 4 50K 100K 150K 200K 250K SE +/- 502.05, N = 3 233289.60 232542.88 231805.36 229994.85 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Apache HTTP Server Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1 1 2 3 4 2K 4K 6K 8K 10K SE +/- 101.52, N = 5 9819.79 9833.46 9868.89 10254.01 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 20 1 2 3 4 14K 28K 42K 56K 70K SE +/- 94.76, N = 3 66526.10 67436.40 67073.14 66776.03 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 1 2 3 4 14K 28K 42K 56K 70K SE +/- 95.02, N = 3 65160.84 64993.78 64743.08 64953.13 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 1 2 3 4 14K 28K 42K 56K 70K SE +/- 180.87, N = 3 63185.75 62187.44 62443.50 62016.76 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 1 2 3 4 13K 26K 39K 52K 65K SE +/- 93.60, N = 3 58422.38 58016.40 58124.42 58148.68 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1000 1 2 3 4 12K 24K 36K 48K 60K SE +/- 90.81, N = 3 55930.05 55861.84 55057.75 55570.45 1. (CC) gcc options: -shared -fPIC -O2 -pthread
GravityMark Resolution: 1920 x 1080 - Renderer: Vulkan OpenBenchmarking.org Frames Per Second, More Is Better GravityMark 1.2 Resolution: 1920 x 1080 - Renderer: Vulkan 1 2 3 4 20 40 60 80 100 SE +/- 0.28, N = 3 100.9 100.9 100.5 100.8
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.2.0 1 2 3 4 130K 260K 390K 520K 650K SE +/- 1493.71, N = 3 600652.06 602060.78 602741.90 594994.72 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Apache Cassandra Test: Reads OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Reads 1 2 3 4 30K 60K 90K 120K 150K SE +/- 5575.52, N = 9 123479 108039 106972 103621
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Writes 1 2 3 4 30K 60K 90K 120K 150K SE +/- 1139.42, N = 8 127426 137378 131031 137372
Apache Cassandra Test: Mixed 1:1 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:1 1 2 3 4 20K 40K 60K 80K 100K SE +/- 1119.54, N = 12 95847 109185 103053 102546
Apache Cassandra Test: Mixed 1:3 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:3 1 2 3 4 20K 40K 60K 80K 100K SE +/- 1985.99, N = 12 92247 104788 97929 101007
Phoronix Test Suite v10.8.4