nvidia-tests AMD FX-8350 Eight-Core testing with a ASUS SABERTOOTH 990FX R2.0 (2901 BIOS) and MSI NVIDIA GeForce GT 1030 2GB on Gentoo/Linux via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111177-TJ-NVIDIATES81&grr&export=pdf&rdt .
nvidia-tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution Desktop OpenGL Gentoo Gentoo_OC Win10_OC AMD FX-8350 Eight-Core @ 4.00GHz (4 Cores / 8 Threads) ASUS SABERTOOTH 990FX R2.0 (2901 BIOS) AMD RD9x0/RX980 16GB 1000GB CT1000MX500SSD1 + 3001GB TOSHIBA DT01ACA3 + 2000GB Seagate ST2000DL003-9VT1 + 500GB CT500MX500SSD1 MSI NVIDIA GeForce GT 1030 2GB Realtek ALC892 G27QC Realtek RTL8111/8168/8411 Gentoo/Linux 5.15.2-gentoo-x86_64 (x86_64) X Server 1.20.13 NVIDIA OpenCL 3.0 CUDA 11.5.100 1.2.186 GCC 11.2.0 + Clang 13.0.0 + LLVM 13.0.0 + CUDA 11.5 ext4 1280x1024 MSI NVIDIA GeForce GT 1030 2GB OC KDE Plasma 5.22.5 4.6.0 2560x1440 2 x 8192 MB 800MHz Kingston 466GB CT500MX5 00SSD1 SATA Disk + 2795GB TOSHIBA DT01ACA300 SATA Disk + 932GB CT1000MX 500SSD1 SATA Disk + 1863GB ST2000DL 003-9VT166 SATA Disk NVIDIA GeForce GT 1030 2GB NVIDIA HD Audio + Realtek HD Audio + NVIDIA Virtual Audio Device (Wave Extensible) (WDM) + HD Webcam C510 G7QC VirtualBox Host-Only + Symantec TAP Driver + RAS Async + Bluetooth Device (Personal Area ) + Bluetooth Device (Personal Area ) #2 Microsoft Windows 10 Pro Build 19043 10.0 (x86_64) 496.49 (30.0.14.9649) OpenCL 3.0 CUDA 11.5.76 + OpenCL 1.2 AMD-APP (937.2) GCC 8.3.0 + Clang 6.0.0 NTFS OpenBenchmarking.org Kernel Details - Gentoo, Gentoo_OC: Transparent Huge Pages: madvise Compiler Details - Gentoo, Gentoo_OC: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/11.2.0 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0 --disable-esp --disable-fixed-point --disable-libada --disable-libssp --disable-libunwind-exceptions --disable-libvtv --disable-systemtap --disable-valgrind-annotations --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/11.2.0/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/man --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/python --without-isl --without-zstd Processor Details - Gentoo: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852 - Gentoo_OC: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852 - Win10_OC: CPU Microcode: 5208000600000000 Graphics Details - Gentoo: BAR1 / Visible vRAM Size: 256 MiB Security Details - Gentoo: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - Gentoo_OC: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - Win10_OC: __user pointer sanitization: Disabled + Retpoline: Full + IBPB: Always Environment Details - Win10_OC: windows_tracing_flags=3
nvidia-tests realsr-ncnn: 4x - Yes ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet lczero: OpenCL lczero: Eigen luxcorerender: Danish Mood - GPU lczero: CUDA + cuDNN lczero: BLAS ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet luxcorerender: LuxCore Benchmark - CPU luxcorerender: Danish Mood - CPU fahbench: luxcorerender: Orange Juice - CPU luxcorerender: LuxCore Benchmark - GPU luxcorerender: DLSC - CPU octanebench: Total Score indigobench: OpenCL GPU - Bedroom vkpeak: int32-vec4 vkpeak: int32-scalar vkpeak: fp64-vec4 vkpeak: fp64-scalar vkpeak: fp32-vec4 vkpeak: fp32-scalar luxcorerender: DLSC - GPU indigobench: OpenCL GPU - Supercar indigobench: CPU - Supercar realsr-ncnn: 4x - No luxcorerender: Orange Juice - GPU indigobench: CPU - Bedroom luxcorerender: Rainbow Colors and Prism - GPU luxcorerender: Rainbow Colors and Prism - CPU neatbench: CPU cl-mem: Copy cl-mem: Write cl-mem: Read vkresample: 2x - Double waifu2x-ncnn: 2x - 3 - Yes vkresample: 2x - Single waifu2x-ncnn: 2x - 3 - No clpeak: Double-Precision Double clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Integer Compute INT clpeak: Single-Precision Float neatbench: GPU clpeak: Global Memory Bandwidth clpeak: Kernel Latency Gentoo Gentoo_OC Win10_OC Gentoo2 7.83 24.36 25.50 19.87 12.51 71.55 14.66 2.38 11.03 6.18 4.42 6.89 5.94 21.12 0.08 49.26 59.62 123.20 118.69 35.84 58.36 400.80 55.53 3.83 23.55 16.06 11.46 14.71 18.16 60.06 0.18 0.15 25.7380 0.76 0.11 0.52 0.3 0.33 1.68 2.11 37.3 39.4 40.7 41.27 1.49 1.69 353.72 1139.82 39.52 6.43 564.855 6.86 22.96 35.78 22.89 15.96 11.27 58.08 13.49 2.35 9.60 5.65 4.09 5.86 5.06 21.11 2487 153 0.09 6201 31.6520 0.13 24.612607 1.095 452.10 487.20 46.01 46.02 1431.54 1429.71 0.34 3.122 1.226 74.730 0.39 0.487 2.11 5.87 48.8 51.3 54.3 4.917 28.687 10.009 4.899 45.57 1.49 1.69 385.06 1243.14 1030 52.31 6.37 563.714 432 0.06 682 0.16 0.11 26.4165 0.68 0.11 0.48 24.583162 1.103 445.74 480.54 45.4 45.40 1411.57 1410.30 0.32 3.161 1.144 73.507 0.38 0.452 2.08 1.33 5.86 0.145 28.643 159.514 5.299 1030 37.70 OpenBenchmarking.org
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Gentoo_OC Win10_OC 120 240 360 480 600 SE +/- 0.20, N = 3 SE +/- 0.05, N = 3 564.86 563.71
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m Gentoo Gentoo_OC 2 4 6 8 10 SE +/- 0.12, N = 15 SE +/- 0.15, N = 12 7.83 6.86 MIN: 6.27 / MAX: 16.64 MIN: 5.7 / MAX: 11.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd Gentoo Gentoo_OC 6 12 18 24 30 SE +/- 0.24, N = 15 SE +/- 0.18, N = 12 24.36 22.96 MIN: 18.49 / MAX: 37.19 MIN: 16.96 / MAX: 33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny Gentoo2 Gentoo_OC 9 18 27 36 45 SE +/- 0.63, N = 15 SE +/- 1.03, N = 12 37.70 35.78 MIN: 28.5 / MAX: 63 MIN: 26.39 / MAX: 56.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet50 Gentoo Gentoo_OC 6 12 18 24 30 SE +/- 0.05, N = 15 SE +/- 0.07, N = 12 25.50 22.89 MIN: 22.49 / MAX: 36.56 MIN: 19.87 / MAX: 37.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: alexnet Gentoo Gentoo_OC 5 10 15 20 25 SE +/- 0.03, N = 15 SE +/- 0.05, N = 12 19.87 15.96 MIN: 18.06 / MAX: 30.37 MIN: 14.07 / MAX: 21.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet18 Gentoo Gentoo_OC 3 6 9 12 15 SE +/- 0.09, N = 15 SE +/- 0.13, N = 12 12.51 11.27 MIN: 10.3 / MAX: 21.73 MIN: 9.16 / MAX: 33.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: vgg16 Gentoo Gentoo_OC 16 32 48 64 80 SE +/- 0.10, N = 15 SE +/- 0.06, N = 12 71.55 58.08 MIN: 68.13 / MAX: 94.57 MIN: 54.62 / MAX: 81.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: googlenet Gentoo Gentoo_OC 4 8 12 16 20 SE +/- 0.11, N = 15 SE +/- 0.12, N = 12 14.66 13.49 MIN: 11.86 / MAX: 22.78 MIN: 10.68 / MAX: 20.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: blazeface Gentoo Gentoo_OC 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.17, N = 15 SE +/- 0.20, N = 12 2.38 2.35 MIN: 1.7 / MAX: 8.13 MIN: 1.63 / MAX: 9.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 Gentoo Gentoo_OC 3 6 9 12 15 SE +/- 0.09, N = 15 SE +/- 0.16, N = 12 11.03 9.60 MIN: 8.25 / MAX: 26.42 MIN: 7.33 / MAX: 15.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mnasnet Gentoo Gentoo_OC 2 4 6 8 10 SE +/- 0.13, N = 15 SE +/- 0.15, N = 12 6.18 5.65 MIN: 5.08 / MAX: 10.21 MIN: 4.55 / MAX: 11.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 Gentoo Gentoo_OC 0.9945 1.989 2.9835 3.978 4.9725 SE +/- 0.11, N = 15 SE +/- 0.12, N = 12 4.42 4.09 MIN: 3.91 / MAX: 9.88 MIN: 3.64 / MAX: 7.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Gentoo Gentoo_OC 2 4 6 8 10 SE +/- 0.11, N = 15 SE +/- 0.15, N = 12 6.89 5.86 MIN: 5.58 / MAX: 12.25 MIN: 4.99 / MAX: 10.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Gentoo Gentoo_OC 1.3365 2.673 4.0095 5.346 6.6825 SE +/- 0.10, N = 15 SE +/- 0.14, N = 12 5.94 5.06 MIN: 4.9 / MAX: 11.53 MIN: 4.38 / MAX: 8.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet Gentoo Gentoo_OC 5 10 15 20 25 SE +/- 0.41, N = 15 SE +/- 0.70, N = 12 21.12 21.11 MIN: 16.16 / MAX: 40.01 MIN: 14.97 / MAX: 34.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: OpenCL Gentoo_OC 500 1000 1500 2000 2500 SE +/- 10.37, N = 3 2487 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen Gentoo_OC Win10_OC 90 180 270 360 450 SE +/- 0.67, N = 3 153 432 1. (CXX) g++ options: -flto -pthread
LuxCoreRender Scene: Danish Mood - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: GPU Gentoo Gentoo_OC Win10_OC 0.0203 0.0406 0.0609 0.0812 0.1015 SE +/- 0.00, N = 15 SE +/- 0.00, N = 15 SE +/- 0.01, N = 15 0.08 0.09 0.06 MAX: 0.17 MAX: 0.2 MAX: 0.16
LeelaChessZero Backend: CUDA + cuDNN OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: CUDA + cuDNN Gentoo_OC 1300 2600 3900 5200 6500 SE +/- 21.79, N = 3 6201 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS Win10_OC 150 300 450 600 750 SE +/- 2.96, N = 3 682
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Gentoo 11 22 33 44 55 SE +/- 0.06, N = 3 49.26 MIN: 48.84 / MAX: 57.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Gentoo 13 26 39 52 65 SE +/- 0.22, N = 3 59.62 MIN: 58.66 / MAX: 66.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Gentoo 30 60 90 120 150 SE +/- 0.07, N = 3 123.20 MIN: 121.61 / MAX: 130.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Gentoo 30 60 90 120 150 SE +/- 0.17, N = 3 118.69 MIN: 117.93 / MAX: 125.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Gentoo 8 16 24 32 40 SE +/- 0.03, N = 3 35.84 MIN: 35.14 / MAX: 41.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Gentoo 13 26 39 52 65 SE +/- 0.04, N = 3 58.36 MIN: 57.89 / MAX: 64.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Gentoo 90 180 270 360 450 SE +/- 0.10, N = 3 400.80 MIN: 398.23 / MAX: 409.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Gentoo 12 24 36 48 60 SE +/- 0.05, N = 3 55.53 MIN: 54.84 / MAX: 62.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Gentoo 0.8618 1.7236 2.5854 3.4472 4.309 SE +/- 0.01, N = 3 3.83 MIN: 3.71 / MAX: 7.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Gentoo 6 12 18 24 30 SE +/- 0.12, N = 3 23.55 MIN: 22.86 / MAX: 30.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Gentoo 4 8 12 16 20 SE +/- 0.01, N = 3 16.06 MIN: 15.83 / MAX: 22.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Gentoo 3 6 9 12 15 SE +/- 0.05, N = 3 11.46 MIN: 11.21 / MAX: 18.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Gentoo 4 8 12 16 20 SE +/- 0.05, N = 3 14.71 MIN: 14.46 / MAX: 21.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Gentoo 4 8 12 16 20 SE +/- 0.10, N = 3 18.16 MIN: 17.53 / MAX: 24.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Gentoo 13 26 39 52 65 SE +/- 0.13, N = 3 60.06 MIN: 59.06 / MAX: 66.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: CPU Gentoo Win10_OC 0.0405 0.081 0.1215 0.162 0.2025 SE +/- 0.00, N = 15 SE +/- 0.00, N = 15 0.18 0.16 MIN: 0.03 / MAX: 0.32 MIN: 0.1 / MAX: 0.21
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: CPU Gentoo Win10_OC 0.0338 0.0676 0.1014 0.1352 0.169 SE +/- 0.01, N = 15 SE +/- 0.01, N = 12 0.15 0.11 MIN: 0.03 / MAX: 0.32 MIN: 0.02 / MAX: 0.23
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 Gentoo Gentoo_OC Win10_OC 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.23, N = 3 25.74 31.65 26.42
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: CPU Gentoo Win10_OC 0.171 0.342 0.513 0.684 0.855 SE +/- 0.01, N = 15 SE +/- 0.00, N = 3 0.76 0.68 MIN: 0.62 / MAX: 0.83 MIN: 0.67
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: GPU Gentoo Gentoo_OC Win10_OC 0.0293 0.0586 0.0879 0.1172 0.1465 SE +/- 0.00, N = 13 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.11 0.13 0.11 MAX: 0.18 MAX: 0.21 MIN: 0.06 / MAX: 0.16
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: DLSC - Acceleration: CPU Gentoo Win10_OC 0.117 0.234 0.351 0.468 0.585 SE +/- 0.01, N = 15 SE +/- 0.00, N = 3 0.52 0.48 MIN: 0.47 / MAX: 0.54
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score Gentoo_OC Win10_OC 6 12 18 24 30 24.61 24.58
IndigoBench Acceleration: OpenCL GPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom Gentoo_OC Win10_OC 0.2482 0.4964 0.7446 0.9928 1.241 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.095 1.103
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-vec4 Gentoo_OC Win10_OC 100 200 300 400 500 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 452.10 445.74
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-scalar Gentoo_OC Win10_OC 110 220 330 440 550 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 487.20 480.54
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-vec4 Gentoo_OC Win10_OC 10 20 30 40 50 SE +/- 0.00, N = 3 46.01 45.40
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-scalar Gentoo_OC Win10_OC 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 46.02 45.40
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-vec4 Gentoo_OC Win10_OC 300 600 900 1200 1500 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 1431.54 1411.57
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-scalar Gentoo_OC Win10_OC 300 600 900 1200 1500 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 1429.71 1410.30
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: DLSC - Acceleration: GPU Gentoo Gentoo_OC Win10_OC 0.0765 0.153 0.2295 0.306 0.3825 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.30 0.34 0.32 MIN: 0.24 / MAX: 0.31 MIN: 0.29 / MAX: 0.35 MIN: 0.31
IndigoBench Acceleration: OpenCL GPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar Gentoo_OC Win10_OC 0.7112 1.4224 2.1336 2.8448 3.556 SE +/- 0.016, N = 3 SE +/- 0.003, N = 3 3.122 3.161
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Gentoo_OC Win10_OC 0.2759 0.5518 0.8277 1.1036 1.3795 SE +/- 0.003, N = 3 SE +/- 0.013, N = 4 1.226 1.144
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Gentoo_OC Win10_OC 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 74.73 73.51
LuxCoreRender Scene: Orange Juice - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: GPU Gentoo Gentoo_OC Win10_OC 0.0878 0.1756 0.2634 0.3512 0.439 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.33 0.39 0.38 MIN: 0.02 / MAX: 0.4 MIN: 0.02 / MAX: 0.44 MIN: 0.36 / MAX: 0.39
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom Gentoo_OC Win10_OC 0.1096 0.2192 0.3288 0.4384 0.548 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.487 0.452
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: GPU Gentoo Gentoo_OC Win10_OC 0.4748 0.9496 1.4244 1.8992 2.374 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.68 2.11 2.08 MIN: 0.74 / MAX: 1.8 MIN: 0.9 / MAX: 2.25 MIN: 2.03 / MAX: 2.11
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: CPU Gentoo Win10_OC 0.4748 0.9496 1.4244 1.8992 2.374 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 2.11 1.33 MIN: 2.07 / MAX: 2.14 MIN: 1.32 / MAX: 1.36
NeatBench Acceleration: CPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: CPU Gentoo_OC Win10_OC 1.3208 2.6416 3.9624 5.2832 6.604 SE +/- 0.55, N = 16 SE +/- 0.55, N = 16 5.87 5.86
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Gentoo Gentoo_OC 11 22 33 44 55 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 37.3 48.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Gentoo Gentoo_OC 12 24 36 48 60 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 39.4 51.3 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Gentoo Gentoo_OC 12 24 36 48 60 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 40.7 54.3 1. (CC) gcc options: -O2 -flto -lOpenCL
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double Gentoo_OC Win10_OC 1.1063 2.2126 3.3189 4.4252 5.5315 SE +/- 0.365, N = 15 SE +/- 0.109, N = 12 4.917 0.145 1. (CXX) g++ options: -O3 -pthread
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Gentoo_OC Win10_OC 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 28.69 28.64
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single Gentoo_OC Win10_OC 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 10.01 159.51 1. (CXX) g++ options: -O3 -pthread
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Gentoo_OC Win10_OC 1.1923 2.3846 3.5769 4.7692 5.9615 SE +/- 0.059, N = 8 SE +/- 0.047, N = 13 4.899 5.299
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Gentoo Gentoo_OC 10 20 30 40 50 SE +/- 0.17, N = 3 SE +/- 0.44, N = 3 41.27 45.57 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Gentoo Gentoo_OC 0.3353 0.6706 1.0059 1.3412 1.6765 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.49 1.49 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Gentoo Gentoo_OC 0.3803 0.7606 1.1409 1.5212 1.9015 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.69 1.69 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Gentoo Gentoo_OC 80 160 240 320 400 SE +/- 6.73, N = 15 SE +/- 9.00, N = 15 353.72 385.06 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Gentoo Gentoo_OC 300 600 900 1200 1500 SE +/- 26.36, N = 15 SE +/- 40.31, N = 15 1139.82 1243.14 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Gentoo_OC Win10_OC 200 400 600 800 1000 1030 1030
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Gentoo Gentoo_OC 12 24 36 48 60 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 39.52 52.31 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Gentoo Gentoo_OC 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 6.43 6.37 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.5