nvidia-tests

AMD FX-8350 Eight-Core testing with a ASUS SABERTOOTH 990FX R2.0 (2901 BIOS) and MSI NVIDIA GeForce GT 1030 2GB on Gentoo/Linux via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2111171-TJ-NVIDIATES55&sor.

nvidia-tests ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDisplay ServerDisplay DriverOpenCLVulkanCompilerFile-SystemScreen ResolutionDesktopOpenGLGentooGentoo_OCWin10_OCAMD FX-8350 Eight-Core @ 4.00GHz (4 Cores / 8 Threads)ASUS SABERTOOTH 990FX R2.0 (2901 BIOS)AMD RD9x0/RX98016GB1000GB CT1000MX500SSD1 + 3001GB TOSHIBA DT01ACA3 + 2000GB Seagate ST2000DL003-9VT1 + 500GB CT500MX500SSD1MSI NVIDIA GeForce GT 1030 2GBRealtek ALC892G27QCRealtek RTL8111/8168/8411Gentoo/Linux5.15.2-gentoo-x86_64 (x86_64)X Server 1.20.13NVIDIAOpenCL 3.0 CUDA 11.5.1001.2.186GCC 11.2.0 + Clang 13.0.0 + LLVM 13.0.0 + CUDA 11.5ext41280x1024MSI NVIDIA GeForce GT 1030 2GB OCKDE Plasma 5.22.54.6.02560x14402 x 8192 MB 800MHz Kingston466GB CT500MX5 00SSD1 SATA Disk + 2795GB TOSHIBA DT01ACA300 SATA Disk + 932GB CT1000MX 500SSD1 SATA Disk + 1863GB ST2000DL 003-9VT166 SATA DiskNVIDIA GeForce GT 1030 2GBNVIDIA HD Audio + Realtek HD Audio + NVIDIA Virtual Audio Device (Wave Extensible) (WDM) + HD Webcam C510G7QCVirtualBox Host-Only + Symantec TAP Driver + RAS Async + Bluetooth Device (Personal Area ) + Bluetooth Device (Personal Area ) #2Microsoft Windows 10 Pro Build 1904310.0 (x86_64)496.49 (30.0.14.9649)OpenCL 3.0 CUDA 11.5.76 + OpenCL 1.2 AMD-APP (937.2)GCC 8.3.0 + Clang 6.0.0NTFSOpenBenchmarking.orgKernel Details- Gentoo, Gentoo_OC: Transparent Huge Pages: madviseCompiler Details- Gentoo, Gentoo_OC: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/11.2.0 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0 --disable-esp --disable-fixed-point --disable-libada --disable-libssp --disable-libunwind-exceptions --disable-libvtv --disable-systemtap --disable-valgrind-annotations --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/11.2.0/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/man --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/python --without-isl --without-zstd Processor Details- Gentoo: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852- Gentoo_OC: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852- Win10_OC: CPU Microcode: 5208000600000000Graphics Details- Gentoo: BAR1 / Visible vRAM Size: 256 MiBSecurity Details- Gentoo: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Gentoo_OC: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Win10_OC: __user pointer sanitization: Disabled + Retpoline: Full + IBPB: AlwaysEnvironment Details- Win10_OC: windows_tracing_flags=3

nvidia-tests cl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferfahbench: luxcorerender: DLSC - GPUluxcorerender: Danish Mood - GPUluxcorerender: Orange Juice - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mindigobench: CPU - Bedroomindigobench: CPU - Supercarindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarlczero: BLASlczero: Eigenlczero: OpenCLlczero: CUDA + cuDNNneatbench: CPUneatbench: GPUoctanebench: Total Scorerealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yesvkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkresample: 2x - Doublevkresample: 2x - Singlewaifu2x-ncnn: 2x - 3 - Nowaifu2x-ncnn: 2x - 3 - YesGentooGentoo_OCWin10_OC37.340.739.46.43353.721139.8241.2739.521.691.4925.73800.30.080.330.111.680.520.150.760.182.1160.0618.1614.7111.4616.0623.553.8355.53400.8058.3635.84118.69123.2059.6249.2621.125.946.894.426.1811.032.3814.6671.5512.5119.8725.5037.7024.367.8348.854.351.36.37385.061243.1445.5752.311.691.4931.65200.340.090.390.132.1121.115.065.864.095.659.602.3513.4958.0811.2715.9622.8935.7822.966.860.4871.2261.0953.122153248762015.87103024.61260774.730564.8551429.711431.5446.0246.01487.20452.104.91710.0094.89928.68726.41650.320.060.380.112.080.480.110.680.161.330.4521.1441.1033.1616824325.86103024.58316273.507563.7141410.301411.5745.4045.4480.54445.740.145159.5145.29928.643OpenBenchmarking.org

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGentoo_OCGentoo1122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 348.837.31. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGentoo_OCGentoo1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 354.340.71. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGentoo_OCGentoo1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 351.339.41. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGentoo_OCGentoo246810SE +/- 0.01, N = 3SE +/- 0.03, N = 36.376.431. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGentoo_OCGentoo80160240320400SE +/- 9.00, N = 15SE +/- 6.73, N = 15385.06353.721. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGentoo_OCGentoo30060090012001500SE +/- 40.31, N = 15SE +/- 26.36, N = 151243.141139.821. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGentoo_OCGentoo1020304050SE +/- 0.44, N = 3SE +/- 0.17, N = 345.5741.271. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGentoo_OCGentoo1224364860SE +/- 0.14, N = 3SE +/- 0.10, N = 352.3139.521. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferGentoo_OCGentoo0.38030.76061.14091.52121.9015SE +/- 0.00, N = 3SE +/- 0.00, N = 31.691.691. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGentoo_OCGentoo0.33530.67061.00591.34121.6765SE +/- 0.00, N = 3SE +/- 0.00, N = 31.491.491. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Gentoo_OCWin10_OCGentoo714212835SE +/- 0.02, N = 3SE +/- 0.23, N = 3SE +/- 0.01, N = 331.6526.4225.74

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: GPUGentoo_OCWin10_OCGentoo0.07650.1530.22950.3060.3825SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.340.320.30MIN: 0.29 / MAX: 0.35MIN: 0.31MIN: 0.24 / MAX: 0.31

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: GPUGentoo_OCGentooWin10_OC0.02030.04060.06090.08120.1015SE +/- 0.00, N = 15SE +/- 0.00, N = 15SE +/- 0.01, N = 150.090.080.06MAX: 0.2MAX: 0.17MAX: 0.16

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: GPUGentoo_OCWin10_OCGentoo0.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.380.33MIN: 0.02 / MAX: 0.44MIN: 0.36 / MAX: 0.39MIN: 0.02 / MAX: 0.4

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: GPUGentoo_OCWin10_OCGentoo0.02930.05860.08790.11720.1465SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 130.130.110.11MAX: 0.21MIN: 0.06 / MAX: 0.16MAX: 0.18

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: GPUGentoo_OCWin10_OCGentoo0.47480.94961.42441.89922.374SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.112.081.68MIN: 0.9 / MAX: 2.25MIN: 2.03 / MAX: 2.11MIN: 0.74 / MAX: 1.8

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: CPUGentooWin10_OC0.1170.2340.3510.4680.585SE +/- 0.01, N = 15SE +/- 0.00, N = 30.520.48MIN: 0.47 / MAX: 0.54

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: CPUGentooWin10_OC0.03380.06760.10140.13520.169SE +/- 0.01, N = 15SE +/- 0.01, N = 120.150.11MIN: 0.03 / MAX: 0.32MIN: 0.02 / MAX: 0.23

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: CPUGentooWin10_OC0.1710.3420.5130.6840.855SE +/- 0.01, N = 15SE +/- 0.00, N = 30.760.68MIN: 0.62 / MAX: 0.83MIN: 0.67

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: CPUGentooWin10_OC0.04050.0810.12150.1620.2025SE +/- 0.00, N = 15SE +/- 0.00, N = 150.180.16MIN: 0.03 / MAX: 0.32MIN: 0.1 / MAX: 0.21

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: CPUGentooWin10_OC0.47480.94961.42441.89922.374SE +/- 0.02, N = 3SE +/- 0.01, N = 32.111.33MIN: 2.07 / MAX: 2.14MIN: 1.32 / MAX: 1.36

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetGentoo1326395265SE +/- 0.13, N = 360.06MIN: 59.06 / MAX: 66.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2Gentoo48121620SE +/- 0.10, N = 318.16MIN: 17.53 / MAX: 24.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3Gentoo48121620SE +/- 0.05, N = 314.71MIN: 14.46 / MAX: 21.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Gentoo3691215SE +/- 0.05, N = 311.46MIN: 11.21 / MAX: 18.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetGentoo48121620SE +/- 0.01, N = 316.06MIN: 15.83 / MAX: 22.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Gentoo612182430SE +/- 0.12, N = 323.55MIN: 22.86 / MAX: 30.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceGentoo0.86181.72362.58543.44724.309SE +/- 0.01, N = 33.83MIN: 3.71 / MAX: 7.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetGentoo1224364860SE +/- 0.05, N = 355.53MIN: 54.84 / MAX: 62.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16Gentoo90180270360450SE +/- 0.10, N = 3400.80MIN: 398.23 / MAX: 409.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18Gentoo1326395265SE +/- 0.04, N = 358.36MIN: 57.89 / MAX: 64.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetGentoo816243240SE +/- 0.03, N = 335.84MIN: 35.14 / MAX: 41.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Gentoo306090120150SE +/- 0.17, N = 3118.69MIN: 117.93 / MAX: 125.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyGentoo306090120150SE +/- 0.07, N = 3123.20MIN: 121.61 / MAX: 130.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdGentoo1326395265SE +/- 0.22, N = 359.62MIN: 58.66 / MAX: 66.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mGentoo1122334455SE +/- 0.06, N = 349.26MIN: 48.84 / MAX: 57.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetGentoo_OCGentoo510152025SE +/- 0.70, N = 12SE +/- 0.41, N = 1521.1121.12MIN: 14.97 / MAX: 34.03MIN: 16.16 / MAX: 40.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Gentoo_OCGentoo1.33652.6734.00955.3466.6825SE +/- 0.14, N = 12SE +/- 0.10, N = 155.065.94MIN: 4.38 / MAX: 8.96MIN: 4.9 / MAX: 11.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Gentoo_OCGentoo246810SE +/- 0.15, N = 12SE +/- 0.11, N = 155.866.89MIN: 4.99 / MAX: 10.61MIN: 5.58 / MAX: 12.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2Gentoo_OCGentoo0.99451.9892.98353.9784.9725SE +/- 0.12, N = 12SE +/- 0.11, N = 154.094.42MIN: 3.64 / MAX: 7.82MIN: 3.91 / MAX: 9.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetGentoo_OCGentoo246810SE +/- 0.15, N = 12SE +/- 0.13, N = 155.656.18MIN: 4.55 / MAX: 11.7MIN: 5.08 / MAX: 10.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0Gentoo_OCGentoo3691215SE +/- 0.16, N = 12SE +/- 0.09, N = 159.6011.03MIN: 7.33 / MAX: 15.26MIN: 8.25 / MAX: 26.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceGentoo_OCGentoo0.53551.0711.60652.1422.6775SE +/- 0.20, N = 12SE +/- 0.17, N = 152.352.38MIN: 1.63 / MAX: 9.35MIN: 1.7 / MAX: 8.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetGentoo_OCGentoo48121620SE +/- 0.12, N = 12SE +/- 0.11, N = 1513.4914.66MIN: 10.68 / MAX: 20.26MIN: 11.86 / MAX: 22.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16Gentoo_OCGentoo1632486480SE +/- 0.06, N = 12SE +/- 0.10, N = 1558.0871.55MIN: 54.62 / MAX: 81.74MIN: 68.13 / MAX: 94.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18Gentoo_OCGentoo3691215SE +/- 0.13, N = 12SE +/- 0.09, N = 1511.2712.51MIN: 9.16 / MAX: 33.12MIN: 10.3 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetGentoo_OCGentoo510152025SE +/- 0.05, N = 12SE +/- 0.03, N = 1515.9619.87MIN: 14.07 / MAX: 21.71MIN: 18.06 / MAX: 30.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50Gentoo_OCGentoo612182430SE +/- 0.07, N = 12SE +/- 0.05, N = 1522.8925.50MIN: 19.87 / MAX: 37.85MIN: 22.49 / MAX: 36.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyGentoo_OCGentoo918273645SE +/- 1.03, N = 12SE +/- 0.63, N = 1535.7837.70MIN: 26.39 / MAX: 56.26MIN: 28.5 / MAX: 631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdGentoo_OCGentoo612182430SE +/- 0.18, N = 12SE +/- 0.24, N = 1522.9624.36MIN: 16.96 / MAX: 33MIN: 18.49 / MAX: 37.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mGentoo_OCGentoo246810SE +/- 0.15, N = 12SE +/- 0.12, N = 156.867.83MIN: 5.7 / MAX: 11.24MIN: 6.27 / MAX: 16.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomGentoo_OCWin10_OC0.10960.21920.32880.43840.548SE +/- 0.002, N = 3SE +/- 0.002, N = 30.4870.452

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarGentoo_OCWin10_OC0.27590.55180.82771.10361.3795SE +/- 0.003, N = 3SE +/- 0.013, N = 41.2261.144

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomWin10_OCGentoo_OC0.24820.49640.74460.99281.241SE +/- 0.001, N = 3SE +/- 0.002, N = 31.1031.095

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarWin10_OCGentoo_OC0.71121.42242.13362.84483.556SE +/- 0.003, N = 3SE +/- 0.016, N = 33.1613.122

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASWin10_OC150300450600750SE +/- 2.96, N = 3682

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenWin10_OCGentoo_OC90180270360450SE +/- 0.67, N = 34321531. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLGentoo_OC5001000150020002500SE +/- 10.37, N = 324871. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: CUDA + cuDNN

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: CUDA + cuDNNGentoo_OC13002600390052006500SE +/- 21.79, N = 362011. (CXX) g++ options: -flto -pthread

NeatBench

Acceleration: CPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPUGentoo_OCWin10_OC1.32082.64163.96245.28326.604SE +/- 0.55, N = 16SE +/- 0.55, N = 165.875.86

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUGentoo_OCWin10_OC200400600800100010301030

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreGentoo_OCWin10_OC61218243024.6124.58

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoWin10_OCGentoo_OC20406080100SE +/- 0.14, N = 3SE +/- 0.01, N = 373.5174.73

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesWin10_OCGentoo_OC120240360480600SE +/- 0.05, N = 3SE +/- 0.20, N = 3563.71564.86

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarGentoo_OCWin10_OC30060090012001500SE +/- 0.03, N = 3SE +/- 0.01, N = 31429.711410.30

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4Gentoo_OCWin10_OC30060090012001500SE +/- 0.09, N = 3SE +/- 0.06, N = 31431.541411.57

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarGentoo_OCWin10_OC1020304050SE +/- 0.00, N = 3SE +/- 0.01, N = 346.0245.40

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4Gentoo_OCWin10_OC1020304050SE +/- 0.00, N = 346.0145.40

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarGentoo_OCWin10_OC110220330440550SE +/- 0.07, N = 3SE +/- 0.06, N = 3487.20480.54

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4Gentoo_OCWin10_OC100200300400500SE +/- 0.01, N = 3SE +/- 0.02, N = 3452.10445.74

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleWin10_OCGentoo_OC1.10632.21263.31894.42525.5315SE +/- 0.109, N = 12SE +/- 0.365, N = 150.1454.9171. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleGentoo_OCWin10_OC4080120160200SE +/- 0.00, N = 3SE +/- 0.06, N = 310.01159.511. (CXX) g++ options: -O3 -pthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: NoGentoo_OCWin10_OC1.19232.38463.57694.76925.9615SE +/- 0.059, N = 8SE +/- 0.047, N = 134.8995.299

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesWin10_OCGentoo_OC714212835SE +/- 0.09, N = 3SE +/- 0.03, N = 328.6428.69


Phoronix Test Suite v10.8.4