2024-10-17-ncnn-gpu

AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS) and AMD Radeon RX 6900 XT 16GB on Fedora Linux 40 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2410170-ENB-2024101798
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX
October 17
  6 Hours, 58 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


2024-10-17-ncnn-gpuOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS)AMD Starship/Matisse128GB2 x 2000GB Sabrent Rocket QAMD Radeon RX 6900 XT 16GBAMD Navi 21/23MPCP28UHD + MP MonitorAquantia AQtion AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200Fedora Linux 406.11.3-200.fc40.x86_64 (x86_64)GNOME Shell 46.5X Server 1.20.14 + Wayland4.6 Mesa 24.1.7 (LLVM 18.1.6 DRM 3.59)OpenCL 2.1 AMD-APP (3614.0) + OpenCL 3.0 PoCL 5.0 Linux RELOC SPIR LLVM 17.0.6 SLEEF DISTRO POCL_DEBUGGCC 14.2.1 20240912 + Clang 18.1.8 + LLVM 18.1.8btrfs7680x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution2024-10-17-ncnn-gpu BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830107c- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

2024-10-17-ncnn-gpuncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX9.7173.1322.2818.5832.2527.6411.7816.0143.2825.043.709.276.458.786.197.7118.78OpenBenchmarking.org

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.35, N = 129.71MIN: 8.07 / MAX: 424.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1632486480SE +/- 1.76, N = 1273.13MIN: 62.69 / MAX: 1079.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.66, N = 1222.28MIN: 18.51 / MAX: 1014.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.61, N = 1118.58MIN: 15.49 / MAX: 1036.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX714212835SE +/- 0.74, N = 1232.25MIN: 25.08 / MAX: 650.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX714212835SE +/- 0.86, N = 1227.64MIN: 19.54 / MAX: 1095.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.34, N = 1211.78MIN: 8.2 / MAX: 373.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX48121620SE +/- 0.65, N = 1216.01MIN: 11.05 / MAX: 586.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1020304050SE +/- 1.27, N = 1243.28MIN: 33.71 / MAX: 423.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX612182430SE +/- 1.88, N = 1225.04MIN: 16.8 / MAX: 988.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX0.83251.6652.49753.334.1625SE +/- 0.22, N = 123.70MIN: 2.87 / MAX: 278.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.50, N = 129.27MIN: 7.86 / MAX: 315.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.36, N = 126.45MIN: 5.4 / MAX: 916.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.50, N = 128.78MIN: 7.24 / MAX: 5041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.24, N = 126.19MIN: 5.46 / MAX: 262.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.71, N = 127.71MIN: 6.19 / MAX: 556.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.36, N = 1218.78MIN: 15.72 / MAX: 589.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread