ddda

AMD Ryzen AI 9 365 testing with a ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 (UM5606WA.308 BIOS) and AMD Radeon 512MB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2412307-NE-DDDA8846655&sor.

dddaProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcdAMD Ryzen AI 9 365 @ 4.31GHz (10 Cores / 20 Threads)ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 (UM5606WA.308 BIOS)AMD Device 15074 x 6GB LPDDR5-7500MT/s Micron MT62F1536M32D4DS-0261024GB MTFDKBA1T0QFM-1BD1AABGBAMD Radeon 512MBAMD Rembrandt Radeon HD AudioMEDIATEK Device 7925Ubuntu 24.106.12.0-rc7-phx-eraps (x86_64)GNOME Shell 47.0X Server + Wayland4.6 Mesa 24.2.3-1ubuntu1 (LLVM 19.1.0 DRM 3.59)GCC 14.2.0ext42880x1800OpenBenchmarking.orgKernel Details- amdgpu.dcdebugmask=0x600 - Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - Platform Profile: balanced - CPU Microcode: 0xb204011 - ACPI Profile: balanced Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; PBRSB-eIBRS: Not affected; BHI: Not affected; ERAPS hardware RSB flush + srbds: Not affected + tsx_async_abort: Not affected

dddancnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 1024llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 512llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048abcd11.884.683.763.233.735.141.1710.2233.837.055.8313.9211.8815.888.449.1770.924.1211.394.343.652.993.494.841.249.0133.385.974.9614.111.3915.188.059.1271.322.7710.338.7137.2831.6210.7536.1134.331.5859.22151.41151.53139.2810.734.423.43.123.9951.3310.5733.425.965.2914.3610.7314.698.049.1971.624.2411.524.293.562.823.414.921.129.3532.296.015.041411.5217.068.738.8971.773.4910.3337.4237.4235.410.8637.4637.7333.1260.7158.37158.82139.5711.874.473.53.113.864.971.279.4932.756.124.9714.6511.8716.199.139.271.614.1711.514.333.6933.465.051.199.5332.386.65.1714.111.5116.178.539.1468.182.7510.3238.436.9435.1610.837.9736.5733.8160.75151.53150.48140.3610.514.543.513.223.934.991.289.1933.195.935.0814.4410.5114.858.038.8771.744.2511.694.323.412.913.264.81.078.7433.236.145.2314.3811.6916.528.318.7472.363.9810.337.3437.3832.9810.8136.3235.2532.6358.28131.74141.29139.91OpenBenchmarking.org

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: mobilenetdbca369121510.5110.7311.8711.88MIN: 10.4 / MAX: 12.15MIN: 10.62 / MAX: 12.77MIN: 11.6 / MAX: 35.77MIN: 11.76 / MAX: 17.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU-v2-v2 - Model: mobilenet-v2bcda1.0532.1063.1594.2125.2654.424.474.544.68MIN: 3.66 / MAX: 33.56MIN: 3.78 / MAX: 33.88MIN: 3.8 / MAX: 29.67MIN: 4.11 / MAX: 35.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU-v3-v3 - Model: mobilenet-v3bcda0.8461.6922.5383.3844.233.403.503.513.76MIN: 3.25 / MAX: 26.22MIN: 3.3 / MAX: 45.42MIN: 3.35 / MAX: 27.78MIN: 3.53 / MAX: 28.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: shufflenet-v2cbda0.72681.45362.18042.90723.6343.113.123.223.23MIN: 2.86 / MAX: 32.43MIN: 2.99 / MAX: 25.69MIN: 3.07 / MAX: 34.03MIN: 2.93 / MAX: 35.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: mnasnetacdb0.89781.79562.69343.59124.4893.733.863.933.99MIN: 3.52 / MAX: 25.76MIN: 3.51 / MAX: 53.01MIN: 3.64 / MAX: 32.62MIN: 3.66 / MAX: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: efficientnet-b0cdba1.15652.3133.46954.6265.78254.974.995.005.14MIN: 4.92 / MAX: 6.74MIN: 4.93 / MAX: 5.87MIN: 4.92 / MAX: 9.16MIN: 5.1 / MAX: 5.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: blazefaceacdb0.29930.59860.89791.19721.49651.171.271.281.33MIN: 1.15 / MAX: 1.32MIN: 1.26 / MAX: 1.43MIN: 1.26 / MAX: 1.87MIN: 1.3 / MAX: 21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: googlenetdcab36912159.199.4910.2210.57MIN: 8.9 / MAX: 49.46MIN: 8.97 / MAX: 57.01MIN: 10.02 / MAX: 14.16MIN: 9.04 / MAX: 50.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: vgg16cdba81624324032.7533.1933.4233.83MIN: 31.39 / MAX: 55.09MIN: 31.59 / MAX: 64.74MIN: 32.44 / MAX: 75.15MIN: 31.89 / MAX: 74.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: resnet18dbca2468105.935.966.127.05MIN: 5.83 / MAX: 7.8MIN: 5.89 / MAX: 7.95MIN: 5.81 / MAX: 30.87MIN: 7 / MAX: 7.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: alexnetcdba1.31182.62363.93545.24726.5594.975.085.295.83MIN: 4.66 / MAX: 10.74MIN: 4.97 / MAX: 6.57MIN: 5.01 / MAX: 17.98MIN: 5.76 / MAX: 7.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: resnet50abdc4812162013.9214.3614.4414.65MIN: 13.33 / MAX: 58.77MIN: 12.97 / MAX: 61.42MIN: 13.49 / MAX: 34.89MIN: 14.54 / MAX: 15.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3dbca369121510.5110.7311.8711.88MIN: 10.4 / MAX: 12.15MIN: 10.62 / MAX: 12.77MIN: 11.6 / MAX: 35.77MIN: 11.76 / MAX: 17.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: yolov4-tinybdac4812162014.6914.8515.8816.19MIN: 14.27 / MAX: 69.64MIN: 14.14 / MAX: 85.67MIN: 15.09 / MAX: 54.54MIN: 14.04 / MAX: 25.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: squeezenet_ssddbac36912158.038.048.449.13MIN: 7.87 / MAX: 12.13MIN: 7.86 / MAX: 25.17MIN: 8.25 / MAX: 27.11MIN: 8.78 / MAX: 67.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: regnety_400mdabc36912158.879.179.199.20MIN: 8.63 / MAX: 46.65MIN: 9.11 / MAX: 11.17MIN: 9.1 / MAX: 12.28MIN: 8.93 / MAX: 57.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: vision_transformeracbd163248648070.9271.6171.6271.74MIN: 68.66 / MAX: 103.73MIN: 67.29 / MAX: 115.35MIN: 69.18 / MAX: 102.55MIN: 69.59 / MAX: 91.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: CPU - Model: FastestDetacbd0.95631.91262.86893.82524.78154.124.174.244.25MIN: 4.08 / MAX: 5.73MIN: 3.99 / MAX: 32.13MIN: 4.12 / MAX: 25.65MIN: 4.21 / MAX: 8.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mobilenetacbd369121511.3911.5111.5211.69MIN: 11.22 / MAX: 16.67MIN: 11.28 / MAX: 31.2MIN: 11.29 / MAX: 40.04MIN: 11.56 / MAX: 17.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2bdca0.97651.9532.92953.9064.88254.294.324.334.34MIN: 3.64 / MAX: 29.66MIN: 3.64 / MAX: 27.83MIN: 3.66 / MAX: 31.75MIN: 3.65 / MAX: 34.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3dbac0.83031.66062.49093.32124.15153.413.563.653.69MIN: 3.11 / MAX: 34.27MIN: 3.32 / MAX: 26.9MIN: 3.38 / MAX: 31.52MIN: 3.34 / MAX: 53.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: shufflenet-v2bdac0.6751.352.0252.73.3752.822.912.993.00MIN: 2.69 / MAX: 25.04MIN: 2.66 / MAX: 51.34MIN: 2.87 / MAX: 25.33MIN: 2.79 / MAX: 47.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mnasnetdbca0.78531.57062.35593.14123.92653.263.413.463.49MIN: 3.09 / MAX: 31.76MIN: 3.26 / MAX: 31.3MIN: 3.33 / MAX: 25.82MIN: 3.36 / MAX: 29.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: efficientnet-b0dabc1.13632.27263.40894.54525.68154.804.844.925.05MIN: 4.72 / MAX: 9.79MIN: 4.8 / MAX: 6.4MIN: 4.88 / MAX: 6.43MIN: 4.86 / MAX: 28.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: blazefacedbca0.2790.5580.8371.1161.3951.071.121.191.24MIN: 1.06 / MAX: 1.69MIN: 1.11 / MAX: 1.34MIN: 1.18 / MAX: 1.23MIN: 1.18 / MAX: 5.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: googlenetdabc36912158.749.019.359.53MIN: 8.35 / MAX: 66.31MIN: 8.55 / MAX: 49.29MIN: 9.02 / MAX: 29.75MIN: 8.98 / MAX: 50.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vgg16bcda81624324032.2932.3833.2333.38MIN: 30.48 / MAX: 69.49MIN: 31.27 / MAX: 63.71MIN: 30.69 / MAX: 148.44MIN: 31.56 / MAX: 65.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet18abdc2468105.976.016.146.60MIN: 5.88 / MAX: 7.65MIN: 5.92 / MAX: 7.29MIN: 5.83 / MAX: 26.68MIN: 5.87 / MAX: 53.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: alexnetabcd1.17682.35363.53044.70725.8844.965.045.175.23MIN: 4.89 / MAX: 6.2MIN: 4.89 / MAX: 6.96MIN: 4.98 / MAX: 20.08MIN: 5.08 / MAX: 5.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet50bacd4812162014.0014.1014.1014.38MIN: 13.83 / MAX: 15.63MIN: 13.4 / MAX: 38.99MIN: 13.45 / MAX: 36.9MIN: 13.78 / MAX: 58.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3acbd369121511.3911.5111.5211.69MIN: 11.22 / MAX: 16.67MIN: 11.28 / MAX: 31.2MIN: 11.29 / MAX: 40.04MIN: 11.56 / MAX: 17.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: yolov4-tinyacdb4812162015.1816.1716.5217.06MIN: 14.42 / MAX: 56.53MIN: 15.1 / MAX: 60.38MIN: 15.5 / MAX: 86.62MIN: 16.45 / MAX: 52.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: squeezenet_ssdadcb2468108.058.318.538.73MIN: 7.93 / MAX: 13.96MIN: 8.2 / MAX: 13.89MIN: 8.45 / MAX: 10.18MIN: 8.63 / MAX: 9.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: regnety_400mdbac36912158.748.899.129.14MIN: 8.64 / MAX: 9.66MIN: 8.74 / MAX: 10.45MIN: 8.89 / MAX: 34.9MIN: 9.05 / MAX: 13.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vision_transformercabd163248648068.1871.3271.7772.36MIN: 61.64 / MAX: 135.53MIN: 66.94 / MAX: 100.96MIN: 66.86 / MAX: 121.7MIN: 70.52 / MAX: 91.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: FastestDetcabd0.89551.7912.68653.5824.47752.752.773.493.98MIN: 2.71 / MAX: 4.51MIN: 2.72 / MAX: 4.61MIN: 3.46 / MAX: 3.88MIN: 3.94 / MAX: 5.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128bcda369121510.3310.3210.3010.301. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512acbd91827364538.7138.4037.4237.341. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024bdac91827364537.4237.3837.2836.941. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048bcda81624324035.4035.1632.9831.621. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128bdca369121510.8610.8110.8010.751. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512cbda91827364537.9737.4636.3236.111. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024bcda91827364537.7336.5735.2534.301. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048cbda81624324033.8133.1232.6331.581. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128cbad142842567060.7560.7059.2258.281. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512bcad4080120160200158.37151.53151.41131.741. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024bacd4080120160200158.82151.53150.48141.291. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048cdba306090120150140.36139.91139.57139.281. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5