m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

KVM testing AMD Ryzen 9 7940HS testing with a Win element M600 (SR500P03_P5C2V07 BIOS) and AMD Phoenix1 16GB on EndeavourOS rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2410047-NE-2309026NE47&gru.

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionSystem Layerm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03AMD Ryzen 9 7940HS @ 4.00GHz (8 Cores / 16 Threads)Win element M600 (SR500P03_P5C2V07 BIOS)AMD Device 14e880GBWestern Digital WD_BLACK SN850X 2000GBAMD Phoenix1 16GBAMD Rembrandt Radeon HD AudioDELL S3422DW2 x Realtek RTL8125 2.5GbE + Intel Wi-Fi 6 AX200EndeavourOS rolling6.4.12-arch1-1 (x86_64)Xfce 4.18X Server 1.21.1.84.6 Mesa 23.1.6-arch1.4 (LLVM 16.0.6 DRM 3.52)GCC 13.2.1 20230801ext43440x1440AMD Ryzen 9 7940HS (14 Cores)QEMU Standard PC (Q35 + ICH9 2009) (4.2023.08-4 BIOS)Intel 82G33/G31/P35/P31 + ICH960GBWestern Digital WD_BLACK SN850X 2000GB + 34GB QEMU HDDAMD Radeon 780M 16GB (2799/2800MHz)Intel 82801IRed Hat Virtio device6.11.1-arch1-1 (x86_64)X Server 1.21.1.134.6 Mesa 24.2.3-arch1.1 (LLVM 18.1.8 DRM 3.58)GCC 14.2.1 20240910 + Clang 18.1.8 + LLVM 18.1.8KVMOpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa704101- m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03: CPU Microcode: 0xa704101Graphics Details- m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2: GLAMOR - BAR1 / Visible vRAM Size: 16384 MB- m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03: GLAMOR - BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-PHXGENERIC-001Python Details- m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2: Python 3.11.5- m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03: Python 3.12.6Security Details- m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2numpy: tensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetdeepspeech: CPUrnnoise: numenta-nab: KNN CADnumenta-nab: Relative Entropynumenta-nab: Windowed Gaussiannumenta-nab: Earthgecko Skylinenumenta-nab: Bayesian Changepointnumenta-nab: Contextual Anomaly Detector OSEscikit-learn: GLMscikit-learn: Treescikit-learn: Lassoscikit-learn: Sparsifyscikit-learn: Plot Wardscikit-learn: MNIST Datasetscikit-learn: SGD Regressionscikit-learn: SGDOneClassSVMscikit-learn: Plot Fast KMeansscikit-learn: Plot Hierarchicalscikit-learn: Plot OMP vs. LARSscikit-learn: Feature Expansionsscikit-learn: TSNE MNIST Datasetscikit-learn: Isotonic / Logisticscikit-learn: Hist Gradient Boostingm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03692.941921.5128464.87083.461515.793105.7928085.84.546204.619660.7206561.583511.585792.6599210.53306.749074.504929.063920.9121511.078712672.081436.722718.504.213338.551502.759361409.842714.851402.638.072.432.231.972.183.300.736.3331.144.644.5210.4513.426.305.1453.792.498.242.472.282.012.213.380.756.6631.204.904.9110.3713.606.105.2254.042.4846.9561214.295107.1229.7166.46686.57023.27130.6631649.84244.3203207.145104.14839.65455.463655.585209.017512.427128.612627.893110.195233.6601118.01462.776673.142043.1429628.07052.861566.173581.7628135.55.508334.908760.7882781.801071.750322.600279.388836.140165.909619.000151.085031.504503134.371620.263155.674.270919.903743.660351604.763157.491634.659.612.742.522.352.593.740.827.7833.565.655.2812.5315.856.675.8063.822.949.322.742.542.362.563.760.847.8333.585.745.3512.7215.546.715.8864.382.8847.9991414.865126.157119.9348.848103.75222.99337.6761642.14446.5773192.21276.27039.93456.571662.907200.236525.463128.645110.213254.2851365.69470.433OpenBenchmarking.org

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarkm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03150300450600750SE +/- 5.43, N = 3SE +/- 2.26, N = 3692.94673.14

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03400800120016002000SE +/- 5.32, N = 3SE +/- 20.97, N = 31921.512043.14

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-036K12K18K24K30KSE +/- 57.85, N = 3SE +/- 104.54, N = 328464.829628.0

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilem600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0315003000450060007500SE +/- 47.37, N = 3SE +/- 2.29, N = 37083.467052.86

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0330060090012001500SE +/- 9.91, N = 3SE +/- 4.17, N = 31515.791566.17

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-038001600240032004000SE +/- 3.46, N = 3SE +/- 49.13, N = 33105.793581.76

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-036K12K18K24K30KSE +/- 247.53, N = 3SE +/- 99.72, N = 328085.828135.5

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.23942.47883.71824.95766.197SE +/- 0.04068, N = 3SE +/- 0.04121, N = 34.546205.50833MIN: 3.79MIN: 4.831. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.10452.2093.31354.4185.5225SE +/- 0.04248, N = 6SE +/- 0.01808, N = 34.619664.90876MIN: 4.37MIN: 4.751. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.17740.35480.53220.70960.887SE +/- 0.006523, N = 15SE +/- 0.009359, N = 30.7206560.788278MIN: 0.57MIN: 0.71. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.40520.81041.21561.62082.026SE +/- 0.02084, N = 3SE +/- 0.01662, N = 61.583511.80107MIN: 1.39MIN: 1.541. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.39380.78761.18141.57521.969SE +/- 0.00921, N = 3SE +/- 0.00736, N = 31.585791.75032MIN: 1.32MIN: 1.611. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.59851.1971.79552.3942.9925SE +/- 0.01990, N = 3SE +/- 0.02758, N = 32.659922.60027MIN: 2.43MIN: 2.451. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.10514, N = 3SE +/- 0.00450, N = 310.533009.38883MIN: 8.9MIN: 9.031. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.11025, N = 12SE +/- 0.02214, N = 36.749076.14016MIN: 4.3MIN: 5.371. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.32972.65943.98915.31886.6485SE +/- 0.13143, N = 15SE +/- 0.03712, N = 34.504925.90961MIN: 4.07MIN: 5.631. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.04724, N = 3SE +/- 0.00233, N = 39.063929.00015MIN: 8.73MIN: 8.751. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.24410.48820.73230.97641.2205SE +/- 0.003553, N = 3SE +/- 0.006131, N = 30.9121511.085030MIN: 0.82MIN: 11. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.33850.6771.01551.3541.6925SE +/- 0.00135, N = 3SE +/- 0.00207, N = 31.078711.50450MIN: 0.96MIN: 1.41. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-037001400210028003500SE +/- 22.64, N = 8SE +/- 8.47, N = 32672.083134.37MIN: 2490.39MIN: 3100.851. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0330060090012001500SE +/- 4.68, N = 3SE +/- 3.35, N = 31436.721620.26MIN: 1361.41MIN: 1590.571. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-037001400210028003500SE +/- 9.78, N = 3SE +/- 8.76, N = 32718.503155.67MIN: 2636.72MIN: 3117.681. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.9611.9222.8833.8444.805SE +/- 0.06060, N = 3SE +/- 0.02962, N = 34.213334.27091MIN: 3.94MIN: 4.051. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.03672, N = 3SE +/- 0.09024, N = 158.551509.90374MIN: 7.52MIN: 8.661. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.82361.64722.47083.29444.118SE +/- 0.00335, N = 3SE +/- 0.01273, N = 32.759363.66035MIN: 2.51MIN: 3.451. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0330060090012001500SE +/- 17.30, N = 4SE +/- 7.75, N = 31409.841604.76MIN: 1312.69MIN: 1553.391. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-037001400210028003500SE +/- 5.55, N = 3SE +/- 14.23, N = 32714.853157.49MIN: 2634.21MIN: 3102.281. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03400800120016002000SE +/- 2.23, N = 3SE +/- 2.00, N = 31402.631634.65MIN: 1343.03MIN: 1607.741. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.01, N = 3SE +/- 0.21, N = 128.079.61MIN: 7.82 / MAX: 11.67MIN: 8.79 / MAX: 298.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.61651.2331.84952.4663.0825SE +/- 0.01, N = 3SE +/- 0.05, N = 122.432.74MIN: 2.28 / MAX: 5.61MIN: 2.42 / MAX: 6.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.5671.1341.7012.2682.835SE +/- 0.01, N = 3SE +/- 0.04, N = 122.232.52MIN: 2.13 / MAX: 5.1MIN: 2.18 / MAX: 5.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.52881.05761.58642.11522.644SE +/- 0.00, N = 3SE +/- 0.03, N = 121.972.35MIN: 1.85 / MAX: 6.58MIN: 2.04 / MAX: 7.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.58281.16561.74842.33122.914SE +/- 0.03, N = 3SE +/- 0.05, N = 112.182.59MIN: 2.01 / MAX: 5.91MIN: 2.21 / MAX: 6.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.84151.6832.52453.3664.2075SE +/- 0.03, N = 3SE +/- 0.06, N = 123.303.74MIN: 3.09 / MAX: 6.22MIN: 3.31 / MAX: 7.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefacem600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.18450.3690.55350.7380.9225SE +/- 0.01, N = 3SE +/- 0.01, N = 120.730.82MIN: 0.69 / MAX: 3.56MIN: 0.72 / MAX: 3.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.02, N = 3SE +/- 0.15, N = 126.337.78MIN: 6.06 / MAX: 9.4MIN: 6.82 / MAX: 11.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03816243240SE +/- 0.46, N = 3SE +/- 0.11, N = 1231.1433.56MIN: 30.03 / MAX: 44.67MIN: 30.78 / MAX: 59.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.27132.54263.81395.08526.3565SE +/- 0.03, N = 3SE +/- 0.13, N = 124.645.65MIN: 4.44 / MAX: 7.49MIN: 5.12 / MAX: 10.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.1882.3763.5644.7525.94SE +/- 0.03, N = 3SE +/- 0.03, N = 124.525.28MIN: 4.35 / MAX: 8.37MIN: 4.94 / MAX: 12.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.55, N = 3SE +/- 0.05, N = 1210.4512.53MIN: 9.44 / MAX: 15.88MIN: 11.97 / MAX: 19.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinym600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0348121620SE +/- 0.08, N = 3SE +/- 0.58, N = 1213.4215.85MIN: 12.96 / MAX: 17.77MIN: 14.11 / MAX: 189.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.27, N = 3SE +/- 0.05, N = 116.306.67MIN: 5.74 / MAX: 10.52MIN: 6.28 / MAX: 10.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.3052.613.9155.226.525SE +/- 0.01, N = 3SE +/- 0.08, N = 125.145.80MIN: 4.92 / MAX: 11.22MIN: 5.11 / MAX: 9.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031428425670SE +/- 0.05, N = 3SE +/- 0.30, N = 1253.7963.82MIN: 51.57 / MAX: 61.95MIN: 60.84 / MAX: 93.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.66151.3231.98452.6463.3075SE +/- 0.04, N = 3SE +/- 0.05, N = 122.492.94MIN: 2.33 / MAX: 5.65MIN: 2.56 / MAX: 7.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.09, N = 3SE +/- 0.02, N = 38.249.32MIN: 7.85 / MAX: 12.18MIN: 9.11 / MAX: 12.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.61651.2331.84952.4663.0825SE +/- 0.03, N = 3SE +/- 0.06, N = 32.472.74MIN: 2.28 / MAX: 5.9MIN: 2.54 / MAX: 6.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.57151.1431.71452.2862.8575SE +/- 0.02, N = 3SE +/- 0.06, N = 32.282.54MIN: 2.14 / MAX: 5.25MIN: 2.36 / MAX: 5.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.5311.0621.5932.1242.655SE +/- 0.01, N = 3SE +/- 0.04, N = 32.012.36MIN: 1.9 / MAX: 4.96MIN: 2.24 / MAX: 5.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.5761.1521.7282.3042.88SE +/- 0.02, N = 3SE +/- 0.06, N = 32.212.56MIN: 2.08 / MAX: 5.13MIN: 2.31 / MAX: 5.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.8461.6922.5383.3844.23SE +/- 0.06, N = 3SE +/- 0.07, N = 33.383.76MIN: 3.17 / MAX: 6.42MIN: 3.52 / MAX: 7.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefacem600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.1890.3780.5670.7560.945SE +/- 0.01, N = 3SE +/- 0.01, N = 30.750.84MIN: 0.69 / MAX: 3.66MIN: 0.77 / MAX: 3.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.07, N = 3SE +/- 0.33, N = 36.667.83MIN: 6.35 / MAX: 10.74MIN: 7.27 / MAX: 11.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03816243240SE +/- 0.02, N = 3SE +/- 0.18, N = 331.2033.58MIN: 30.51 / MAX: 39.52MIN: 30.97 / MAX: 56.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.29152.5833.87455.1666.4575SE +/- 0.03, N = 3SE +/- 0.35, N = 34.905.74MIN: 4.68 / MAX: 10.25MIN: 5.13 / MAX: 11.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.20382.40763.61144.81526.019SE +/- 0.10, N = 3SE +/- 0.03, N = 34.915.35MIN: 4.69 / MAX: 11.39MIN: 5.21 / MAX: 8.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-033691215SE +/- 0.02, N = 3SE +/- 0.07, N = 310.3712.72MIN: 9.94 / MAX: 16.03MIN: 12.47 / MAX: 17.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinym600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0348121620SE +/- 0.06, N = 3SE +/- 0.07, N = 313.6015.54MIN: 13.22 / MAX: 18.63MIN: 13.95 / MAX: 19.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.02, N = 3SE +/- 0.04, N = 36.106.71MIN: 5.84 / MAX: 14.12MIN: 6.47 / MAX: 10.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031.3232.6463.9695.2926.615SE +/- 0.05, N = 3SE +/- 0.07, N = 35.225.88MIN: 4.91 / MAX: 8.27MIN: 5.67 / MAX: 9.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031428425670SE +/- 0.06, N = 3SE +/- 0.24, N = 354.0464.38MIN: 52.55 / MAX: 64.52MIN: 61.47 / MAX: 73.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-030.6481.2961.9442.5923.24SE +/- 0.05, N = 3SE +/- 0.07, N = 32.482.88MIN: 2.34 / MAX: 6MIN: 2.66 / MAX: 5.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

DeepSpeech

Acceleration: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031122334455SE +/- 0.02, N = 3SE +/- 0.03, N = 346.9648.00

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0348121620SE +/- 0.02, N = 3SE +/- 0.08, N = 314.3014.871. (CC) gcc options: -O2 -pedantic -fvisibility=hidden -lm

Numenta Anomaly Benchmark

Detector: KNN CAD

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: KNN CADm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03306090120150SE +/- 0.38, N = 3SE +/- 1.55, N = 4107.12126.16

Numenta Anomaly Benchmark

Detector: Relative Entropy

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Relative Entropym600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03306090120150SE +/- 0.090, N = 6SE +/- 1.498, N = 49.716119.934

Numenta Anomaly Benchmark

Detector: Windowed Gaussian

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Windowed Gaussianm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03246810SE +/- 0.012, N = 3SE +/- 0.058, N = 36.4668.848

Numenta Anomaly Benchmark

Detector: Earthgecko Skyline

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Earthgecko Skylinem600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0320406080100SE +/- 0.91, N = 15SE +/- 0.60, N = 386.57103.75

Numenta Anomaly Benchmark

Detector: Bayesian Changepoint

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Bayesian Changepointm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03612182430SE +/- 0.20, N = 3SE +/- 0.02, N = 323.2722.99

Numenta Anomaly Benchmark

Detector: Contextual Anomaly Detector OSE

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Contextual Anomaly Detector OSEm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03918273645SE +/- 0.17, N = 3SE +/- 0.14, N = 330.6637.68

Scikit-Learn

Benchmark: GLM

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: GLMm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03400800120016002000SE +/- 2.64, N = 3SE +/- 2.20, N = 31649.841642.141. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Tree

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Treem600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031122334455SE +/- 0.36, N = 15SE +/- 0.39, N = 1544.3246.581. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Lasso

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Lassom600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-037001400210028003500SE +/- 12.72, N = 3SE +/- 9.69, N = 33207.153192.211. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Sparsify

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Sparsifym600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0320406080100SE +/- 0.18, N = 3SE +/- 0.28, N = 3104.1576.271. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Plot Ward

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Plot Wardm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03918273645SE +/- 0.04, N = 3SE +/- 0.03, N = 339.6539.931. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: MNIST Dataset

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: MNIST Datasetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031326395265SE +/- 0.50, N = 3SE +/- 0.18, N = 355.4656.571. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: SGD Regression

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: SGD Regressionm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03140280420560700SE +/- 2.72, N = 3SE +/- 0.37, N = 3655.59662.911. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: SGDOneClassSVM

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: SGDOneClassSVMm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0350100150200250SE +/- 2.78, N = 3SE +/- 0.65, N = 3209.02200.241. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Plot Fast KMeans

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Plot Fast KMeansm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03110220330440550SE +/- 2.66, N = 3SE +/- 3.25, N = 3512.43525.461. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Plot Hierarchical

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Plot Hierarchicalm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-03306090120150SE +/- 1.33, N = 3SE +/- 0.66, N = 3128.61128.651. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Plot OMP vs. LARS

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Plot OMP vs. LARSm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2140280420560700SE +/- 1.88, N = 3627.891. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Feature Expansions

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Feature Expansionsm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0320406080100SE +/- 0.08, N = 3SE +/- 0.39, N = 3110.20110.211. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: TSNE MNIST Dataset

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: TSNE MNIST Datasetm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0360120180240300SE +/- 0.28, N = 3SE +/- 1.77, N = 3233.66254.291. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Isotonic / Logistic

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Isotonic / Logisticm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-0330060090012001500SE +/- 4.00, N = 3SE +/- 2.94, N = 31118.011365.691. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Hist Gradient Boosting

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Hist Gradient Boostingm600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2m600_7940hs-guest-os-on-proxmox-host-60gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2024-10-031632486480SE +/- 0.08, N = 3SE +/- 0.50, N = 1562.7870.431. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc


Phoronix Test Suite v10.8.5