newd

AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.8 BIOS) and NVIDIA GeForce RTX 4090 24GB on EndeavourOS rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2404233-NE-NEWD5443293&grr.

newdProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GBAMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads)Supermicro H12SSL-I v1.02 (2.8 BIOS)AMD Starship/Matisse256GB15363GB Micron_7450_MTFDKCC15T3TFRNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD Audio38GN9502 x Intel X710 for 10GbE SFP+EndeavourOS rolling6.8.7-zen1-1-zen (x86_64)Xfce 4.18X Server 1.21.1.13NVIDIA 550.764.6.0GCC 13.2.1 20230801 + Clang 17.0.6 + LLVM 17.0.6 + CUDA 12.4btrfs3840x1600OpenBenchmarking.org- Transparent Huge Pages: always- NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d3 - Python 3.11.8- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

newdopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUllama-cpp: llama-2-13b.Q4_0.ggufAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB2306.3610.31228.80104.73227.07105.5315.84OpenBenchmarking.org

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB5001000150020002500SE +/- 3.56, N = 32306.36MIN: 1274.94 / MAX: 2469.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB3691215SE +/- 0.02, N = 310.311. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB50100150200250SE +/- 0.31, N = 3228.80MIN: 115.55 / MAX: 317.771. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB20406080100SE +/- 0.13, N = 3104.731. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB50100150200250SE +/- 0.81, N = 3227.07MIN: 130.78 / MAX: 319.211. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPUAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB20406080100SE +/- 0.39, N = 3105.531. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Llama.cpp

Model: llama-2-13b.Q4_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-13b.Q4_0.ggufAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB48121620SE +/- 0.02, N = 315.841. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -fopenmp -lopenblas


Phoronix Test Suite v10.8.4