KVM testing on Ubuntu 24.04 via the Phoronix Test Suite.
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
5x A5000 kw-dl580-3-4 NVIDIA Processor: 4 x Intel Xeon E7-4880 v2 (60 Cores / 120 Threads) , Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (edk2-20240813-1.fc40 BIOS) , Chipset: Intel 82G33/G31/P35/P31 + ICH9 , Memory: 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 4 GB RAM , Disk: 21GB VIRTUAL-DISK , Graphics: Red Hat QXL paravirtual graphic card 22GB , Audio: QEMU Generic , Network: 2 x Red Hat Virtio 1.0 device
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.0, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0x715Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.6d.00.0dPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion; VMX: flush not necessary SMT vulnerable + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Retpoline + srbds: Not affected + tsx_async_abort: Not affected
ArrayFire Test: Conjugate Gradient OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found
5x A5000 kw-dl580-3-4 NVIDIA: The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found
OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 130 260 390 520 650 SE +/- 0.43, N = 3 SE +/- 0.03, N = 3 584.4 380.1 1. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 120 240 360 480 600 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 547.4 376.4 1. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 6K 12K 18K 24K 30K SE +/- 0.40, N = 3 SE +/- 76.05, N = 3 26836.38 18602.65 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.64, N = 3 SE +/- 0.36, N = 3 483.57 365.83 1. (CXX) g++ options: -O3
OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 130 260 390 520 650 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 582.46 377.04 1. (CXX) g++ options: -O3
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.107, N = 3 SE +/- 0.016, N = 3 8.049 10.460 1. (CXX) g++ options: -O3 -march=native -fopenmp
Hashcat Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 30000M 60000M 90000M 120000M 150000M SE +/- 31239876948.73, N = 16 SE +/- 23688235109.70, N = 16 156178112500 137067950000
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20000M 40000M 60000M 80000M 100000M SE +/- 113680610.09, N = 3 SE +/- 133865807.60, N = 3 91940033333 75698633333
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 900K 1800K 2700K 3600K 4500K SE +/- 9462.73, N = 3 SE +/- 9837.57, N = 3 4224700 3519467
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 3000M 6000M 9000M 12000M 15000M SE +/- 26463244.95, N = 3 SE +/- 27986087.81, N = 3 13308000000 10923433333
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 700K 1400K 2100K 2800K 3500K SE +/- 1039.23, N = 3 SE +/- 4272.52, N = 3 3454200 2853933
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
Backend: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 13 26 39 52 65 SE +/- 5.22, N = 12 SE +/- 0.60, N = 3 57.44 48.60 MAX: 65.49 MIN: 29.6 / MAX: 52.28
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 8 16 24 32 40 SE +/- 0.26, N = 3 SE +/- 0.08, N = 3 35.82 24.72 MIN: 12.8 / MAX: 46.64 MIN: 2.11 / MAX: 34.96
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.29, N = 15 50.64 34.46 MIN: 44.98 / MAX: 64.06 MIN: 0.45 / MAX: 47.01
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 6 12 18 24 30 SE +/- 2.44, N = 12 SE +/- 1.54, N = 12 26.70 16.69 MAX: 43.01 MAX: 33.09
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 30 60 90 120 150 SE +/- 0.62, N = 3 SE +/- 2.42, N = 12 122.06 76.39 MIN: 106.98 / MAX: 141.43 MIN: 61.59 / MAX: 129.05
MandelGPU MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.
OpenCL Device: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status.
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 4.99, N = 15 SE +/- 0.92, N = 3 391.91 309.39 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 6K 12K 18K 24K 30K SE +/- 327.84, N = 15 SE +/- 27.29, N = 3 27753.40 18670.28 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.22, N = 12 SE +/- 2.12, N = 9 18.93 77.05 MIN: 17.49 / MAX: 22.25 MIN: 37.56 / MAX: 842.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 9 18 27 36 45 SE +/- 0.10, N = 12 SE +/- 1.67, N = 9 8.37 40.16 MIN: 7.43 / MAX: 30.63 MIN: 19.36 / MAX: 731.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 9 18 27 36 45 SE +/- 0.06, N = 12 SE +/- 1.76, N = 9 8.52 40.31 MIN: 7.93 / MAX: 87.35 MIN: 19.09 / MAX: 864.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 11 22 33 44 55 SE +/- 0.08, N = 12 SE +/- 2.22, N = 9 9.72 47.98 MIN: 8.97 / MAX: 16.96 MIN: 22.19 / MAX: 949.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 9 18 27 36 45 SE +/- 0.12, N = 12 SE +/- 1.96, N = 9 7.30 39.58 MIN: 6.57 / MAX: 71.25 MIN: 17.71 / MAX: 715.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 13 26 39 52 65 SE +/- 0.13, N = 12 SE +/- 1.20, N = 9 11.02 57.12 MIN: 9.86 / MAX: 77.38 MIN: 26.93 / MAX: 1232.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 5 10 15 20 25 SE +/- 0.07, N = 12 SE +/- 1.40, N = 9 4.13 21.06 MIN: 3.7 / MAX: 4.65 MIN: 10.19 / MAX: 621.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.33, N = 12 SE +/- 3.17, N = 9 18.15 90.21 MIN: 15.73 / MAX: 36.32 MIN: 41.1 / MAX: 1221.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 30 60 90 120 150 SE +/- 0.42, N = 12 SE +/- 1.79, N = 9 45.73 140.86 MIN: 41.81 / MAX: 716.68 MIN: 71.64 / MAX: 393.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 10 20 30 40 50 SE +/- 0.11, N = 12 SE +/- 1.32, N = 9 10.92 43.28 MIN: 10.17 / MAX: 12.21 MIN: 20.79 / MAX: 504.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 7 14 21 28 35 SE +/- 0.10, N = 12 SE +/- 0.75, N = 9 7.97 28.16 MIN: 7.31 / MAX: 10.33 MIN: 13.44 / MAX: 256.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.24, N = 12 SE +/- 2.01, N = 9 21.90 93.89 MIN: 20.15 / MAX: 31.1 MIN: 44.98 / MAX: 1038.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.49, N = 12 SE +/- 1.69, N = 9 33.43 106.30 MIN: 29.11 / MAX: 256.67 MIN: 52.73 / MAX: 505.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.34, N = 12 SE +/- 3.32, N = 9 20.32 77.37 MIN: 18.17 / MAX: 30.87 MIN: 35.36 / MAX: 1176.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 50 100 150 200 250 SE +/- 0.23, N = 12 SE +/- 27.31, N = 9 32.77 227.14 MIN: 93.69 / MAX: 5948.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 50 100 150 200 250 SE +/- 0.72, N = 12 SE +/- 3.18, N = 9 58.46 228.16 MIN: 52.56 / MAX: 125.76 MIN: 122.48 / MAX: 1174.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 11 22 33 44 55 SE +/- 0.41, N = 11 SE +/- 1.46, N = 9 10.39 50.17 MIN: 8.52 / MAX: 30.47 MIN: 22.35 / MAX: 948.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 2.12, N = 9 77.05 MIN: 37.56 / MAX: 842.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NeatBench NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.
Acceleration: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.
5x A5000 kw-dl580-3-4 NVIDIA: The test run did not produce a result.
PlaidML This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.
FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 160 320 480 640 800 SE +/- 1.13, N = 3 751.93
FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 400 800 1200 1600 2000 SE +/- 2.99, N = 3 1898.70
FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 500 1000 1500 2000 2500 SE +/- 0.40, N = 3 2201.61
FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 0.34, N = 3 179.21
FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
RedShift Demo This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.079, N = 15 SE +/- 0.096, N = 3 6.694 7.105 -O2 -lOpenCL -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl 1. (CXX) g++ options:
SHOC Scalable HeterOgeneous Computing The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 0.07, N = 3 211.90 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: S3D
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 12.12 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Triad
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP ASPEED - 2 x Intel Xeon Gold 6226R 200 400 600 800 1000 SE +/- 0.17, N = 3 1094.66 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: FFT SP
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash ASPEED - 2 x Intel Xeon Gold 6226R 5 10 15 20 25 SE +/- 0.00, N = 3 22.57 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: MD5 Hash
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.27, N = 3 324.18 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Reduction
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N ASPEED - 2 x Intel Xeon Gold 6226R 800 1600 2400 3200 4000 SE +/- 44.25, N = 4 3630.55 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: GEMM SGEMM_N
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops ASPEED - 2 x Intel Xeon Gold 6226R 5K 10K 15K 20K 25K SE +/- 305.65, N = 3 21619.9 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Max SP Flops
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 12.33 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Bus Speed Download
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 13.15 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Bus Speed Readback
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth ASPEED - 2 x Intel Xeon Gold 6226R 400 800 1200 1600 2000 SE +/- 4.68, N = 3 1998.58 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Texture Read Bandwidth
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 34.09, N = 12 SE +/- 3.89, N = 15 299.3 228.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 53.50, N = 12 SE +/- 5.93, N = 15 392.9 382.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 60 120 180 240 300 SE +/- 1.23, N = 15 SE +/- 23.95, N = 12 261.0 199.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 30 60 90 120 150 SE +/- 2.79, N = 15 SE +/- 1.21, N = 12 117.9 46.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 40 80 120 160 200 SE +/- 1.77, N = 15 SE +/- 2.44, N = 12 183.0 67.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 40 80 120 160 200 SE +/- 1.40, N = 15 SE +/- 2.95, N = 12 175.0 56.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 0.64, N = 15 SE +/- 1.70, N = 12 108.0 51.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 40 80 120 160 200 SE +/- 1.22, N = 15 SE +/- 20.15, N = 12 196.0 191.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 13 26 39 52 65 SE +/- 1.20, N = 15 SE +/- 0.22, N = 12 59.7 57.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 13 26 39 52 65 SE +/- 1.11, N = 14 SE +/- 0.35, N = 12 59.4 56.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 14 28 42 56 70 SE +/- 1.35, N = 14 SE +/- 0.13, N = 12 62.2 58.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 13 26 39 52 65 SE +/- 1.46, N = 15 SE +/- 0.14, N = 12 58.3 57.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 309 266 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 1.53, N = 3 SE +/- 0.33, N = 3 403 345 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 312 291 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.45, N = 3 SE +/- 0.58, N = 3 473 359 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 120 240 360 480 600 SE +/- 1.15, N = 3 SE +/- 0.00, N = 3 534 385 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 477 383 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N ASPEED - 2 x Intel Xeon Gold 6226R 5x A5000 kw-dl580-3-4 NVIDIA 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 170 164 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.88, N = 3 SE +/- 1.76, N = 3 324 317 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.15, N = 3 SE +/- 1.45, N = 3 440 344 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 2.03, N = 3 443 348 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 2.03, N = 3 442 340 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 5x A5000 kw-dl580-3-4 NVIDIA 100 200 300 400 500 SE +/- 1.76, N = 3 443 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 19 July 2024 23:42 by user malogica.
5x A5000 kw-dl580-3-4 NVIDIA Processor: 4 x Intel Xeon E7-4880 v2 (60 Cores / 120 Threads), Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (edk2-20240813-1.fc40 BIOS), Chipset: Intel 82G33/G31/P35/P31 + ICH9, Memory: 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 4 GB RAM, Disk: 21GB VIRTUAL-DISK, Graphics: Red Hat QXL paravirtual graphic card 22GB, Audio: QEMU Generic, Network: 2 x Red Hat Virtio 1.0 device
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.0, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0x715Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.6d.00.0dPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion; VMX: flush not necessary SMT vulnerable + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Retpoline + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2024 22:58 by user root.