2 x Intel Xeon Gold 6226R testing with a (5.14 BIOS) and ASPEED 16GB on Ubuntu 24.04 via the Phoronix Test Suite.
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Nvidia OpenBenchmarking.org Phoronix Test Suite 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads) (5.14 BIOS) Intel Sky Lake-E DMI3 Registers 512GB 2 x 8002GB INTEL SSDPE2KX080T8 ASPEED 16GB NVIDIA GA104 HD Audio 27B2G5 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb Ubuntu 24.04 6.8.0-38-generic (x86_64) X Server NVIDIA OpenCL 3.0 CUDA 12.4.131 GCC 13.2.0 + CUDA 12.4 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Display Server Display Driver OpenCL Compiler File-System Screen Resolution Nvidia Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08 - Python 3.8.13 - gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Nvidia hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS mixbench: OpenCL - Integer mixbench: OpenCL - Double Precision mixbench: OpenCL - Single Precision shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write fahbench: clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth rodinia: OpenCL Particle Filter luxcorerender: DLSC - GPU luxcorerender: Danish Mood - GPU luxcorerender: Orange Juice - GPU luxcorerender: LuxCore Benchmark - GPU luxcorerender: Rainbow Colors and Prism - GPU financebench: Black-Scholes OpenCL viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TT ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet plaidml: No - Inference - IMDB LSTM - OpenCL plaidml: No - Inference - Mobilenet - OpenCL plaidml: Yes - Inference - Mobilenet - OpenCL plaidml: No - Inference - DenseNet 201 - OpenCL neatbench: GPU ASPEED - 2 x Intel Xeon Gold 6226R 156178112500 91940033333 4224700 13308000000 3454200 11601.09 309.39 18670.28 211.904 12.1173 1094.66 22.5655 324.182 3630.55 21619.9 12.3250 13.1527 1998.58 283.4 380.1 376.4 240.1385 9617.49 18602.65 365.83 377.04 7.105 57.44 35.82 50.64 26.70 122.06 10.460 228 382 261 117.9 183 175 108 196 59.7 59.4 62.2 58.3 266 345 312 359 385 383 170 317 344 348 340 18.93 8.37 8.52 9.72 7.30 11.02 4.13 18.15 45.73 10.92 7.97 21.90 33.43 20.32 32.77 58.46 10.39 751.93 1898.70 2201.61 179.21 OpenBenchmarking.org
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 ASPEED - 2 x Intel Xeon Gold 6226R 20000M 40000M 60000M 80000M 100000M SE +/- 113680610.09, N = 3 91940033333
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 ASPEED - 2 x Intel Xeon Gold 6226R 3000M 6000M 9000M 12000M 15000M SE +/- 26463244.95, N = 3 13308000000
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS ASPEED - 2 x Intel Xeon Gold 6226R 700K 1400K 2100K 2800K 3500K SE +/- 1039.23, N = 3 3454200
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.92, N = 3 309.39 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision ASPEED - 2 x Intel Xeon Gold 6226R 4K 8K 12K 16K 20K SE +/- 27.29, N = 3 18670.28 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
SHOC Scalable HeterOgeneous Computing The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 0.07, N = 3 211.90 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
RedShift Demo This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float ASPEED - 2 x Intel Xeon Gold 6226R 4K 8K 12K 16K 20K SE +/- 76.05, N = 3 18602.65 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 0.36, N = 3 365.83 1. (CXX) g++ options: -O3
OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 0.01, N = 3 377.04 1. (CXX) g++ options: -O3
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
Backend: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.096, N = 3 7.105 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 5.22, N = 12 57.44 MAX: 65.49
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 8 16 24 32 40 SE +/- 0.26, N = 3 35.82 MIN: 12.8 / MAX: 46.64
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 11 22 33 44 55 SE +/- 0.09, N = 3 50.64 MIN: 44.98 / MAX: 64.06
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 6 12 18 24 30 SE +/- 2.44, N = 12 26.70 MAX: 43.01
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU ASPEED - 2 x Intel Xeon Gold 6226R 30 60 90 120 150 SE +/- 0.62, N = 3 122.06 MIN: 106.98 / MAX: 141.43
ArrayFire Test: Conjugate Gradient OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.02, N = 3 10.46 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 3.89, N = 15 228 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 5.93, N = 15 382 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT ASPEED - 2 x Intel Xeon Gold 6226R 60 120 180 240 300 SE +/- 1.23, N = 15 261 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY ASPEED - 2 x Intel Xeon Gold 6226R 30 60 90 120 150 SE +/- 2.79, N = 15 117.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 1.77, N = 15 183 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 1.40, N = 15 175 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 0.64, N = 15 108 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 1.22, N = 15 196 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 1.20, N = 15 59.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 1.11, N = 14 59.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN ASPEED - 2 x Intel Xeon Gold 6226R 14 28 42 56 70 SE +/- 1.35, N = 14 62.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 1.46, N = 15 58.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY ASPEED - 2 x Intel Xeon Gold 6226R 60 120 180 240 300 SE +/- 0.67, N = 3 266 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.33, N = 3 345 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.67, N = 3 312 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 0.58, N = 3 359 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 0.00, N = 3 385 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 SE +/- 0.33, N = 3 383 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 0.00, N = 3 170 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 1.76, N = 3 317 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 1.45, N = 3 344 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT ASPEED - 2 x Intel Xeon Gold 6226R 80 160 240 320 400 348 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 340 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.10, N = 12 8.37 MIN: 7.43 / MAX: 30.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.06, N = 12 8.52 MIN: 7.93 / MAX: 87.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.08, N = 12 9.72 MIN: 8.97 / MAX: 16.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.12, N = 12 7.30 MIN: 6.57 / MAX: 71.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.13, N = 12 11.02 MIN: 9.86 / MAX: 77.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface ASPEED - 2 x Intel Xeon Gold 6226R 0.9293 1.8586 2.7879 3.7172 4.6465 SE +/- 0.07, N = 12 4.13 MIN: 3.7 / MAX: 4.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet ASPEED - 2 x Intel Xeon Gold 6226R 4 8 12 16 20 SE +/- 0.33, N = 12 18.15 MIN: 15.73 / MAX: 36.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ASPEED - 2 x Intel Xeon Gold 6226R 10 20 30 40 50 SE +/- 0.42, N = 12 45.73 MIN: 41.81 / MAX: 716.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.11, N = 12 10.92 MIN: 10.17 / MAX: 12.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.10, N = 12 7.97 MIN: 7.31 / MAX: 10.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ASPEED - 2 x Intel Xeon Gold 6226R 5 10 15 20 25 SE +/- 0.24, N = 12 21.90 MIN: 20.15 / MAX: 31.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ASPEED - 2 x Intel Xeon Gold 6226R 8 16 24 32 40 SE +/- 0.49, N = 12 33.43 MIN: 29.11 / MAX: 256.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ASPEED - 2 x Intel Xeon Gold 6226R 5 10 15 20 25 SE +/- 0.34, N = 12 20.32 MIN: 18.17 / MAX: 30.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ASPEED - 2 x Intel Xeon Gold 6226R 8 16 24 32 40 SE +/- 0.23, N = 12 32.77 MIN: 31.13 / MAX: 37.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 0.72, N = 12 58.46 MIN: 52.56 / MAX: 125.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.41, N = 11 10.39 MIN: 8.52 / MAX: 30.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
PlaidML This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.
FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 160 320 480 640 800 SE +/- 1.13, N = 3 751.93
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 400 800 1200 1600 2000 SE +/- 2.99, N = 3 1898.70
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 500 1000 1500 2000 2500 SE +/- 0.40, N = 3 2201.61
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 0.34, N = 3 179.21
MandelGPU MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.
OpenCL Device: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
NeatBench NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.
Acceleration: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 19 July 2024 23:42 by user malogica.