HA1 HyperScalers with L40S

2 x AMD EPYC 9174F 16-Core testing with a ASUS RS700A-E12-RS4U K14PP-D24 (1002 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402011-NE-HA1HYPERS12.

HA1 HyperScalers with L40SProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionL40S test2 x AMD EPYC 9174F 16-Core @ 4.10GHz (32 Cores / 63 Threads)ASUS RS700A-E12-RS4U K14PP-D24 (1002 BIOS)AMD Device 14a424 x 32 GB 4800MT/s Samsung M321R4GA3BB6-CQKEG500GB CT500P3PSSD8 + 3841GB DAPUSTOR DPH311T4T003T8 + 960GB HUSMR7696BDP3Y1llvmpipeACER KA220HQ2 x Intel X710 for 10GBASE-T + 2 x Intel X710 for 10GbE SFP+Ubuntu 20.045.15.0-91-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.13NVIDIA 535.154.054.5 Mesa 21.2.6 (LLVM 12.0.0 256 bits)OpenCL 3.0 CUDA 12.2.1481.3.242GCC 9.4.0ext41280x1024OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-9QDOt0/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa10113e- GPU Compute Cores: 18176- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

HA1 HyperScalers with L40Saskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: Hogbom Clean OpenMPL40S test11285.913366.322021.427148.512381.121619.1756.469OpenBenchmarking.org

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingL40S test2K4K6K8K10KSE +/- 252.68, N = 1511285.91. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingL40S test3K6K9K12K15KSE +/- 126.76, N = 1513366.31. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingL40S test5K10K15K20K25KSE +/- 155.07, N = 322021.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingL40S test6K12K18K24K30KSE +/- 236.07, N = 327148.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingL40S test3K6K9K12K15KSE +/- 116.35, N = 1512381.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingL40S test5K10K15K20K25KSE +/- 215.04, N = 1521619.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPL40S test160320480640800SE +/- 8.95, N = 4756.471. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp


Phoronix Test Suite v10.8.4