a100-memory

2 x AMD EPYC 7742 64-Core testing with a NVIDIA DGXA100 v555.06901.0004 (0.34 BIOS) and ASPEED 40GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2109285-IB-A100MEMOR84.

a100-memoryProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDisplay ServerDisplay DriverOpenCLCompilerFile-SystemScreen Resolution16 x 64 GB DDR4-3200MT2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)NVIDIA DGXA100 v555.06901.0004 (0.34 BIOS)AMD Starship/Matisse16 x 64 GB DDR4-3200MT/s 36ASF8G72PZ-3G2B24 x 3841GB SAMSUNG MZWLJ3T8HBLS-00007 + 2 x 1920GB SAMSUNG MZ1LB1T9HALS-00007ASPEED 40GB2 x Intel 82599ES 10-Gigabit SFI/SFP+ + 3 x Mellanox MT28908 + Intel I210Ubuntu 20.045.4.0-80-generic (x86_64)X ServerNVIDIAOpenCL 1.2 CUDA 11.2.109GCC 9.3.0 + CUDA 11.2ext4800x600OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

a100-memoryramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Copy - Floating Pointramspeed: Scale - Floating Pointramspeed: Triad - Floating Pointramspeed: Average - Floating Pointtinymembench: Standard Memcpytinymembench: Standard Memsetmbw: Memory Copy - 1024 MiBmbw: Memory Copy, Fixed Block Size - 1024 MiBt-test1: 1t-test1: 2cachebench: Read Cachecachebench: Write Cache16 x 64 GB DDR4-3200MT40023.2134899.9837587.4942097.1539332.7441087.7634671.5435841.7441403.0038080.038280.813366.815032.3368033.64622.4077.9082342.46139724492.006409OpenBenchmarking.org

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 366.34, N = 340023.211. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer16 x 64 GB DDR4-3200MT7K14K21K28K35KSE +/- 222.45, N = 334899.981. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 165.03, N = 337587.491. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 231.52, N = 342097.151. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 66.96, N = 339332.741. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Point16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 98.40, N = 341087.761. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Point16 x 64 GB DDR4-3200MT7K14K21K28K35KSE +/- 290.98, N = 334671.541. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Point16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 495.92, N = 335841.741. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Point16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 399.39, N = 341403.001. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Point16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 304.57, N = 338080.031. (CC) gcc options: -O3 -march=native

Tinymembench

Standard Memcpy

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpy16 x 64 GB DDR4-3200MT2K4K6K8K10KSE +/- 79.53, N = 38280.81. (CC) gcc options: -O2 -lm

Tinymembench

Standard Memset

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memset16 x 64 GB DDR4-3200MT3K6K9K12K15KSE +/- 97.86, N = 313366.81. (CC) gcc options: -O2 -lm

MBW

Test: Memory Copy - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiB16 x 64 GB DDR4-3200MT3K6K9K12K15KSE +/- 168.08, N = 315032.341. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB16 x 64 GB DDR4-3200MT2K4K6K8K10KSE +/- 70.20, N = 158033.651. (CC) gcc options: -O3 -march=native

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 116 x 64 GB DDR4-3200MT510152025SE +/- 0.16, N = 1522.411. (CC) gcc options: -pthread

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 216 x 64 GB DDR4-3200MT246810SE +/- 0.019, N = 37.9081. (CC) gcc options: -pthread

CacheBench

Read Cache

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchRead Cache16 x 64 GB DDR4-3200MT5001000150020002500SE +/- 1.70, N = 32342.46MIN: 2320.67 / MAX: 2351.231. (CC) gcc options: -lrt

CacheBench

Write Cache

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchWrite Cache16 x 64 GB DDR4-3200MT5K10K15K20K25KSE +/- 25.22, N = 324492.01MIN: 21290.4 / MAX: 25792.761. (CC) gcc options: -lrt


Phoronix Test Suite v10.8.4