a100-memory

2 x AMD EPYC 7742 64-Core testing with a NVIDIA DGXA100 v555.06901.0004 (0.34 BIOS) and ASPEED 40GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2109285-IB-A100MEMOR84
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
16 x 64 GB DDR4-3200MT
September 28 2021
  1 Hour, 16 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


a100-memoryOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)NVIDIA DGXA100 v555.06901.0004 (0.34 BIOS)AMD Starship/Matisse16 x 64 GB DDR4-3200MT/s 36ASF8G72PZ-3G2B24 x 3841GB SAMSUNG MZWLJ3T8HBLS-00007 + 2 x 1920GB SAMSUNG MZ1LB1T9HALS-00007ASPEED 40GB2 x Intel 82599ES 10-Gigabit SFI/SFP+ + 3 x Mellanox MT28908 + Intel I210Ubuntu 20.045.4.0-80-generic (x86_64)X ServerNVIDIAOpenCL 1.2 CUDA 11.2.109GCC 9.3.0 + CUDA 11.2ext4800x600ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDisplay ServerDisplay DriverOpenCLCompilerFile-SystemScreen ResolutionA100-memory BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

a100-memoryramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Copy - Floating Pointramspeed: Scale - Floating Pointramspeed: Triad - Floating Pointramspeed: Average - Floating Pointtinymembench: Standard Memcpytinymembench: Standard Memsetmbw: Memory Copy - 1024 MiBmbw: Memory Copy, Fixed Block Size - 1024 MiBt-test1: 1t-test1: 2cachebench: Read Cachecachebench: Write Cache16 x 64 GB DDR4-3200MT40023.2134899.9837587.4942097.1539332.7441087.7634671.5435841.7441403.0038080.038280.813366.815032.3368033.64622.4077.9082342.46139724492.006409OpenBenchmarking.org

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 366.34, N = 340023.211. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer16 x 64 GB DDR4-3200MT7K14K21K28K35KSE +/- 222.45, N = 334899.981. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 165.03, N = 337587.491. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 231.52, N = 342097.151. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 66.96, N = 339332.741. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Point16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 98.40, N = 341087.761. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Point16 x 64 GB DDR4-3200MT7K14K21K28K35KSE +/- 290.98, N = 334671.541. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Point16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 495.92, N = 335841.741. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Point16 x 64 GB DDR4-3200MT9K18K27K36K45KSE +/- 399.39, N = 341403.001. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Point16 x 64 GB DDR4-3200MT8K16K24K32K40KSE +/- 304.57, N = 338080.031. (CC) gcc options: -O3 -march=native

Tinymembench

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpy16 x 64 GB DDR4-3200MT2K4K6K8K10KSE +/- 79.53, N = 38280.81. (CC) gcc options: -O2 -lm

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memset16 x 64 GB DDR4-3200MT3K6K9K12K15KSE +/- 97.86, N = 313366.81. (CC) gcc options: -O2 -lm

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiB16 x 64 GB DDR4-3200MT3K6K9K12K15KSE +/- 168.08, N = 315032.341. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB16 x 64 GB DDR4-3200MT2K4K6K8K10KSE +/- 70.20, N = 158033.651. (CC) gcc options: -O3 -march=native

t-test1

This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 116 x 64 GB DDR4-3200MT510152025SE +/- 0.16, N = 1522.411. (CC) gcc options: -pthread

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 216 x 64 GB DDR4-3200MT246810SE +/- 0.019, N = 37.9081. (CC) gcc options: -pthread

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchRead Cache16 x 64 GB DDR4-3200MT5001000150020002500SE +/- 1.70, N = 32342.46MIN: 2320.67 / MAX: 2351.231. (CC) gcc options: -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchWrite Cache16 x 64 GB DDR4-3200MT5K10K15K20K25KSE +/- 25.22, N = 324492.01MIN: 21290.4 / MAX: 25792.761. (CC) gcc options: -lrt