ha1-hyperscalers-with-l40s-pytorch

2 x AMD EPYC 9174F 16-Core testing with a ASUS RS700A-E12-RS4U K14PP-D24 (1002 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2402041-NE-HA1HYPERS87
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
ha1-hyperscalers-with-l40s-pytorch
February 02
  7 Hours, 38 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ha1-hyperscalers-with-l40s-pytorchOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 9174F 16-Core @ 4.10GHz (32 Cores / 63 Threads)ASUS RS700A-E12-RS4U K14PP-D24 (1002 BIOS)AMD Device 14a424 x 32 GB 4800MT/s Samsung M321R4GA3BB6-CQKEG500GB CT500P3PSSD8 + 3841GB DAPUSTOR DPH311T4T003T8 + 960GB HUSMR7696BDP3Y1llvmpipeACER KA220HQ2 x Intel X710 for 10GBASE-T + 2 x Intel X710 for 10GbE SFP+Ubuntu 20.045.15.0-91-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.13NVIDIA 535.154.054.5 Mesa 21.2.6 (LLVM 12.0.0 256 bits)OpenCL 3.0 CUDA 12.2.1481.3.242GCC 9.4.0ext41280x1024ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionHa1-hyperscalers-with-l40s-pytorch BenchmarksSystem Logs- Transparent Huge Pages: madvise- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa10113e- Python 3.8.10- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

ha1-hyperscalers-with-l40s-pytorchpytorch: CPU - 1 - ResNet-50pytorch: CPU - 1 - ResNet-152pytorch: CPU - 16 - ResNet-50pytorch: CPU - 32 - ResNet-50pytorch: CPU - 64 - ResNet-50pytorch: CPU - 16 - ResNet-152pytorch: CPU - 256 - ResNet-50pytorch: CPU - 32 - ResNet-152pytorch: CPU - 512 - ResNet-50pytorch: CPU - 64 - ResNet-152pytorch: CPU - 256 - ResNet-152pytorch: CPU - 512 - ResNet-152pytorch: CPU - 1 - Efficientnet_v2_lpytorch: CPU - 16 - Efficientnet_v2_lpytorch: CPU - 32 - Efficientnet_v2_lpytorch: CPU - 64 - Efficientnet_v2_lpytorch: CPU - 256 - Efficientnet_v2_lpytorch: CPU - 512 - Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch19.4110.1131.2531.1830.0611.8629.9011.9029.5011.8712.0011.707.441.341.351.361.361.35OpenBenchmarking.org

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch510152025SE +/- 1.96, N = 1219.41MIN: 0.32 / MAX: 36.03

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.35, N = 1210.11MIN: 0.36 / MAX: 14.68

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 16 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch714212835SE +/- 0.39, N = 331.25MIN: 1.05 / MAX: 34.01

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 32 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch714212835SE +/- 0.34, N = 331.18MIN: 1.66 / MAX: 33.91

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch714212835SE +/- 0.28, N = 330.06MIN: 1.35 / MAX: 32.82

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 16 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.14, N = 411.86MIN: 0.71 / MAX: 13.16

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch714212835SE +/- 0.15, N = 329.90MIN: 1.12 / MAX: 32.3

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 32 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.09, N = 1211.90MIN: 0.25 / MAX: 13.59

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 512 - Model: ResNet-50ha1-hyperscalers-with-l40s-pytorch714212835SE +/- 0.30, N = 329.50MIN: 0.4 / MAX: 32.68

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.17, N = 311.87MIN: 0.34 / MAX: 13.03

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.08, N = 312.00MIN: 0.36 / MAX: 13.08

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 512 - Model: ResNet-152ha1-hyperscalers-with-l40s-pytorch3691215SE +/- 0.11, N = 611.70MIN: 0.6 / MAX: 13.06

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch246810SE +/- 0.05, N = 37.44MIN: 0.51 / MAX: 8.22

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch0.30150.6030.90451.2061.5075SE +/- 0.00, N = 31.34MIN: 0.41 / MAX: 1.95

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch0.30380.60760.91141.21521.519SE +/- 0.01, N = 31.35MIN: 0.71 / MAX: 1.9

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch0.3060.6120.9181.2241.53SE +/- 0.01, N = 31.36MIN: 0.84 / MAX: 1.89

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch0.3060.6120.9181.2241.53SE +/- 0.00, N = 31.36MIN: 0.61 / MAX: 1.94

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_lha1-hyperscalers-with-l40s-pytorch0.30380.60760.91141.21521.519SE +/- 0.00, N = 31.35MIN: 0.6 / MAX: 1.91