Microsoft Azure EPYC 7003 HBv3 Benchmarks

Azure HBv3 benchmarks against other Microsoft Azure HPC instance types. Benchmarks by Michael Larabel for a future article on phoronix.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2104126-PTS-MSAZURE208
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Timed Code Compilation 3 Tests
C/C++ Compiler Tests 6 Tests
CPU Massive 15 Tests
Creator Workloads 4 Tests
Encoding 3 Tests
Finance 2 Tests
Fortran Tests 4 Tests
HPC - High Performance Computing 15 Tests
Machine Learning 5 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 5 Tests
Multi-Core 14 Tests
NVIDIA GPU Compute 4 Tests
OpenMPI Tests 8 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 3 Tests
Scientific Computing 7 Tests
Server CPU Tests 10 Tests
Single-Threaded 2 Tests
Video Encoding 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Azure HBv3 - EPYC 7V13
April 09 2021
  7 Hours, 26 Minutes
Azure HBv2 - EPYC 7V12
April 10 2021
  7 Hours, 35 Minutes
Azure HBv1 - EPYC 7551
April 11 2021
  9 Hours, 54 Minutes
Azure HC - Xeon 8168
April 12 2021
  4 Hours, 27 Minutes
Invert Hiding All Results Option
  7 Hours, 20 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Microsoft Azure EPYC 7003 HBv3 Benchmarks - Phoronix Test Suite

Microsoft Azure EPYC 7003 HBv3 Benchmarks

Azure HBv3 benchmarks against other Microsoft Azure HPC instance types. Benchmarks by Michael Larabel for a future article on phoronix.

HTML result view exported from: https://openbenchmarking.org/result/2104126-PTS-MSAZURE208&grs&sor.

Microsoft Azure EPYC 7003 HBv3 BenchmarksProcessorMotherboardMemoryDiskGraphicsOSKernelCompilerFile-SystemScreen ResolutionSystem LayerAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 81682 x AMD EPYC 7V13 64-Core (120 Cores)Microsoft Virtual Machine (Hyper-V UEFI v4.1 BIOS)442GB2 x 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Diskhyperv_fbCentOS Linux 84.18.0-147.8.1.el8_1.x86_64 (x86_64)GCC 8.3.1 20190507nfs1152x864microsoft2 x AMD EPYC 7V12 64-Core (120 Cores)Microsoft Virtual Machine (Hyper-V UEFI v4.0 BIOS)450GB960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Disk2 x AMD EPYC 7551 32-Core (60 Cores)226GB32GB Virtual Disk + 752GB Virtual Disk2 x Intel Xeon Platinum 8168 (44 Cores)348GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- CPU Microcode: 0xffffffffPython Details- Python 3.6.8Security Details- Azure HBv3 - EPYC 7V13: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected - Azure HBv2 - EPYC 7V12: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected - Azure HBv1 - EPYC 7551: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected - Azure HC - Xeon 8168: itlb_multihit: vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown

Microsoft Azure EPYC 7003 HBv3 Benchmarksonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUpennant: sedovbigcompress-zstd: 8 - Compression Speedonednn: Deconvolution Batch shapes_3d - f32 - CPUpennant: leblancbigonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUgromacs: Water Benchmarksvt-vp9: Visual Quality Optimized - Bosphorus 1080pnamd: ATPase Simulation - 327,506 Atomsrodinia: OpenMP LavaMDsvt-av1: Enc Mode 8 - 1080pnpb: LU.Cmnn: inception-v3botan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptlulesh: plaidml: No - Inference - VGG19 - CPUsvt-hevc: 1 - Bosphorus 1080ponednn: IP Shapes 1D - f32 - CPUsvt-av1: Enc Mode 0 - 1080pplaidml: No - Inference - VGG16 - CPUcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Decompression Speedfinancebench: Bonds OpenMPquantlib: financebench: Repo OpenMPcompress-zstd: 8, Long Mode - Decompression Speedrodinia: OpenMP HotSpot3Dincompact3d: X3D-benchmarking input.i3dbuild-nodejs: Time To Compilebuild-llvm: Time To Compilebotan: AES-256botan: AES-256 - Decryptcompress-zstd: 8, Long Mode - Compression Speedgmpbench: Total Timebotan: Twofishbotan: Twofish - Decryptbotan: Blowfish - Decryptbotan: Blowfishtnn: CPU - SqueezeNet v1.1mafft: Multiple Sequence Alignment - LSU RNAbotan: CAST-256botan: KASUMIbotan: CAST-256 - Decryptbotan: KASUMI - Decrypthpcg: build-linux-kernel: Time To Compileplaidml: No - Inference - ResNet 50 - CPUmnn: resnet-v2-50mnn: SqueezeNetV1.0tensorflow-lite: SqueezeNetonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUsvt-hevc: 10 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-av1: Enc Mode 4 - 1080pcompress-zstd: 19 - Compression Speedcloverleaf: Lagrangian-Eulerian HydrodynamicsAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 81680.4556829.504455.8334193184.21.581823.3372010.4452239.041337.710.2756637.80394.02856682.8234.710667.765658.37041636.60334.5345.390.8813070.16138.393717.53727.5104884.2942712299.359196.5468754256.882.489287.599508111.327163.7605412.1315407.477771.94893.4346.319345.776425.158424.738272.61414.326133.69187.480133.66584.05939.062042.0086.2729.0868.22466320.4540.194843.1030.255501561.773962.336530.898875.1660.4066860.3838553.21204548.33378.0112.27578.216.660.73213012.10516.0264492653.11.595573.4865030.5247378.458140.590.3005639.19878.10453829.3455.356615.795610.86134803.53923.1236.121.124840.12425.572631.02674.2133719.3281251725.275154.2578133037.5107.976318.905009118.969174.7433934.4013938.444588.04155.5280.902283.257349.147348.425323.61516.064113.60073.175113.72471.10137.268644.9225.6052.53314.13674885.4796.8161294.530.350347791.4601287.03778.3051314.001.0319420.4410129.68321379.91166.909.51269.523.782.736217.9156915.52899920.13.9407710.337641.154653.288129.440.7094790.17540.14625304.1976.799303.243300.17819011.45815.9721.121.878340.07619.882005.42019.3195702.7968751251.7108606.5026042339.6145.569455.812378194.550283.2913144.3133151.228468.83021.3217.312218.034272.733273.039420.04722.06788.54557.95088.55555.98030.431458.7384.8062.44617.19495206.23423.155355.241.512713373.885539.483344.145524.502.405611.7635510.22263141.93119.204.66948.525.000.6686412.6791423.871261188.21.2118110.529200.3980054.866299.950.5256994.11957.37735.673633.217626.91920846.59034.6623.870.9310730.11141.541903.91991.3130250.4492191812.971609.0651042350.2115.832507.295970189.096251.7443155.3913159.852649.34196.5293.551294.381365.547367.942398.40415.627116.63076.149116.57374.28726.038155.3244.9129.0068.22673517.3458.201753.3560.352180459.576746.464462.855744.3550.5514670.4743113.28160480.79256.287.62176.320.19OpenBenchmarking.org

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75510.61561.23121.84682.46243.078SE +/- 0.002442, N = 3SE +/- 0.006150, N = 15SE +/- 0.002363, N = 3SE +/- 0.005596, N = 30.4556820.6686410.7321302.736210MIN: 0.63MIN: 0.65MIN: 2.71. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv1 - EPYC 7551Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V123691215SE +/- 0.01628, N = 3SE +/- 0.06125, N = 3SE +/- 0.09389, N = 3SE +/- 0.11811, N = 62.679147.915699.5044512.10510MIN: 2MIN: 5.11MIN: 4.19MIN: 5.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 8168612182430SE +/- 0.003905, N = 3SE +/- 0.012548, N = 3SE +/- 0.085157, N = 3SE +/- 0.048781, N = 35.8334196.02644915.52899023.8712601. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75517001400210028003500SE +/- 45.10, N = 3SE +/- 33.03, N = 15SE +/- 6.93, N = 3SE +/- 6.06, N = 33184.22653.11188.2920.11. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75510.88671.77342.66013.54684.4335SE +/- 0.00120, N = 3SE +/- 0.00509, N = 3SE +/- 0.01470, N = 3SE +/- 0.00862, N = 31.211811.581821.595573.94077MIN: 1.18MIN: 1.49MIN: 1.5MIN: 3.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 81683691215SE +/- 0.017738, N = 3SE +/- 0.009939, N = 3SE +/- 0.027196, N = 3SE +/- 0.037598, N = 33.3372013.48650310.33764010.5292001. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75510.25980.51960.77941.03921.299SE +/- 0.004555, N = 15SE +/- 0.004912, N = 4SE +/- 0.004967, N = 3SE +/- 0.003421, N = 30.3980050.4452230.5247371.154650MIN: 0.31MIN: 0.39MIN: 0.48MIN: 1.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75513691215SE +/- 0.009, N = 3SE +/- 0.009, N = 3SE +/- 0.003, N = 3SE +/- 0.025, N = 159.0418.4584.8663.2881. (CXX) g++ options: -O2 -pthread -lrt -lpthread -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755170140210280350SE +/- 4.36, N = 3SE +/- 4.18, N = 3SE +/- 0.20, N = 3SE +/- 1.10, N = 8337.71299.95140.59129.441. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75510.15960.31920.47880.63840.798SE +/- 0.00027, N = 3SE +/- 0.00059, N = 3SE +/- 0.00126, N = 3SE +/- 0.00507, N = 30.275660.300560.525690.70947

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 816820406080100SE +/- 0.23, N = 3SE +/- 0.20, N = 3SE +/- 0.39, N = 3SE +/- 0.02, N = 337.8039.2090.1894.121. (CXX) g++ options: -O2 -lOpenCL

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 755120406080100SE +/- 0.34, N = 3SE +/- 0.65, N = 3SE +/- 0.24, N = 3SE +/- 0.54, N = 1294.0378.1057.3840.151. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755112K24K36K48K60KSE +/- 428.57, N = 14SE +/- 21.24, N = 3SE +/- 114.01, N = 356682.8253829.3425304.191. (F9X) gfortran options: -O3 -march=native -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Azure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120406080100SE +/- 0.24, N = 15SE +/- 0.32, N = 3SE +/- 0.95, N = 12SE +/- 0.96, N = 1234.7135.6755.3676.80MIN: 31.09 / MAX: 427.77MIN: 34.52 / MAX: 128.55MIN: 47.56 / MAX: 509.74MIN: 69.85 / MAX: 676.141. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Azure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551140280420560700SE +/- 0.27, N = 3SE +/- 0.30, N = 3SE +/- 0.90, N = 3SE +/- 1.39, N = 3667.77633.22615.80303.241. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551140280420560700SE +/- 0.06, N = 3SE +/- 0.55, N = 3SE +/- 0.78, N = 3SE +/- 1.37, N = 3658.37626.92610.86300.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75519K18K27K36K45KSE +/- 476.10, N = 3SE +/- 44.77, N = 3SE +/- 25.50, N = 3SE +/- 81.47, N = 341636.6034803.5420846.5919011.461. (CXX) g++ options: -O3 -fopenmp -lm -fexceptions -pthread -lmpi_cxx -lmpi

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551816243240SE +/- 0.19, N = 3SE +/- 0.42, N = 4SE +/- 0.31, N = 15SE +/- 0.21, N = 1234.6634.5323.1215.97

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75511020304050SE +/- 0.60, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 345.3936.1223.8721.121. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75510.42260.84521.26781.69042.113SE +/- 0.012358, N = 3SE +/- 0.008475, N = 3SE +/- 0.011740, N = 15SE +/- 0.021017, N = 30.8813070.9310731.1248401.878340MIN: 0.76MIN: 0.87MIN: 1.01MIN: 1.751. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75510.03620.07240.10860.14480.181SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1610.1240.1110.0761. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551918273645SE +/- 0.22, N = 3SE +/- 0.39, N = 3SE +/- 0.21, N = 3SE +/- 0.21, N = 1541.5438.3925.5719.88

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 81688001600240032004000SE +/- 6.14, N = 15SE +/- 1.17, N = 15SE +/- 0.66, N = 15SE +/- 2.22, N = 153717.52631.02005.41903.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 81688001600240032004000SE +/- 12.70, N = 15SE +/- 2.70, N = 15SE +/- 0.32, N = 12SE +/- 2.46, N = 153727.52674.22019.31991.31. (CC) gcc options: -O3 -pthread -lz -llzma

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755140K80K120K160K200KSE +/- 454.05, N = 3SE +/- 1475.96, N = 4SE +/- 476.22, N = 3SE +/- 1191.05, N = 3104884.29130250.45133719.33195702.801. (CXX) g++ options: -O3 -march=native -fopenmp

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Azure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75515001000150020002500SE +/- 4.80, N = 3SE +/- 7.80, N = 3SE +/- 0.77, N = 3SE +/- 1.79, N = 32299.31812.91725.21251.71. (CXX) g++ options: -O3 -march=native -O2 -rdynamic -lboost_timer -lboost_system -lboost_chrono

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120K40K60K80K100KSE +/- 228.72, N = 3SE +/- 15.23, N = 3SE +/- 166.77, N = 3SE +/- 324.87, N = 359196.5571609.0775154.26108606.501. (CXX) g++ options: -O3 -march=native -fopenmp

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75519001800270036004500SE +/- 4.89, N = 7SE +/- 4.57, N = 3SE +/- 2.50, N = 3SE +/- 1.19, N = 154256.83037.52350.22339.61. (CC) gcc options: -O3 -pthread -lz -llzma

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 7551306090120150SE +/- 1.18, N = 15SE +/- 0.89, N = 3SE +/- 0.21, N = 3SE +/- 1.35, N = 382.49107.98115.83145.571. (CXX) g++ options: -O2 -lOpenCL

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 8168110220330440550SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 2.96, N = 3SE +/- 7.11, N = 3287.60318.91455.81507.301. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75514080120160200SE +/- 1.09, N = 3SE +/- 0.83, N = 3SE +/- 0.40, N = 3SE +/- 0.16, N = 3111.33118.97189.10194.55

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 755160120180240300SE +/- 1.89, N = 3SE +/- 1.99, N = 3SE +/- 2.31, N = 3SE +/- 3.41, N = 4163.76174.74251.74283.29

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 755112002400360048006000SE +/- 3.73, N = 3SE +/- 2.68, N = 3SE +/- 0.18, N = 3SE +/- 16.79, N = 35412.133934.403155.393144.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 755112002400360048006000SE +/- 7.85, N = 3SE +/- 3.26, N = 3SE +/- 0.03, N = 3SE +/- 1.16, N = 35407.483938.443159.853151.231. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551170340510680850SE +/- 6.99, N = 7SE +/- 6.29, N = 3SE +/- 0.81, N = 3SE +/- 3.02, N = 15771.9649.3588.0468.81. (CC) gcc options: -O3 -pthread -lz -llzma

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total TimeAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551100020003000400050004893.44196.54155.53021.31. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755180160240320400SE +/- 1.10, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 3SE +/- 2.54, N = 4346.32293.55280.90217.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755180160240320400SE +/- 0.77, N = 3SE +/- 0.13, N = 3SE +/- 0.22, N = 3SE +/- 2.43, N = 4345.78294.38283.26218.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755190180270360450SE +/- 0.42, N = 3SE +/- 0.12, N = 3SE +/- 0.56, N = 3SE +/- 0.08, N = 3425.16365.55349.15272.731. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755190180270360450SE +/- 0.47, N = 3SE +/- 0.10, N = 3SE +/- 0.42, N = 3SE +/- 0.10, N = 3424.74367.94348.43273.041. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 755190180270360450SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 3272.61323.62398.40420.05MIN: 272.1 / MAX: 273.5MIN: 322.04 / MAX: 326.05MIN: 398 / MAX: 399.03MIN: 418.9 / MAX: 430.091. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551510152025SE +/- 0.10, N = 15SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.24, N = 314.3315.6316.0622.071. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Azure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551306090120150SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3133.69116.63113.6088.551. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 387.4876.1573.1857.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 2133.67116.57113.7288.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 384.0674.2971.1055.981. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551Azure HC - Xeon 8168918273645SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 339.0637.2730.4326.041. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -fexceptions -pthread -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75511326395265SE +/- 0.58, N = 15SE +/- 0.58, N = 13SE +/- 0.60, N = 15SE +/- 0.45, N = 1342.0144.9255.3258.74

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 7551246810SE +/- 0.16, N = 9SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.275.604.914.80

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Azure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75511428425670SE +/- 0.12, N = 3SE +/- 0.25, N = 15SE +/- 1.18, N = 12SE +/- 1.07, N = 1229.0129.0952.5362.45MIN: 27.94 / MAX: 76.84MIN: 25.79 / MAX: 252.29MIN: 41.27 / MAX: 326.88MIN: 54.15 / MAX: 188.441. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Azure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755148121620SE +/- 0.113, N = 15SE +/- 0.058, N = 3SE +/- 0.657, N = 12SE +/- 0.162, N = 128.2248.22614.13617.194MIN: 5.66 / MAX: 54.17MIN: 7.29 / MAX: 23.89MIN: 10.98 / MAX: 78.48MIN: 15.74 / MAX: 54.061. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120K40K60K80K100KSE +/- 2854.14, N = 15SE +/- 698.77, N = 3SE +/- 1151.94, N = 15SE +/- 1151.23, N = 1566320.473517.374885.495206.2

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75517001400210028003500SE +/- 2.64, N = 3SE +/- 8.06, N = 15SE +/- 6.35, N = 15SE +/- 87.86, N = 15458.20540.19796.823423.15MIN: 447.82MIN: 462.7MIN: 726.18MIN: 2651.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755111002200330044005500SE +/- 4.09, N = 3SE +/- 14.29, N = 15SE +/- 11.10, N = 15SE +/- 122.64, N = 15753.36843.101294.535355.24MIN: 736.05MIN: 711.66MIN: 1179.19MIN: 4475.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75510.34040.68081.02121.36161.702SE +/- 0.000278, N = 3SE +/- 0.006544, N = 12SE +/- 0.011574, N = 15SE +/- 0.088179, N = 150.2555010.3503470.3521801.512710MIN: 0.3MIN: 0.27MIN: 0.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75517001400210028003500SE +/- 1.52, N = 3SE +/- 8.80, N = 15SE +/- 5.73, N = 15SE +/- 110.58, N = 12459.58561.77791.463373.88MIN: 450.73MIN: 471.83MIN: 718.74MIN: 2418.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755112002400360048006000SE +/- 3.40, N = 3SE +/- 102.39, N = 14SE +/- 13.52, N = 15SE +/- 103.35, N = 12746.46962.341287.035539.48MIN: 722.13MIN: 1149.25MIN: 4475.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75517001400210028003500SE +/- 2.05, N = 3SE +/- 8.62, N = 15SE +/- 8.13, N = 3SE +/- 113.65, N = 12462.86530.90778.313344.14MIN: 450.78MIN: 458.96MIN: 738.78MIN: 2662.071. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAzure HC - Xeon 8168Azure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755112002400360048006000SE +/- 4.60, N = 3SE +/- 25.25, N = 12SE +/- 16.54, N = 15SE +/- 83.42, N = 13744.36875.171314.005524.50MIN: 727.38MIN: 1140.59MIN: 4658.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75510.54131.08261.62392.16522.7065SE +/- 0.004961, N = 3SE +/- 0.003751, N = 3SE +/- 0.030825, N = 15SE +/- 0.045752, N = 120.4066860.5514671.0319422.405610MIN: 0.51MIN: 0.83MIN: 1.961. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75510.39680.79361.19041.58721.984SE +/- 0.009558, N = 12SE +/- 0.003995, N = 7SE +/- 0.017521, N = 12SE +/- 0.040927, N = 150.3838550.4410120.4743111.763550MIN: 0.32MIN: 0.4MIN: 0.34MIN: 0.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 75513691215SE +/- 0.02162, N = 3SE +/- 0.00454, N = 3SE +/- 0.28898, N = 15SE +/- 0.43651, N = 153.212043.281609.6832110.22263MIN: 2.58MIN: 3.2MIN: 5.56MIN: 4.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551120240360480600SE +/- 7.76, N = 3SE +/- 2.25, N = 3SE +/- 21.86, N = 15SE +/- 2.68, N = 15548.33480.79379.91141.931. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755180160240320400SE +/- 4.22, N = 3SE +/- 1.61, N = 3SE +/- 6.53, N = 12SE +/- 1.15, N = 3378.01256.28166.90119.201. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAzure HBv3 - EPYC 7V13Azure HBv2 - EPYC 7V12Azure HC - Xeon 8168Azure HBv1 - EPYC 75513691215SE +/- 0.031, N = 3SE +/- 0.218, N = 15SE +/- 0.023, N = 3SE +/- 0.034, N = 1112.2759.5127.6214.6691. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 755120406080100SE +/- 0.71, N = 15SE +/- 1.01, N = 15SE +/- 1.18, N = 15SE +/- 0.83, N = 1578.276.369.548.51. (CC) gcc options: -O3 -pthread -lz -llzma

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsAzure HBv3 - EPYC 7V13Azure HC - Xeon 8168Azure HBv2 - EPYC 7V12Azure HBv1 - EPYC 7551612182430SE +/- 0.83, N = 15SE +/- 0.26, N = 3SE +/- 0.45, N = 12SE +/- 0.11, N = 316.6620.1923.7825.001. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp


Phoronix Test Suite v10.8.4