Microsoft Azure EPYC Milan-X HBv3 Benchmarks

Microsoft Azure HBv3 (Milan) versus HBv3 (Milan-X) benchmarking by Michael Larabel for a future article on Phoronix.com. Looking at performance of AMD EPYC Milan-X in Microsoft Azure cloud for a variety of workloads.

HTML result view exported from: https://openbenchmarking.org/result/2203201-PTS-AZUREHBV49&gru.

ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionSystem LayerHBv3HBv3 Milan-XHBv3HBv3 Milan-X 64 Cores 64 Cores 120 Cores 120 Cores2 x AMD EPYC 7V13 64-Core (64 Cores)Microsoft Virtual Machine (Hyper-V UEFI v4.1 BIOS)442GB2 x 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Diskhyperv_fbMellanox MT27710CentOS Linux 84.18.0-147.8.1.el8_1.x86_64 (x86_64)GCC 8.3.1 20190507ext41152x864microsoft2 x AMD EPYC 7V73X 64-Core (64 Cores)2 x AMD EPYC 7V13 64-Core (120 Cores)2 x AMD EPYC 7V73X 64-Core (120 Cores)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- CPU Microcode: 0xffffffffPython Details- Python 3.6.8Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

ospray: San Miguel - SciVisospray: XFrog Forest - SciVisospray: San Miguel - Path Tracerospray: NASA Streamlines - SciVisospray: XFrog Forest - Path Tracerospray: Magnetic Reconnection - SciVisospray: NASA Streamlines - Path Tracerembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonhpcc: Rand Ring Bandwidthhpcg: hpcc: G-HPLonnx: super-resolution-10 - CPUopenvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalargraphics-magick: Noise-Gaussianhpcc: Max Ping Pong Bandwidthcompress-zstd: 19 - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddinggromacs: MPI CPU - water_GMX50_barelammps: 20k Atomslammps: Rhodopsin Proteinrocksdb: Rand Readrocksdb: Read Rand Write Randjohn-the-ripper: MD5kripke: npb: CG.Cbrl-cad: VGR Performance Metriclulesh: namd: ATPase Simulation - 327,506 Atomswrf: conus 2.5kmopenfoam: Motorbike 60Mnwchem: C240 Buckyballincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionrelion: Basic - CPUparboil: OpenMP CUTCPbuild-linux-kernel: Time To Compilebuild-nodejs: Time To CompileHBv3HBv3 Milan-XHBv3HBv3 Milan-X 64 Cores 64 Cores 120 Cores 120 Cores52.6310.754.3271.435.7038.4615.8740.780038.905141.859642.00151.8299240.023399.5661061071207258517174.75385.139.835988.038175.07.47631.60532.958324822447135734956974677363552120940.7061849244262.2270.4115710150.06789.652256.6348.11460413.6781092418.4791.51554824.15996.34855.5611.364.9383.336.204017.2446.092944.252345.683345.65666.2024241.1303175.0270063541267487418347.866106.259.840896.341160.17.97732.37333.955330410911138115759130009737330122323.2365518354759.6890.408029294.70365.502219.8322.87511210.8780505414.5411.12716623.90693.79583.3316.957.39111.119.0962.524.5966.504963.286264.371763.43840.7641438.718089.35550585216610672115815.90382.036.641724.741287.09.05436.88135.409502728808158774371432678820114220926.52104436840262.2060.276198766.5480.602557.1287.76138312.2859945312.7970.97647019.06575.71485.8618.418.501259.9066.6727.7875.940971.980278.771076.52423.4153839.4368139.044006485177111112316082.14893.852.857881.457042.59.70539.53538.467522387301168465481414009393654121914.51110948647341.1280.269007804.4654.032467.9255.8364119.93829823274.3530.84772018.55972.452OpenBenchmarking.org

OSPray

Demo: San Miguel - Renderer: SciVis

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVis64 Cores120 Cores20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.95, N = 1552.6383.3355.5685.86

OSPray

Demo: XFrog Forest - Renderer: SciVis

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVis64 Cores120 Cores510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.11, N = 310.7516.9511.3618.41

OSPray

Demo: San Miguel - Renderer: Path Tracer

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path Tracer64 Cores120 Cores246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 34.327.394.938.50

OSPray

Demo: NASA Streamlines - Renderer: SciVis

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVis64 Cores120 Cores306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 371.43111.1183.33125.00

OSPray

Demo: XFrog Forest - Renderer: Path Tracer

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path Tracer64 Cores120 Cores3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.709.096.209.90

OSPray

Demo: Magnetic Reconnection - Renderer: SciVis

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVis64 Cores120 Cores1530456075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 338.4662.5040.0066.67

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

HBv3HBv3 Milan-XOpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path Tracer64 Cores120 Cores714212835SE +/- 0.00, N = 3SE +/- 0.20, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 315.8724.5917.2427.78

Embree

Binary: Pathtracer - Model: Crown

HBv3HBv3 Milan-XOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Crown64 Cores120 Cores20406080100SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 340.7866.5046.0975.94

Embree

Binary: Pathtracer ISPC - Model: Crown

HBv3HBv3 Milan-XOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Crown64 Cores120 Cores1632486480SE +/- 0.08, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 338.9163.2944.2571.98

Embree

Binary: Pathtracer - Model: Asian Dragon

HBv3HBv3 Milan-XOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Asian Dragon64 Cores120 Cores20406080100SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.21, N = 341.8664.3745.6878.77

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

HBv3HBv3 Milan-XOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon64 Cores120 Cores20406080100SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 342.0063.4445.6676.52

HPC Challenge

Test / Class: Random Ring Bandwidth

HBv3HBv3 Milan-XOpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth64 Cores120 Cores2468101.829920.764146.202423.415381. (CC) gcc options: -lblas -lm -fexceptions -pthread -lmpi2. ATLAS + Open MPI 4.0.5

High Performance Conjugate Gradient

HBv3HBv3 Milan-XOpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.164 Cores120 Cores918273645SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 340.0238.7241.1339.441. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi

HPC Challenge

Test / Class: G-HPL

HBv3HBv3 Milan-XOpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL64 Cores120 Cores408012016020099.5789.36175.03139.041. (CC) gcc options: -lblas -lm -fexceptions -pthread -lmpi2. ATLAS + Open MPI 4.0.5

ONNX Runtime

Model: super-resolution-10 - Device: CPU

HBv3HBv3 Milan-XOpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.9.1Model: super-resolution-10 - Device: CPU64 Cores120 Cores14002800420056007000SE +/- 62.83, N = 3SE +/- 56.15, N = 3SE +/- 100.53, N = 9SE +/- 117.46, N = 961075852635464851. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O2 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread

OpenVKL

Benchmark: vklBenchmark ISPC

HBv3HBv3 Milan-XOpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC64 Cores120 Cores4080120160200SE +/- 0.88, N = 3SE +/- 1.75, N = 5120166126177

OpenVKL

Benchmark: vklBenchmark Scalar

HBv3HBv3 Milan-XOpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar64 Cores120 Cores20406080100SE +/- 0.88, N = 3SE +/- 1.21, N = 9SE +/- 0.67, N = 3SE +/- 1.11, N = 67210674111

GraphicsMagick

Operation: Noise-Gaussian

HBv3HBv3 Milan-XOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian64 Cores120 Cores2004006008001000SE +/- 6.24, N = 4SE +/- 4.18, N = 3SE +/- 4.48, N = 3SE +/- 11.24, N = 1558572187411231. (CC) gcc options: -fopenmp -O2 -pthread -ltiff -ljpeg -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

HBv3HBv3 Milan-XOpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth64 Cores120 Cores4K8K12K16K20K17174.7515815.9018347.8716082.151. (CC) gcc options: -lblas -lm -fexceptions -pthread -lmpi2. ATLAS + Open MPI 4.0.5

Zstd Compression

Compression Level: 19 - Compression Speed

HBv3HBv3 Milan-XOpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed64 Cores120 Cores20406080100SE +/- 0.93, N = 15SE +/- 0.83, N = 6SE +/- 1.05, N = 15SE +/- 1.33, N = 385.182.0106.293.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

HBv3HBv3 Milan-XOpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed64 Cores120 Cores1326395265SE +/- 0.34, N = 15SE +/- 0.29, N = 3SE +/- 0.64, N = 3SE +/- 0.50, N = 1539.836.659.852.81. (CC) gcc options: -O3 -pthread -lz -llzma

ASKAP

Test: tConvolve MPI - Degridding

HBv3HBv3 Milan-XOpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding64 Cores120 Cores12K24K36K48K60KSE +/- 204.47, N = 3SE +/- 146.90, N = 3SE +/- 263.83, N = 3SE +/- 0.00, N = 335988.041724.740896.357881.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

HBv3HBv3 Milan-XOpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding64 Cores120 Cores12K24K36K48K60KSE +/- 400.79, N = 3SE +/- 143.87, N = 3SE +/- 0.00, N = 338175.041287.041160.157042.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

HBv3HBv3 Milan-XOpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021.2Implementation: MPI CPU - Input: water_GMX50_bare64 Cores120 Cores3691215SE +/- 0.061, N = 3SE +/- 0.051, N = 3SE +/- 0.020, N = 3SE +/- 0.061, N = 37.4769.0547.9779.7051. (CXX) g++ options: -O2 -pthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

HBv3HBv3 Milan-XOpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms64 Cores120 Cores918273645SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 331.6136.8832.3739.541. (CXX) g++ options: -O2 -pthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

HBv3HBv3 Milan-XOpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein64 Cores120 Cores918273645SE +/- 0.24, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.29, N = 332.9635.4133.9638.471. (CXX) g++ options: -O2 -pthread -lm

Facebook RocksDB

Test: Random Read

HBv3HBv3 Milan-XOpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Read64 Cores120 Cores110M220M330M440M550MSE +/- 95304.17, N = 3SE +/- 4680557.08, N = 7SE +/- 648314.38, N = 3SE +/- 1522212.52, N = 33248224475027288083304109115223873011. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -O2 -fno-rtti -lgflags

Facebook RocksDB

Test: Read Random Write Random

HBv3HBv3 Milan-XOpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Read Random Write Random64 Cores120 Cores400K800K1200K1600K2000KSE +/- 10799.70, N = 3SE +/- 5368.09, N = 3SE +/- 6520.37, N = 3SE +/- 6175.84, N = 313573491587743138115716846541. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -O2 -fno-rtti -lgflags

John The Ripper

Test: MD5

HBv3HBv3 Milan-XOpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD564 Cores120 Cores2M4M6M8M10MSE +/- 54210.51, N = 15SE +/- 283586.65, N = 15SE +/- 10969.66, N = 3SE +/- 271831.96, N = 1556974677143267591300081414001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Kripke

HBv3HBv3 Milan-XOpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.464 Cores120 Cores20M40M60M80M100MSE +/- 1812363.94, N = 15SE +/- 2974209.65, N = 15SE +/- 2522711.91, N = 15SE +/- 2167036.06, N = 15736355218820114297373301939365411. (CXX) g++ options: -O2 -fopenmp

NAS Parallel Benchmarks

Test / Class: CG.C

HBv3HBv3 Milan-XOpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C64 Cores120 Cores5K10K15K20K25KSE +/- 51.72, N = 3SE +/- 26.14, N = 3SE +/- 34.77, N = 3SE +/- 70.02, N = 320940.7020926.5222323.2321914.511. (F9X) gfortran options: -O3 -march=native -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

BRL-CAD

VGR Performance Metric

HBv3HBv3 Milan-XOpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.32.2VGR Performance Metric64 Cores120 Cores200K400K600K800K1000K618492104436865518311094861. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm

LULESH

HBv3HBv3 Milan-XOpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.364 Cores120 Cores12K24K36K48K60KSE +/- 36.72, N = 3SE +/- 286.97, N = 3SE +/- 258.47, N = 3SE +/- 209.57, N = 344262.2340262.2154759.6947341.131. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi

NAMD

ATPase Simulation - 327,506 Atoms

HBv3HBv3 Milan-XOpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms64 Cores120 Cores0.09260.18520.27780.37040.463SE +/- 0.00048, N = 3SE +/- 0.00012, N = 3SE +/- 0.00005, N = 3SE +/- 0.00007, N = 30.411570.276190.408020.26900

WRF

Input: conus 2.5km

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km64 Cores120 Cores2K4K6K8K10K10150.078766.549294.707804.461. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenFOAM

Input: Motorbike 60M

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M64 Cores120 Cores20406080100SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.22, N = 389.6580.6065.5054.031. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

NWChem

Input: C240 Buckyball

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball64 Cores120 Cores50010001500200025002256.62557.12219.82467.91. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lcomex -lm -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d64 Cores120 Cores80160240320400SE +/- 0.83, N = 3SE +/- 0.24, N = 3SE +/- 0.69, N = 3SE +/- 0.52, N = 3348.11287.76322.88255.841. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction64 Cores120 Cores48121620SE +/- 0.43535891, N = 15SE +/- 0.02189533, N = 3SE +/- 0.04593850, N = 3SE +/- 0.02331367, N = 313.6781092012.2859945010.878050509.938298231. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

RELION

Test: Basic - Device: CPU

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU64 Cores120 Cores90180270360450SE +/- 1.03, N = 3SE +/- 1.58, N = 3SE +/- 0.68, N = 3SE +/- 1.22, N = 3418.48312.80414.54274.351. (CXX) g++ options: -fopenmp -std=c++0x -O2 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -fexceptions -pthread -lmpi_cxx -lmpi

Parboil

Test: OpenMP CUTCP

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP64 Cores120 Cores0.3410.6821.0231.3641.705SE +/- 0.014450, N = 3SE +/- 0.006046, N = 3SE +/- 0.011448, N = 6SE +/- 0.022647, N = 121.5155480.9764701.1271660.8477201. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Timed Linux Kernel Compilation

Time To Compile

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To Compile64 Cores120 Cores612182430SE +/- 0.22, N = 7SE +/- 0.16, N = 8SE +/- 0.21, N = 8SE +/- 0.12, N = 1324.1619.0723.9118.56

Timed Node.js Compilation

Time To Compile

HBv3HBv3 Milan-XOpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile64 Cores120 Cores20406080100SE +/- 0.27, N = 3SE +/- 0.28, N = 3SE +/- 0.31, N = 3SE +/- 0.32, N = 396.3575.7193.8072.45


Phoronix Test Suite v10.8.5