EPYC 7702 Sunny

AMD EPYC 7702 64-Core testing with a ASRockRack EPYCD8 (P2.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101184-PTS-EPYC770264&sor&grr.

EPYC 7702 SunnyProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution122a4AMD EPYC 7702 64-Core @ 2.00GHz (64 Cores / 128 Threads)ASRockRack EPYCD8 (P2.10 BIOS)AMD Starship/Matisse126GB280GB INTEL SSDPED1D280GAllvmpipeAMD Starship/MatisseVE2282 x Intel I350Ubuntu 20.105.10.2-051002-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830101c Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7702 Sunnyonnx: shufflenet-v2-10 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUlammps: 20k Atomsmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0onnx: super-resolution-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUbuild-godot: Time To Compilerav1e: 5rav1e: 1rav1e: 6rav1e: 10lulesh: cloverleaf: Lagrangian-Eulerian Hydrodynamicssynthmark: VoiceMark_100tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1lammps: Rhodopsin Protein122a4486725124.63038.9943.3315.08428.6478.36221577421166.3270.9080.3321.1782.34114396.47118.58644.467353.684304.44718.24623824.71339.5773.2285.11428.5648.3487220166.2980.9080.3341.1782.34514392.74720.46644.649369.317304.58817.631413223124.23139.5423.3045.04228.3408.70520957019966.5180.9030.3341.1742.37114315.95820.90643.648363.758304.81917.49927.07OpenBenchmarking.org

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU12a10002000300040005000SE +/- 231.65, N = 12SE +/- 215.45, N = 12486741321. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU122a50100150200250SE +/- 3.22, N = 12SE +/- 0.73, N = 3SE +/- 2.53, N = 122512382311. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms212a612182430SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 324.7124.6324.231. (CXX) g++ options: -O3 -pthread -lm

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v312a2918273645SE +/- 0.38, N = 15SE +/- 0.35, N = 15SE +/- 1.17, N = 338.9939.5439.58MIN: 35.51 / MAX: 46.58MIN: 35.89 / MAX: 48.53MIN: 35.74 / MAX: 45.571. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.022a10.74951.4992.24852.9983.7475SE +/- 0.022, N = 3SE +/- 0.009, N = 15SE +/- 0.012, N = 153.2283.3043.331MIN: 3.11 / MAX: 5.38MIN: 3.13 / MAX: 6.39MIN: 3.14 / MAX: 4.061. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2242a121.15072.30143.45214.60285.7535SE +/- 0.033, N = 15SE +/- 0.037, N = 15SE +/- 0.100, N = 35.0425.0845.114MIN: 4.78 / MAX: 8.1MIN: 4.78 / MAX: 6.22MIN: 4.85 / MAX: 6.711. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-502a21714212835SE +/- 0.05, N = 15SE +/- 0.25, N = 3SE +/- 0.15, N = 1528.3428.5628.65MIN: 27.16 / MAX: 32.53MIN: 27.48 / MAX: 33.5MIN: 27.13 / MAX: 39.221. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0212a246810SE +/- 0.059, N = 3SE +/- 0.170, N = 15SE +/- 0.239, N = 158.3488.3628.705MIN: 8.1 / MAX: 9.98MIN: 7.54 / MAX: 11.23MIN: 7.5 / MAX: 14.181. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU12a5001000150020002500SE +/- 28.81, N = 4SE +/- 18.77, N = 3215720951. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU122a1632486480SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 37472701. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU122a50100150200250SE +/- 1.45, N = 3SE +/- 0.44, N = 3SE +/- 0.44, N = 32112011991. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile212a1530456075SE +/- 0.15, N = 3SE +/- 0.30, N = 3SE +/- 0.04, N = 366.3066.3366.52

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5212a0.20430.40860.61290.81721.0215SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.9080.9080.903

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 12a210.07520.15040.22560.30080.376SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.3340.3340.332

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6212a0.26510.53020.79531.06041.3255SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 31.1781.1781.174

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 102a210.53351.0671.60052.1342.6675SE +/- 0.029, N = 3SE +/- 0.007, N = 3SE +/- 0.012, N = 32.3712.3452.341

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3122a3K6K9K12K15KSE +/- 96.69, N = 3SE +/- 111.33, N = 3SE +/- 25.04, N = 314396.4714392.7514315.961. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics122a4612182430SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.29, N = 718.5820.4620.9027.071. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100212a140280420560700SE +/- 0.11, N = 3SE +/- 0.44, N = 3SE +/- 0.33, N = 3644.65644.47643.651. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212a280160240320400SE +/- 3.83, N = 3SE +/- 2.21, N = 3SE +/- 3.76, N = 3353.68363.76369.32MIN: 315.66 / MAX: 447.76MIN: 314.2 / MAX: 474.09MIN: 313.18 / MAX: 460.921. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1122a70140210280350SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.37, N = 3304.45304.59304.82MIN: 303.75 / MAX: 305.53MIN: 303.74 / MAX: 305.38MIN: 303.72 / MAX: 320.851. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein122a48121620SE +/- 0.55, N = 13SE +/- 0.24, N = 12SE +/- 0.24, N = 318.2517.6317.501. (CXX) g++ options: -O3 -pthread -lm


Phoronix Test Suite v10.8.4