EPYC 7502

AMD EPYC 7502 32-Core testing with a ASRockRack EPYCD8 (P2.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012126-HA-EPYC7502356&sor&gru.

EPYC 7502ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123AMD EPYC 7502 32-Core @ 2.50GHz (32 Cores / 64 Threads)ASRockRack EPYCD8 (P2.10 BIOS)AMD Starship/Matisse126GB280GB INTEL SSDPED1D280GAllvmpipeAMD Starship/MatisseVE2282 x Intel I350Ubuntu 20.105.8.0-31-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)GCC 10.2.0ext41024x7681920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x830101c Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7502graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondcompilebench: Compilecompilebench: Initial Createcompilebench: Read Compiled Treeapache-siege: 10apache-siege: 50apache-siege: 100apache-siege: 200apache-siege: 250apache-siege: 500brl-cad: VGR Performance Metriconednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUhmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAbuild-ffmpeg: Time To Compilesqlite-speedtest: Timed Time - Size 1,0001231198520372567176751411241128026.2599871826.99529.462774.3919503.9929285.9030541.4533062.4737395.2346298.753622911.594882.993231.187701.022323.505872.132233.699574.026602.210531.954662780.111006.1482958.901002.2380.5381542782.711023.251.17432176.77010.41026.17482.9711194508372564175950611011129442.3502651840.71543.802757.1220108.1329257.8830529.8832895.0534655.331.600432.998821.184651.0049743.507992.125223.695803.994042.208641.956722782.401014.802787.42996.4840.5415692790.871012.5151.17368176.85710.31726.26184.9171200524375567176951311331128056.1108631831.66544.252783.901.599182.990841.186701.008963.501532.134623.689994.004532.215501.964042764.711011.4062772.041026.040.5416232772.141022.601.17332176.75410.25626.23183.411OpenBenchmarking.org

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl31230060090012001500SE +/- 1.00, N = 3SE +/- 2.91, N = 3SE +/- 0.58, N = 31200119811941. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate312110220330440550SE +/- 0.67, N = 3SE +/- 3.21, N = 35245205081. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen32180160240320400SE +/- 0.33, N = 33753723721. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced312120240360480600SE +/- 0.33, N = 35675675641. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing312400800120016002000SE +/- 14.40, N = 3SE +/- 4.04, N = 3SE +/- 18.15, N = 31769176717591. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian132110220330440550SE +/- 0.67, N = 35145135061. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space3122004006008001000SE +/- 0.67, N = 3SE +/- 0.67, N = 31133112411011. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second231200K400K600K800K1000KSE +/- 4150.99, N = 3SE +/- 4608.87, N = 3SE +/- 3711.65, N = 31129442.351128056.111128026.261. (CC) gcc options: -O2 -lrt" -lrt

Compile Bench

Test: Compile

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Compile231400800120016002000SE +/- 25.27, N = 3SE +/- 6.63, N = 3SE +/- 11.84, N = 31840.711831.661826.99

Compile Bench

Test: Initial Create

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Initial Create321120240360480600SE +/- 2.36, N = 3SE +/- 0.74, N = 3SE +/- 9.07, N = 3544.25543.80529.46

Compile Bench

Test: Read Compiled Tree

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Read Compiled Tree3126001200180024003000SE +/- 9.50, N = 3SE +/- 9.50, N = 3SE +/- 9.58, N = 32783.902774.392757.12

Apache Siege

Concurrent Users: 10

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 10214K8K12K16K20KSE +/- 94.53, N = 3SE +/- 326.19, N = 320108.1319503.991. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 50

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 50126K12K18K24K30KSE +/- 82.82, N = 3SE +/- 121.02, N = 329285.9029257.881. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 100127K14K21K28K35KSE +/- 116.40, N = 3SE +/- 163.78, N = 330541.4530529.881. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200127K14K21K28K35KSE +/- 127.12, N = 3SE +/- 71.57, N = 333062.4732895.051. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250128K16K24K32K40KSE +/- 489.59, N = 3SE +/- 253.85, N = 337395.2334655.331. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 500

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 500110K20K30K40K50KSE +/- 1011.06, N = 1546298.751. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric180K160K240K320K400K3622911. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lGL -lGLdispatch -lX11 -lpthread -ldl -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1320.36010.72021.08031.44041.8005SE +/- 0.00343, N = 3SE +/- 0.00184, N = 3SE +/- 0.00590, N = 31.594881.599181.60043MIN: 1.52MIN: 1.52MIN: 1.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU3120.67471.34942.02412.69883.3735SE +/- 0.00136, N = 3SE +/- 0.00417, N = 3SE +/- 0.00952, N = 32.990842.993232.99882MIN: 2.88MIN: 2.91MIN: 2.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU2310.26720.53440.80161.06881.336SE +/- 0.00481, N = 3SE +/- 0.00216, N = 3SE +/- 0.00307, N = 31.184651.186701.18770MIN: 1.13MIN: 1.13MIN: 1.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2310.230.460.690.921.15SE +/- 0.007652, N = 3SE +/- 0.003333, N = 3SE +/- 0.009672, N = 31.0049741.0089601.022320MIN: 0.95MIN: 0.95MIN: 0.961. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU3120.78931.57862.36793.15723.9465SE +/- 0.02094, N = 3SE +/- 0.01112, N = 3SE +/- 0.01399, N = 33.501533.505873.50799MIN: 3.41MIN: 3.43MIN: 3.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU2130.48030.96061.44091.92122.4015SE +/- 0.00581, N = 3SE +/- 0.01107, N = 3SE +/- 0.00581, N = 32.125222.132232.13462MIN: 2.06MIN: 2.06MIN: 2.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU3210.83241.66482.49723.32964.162SE +/- 0.00980, N = 3SE +/- 0.00747, N = 3SE +/- 0.01637, N = 33.689993.695803.69957MIN: 3.51MIN: 3.52MIN: 3.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2310.9061.8122.7183.6244.53SE +/- 0.02974, N = 3SE +/- 0.03862, N = 3SE +/- 0.01678, N = 33.994044.004534.02660MIN: 3.87MIN: 3.87MIN: 3.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2130.49850.9971.49551.9942.4925SE +/- 0.00333, N = 3SE +/- 0.00442, N = 3SE +/- 0.00830, N = 32.208642.210532.21550MIN: 2.07MIN: 2.07MIN: 2.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.44190.88381.32571.76762.2095SE +/- 0.00337, N = 3SE +/- 0.00252, N = 3SE +/- 0.00205, N = 31.954661.956721.96404MIN: 1.82MIN: 1.88MIN: 1.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3126001200180024003000SE +/- 4.80, N = 3SE +/- 6.08, N = 3SE +/- 3.21, N = 32764.712780.112782.40MIN: 2746.18MIN: 2761.69MIN: 2757.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1322004006008001000SE +/- 13.95, N = 3SE +/- 11.63, N = 3SE +/- 3.95, N = 31006.151011.411014.80MIN: 977.62MIN: 980.15MIN: 1001.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3216001200180024003000SE +/- 4.93, N = 3SE +/- 7.52, N = 3SE +/- 170.22, N = 152772.042787.422958.90MIN: 2750.97MIN: 2769.22MIN: 2768.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU2132004006008001000SE +/- 0.88, N = 3SE +/- 10.79, N = 3SE +/- 1.20, N = 3996.481002.241026.04MIN: 989.39MIN: 978.04MIN: 1017.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.12190.24380.36570.48760.6095SE +/- 0.002054, N = 3SE +/- 0.001887, N = 3SE +/- 0.001884, N = 30.5381540.5415690.541623MIN: 0.51MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3126001200180024003000SE +/- 9.53, N = 3SE +/- 1.44, N = 3SE +/- 2.00, N = 32772.142782.712790.87MIN: 2746.59MIN: 2766.03MIN: 2779.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU2312004006008001000SE +/- 7.29, N = 3SE +/- 0.49, N = 3SE +/- 4.43, N = 31012.521022.601023.25MIN: 992.98MIN: 1015.53MIN: 1007.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU3210.26420.52840.79261.05681.321SE +/- 0.00085, N = 3SE +/- 0.00431, N = 3SE +/- 0.00140, N = 31.173321.173681.17432MIN: 1.08MIN: 1.08MIN: 1.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search3124080120160200SE +/- 0.20, N = 3SE +/- 0.39, N = 3SE +/- 0.16, N = 3176.75176.77176.861. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 310.2610.3210.411. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile132612182430SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 326.1726.2326.26

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00013220406080100SE +/- 0.14, N = 3SE +/- 0.80, N = 3SE +/- 0.33, N = 382.9783.4184.921. (CC) gcc options: -O2 -ldl -lz -lpthread


Phoronix Test Suite v10.8.4