EPYC 7502

AMD EPYC 7502 32-Core testing with a ASRockRack EPYCD8 (P2.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012126-HA-EPYC7502356.

EPYC 7502ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123AMD EPYC 7502 32-Core @ 2.50GHz (32 Cores / 64 Threads)ASRockRack EPYCD8 (P2.10 BIOS)AMD Starship/Matisse126GB280GB INTEL SSDPED1D280GAllvmpipeAMD Starship/MatisseVE2282 x Intel I350Ubuntu 20.105.8.0-31-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)GCC 10.2.0ext41024x7681920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x830101c Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7502compilebench: Compilecompilebench: Initial Createcompilebench: Read Compiled Treehmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondbuild-ffmpeg: Time To Compilesqlite-speedtest: Timed Time - Size 1,000apache-siege: 10apache-siege: 50apache-siege: 100apache-siege: 200apache-siege: 250apache-siege: 500brl-cad: VGR Performance Metric1231826.99529.462774.39176.77010.4101198520372567176751411241.594882.993231.187701.022323.505872.132233.699574.026602.210531.954662780.111006.1482958.901002.2380.5381542782.711023.251.174321128026.25998726.17482.97119503.9929285.9030541.4533062.4737395.2346298.753622911840.71543.802757.12176.85710.3171194508372564175950611011.600432.998821.184651.0049743.507992.125223.695803.994042.208641.956722782.401014.802787.42996.4840.5415692790.871012.5151.173681129442.35026526.26184.91720108.1329257.8830529.8832895.0534655.331831.66544.252783.90176.75410.2561200524375567176951311331.599182.990841.186701.008963.501532.134623.689994.004532.215501.964042764.711011.4062772.041026.040.5416232772.141022.601.173321128056.11086326.23183.411OpenBenchmarking.org

Compile Bench

Test: Compile

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Compile123400800120016002000SE +/- 11.84, N = 3SE +/- 25.27, N = 3SE +/- 6.63, N = 31826.991840.711831.66

Compile Bench

Test: Initial Create

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Initial Create123120240360480600SE +/- 9.07, N = 3SE +/- 0.74, N = 3SE +/- 2.36, N = 3529.46543.80544.25

Compile Bench

Test: Read Compiled Tree

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Read Compiled Tree1236001200180024003000SE +/- 9.50, N = 3SE +/- 9.58, N = 3SE +/- 9.50, N = 32774.392757.122783.90

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1234080120160200SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.20, N = 3176.77176.86176.751. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 310.4110.3210.261. (CC) gcc options: -std=c99 -O3 -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl12330060090012001500SE +/- 2.91, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 31198119412001. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate123110220330440550SE +/- 0.67, N = 3SE +/- 3.21, N = 35205085241. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen12380160240320400SE +/- 0.33, N = 33723723751. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced123120240360480600SE +/- 0.33, N = 35675645671. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing123400800120016002000SE +/- 4.04, N = 3SE +/- 18.15, N = 3SE +/- 14.40, N = 31767175917691. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian123110220330440550SE +/- 0.67, N = 35145065131. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space1232004006008001000SE +/- 0.67, N = 3SE +/- 0.67, N = 31124110111331. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lX11 -lz -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.36010.72021.08031.44041.8005SE +/- 0.00343, N = 3SE +/- 0.00590, N = 3SE +/- 0.00184, N = 31.594881.600431.59918MIN: 1.52MIN: 1.53MIN: 1.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1230.67471.34942.02412.69883.3735SE +/- 0.00417, N = 3SE +/- 0.00952, N = 3SE +/- 0.00136, N = 32.993232.998822.99084MIN: 2.91MIN: 2.9MIN: 2.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.26720.53440.80161.06881.336SE +/- 0.00307, N = 3SE +/- 0.00481, N = 3SE +/- 0.00216, N = 31.187701.184651.18670MIN: 1.13MIN: 1.13MIN: 1.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.230.460.690.921.15SE +/- 0.009672, N = 3SE +/- 0.007652, N = 3SE +/- 0.003333, N = 31.0223201.0049741.008960MIN: 0.96MIN: 0.95MIN: 0.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1230.78931.57862.36793.15723.9465SE +/- 0.01112, N = 3SE +/- 0.01399, N = 3SE +/- 0.02094, N = 33.505873.507993.50153MIN: 3.43MIN: 3.42MIN: 3.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.48030.96061.44091.92122.4015SE +/- 0.01107, N = 3SE +/- 0.00581, N = 3SE +/- 0.00581, N = 32.132232.125222.13462MIN: 2.06MIN: 2.06MIN: 2.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.83241.66482.49723.32964.162SE +/- 0.01637, N = 3SE +/- 0.00747, N = 3SE +/- 0.00980, N = 33.699573.695803.68999MIN: 3.51MIN: 3.52MIN: 3.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1230.9061.8122.7183.6244.53SE +/- 0.01678, N = 3SE +/- 0.02974, N = 3SE +/- 0.03862, N = 34.026603.994044.00453MIN: 3.88MIN: 3.87MIN: 3.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.49850.9971.49551.9942.4925SE +/- 0.00442, N = 3SE +/- 0.00333, N = 3SE +/- 0.00830, N = 32.210532.208642.21550MIN: 2.07MIN: 2.07MIN: 2.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.44190.88381.32571.76762.2095SE +/- 0.00337, N = 3SE +/- 0.00252, N = 3SE +/- 0.00205, N = 31.954661.956721.96404MIN: 1.82MIN: 1.88MIN: 1.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1236001200180024003000SE +/- 6.08, N = 3SE +/- 3.21, N = 3SE +/- 4.80, N = 32780.112782.402764.71MIN: 2761.69MIN: 2757.63MIN: 2746.181. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1232004006008001000SE +/- 13.95, N = 3SE +/- 3.95, N = 3SE +/- 11.63, N = 31006.151014.801011.41MIN: 977.62MIN: 1001.6MIN: 980.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1236001200180024003000SE +/- 170.22, N = 15SE +/- 7.52, N = 3SE +/- 4.93, N = 32958.902787.422772.04MIN: 2768.54MIN: 2769.22MIN: 2750.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1232004006008001000SE +/- 10.79, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 31002.24996.481026.04MIN: 978.04MIN: 989.39MIN: 1017.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.12190.24380.36570.48760.6095SE +/- 0.002054, N = 3SE +/- 0.001887, N = 3SE +/- 0.001884, N = 30.5381540.5415690.541623MIN: 0.51MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1236001200180024003000SE +/- 1.44, N = 3SE +/- 2.00, N = 3SE +/- 9.53, N = 32782.712790.872772.14MIN: 2766.03MIN: 2779.36MIN: 2746.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232004006008001000SE +/- 4.43, N = 3SE +/- 7.29, N = 3SE +/- 0.49, N = 31023.251012.521022.60MIN: 1007.84MIN: 992.98MIN: 1015.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.26420.52840.79261.05681.321SE +/- 0.00140, N = 3SE +/- 0.00431, N = 3SE +/- 0.00085, N = 31.174321.173681.17332MIN: 1.07MIN: 1.08MIN: 1.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123200K400K600K800K1000KSE +/- 3711.65, N = 3SE +/- 4150.99, N = 3SE +/- 4608.87, N = 31128026.261129442.351128056.111. (CC) gcc options: -O2 -lrt" -lrt

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123612182430SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 326.1726.2626.23

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.14, N = 3SE +/- 0.33, N = 3SE +/- 0.80, N = 382.9784.9283.411. (CC) gcc options: -O2 -ldl -lz -lpthread

Apache Siege

Concurrent Users: 10

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 10124K8K12K16K20KSE +/- 326.19, N = 3SE +/- 94.53, N = 319503.9920108.131. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 50

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 50126K12K18K24K30KSE +/- 82.82, N = 3SE +/- 121.02, N = 329285.9029257.881. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 100127K14K21K28K35KSE +/- 116.40, N = 3SE +/- 163.78, N = 330541.4530529.881. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200127K14K21K28K35KSE +/- 127.12, N = 3SE +/- 71.57, N = 333062.4732895.051. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250128K16K24K32K40KSE +/- 489.59, N = 3SE +/- 253.85, N = 337395.2334655.331. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 500

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 500110K20K30K40K50KSE +/- 1011.06, N = 1546298.751. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric180K160K240K320K400K3622911. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lGL -lGLdispatch -lX11 -lpthread -ldl -lm


Phoronix Test Suite v10.8.4