AMD EPYC 7F72 24-Core testing with a ASRockRack EPYCD8 (P2.10 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2009304-FI-EPYC7F72E54 epyc-7f72-eo-september - Phoronix Test Suite epyc-7f72-eo-september AMD EPYC 7F72 24-Core testing with a ASRockRack EPYCD8 (P2.10 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009304-FI-EPYC7F72E54&export=pdf&gru&sor .
epyc-7f72-eo-september Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution EPYC 7F72 EPYC 7F72 x 2 3 AMD EPYC 7F72 24-Core @ 3.20GHz (24 Cores / 48 Threads) ASRockRack EPYCD8 (P2.10 BIOS) AMD Starship/Matisse 126GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED AMD Starship/Matisse 2 x Intel I350 Ubuntu 20.04 5.9.0-050900rc6daily20200921-generic (x86_64) 20200920 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 GCC 9.3.0 ext4 1024x768 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101c Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
epyc-7f72-eo-september byte: Dhrystone 2 ffte: N=256, 3D Complex FFT Routine lczero: BLAS lczero: Eigen lczero: Rand keydb: hint: FLOAT caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 caffe: GoogleNet - CPU - 1000 ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny dolfyn: Computational Fluid Dynamics hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA couchdb: 100 - 1000 - 24 mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression EPYC 7F72 EPYC 7F72 x 2 3 37756888.4 121809.43318954 2227 2073 164671 420383.50 18.627 142.413 9.465 103.685 38374014.4 122399.72540484 2256 2178 164311 421676.28 328822812.22518 75293 150373 751617 190013 379880 1902543 18.87 19.42 9.21 8.80 9.32 8.40 11.09 3.74 19.62 32.83 13.04 9.16 22.78 30.20 18.637 142.611 9.559 103.859 61.04 39.45 24.28 1.65 37972490.0 121264.37712651 2213 2165 166198 419756.03 328868294.68809 75164 150269 752070 189355 380086 1900487 19.38 19.93 9.26 8.81 9.33 8.45 11.03 3.77 19.74 32.85 13.05 9.42 22.71 30.14 18.540 142.595 9.644 104.448 60.77 39.30 24.28 1.65 37884312.2 122021.48057904 2228 2096 165528 419091.19 328901110.00869 75300 150626 751645 190069 379324 1900833 19.62 20.42 9.28 8.88 9.34 8.41 11.00 3.73 19.68 32.52 13.00 9.25 22.92 30.27 18.646 142.538 9.575 104.647 61.39 39.61 24.26 1.65 OpenBenchmarking.org
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 EPYC 7F72 x 2 3 EPYC 7F72 8M 16M 24M 32M 40M SE +/- 421072.96, N = 3 SE +/- 493714.96, N = 5 SE +/- 550044.12, N = 3 SE +/- 390690.78, N = 3 38374014.4 37972490.0 37884312.2 37756888.4
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine EPYC 7F72 x 3 EPYC 7F72 2 30K 60K 90K 120K 150K SE +/- 285.55, N = 3 SE +/- 254.91, N = 3 SE +/- 630.92, N = 3 SE +/- 807.96, N = 3 122399.73 122021.48 121809.43 121264.38 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS EPYC 7F72 x 3 EPYC 7F72 2 500 1000 1500 2000 2500 SE +/- 24.04, N = 7 SE +/- 9.53, N = 3 SE +/- 11.27, N = 3 SE +/- 28.46, N = 5 2256 2228 2227 2213 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen EPYC 7F72 x 2 3 EPYC 7F72 500 1000 1500 2000 2500 SE +/- 26.56, N = 3 SE +/- 11.22, N = 3 SE +/- 28.45, N = 9 SE +/- 23.67, N = 9 2178 2165 2096 2073 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 2 3 EPYC 7F72 EPYC 7F72 x 40K 80K 120K 160K 200K SE +/- 1148.31, N = 3 SE +/- 1873.00, N = 3 SE +/- 1317.43, N = 3 SE +/- 1500.11, N = 3 166198 165528 164671 164311 1. (CXX) g++ options: -flto -pthread
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 EPYC 7F72 x EPYC 7F72 2 3 90K 180K 270K 360K 450K SE +/- 2108.70, N = 3 SE +/- 250.94, N = 3 SE +/- 1309.71, N = 3 SE +/- 1744.78, N = 3 421676.28 420383.50 419756.03 419091.19 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 3 2 EPYC 7F72 x 70M 140M 210M 280M 350M SE +/- 377618.26, N = 3 SE +/- 40216.53, N = 3 SE +/- 53237.74, N = 3 328901110.01 328868294.69 328822812.23 1. (CC) gcc options: -O3 -march=native -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 2 EPYC 7F72 x 3 16K 32K 48K 64K 80K SE +/- 171.98, N = 3 SE +/- 23.86, N = 3 SE +/- 215.05, N = 3 75164 75293 75300 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 2 EPYC 7F72 x 3 30K 60K 90K 120K 150K SE +/- 69.27, N = 3 SE +/- 211.31, N = 3 SE +/- 75.05, N = 3 150269 150373 150626 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 EPYC 7F72 x 3 2 160K 320K 480K 640K 800K SE +/- 670.58, N = 3 SE +/- 811.75, N = 3 SE +/- 654.75, N = 3 751617 751645 752070 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 2 EPYC 7F72 x 3 40K 80K 120K 160K 200K SE +/- 245.02, N = 3 SE +/- 464.46, N = 3 SE +/- 407.34, N = 3 189355 190013 190069 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 3 EPYC 7F72 x 2 80K 160K 240K 320K 400K SE +/- 266.02, N = 3 SE +/- 429.90, N = 3 SE +/- 518.16, N = 3 379324 379880 380086 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 2 3 EPYC 7F72 x 400K 800K 1200K 1600K 2000K SE +/- 1530.16, N = 3 SE +/- 909.25, N = 3 SE +/- 2432.39, N = 3 1900487 1900833 1902543 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet EPYC 7F72 x 2 3 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.20, N = 7 SE +/- 0.30, N = 3 18.87 19.38 19.62 MIN: 18.36 / MAX: 20.85 MIN: 18.46 / MAX: 118.13 MIN: 18.65 / MAX: 21.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet EPYC 7F72 x 2 3 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.28, N = 7 SE +/- 0.68, N = 3 19.42 19.93 20.42 MIN: 18.9 / MAX: 21.51 MIN: 18.87 / MAX: 22.76 MIN: 18.91 / MAX: 84.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 EPYC 7F72 x 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.04, N = 7 SE +/- 0.12, N = 3 9.21 9.26 9.28 MIN: 8.89 / MAX: 10.51 MIN: 8.75 / MAX: 11.07 MIN: 8.89 / MAX: 10.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 EPYC 7F72 x 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 7 SE +/- 0.07, N = 3 8.80 8.81 8.88 MIN: 8.56 / MAX: 10.78 MIN: 8.52 / MAX: 10.79 MIN: 8.49 / MAX: 76.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 EPYC 7F72 x 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.06, N = 6 SE +/- 0.06, N = 3 9.32 9.33 9.34 MIN: 9.09 / MAX: 14.74 MIN: 9.05 / MAX: 11.33 MIN: 9.14 / MAX: 10.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet EPYC 7F72 x 3 2 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 7 8.40 8.41 8.45 MIN: 8.11 / MAX: 10.16 MIN: 8.08 / MAX: 11.46 MIN: 8.05 / MAX: 10.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 3 2 EPYC 7F72 x 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.06, N = 7 SE +/- 0.09, N = 3 11.00 11.03 11.09 MIN: 10.66 / MAX: 13.36 MIN: 10.67 / MAX: 16.51 MIN: 10.74 / MAX: 13.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 3 EPYC 7F72 x 2 0.8483 1.6966 2.5449 3.3932 4.2415 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 7 3.73 3.74 3.77 MIN: 3.58 / MAX: 4.65 MIN: 3.46 / MAX: 4.73 MIN: 3.58 / MAX: 6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet EPYC 7F72 x 3 2 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.17, N = 3 SE +/- 0.09, N = 7 19.62 19.68 19.74 MIN: 19.13 / MAX: 22.12 MIN: 19.15 / MAX: 21.9 MIN: 19.14 / MAX: 25.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 3 EPYC 7F72 x 2 8 16 24 32 40 SE +/- 0.22, N = 3 SE +/- 0.56, N = 3 SE +/- 0.30, N = 7 32.52 32.83 32.85 MIN: 31.78 / MAX: 34.87 MIN: 31.34 / MAX: 36.46 MIN: 31.1 / MAX: 98.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 3 EPYC 7F72 x 2 3 6 9 12 15 SE +/- 0.13, N = 3 SE +/- 0.27, N = 3 SE +/- 0.13, N = 7 13.00 13.04 13.05 MIN: 12.56 / MAX: 14.64 MIN: 12.34 / MAX: 38.34 MIN: 12.38 / MAX: 54.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet EPYC 7F72 x 3 2 3 6 9 12 15 SE +/- 0.26, N = 3 SE +/- 0.23, N = 3 SE +/- 0.19, N = 6 9.16 9.25 9.42 MIN: 8.54 / MAX: 10.41 MIN: 8.56 / MAX: 12.4 MIN: 8.51 / MAX: 10.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 2 EPYC 7F72 x 3 5 10 15 20 25 SE +/- 0.06, N = 7 SE +/- 0.16, N = 3 SE +/- 0.23, N = 3 22.71 22.78 22.92 MIN: 22.12 / MAX: 25.08 MIN: 22.17 / MAX: 85.73 MIN: 22.33 / MAX: 105.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 2 EPYC 7F72 x 3 7 14 21 28 35 SE +/- 0.11, N = 7 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 30.14 30.20 30.27 MIN: 29.27 / MAX: 65.59 MIN: 29.67 / MAX: 32.8 MIN: 29.38 / MAX: 33.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 2 EPYC 7F72 EPYC 7F72 x 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 18.54 18.63 18.64 18.65
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search EPYC 7F72 3 2 EPYC 7F72 x 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 142.41 142.54 142.60 142.61 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA EPYC 7F72 EPYC 7F72 x 3 2 3 6 9 12 15 SE +/- 0.055, N = 3 SE +/- 0.028, N = 3 SE +/- 0.057, N = 3 SE +/- 0.053, N = 3 9.465 9.559 9.575 9.644 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 EPYC 7F72 EPYC 7F72 x 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.06, N = 3 SE +/- 0.60, N = 3 SE +/- 0.60, N = 3 103.69 103.86 104.45 104.65 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 2 EPYC 7F72 x 3 14 28 42 56 70 SE +/- 0.99, N = 3 SE +/- 0.88, N = 3 SE +/- 0.19, N = 3 60.77 61.04 61.39
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 2 EPYC 7F72 x 3 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 39.30 39.45 39.61
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 3 EPYC 7F72 x 2 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 24.26 24.28 24.28
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression EPYC 7F72 x 2 3 0.3713 0.7426 1.1139 1.4852 1.8565 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.65 1.65 1.65
Phoronix Test Suite v10.8.4