epyc-7763-august

2 x AMD EPYC 7763 64-Core testing with a AMD DAYTONA_X (RYM1001D BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2108304-TJ-EPYC7763A02&grr.

epyc-7763-august ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDisplay ServerCompilerFile-SystemScreen Resolution11AA2 x AMD EPYC 7763 64-Core @ 2.45GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RYM1001D BIOS)AMD Starship/Matisse16 x 32 GB DDR4-3200MT/s 36ASF4G72PZ-3G2E23841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 20.105.8.0-29-generic (x86_64)X Server 1.20.8GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001119 Python Details- Python 2.7.18 + Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

epyc-7763-august openvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarlczero: Eigenmysqlslap: 4096mysqlslap: 2048ecp-candle: P3B1ecp-candle: P3B2build-gcc: Time To Compilemysqlslap: 1024qe: AUSURF112lczero: BLASmysqlslap: 256mysqlslap: 512ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetnatron: Spaceshiponnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUpgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Writeapache: 1000nginx: 1000apache: 500nginx: 200apache: 200nginx: 500nginx: 20nginx: 100apache: 100apache: 20apache: 1nginx: 1yafaray: Total Time For Sample Sceneecp-candle: P1B2synthmark: VoiceMark_100build-linux-kernel: Time To Compiletachyon: Total Time11AA4569391.3823332411574737154159923.897734.707702.2289391.572278473481124.7866.7150.18124.42166.4843.4297.4425.0552.2345.6335.6164.1969.7385.220.7323226244295737970.3357462944.7225297599235.71103561.475345.28101036.680769.81101706.25207511.7114598.2888589.8623300.668006.5359456.0952.44342.214764.68120.698.82934631158158703.426305391.082369446476136.6792.8146.54162.1819.8462.6443.79108.3821.8778.5967.6448.3958.1285.97113.73262239228441744140.337591184.7745240954.36741.979764.22721.0849.4203OpenBenchmarking.org

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC1A50100150200250241MIN: 20 / MAX: 3649

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar1A306090120150157MIN: 14 / MAX: 3889

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigen11AA10002000300040005000SE +/- 46.03, N = 94569473746311. (CXX) g++ options: -flto -pthread

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 40961AA3060901201501541581. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 20481AA40801201602001591581. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

ECP-CANDLE

Benchmark: P3B1

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B11A2004006008001000923.90

ECP-CANDLE

Benchmark: P3B2

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B21A160320480640800734.71

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compile1AA150300450600750702.20703.43

MariaDB

Clients: 1024

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 10241AA701402102803502893051. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.8Input: AUSURF11211AA80160240320400SE +/- 0.44, N = 3391.38391.57391.081. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLAS11AA5001000150020002500SE +/- 15.19, N = 32333227823691. (CXX) g++ options: -flto -pthread

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 2561AA1002003004005004734461. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 5121AA1002003004005004814761. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400m1AA306090120150124.78136.67MIN: 116.95 / MAX: 228.04MIN: 125.25 / MAX: 303.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssd1AA2040608010066.7192.81MIN: 44.2 / MAX: 181.84MIN: 48.28 / MAX: 387.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tiny1AA112233445550.1846.54MIN: 37.51 / MAX: 193.66MIN: 39.98 / MAX: 77.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet501AA4080120160200124.40162.18MIN: 98.76 / MAX: 475.79MIN: 52.14 / MAX: 4375.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnet1AA51015202521.0019.84MIN: 11.36 / MAX: 68.88MIN: 10.82 / MAX: 25.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet181AA153045607566.4862.64MIN: 45.24 / MAX: 132.36MIN: 45.73 / MAX: 145.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg161AA102030405043.4243.79MIN: 39.84 / MAX: 140.5MIN: 41.44 / MAX: 92.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenet1AA2040608010097.44108.38MIN: 60.85 / MAX: 184.47MIN: 99.3 / MAX: 165.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazeface1AA61218243025.0521.87MIN: 23.86 / MAX: 124.93MIN: 21.3 / MAX: 26.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b01AA2040608010052.2378.59MIN: 40.56 / MAX: 157.8MIN: 52.57 / MAX: 140.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnet1AA153045607545.6367.64MIN: 28.82 / MAX: 1357.2MIN: 48.15 / MAX: 115.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v21AA112233445535.6148.39MIN: 28.97 / MAX: 58.66MIN: 46.18 / MAX: 68.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v31AA142842567064.1958.12MIN: 28.21 / MAX: 3203.55MIN: 38.26 / MAX: 716.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v21AA2040608010069.7385.97MIN: 34.36 / MAX: 3978MIN: 36.96 / MAX: 1800.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenet1AA30609012015085.22113.73MIN: 67.31 / MAX: 288.4MIN: 55.24 / MAX: 449.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

Natron

Input: Spaceship

OpenBenchmarking.orgFPS, More Is BetterNatron 2.4Input: Spaceship1A0.15750.3150.47250.630.78750.7

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: bertsquad-10 - Device: OpenMP CPU1AA701402102803503232621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: fcn-resnet101-11 - Device: OpenMP CPU1AA501001502002502262391. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: yolov4 - Device: OpenMP CPU1AA501001502002502442281. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: shufflenet-v2-10 - Device: OpenMP CPU1AA9001800270036004500295744171. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: super-resolution-10 - Device: OpenMP CPU1AA9001800270036004500379744141. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency1AA0.07540.15080.22620.30160.3770.3350.3301. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only1AA160K320K480K640K800K7462947591181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency1AA1.07422.14843.22264.29685.3714.7224.7741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write1AA11K22K33K44K55K52975524091. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 10001A20K40K60K80K100K99235.711. (CC) gcc options: -shared -fPIC -O2 -pthread

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 10001A20K40K60K80K100K103561.41. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 5001A16K32K48K64K80K75345.281. (CC) gcc options: -shared -fPIC -O2 -pthread

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 2001A20K40K60K80K100K101036.61. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 2001A20K40K60K80K100K80769.811. (CC) gcc options: -shared -fPIC -O2 -pthread

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 5001A20K40K60K80K100K101706.251. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 201A40K80K120K160K200K207511.71. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1001A20K40K60K80K100K114598.281. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1001A20K40K60K80K100K88589.861. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 201A5K10K15K20K25K23300.661. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 11A2K4K6K8K10K8006.531. (CC) gcc options: -shared -fPIC -O2 -pthread

nginx

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 11A13K26K39K52K65K59456.091. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.5.1Total Time For Sample Scene1AA122436486052.4454.371. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype

ECP-CANDLE

Benchmark: P1B2

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P1B21AA102030405042.2141.98

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001AA160320480640800764.68764.231. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To Compile1AA51015202520.6921.08

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total Time1AA36912158.82939.42031. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread


Phoronix Test Suite v10.8.4