epyc-7763-august

2 x AMD EPYC 7763 64-Core testing with a AMD DAYTONA_X (RYM1001D BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2108304-TJ-EPYC7763A02&grs.

epyc-7763-august ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDisplay ServerCompilerFile-SystemScreen Resolution11AA2 x AMD EPYC 7763 64-Core @ 2.45GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RYM1001D BIOS)AMD Starship/Matisse16 x 32 GB DDR4-3200MT/s 36ASF4G72PZ-3G2E23841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 20.105.8.0-29-generic (x86_64)X Server 1.20.8GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001119 Python Details- Python 2.7.18 + Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

epyc-7763-august ncnn: CPU - efficientnet-b0onnx: shufflenet-v2-10 - OpenMP CPUncnn: CPU - mnasnetncnn: CPU - squeezenet_ssdncnn: CPU - shufflenet-v2ncnn: CPU - mobilenetncnn: CPU - resnet50ncnn: CPU-v2-v2 - mobilenet-v2onnx: bertsquad-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - regnety_400mncnn: CPU - yolov4-tinyonnx: yolov4 - OpenMP CPUtachyon: Total Timencnn: CPU - resnet18mysqlslap: 256ncnn: CPU - alexnetonnx: fcn-resnet101-11 - OpenMP CPUmysqlslap: 1024lczero: BLASlczero: Eigenyafaray: Total Time For Sample Scenemysqlslap: 4096build-linux-kernel: Time To Compilepgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Writemysqlslap: 512ncnn: CPU - vgg16mysqlslap: 2048ecp-candle: P1B2build-gcc: Time To Compileqe: AUSURF112synthmark: VoiceMark_100apache: 1000apache: 500apache: 200apache: 100apache: 20apache: 1natron: Spaceshipnginx: 1000nginx: 500nginx: 200nginx: 100nginx: 20nginx: 1ecp-candle: P3B2ecp-candle: P3B1openvkl: vklBenchmark Scalaropenvkl: vklBenchmark ISPC11AA23334569391.3852.23295745.6366.7135.6185.22124.469.73323379725.0597.4464.19124.7850.182448.829366.48473212262892278473752.44315420.697462940.3354.7225297548143.4215942.214702.2391.57764.68199235.7175345.2880769.8188589.8623300.668006.530.7103561.4101706.25101036.6114598.28207511.759456.09734.707923.89715724178.59441767.6492.8148.39113.73162.1885.97262441421.87108.3858.12136.6746.542289.420362.6444619.842393052369463154.36715821.0847591180.334.7745240947643.7915841.979703.426391.08764.227OpenBenchmarking.org

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b01AA2040608010052.2378.59MIN: 40.56 / MAX: 157.8MIN: 52.57 / MAX: 140.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: shufflenet-v2-10 - Device: OpenMP CPU1AA9001800270036004500295744171. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnet1AA153045607545.6367.64MIN: 28.82 / MAX: 1357.2MIN: 48.15 / MAX: 115.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssd1AA2040608010066.7192.81MIN: 44.2 / MAX: 181.84MIN: 48.28 / MAX: 387.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v21AA112233445535.6148.39MIN: 28.97 / MAX: 58.66MIN: 46.18 / MAX: 68.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenet1AA30609012015085.22113.73MIN: 67.31 / MAX: 288.4MIN: 55.24 / MAX: 449.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet501AA4080120160200124.40162.18MIN: 98.76 / MAX: 475.79MIN: 52.14 / MAX: 4375.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v21AA2040608010069.7385.97MIN: 34.36 / MAX: 3978MIN: 36.96 / MAX: 1800.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: bertsquad-10 - Device: OpenMP CPU1AA701402102803503232621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: super-resolution-10 - Device: OpenMP CPU1AA9001800270036004500379744141. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazeface1AA61218243025.0521.87MIN: 23.86 / MAX: 124.93MIN: 21.3 / MAX: 26.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenet1AA2040608010097.44108.38MIN: 60.85 / MAX: 184.47MIN: 99.3 / MAX: 165.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v31AA142842567064.1958.12MIN: 28.21 / MAX: 3203.55MIN: 38.26 / MAX: 716.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400m1AA306090120150124.78136.67MIN: 116.95 / MAX: 228.04MIN: 125.25 / MAX: 303.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tiny1AA112233445550.1846.54MIN: 37.51 / MAX: 193.66MIN: 39.98 / MAX: 77.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: yolov4 - Device: OpenMP CPU1AA501001502002502442281. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total Time1AA36912158.82939.42031. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet181AA153045607566.4862.64MIN: 45.24 / MAX: 132.36MIN: 45.73 / MAX: 145.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 2561AA1002003004005004734461. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnet1AA51015202521.0019.84MIN: 11.36 / MAX: 68.88MIN: 10.82 / MAX: 25.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.8.2Model: fcn-resnet101-11 - Device: OpenMP CPU1AA501001502002502262391. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

MariaDB

Clients: 1024

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 10241AA701402102803502893051. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLAS11AA5001000150020002500SE +/- 15.19, N = 32333227823691. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigen11AA10002000300040005000SE +/- 46.03, N = 94569473746311. (CXX) g++ options: -flto -pthread

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.5.1Total Time For Sample Scene1AA122436486052.4454.371. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 40961AA3060901201501541581. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To Compile1AA51015202520.6921.08

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only1AA160K320K480K640K800K7462947591181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency1AA0.07540.15080.22620.30160.3770.3350.3301. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency1AA1.07422.14843.22264.29685.3714.7224.7741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write1AA11K22K33K44K55K52975524091. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 5121AA1002003004005004814761. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg161AA102030405043.4243.79MIN: 39.84 / MAX: 140.5MIN: 41.44 / MAX: 92.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 20481AA40801201602001591581. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -pthread -llzma -lbz2 -lsnappy -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

ECP-CANDLE

Benchmark: P1B2

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P1B21AA102030405042.2141.98

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compile1AA150300450600750702.20703.43

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.8Input: AUSURF11211AA80160240320400SE +/- 0.44, N = 3391.38391.57391.081. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001AA160320480640800764.68764.231. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 10001A20K40K60K80K100K99235.711. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 5001A16K32K48K64K80K75345.281. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 2001A20K40K60K80K100K80769.811. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1001A20K40K60K80K100K88589.861. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 201A5K10K15K20K25K23300.661. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 11A2K4K6K8K10K8006.531. (CC) gcc options: -shared -fPIC -O2 -pthread

Natron

Input: Spaceship

OpenBenchmarking.orgFPS, More Is BetterNatron 2.4Input: Spaceship1A0.15750.3150.47250.630.78750.7

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 10001A20K40K60K80K100K103561.41. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 5001A20K40K60K80K100K101706.251. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 2001A20K40K60K80K100K101036.61. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1001A20K40K60K80K100K114598.281. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 201A40K80K120K160K200K207511.71. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 11A13K26K39K52K65K59456.091. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

ECP-CANDLE

Benchmark: P3B2

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B21A160320480640800734.71

ECP-CANDLE

Benchmark: P3B1

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B11A2004006008001000923.90

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar1A306090120150157MIN: 14 / MAX: 3889

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC1A50100150200250241MIN: 20 / MAX: 3649


Phoronix Test Suite v10.8.4