Pauls EPYC Redo 77422

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSU-iN (2.1b BIOS) and llvmpipe 504GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2006295-NI-PAULSEPYC47.

Pauls EPYC Redo 77422ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution2 x AMD EPYC 7742 64-Core2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSU-iN (2.1b BIOS)AMD Starship/Matisse504GB2 x 3841GB Micron_9200_MTFDHAL3T8TCTllvmpipe 504GB4 x Intel I350Ubuntu 20.045.4.0-39-generic (x86_64)GNOME Shell 3.36.2X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits)GCC 9.3.0ext41024x768OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038- Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Pauls EPYC Redo 77422stream: Copystream: Scalestream: Triadstream: Addnpb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnamd: ATPase Simulation - 327,506 Atomsfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096byte: Dhrystone 2byte: Integer Arithmeticbyte: Register Arithmeticbyte: Floating-Point Arithmeticonednn: IP Batch 1D - f32 - CPUonednn: IP Batch All - f32 - CPUonednn: IP Batch 1D - u8s8f32 - CPUonednn: IP Batch All - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080pstockfish: Total Timebuild-llvm: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixely-cruncher: Calculating 500M Pi Digitsopenssl: RSA 4096-bit Performancegromacs: Water Benchmarkredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETsysbench: Memorysysbench: CPUblender: BMW27 - OpenCLblender: BMW27 - CPU-Onlyblender: Classroom - OpenCLblender: Fishy Cat - OpenCLblender: Barbershop - OpenCLblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - OpenCLblender: Pabellon Barcelona - CPU-Onlypybench: Total For Average Test Timesmlpack: scikit_svmmlpack: scikit_linearridgeregression2 x AMD EPYC 7742 64-Core184819.7175933.3199693.6193641.0241482.3049999.818140.388461.55108907.764353.14235826.2399974.26116398.050.260848832.68205.98897.87528.56767.97122.27147.76999.06813.56795.57312.37079.16891.86253.35570.45318.41211716517359283666521129306304027336088339063346244655470184417434269207711571535242187.21112.0695821.87303.1475710.20410.7318862.927042.744252.746912.213011.200071004.117376.4630.7639370.8270130.1059.00986.955236757913195.5746.04814.26724890.08.3342331103.251696127.251299705.472174159.671534110.274622114.7571214875.5076400.6627.09231.481048.86588.9346.7244.66144.171088.3173.77115328.40228.59OpenBenchmarking.org

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy2 x AMD EPYC 7742 64-Core40K80K120K160K200KSE +/- 419.54, N = 5184819.71. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale2 x AMD EPYC 7742 64-Core40K80K120K160K200KSE +/- 246.48, N = 5175933.31. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad2 x AMD EPYC 7742 64-Core40K80K120K160K200KSE +/- 426.56, N = 5199693.61. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add2 x AMD EPYC 7742 64-Core40K80K120K160K200KSE +/- 482.59, N = 5193641.01. (CC) gcc options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x AMD EPYC 7742 64-Core50K100K150K200K250KSE +/- 724.30, N = 3241482.301. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x AMD EPYC 7742 64-Core11K22K33K44K55KSE +/- 204.92, N = 349999.811. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x AMD EPYC 7742 64-Core2K4K6K8K10KSE +/- 25.93, N = 38140.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x AMD EPYC 7742 64-Core2K4K6K8K10KSE +/- 22.78, N = 38461.551. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x AMD EPYC 7742 64-Core20K40K60K80K100KSE +/- 1202.48, N = 3108907.761. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x AMD EPYC 7742 64-Core9001800270036004500SE +/- 63.92, N = 34353.141. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x AMD EPYC 7742 64-Core50K100K150K200K250KSE +/- 1756.51, N = 3235826.231. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x AMD EPYC 7742 64-Core20K40K60K80K100KSE +/- 761.18, N = 399974.261. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x AMD EPYC 7742 64-Core20K40K60K80K100KSE +/- 1045.74, N = 3116398.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13ATPase Simulation - 327,506 Atoms2 x AMD EPYC 7742 64-Core0.05870.11740.17610.23480.2935SE +/- 0.00168, N = 30.26084

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x AMD EPYC 7742 64-Core2K4K6K8K10KSE +/- 10.99, N = 38832.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x AMD EPYC 7742 64-Core2K4K6K8K10KSE +/- 2.97, N = 38205.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x AMD EPYC 7742 64-Core2K4K6K8K10KSE +/- 1.42, N = 38897.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x AMD EPYC 7742 64-Core16003200480064008000SE +/- 6.03, N = 37528.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 58.26, N = 36767.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 23.45, N = 37122.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 42.17, N = 37147.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 28.47, N = 36999.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 19.12, N = 36813.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 4.84, N = 36795.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x AMD EPYC 7742 64-Core16003200480064008000SE +/- 8.17, N = 37312.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 15.92, N = 37079.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x AMD EPYC 7742 64-Core15003000450060007500SE +/- 9.70, N = 36891.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x AMD EPYC 7742 64-Core13002600390052006500SE +/- 18.31, N = 36253.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x AMD EPYC 7742 64-Core12002400360048006000SE +/- 34.05, N = 35570.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x AMD EPYC 7742 64-Core11002200330044005500SE +/- 9.37, N = 35318.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x AMD EPYC 7742 64-Core3K6K9K12K15KSE +/- 111.25, N = 3121171. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x AMD EPYC 7742 64-Core4K8K12K16K20KSE +/- 91.83, N = 3165171. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x AMD EPYC 7742 64-Core8K16K24K32K40KSE +/- 39.55, N = 3359281. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x AMD EPYC 7742 64-Core8K16K24K32K40KSE +/- 174.39, N = 3366651. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x AMD EPYC 7742 64-Core5K10K15K20K25KSE +/- 72.95, N = 3211291. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x AMD EPYC 7742 64-Core7K14K21K28K35KSE +/- 368.26, N = 3306301. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x AMD EPYC 7742 64-Core9K18K27K36K45KSE +/- 353.76, N = 3402731. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x AMD EPYC 7742 64-Core8K16K24K32K40KSE +/- 142.06, N = 3360881. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x AMD EPYC 7742 64-Core7K14K21K28K35KSE +/- 48.79, N = 3339061. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x AMD EPYC 7742 64-Core7K14K21K28K35KSE +/- 136.33, N = 3334621. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x AMD EPYC 7742 64-Core10K20K30K40K50KSE +/- 730.08, N = 3446551. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x AMD EPYC 7742 64-Core10K20K30K40K50KSE +/- 191.44, N = 3470181. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x AMD EPYC 7742 64-Core9K18K27K36K45KSE +/- 456.79, N = 3441741. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x AMD EPYC 7742 64-Core7K14K21K28K35KSE +/- 260.64, N = 3342691. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x AMD EPYC 7742 64-Core4K8K12K16K20KSE +/- 268.11, N = 3207711. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x AMD EPYC 7742 64-Core3K6K9K12K15KSE +/- 193.22, N = 3157151. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 22 x AMD EPYC 7742 64-Core8M16M24M32M40MSE +/- 362953.65, N = 835242187.2

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic2 x AMD EPYC 7742 64-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic2 x AMD EPYC 7742 64-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic2 x AMD EPYC 7742 64-Core0.2250.450.6750.91.1251

oneDNN

Harness: IP Batch 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.46570.93141.39711.86282.3285SE +/- 0.01313, N = 32.06958MIN: 1.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core510152025SE +/- 0.08, N = 321.87MIN: 18.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.70821.41642.12462.83283.541SE +/- 0.01409, N = 33.14757MIN: 2.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core3691215SE +/- 0.02, N = 310.20MIN: 9.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.16470.32940.49410.65880.8235SE +/- 0.000571, N = 30.731886MIN: 0.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.65861.31721.97582.63443.293SE +/- 0.00686, N = 32.92704MIN: 2.661. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.61751.2351.85252.473.0875SE +/- 0.03237, N = 52.74425MIN: 2.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.61811.23621.85432.47243.0905SE +/- 0.19711, N = 122.74691MIN: 0.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.49790.99581.49371.99162.4895SE +/- 0.00661, N = 32.21301MIN: 2.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.270.540.811.081.35SE +/- 0.01456, N = 31.20007MIN: 1.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core2004006008001000SE +/- 11.67, N = 31004.12MIN: 890.71. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core80160240320400SE +/- 0.53, N = 3376.46MIN: 349.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.17190.34380.51570.68760.8595SE +/- 0.010853, N = 30.763937MIN: 0.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7742 64-Core0.18610.37220.55830.74440.9305SE +/- 0.002264, N = 30.827013MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080p2 x AMD EPYC 7742 64-Core0.02360.04720.07080.09440.118SE +/- 0.000, N = 30.1051. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080p2 x AMD EPYC 7742 64-Core3691215SE +/- 0.025, N = 39.0091. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080p2 x AMD EPYC 7742 64-Core20406080100SE +/- 0.98, N = 386.961. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time2 x AMD EPYC 7742 64-Core50M100M150M200M250MSE +/- 1766557.68, N = 32367579131. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile2 x AMD EPYC 7742 64-Core4080120160200SE +/- 0.53, N = 3195.57

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel2 x AMD EPYC 7742 64-Core246810SE +/- 0.010, N = 36.0481. (CC) gcc options: -lm -lpthread -O3

Y-Cruncher

Calculating 500M Pi Digits

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.7.8.9503Calculating 500M Pi Digits2 x AMD EPYC 7742 64-Core48121620SE +/- 0.02, N = 314.27

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance2 x AMD EPYC 7742 64-Core5K10K15K20K25KSE +/- 24.94, N = 324890.01. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water Benchmark2 x AMD EPYC 7742 64-Core246810SE +/- 0.104, N = 58.3341. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOP2 x AMD EPYC 7742 64-Core500K1000K1500K2000K2500KSE +/- 10817.25, N = 32331103.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADD2 x AMD EPYC 7742 64-Core400K800K1200K1600K2000KSE +/- 24020.68, N = 151696127.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSH2 x AMD EPYC 7742 64-Core300K600K900K1200K1500KSE +/- 14918.14, N = 151299705.471. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET2 x AMD EPYC 7742 64-Core500K1000K1500K2000K2500KSE +/- 16374.26, N = 32174159.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET2 x AMD EPYC 7742 64-Core300K600K900K1200K1500KSE +/- 21774.01, N = 151534110.271. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Sysbench

Test: Memory

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory2 x AMD EPYC 7742 64-Core1000K2000K3000K4000K5000KSE +/- 2131.08, N = 34622114.761. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU2 x AMD EPYC 7742 64-Core50K100K150K200K250KSE +/- 234.83, N = 3214875.511. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: OpenCL2 x AMD EPYC 7742 64-Core90180270360450SE +/- 1.38, N = 3400.66

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Only2 x AMD EPYC 7742 64-Core612182430SE +/- 0.15, N = 327.09

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: OpenCL2 x AMD EPYC 7742 64-Core50100150200250SE +/- 1.50, N = 3231.48

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: OpenCL2 x AMD EPYC 7742 64-Core2004006008001000SE +/- 1.78, N = 31048.86

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: OpenCL2 x AMD EPYC 7742 64-Core130260390520650SE +/- 4.46, N = 3588.93

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Only2 x AMD EPYC 7742 64-Core1122334455SE +/- 0.10, N = 346.72

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Only2 x AMD EPYC 7742 64-Core1020304050SE +/- 0.38, N = 344.66

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Only2 x AMD EPYC 7742 64-Core306090120150SE +/- 0.47, N = 3144.17

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: OpenCL2 x AMD EPYC 7742 64-Core2004006008001000SE +/- 4.64, N = 31088.31

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Only2 x AMD EPYC 7742 64-Core1632486480SE +/- 0.19, N = 373.77

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x AMD EPYC 7742 64-Core2004006008001000SE +/- 2.52, N = 31153

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm2 x AMD EPYC 7742 64-Core714212835SE +/- 0.60, N = 1528.40

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression2 x AMD EPYC 7742 64-Core50100150200250SE +/- 2.20, N = 3228.59


Phoronix Test Suite v10.8.4