EPYC 7572 again

2 x AMD EPYC 7F52 16-Core testing with a Supermicro H11DSU-iN (2.1b BIOS) and llvmpipe 504GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2006302-NI-EPYC7572A69.

EPYC 7572 againProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution2 x AMD EPYC 7F52 16-Core2 x AMD EPYC 7F52 16-Core @ 3.50GHz (32 Cores / 64 Threads)Supermicro H11DSU-iN (2.1b BIOS)AMD Starship/Matisse504GB2 x 3841GB Micron_9200_MTFDHAL3T8TCTllvmpipe 504GB4 x Intel I350Ubuntu 20.045.4.0-39-generic (x86_64)GNOME Shell 3.36.2X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits)GCC 9.3.0ext41024x768OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038- Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7572 againstream: Copystream: Scalestream: Triadstream: Addnpb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnamd: ATPase Simulation - 327,506 Atomsfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096byte: Dhrystone 2byte: Integer Arithmeticbyte: Register Arithmeticbyte: Floating-Point Arithmeticonednn: IP Batch 1D - f32 - CPUonednn: IP Batch All - f32 - CPUonednn: IP Batch 1D - u8s8f32 - CPUonednn: IP Batch All - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080pstockfish: Total Timebuild-llvm: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixely-cruncher: Calculating 500M Pi Digitsopenssl: RSA 4096-bit Performancegromacs: Water Benchmarkredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETsysbench: Memorysysbench: CPUblender: BMW27 - OpenCLblender: BMW27 - CPU-Onlyblender: Classroom - OpenCLblender: Fishy Cat - OpenCLblender: Barbershop - OpenCLblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - OpenCLblender: Pabellon Barcelona - CPU-Onlypybench: Total For Average Test Timesgeekbench: CPU Multi Coregeekbench: CPU Multi Core - Gaussian Blurgeekbench: CPU Multi Core - Face Detectiongeekbench: CPU Multi Core - Horizon Detectiongeekbench: CPU Single Coregeekbench: CPU Single Core - Gaussian Blurgeekbench: CPU Single Core - Face Detectiongeekbench: CPU Single Core - Horizon Detectionmlpack: scikit_svmmlpack: scikit_linearridgeregression2 x AMD EPYC 7F52 16-Core188965.1187535.1201925.7198645.2118641.4345035.902804.162820.7276311.272322.00149391.65101748.74107910.880.55141101128133.810132.68638.17667.38166.98242.87999.97777.57795.18389.18126.47888.27237.56298.65816.21390518890413254186424196352624719041126386683806250863537445072738689235271671840571528.31111.6152516.17341.5567711.67741.087442.482913.511124.615232.016121.68715387.563143.1310.5633180.9051180.1177.60160.23382531491220.72616.51710.0029127.24.4581820952.001722705.421381357.81823659.701569169.372324848.423765234.8191354.5445.78208.39934.25481.97115.8071.29209.03957.25152.281010328441.46336.2889.4117973.79.6127.225.69198.64OpenBenchmarking.org

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 2294.95, N = 6188965.11. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 809.33, N = 5187535.11. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 2507.91, N = 5201925.71. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 1962.43, N = 5198645.21. (CC) gcc options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x AMD EPYC 7F52 16-Core30K60K90K120K150KSE +/- 1226.95, N = 3118641.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x AMD EPYC 7F52 16-Core10K20K30K40K50KSE +/- 745.22, N = 345035.901. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x AMD EPYC 7F52 16-Core6001200180024003000SE +/- 4.22, N = 32804.161. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x AMD EPYC 7F52 16-Core6001200180024003000SE +/- 0.48, N = 32820.721. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x AMD EPYC 7F52 16-Core16K32K48K64K80KSE +/- 866.26, N = 1576311.271. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x AMD EPYC 7F52 16-Core5001000150020002500SE +/- 0.97, N = 32322.001. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x AMD EPYC 7F52 16-Core30K60K90K120K150KSE +/- 1021.34, N = 3149391.651. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x AMD EPYC 7F52 16-Core20K40K60K80K100KSE +/- 1207.93, N = 3101748.741. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x AMD EPYC 7F52 16-Core20K40K60K80K100KSE +/- 268.84, N = 3107910.881. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13ATPase Simulation - 327,506 Atoms2 x AMD EPYC 7F52 16-Core0.12410.24820.37230.49640.6205SE +/- 0.00120, N = 30.55141

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 16.38, N = 3101121. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 206.06, N = 158133.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 76.91, N = 310132.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 6.64, N = 38638.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x AMD EPYC 7F52 16-Core16003200480064008000SE +/- 9.78, N = 37667.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 27.08, N = 38166.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 6.51, N = 38242.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 9.09, N = 37999.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x AMD EPYC 7F52 16-Core17003400510068008500SE +/- 24.14, N = 37777.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 13.54, N = 37795.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 13.16, N = 38389.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 9.33, N = 38126.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 11.13, N = 37888.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x AMD EPYC 7F52 16-Core16003200480064008000SE +/- 82.79, N = 37237.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x AMD EPYC 7F52 16-Core14002800420056007000SE +/- 17.73, N = 36298.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x AMD EPYC 7F52 16-Core12002400360048006000SE +/- 49.51, N = 35816.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x AMD EPYC 7F52 16-Core3K6K9K12K15KSE +/- 96.11, N = 3139051. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x AMD EPYC 7F52 16-Core4K8K12K16K20KSE +/- 25.20, N = 3188901. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 10.71, N = 3413251. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 127.77, N = 3418641. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x AMD EPYC 7F52 16-Core5K10K15K20K25KSE +/- 30.60, N = 3241961. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 408.77, N = 3352621. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x AMD EPYC 7F52 16-Core10K20K30K40K50KSE +/- 571.32, N = 3471901. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 144.07, N = 3411261. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 53.41, N = 3386681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 95.19, N = 3380621. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x AMD EPYC 7F52 16-Core11K22K33K44K55KSE +/- 192.06, N = 3508631. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x AMD EPYC 7F52 16-Core12K24K36K48K60KSE +/- 113.59, N = 3537441. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x AMD EPYC 7F52 16-Core11K22K33K44K55KSE +/- 365.99, N = 3507271. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 544.98, N = 3386891. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x AMD EPYC 7F52 16-Core5K10K15K20K25KSE +/- 892.19, N = 12235271. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x AMD EPYC 7F52 16-Core4K8K12K16K20KSE +/- 210.08, N = 3167181. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 22 x AMD EPYC 7F52 16-Core9M18M27M36M45MSE +/- 334012.94, N = 340571528.3

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

oneDNN

Harness: IP Batch 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.36340.72681.09021.45361.817SE +/- 0.02509, N = 151.61525MIN: 1.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core48121620SE +/- 0.35, N = 1216.17MIN: 13.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.35030.70061.05091.40121.7515SE +/- 0.05202, N = 151.55677MIN: 1.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core3691215SE +/- 0.03, N = 311.68MIN: 10.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.24470.48940.73410.97881.2235SE +/- 0.01822, N = 31.08744MIN: 0.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.55871.11741.67612.23482.7935SE +/- 0.04147, N = 32.48291MIN: 1.961. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.791.582.373.163.95SE +/- 0.09360, N = 123.51112MIN: 2.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core1.03842.07683.11524.15365.192SE +/- 0.53429, N = 154.61523MIN: 1.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.45360.90721.36081.81442.268SE +/- 0.01808, N = 32.01612MIN: 1.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.37960.75921.13881.51841.898SE +/- 0.01380, N = 131.68715MIN: 1.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core80160240320400SE +/- 10.84, N = 12387.56MIN: 334.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core306090120150SE +/- 2.12, N = 15143.13MIN: 119.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.12670.25340.38010.50680.6335SE +/- 0.002289, N = 30.563318MIN: 0.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.20370.40740.61110.81481.0185SE +/- 0.006228, N = 30.905118MIN: 0.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080p2 x AMD EPYC 7F52 16-Core0.02630.05260.07890.10520.1315SE +/- 0.000, N = 30.1171. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080p2 x AMD EPYC 7F52 16-Core246810SE +/- 0.068, N = 117.6011. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080p2 x AMD EPYC 7F52 16-Core1326395265SE +/- 0.34, N = 360.231. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time2 x AMD EPYC 7F52 16-Core20M40M60M80M100MSE +/- 362360.61, N = 3825314911. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile2 x AMD EPYC 7F52 16-Core50100150200250SE +/- 3.45, N = 3220.73

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel2 x AMD EPYC 7F52 16-Core48121620SE +/- 0.02, N = 316.521. (CC) gcc options: -lm -lpthread -O3

Y-Cruncher

Calculating 500M Pi Digits

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.7.8.9503Calculating 500M Pi Digits2 x AMD EPYC 7F52 16-Core3691215SE +/- 0.06, N = 310.00

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance2 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 8.09, N = 39127.21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water Benchmark2 x AMD EPYC 7F52 16-Core1.00312.00623.00934.01245.0155SE +/- 0.008, N = 34.4581. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOP2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 94512.91, N = 121820952.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADD2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 18304.02, N = 151722705.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSH2 x AMD EPYC 7F52 16-Core300K600K900K1200K1500KSE +/- 9914.80, N = 31381357.81. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 30665.39, N = 151823659.701. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET2 x AMD EPYC 7F52 16-Core300K600K900K1200K1500KSE +/- 27098.91, N = 31569169.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Sysbench

Test: Memory

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory2 x AMD EPYC 7F52 16-Core500K1000K1500K2000K2500KSE +/- 65373.38, N = 152324848.421. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU2 x AMD EPYC 7F52 16-Core14K28K42K56K70KSE +/- 8.08, N = 365234.821. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: OpenCL2 x AMD EPYC 7F52 16-Core80160240320400SE +/- 1.77, N = 3354.54

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core1020304050SE +/- 0.11, N = 345.78

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: OpenCL2 x AMD EPYC 7F52 16-Core50100150200250SE +/- 1.48, N = 3208.39

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: OpenCL2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 2.04, N = 3934.25

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: OpenCL2 x AMD EPYC 7F52 16-Core100200300400500SE +/- 2.22, N = 3481.97

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core306090120150SE +/- 0.15, N = 3115.80

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core1632486480SE +/- 0.38, N = 371.29

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core50100150200250SE +/- 0.14, N = 3209.03

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: OpenCL2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 3.68, N = 3957.25

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core306090120150SE +/- 0.17, N = 3152.28

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 3.48, N = 31010

Geekbench

Test: CPU Multi Core

OpenBenchmarking.orgScore, More Is BetterGeekbench 5.0Test: CPU Multi Core2 x AMD EPYC 7F52 16-Core7K14K21K28K35KSE +/- 166.82, N = 332844

Geekbench

Test: CPU Multi Core - Gaussian Blur

OpenBenchmarking.orgMpixels/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Gaussian Blur2 x AMD EPYC 7F52 16-Core0.32850.6570.98551.3141.6425SE +/- 0.09, N = 31.46

Geekbench

Test: CPU Multi Core - Face Detection

OpenBenchmarking.orgimages/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Face Detection2 x AMD EPYC 7F52 16-Core70140210280350SE +/- 1.32, N = 3336.2

Geekbench

Test: CPU Multi Core - Horizon Detection

OpenBenchmarking.orgGpixels/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Horizon Detection2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 40.45, N = 3889.4

Geekbench

Test: CPU Single Core

OpenBenchmarking.orgScore, More Is BetterGeekbench 5.0Test: CPU Single Core2 x AMD EPYC 7F52 16-Core30060090012001500SE +/- 1.86, N = 31179

Geekbench

Test: CPU Single Core - Gaussian Blur

OpenBenchmarking.orgMpixels/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Gaussian Blur2 x AMD EPYC 7F52 16-Core1632486480SE +/- 0.06, N = 373.7

Geekbench

Test: CPU Single Core - Face Detection

OpenBenchmarking.orgimages/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Face Detection2 x AMD EPYC 7F52 16-Core3691215SE +/- 0.05, N = 39.61

Geekbench

Test: CPU Single Core - Horizon Detection

OpenBenchmarking.orgGpixels/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Horizon Detection2 x AMD EPYC 7F52 16-Core612182430SE +/- 0.03, N = 327.2

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm2 x AMD EPYC 7F52 16-Core612182430SE +/- 0.63, N = 1525.69

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression2 x AMD EPYC 7F52 16-Core4080120160200SE +/- 0.18, N = 3198.64


Phoronix Test Suite v10.8.4