2 x AMD EPYC 7F52 16-Core testing with a Supermicro H11DSU-iN (2.1b BIOS) and llvmpipe 504GB on Ubuntu 20.04 via the Phoronix Test Suite.
2 x AMD EPYC 7F52 16-Core Processor: 2 x AMD EPYC 7F52 16-Core @ 3.50GHz (32 Cores / 64 Threads), Motherboard: Supermicro H11DSU-iN (2.1b BIOS), Chipset: AMD Starship/Matisse, Memory: 504GB, Disk: 2 x 3841GB Micron_9200_MTFDHAL3T8TCT, Graphics: llvmpipe 504GB, Network: 4 x Intel I350
OS: Ubuntu 20.04, Kernel: 5.4.0-39-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 3.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1024x768
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038Python Notes: Python 3.8.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.82 Blend File: Fishy Cat - Compute: OpenCL 2 x AMD EPYC 7F52 16-Core 200 400 600 800 1000 SE +/- 2.04, N = 3 934.25
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.82 Blend File: Barbershop - Compute: OpenCL 2 x AMD EPYC 7F52 16-Core 100 200 300 400 500 SE +/- 2.22, N = 3 481.97
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.82 Blend File: Barbershop - Compute: CPU-Only 2 x AMD EPYC 7F52 16-Core 50 100 150 200 250 SE +/- 0.14, N = 3 209.03
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.82 Blend File: Pabellon Barcelona - Compute: OpenCL 2 x AMD EPYC 7F52 16-Core 200 400 600 800 1000 SE +/- 3.68, N = 3 957.25
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.82 Blend File: Pabellon Barcelona - Compute: CPU-Only 2 x AMD EPYC 7F52 16-Core 30 60 90 120 150 SE +/- 0.17, N = 3 152.28
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel 2 x AMD EPYC 7F52 16-Core 4 8 12 16 20 SE +/- 0.02, N = 3 16.52 1. (CC) gcc options: -lm -lpthread -O3
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 16.38, N = 3 10112 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 206.06, N = 15 8133.8 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 76.91, N = 3 10132.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 6.64, N = 3 8638.1 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 2 x AMD EPYC 7F52 16-Core 1600 3200 4800 6400 8000 SE +/- 9.78, N = 3 7667.3 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 27.08, N = 3 8166.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 6.51, N = 3 8242.8 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 9.09, N = 3 7999.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 2 x AMD EPYC 7F52 16-Core 1700 3400 5100 6800 8500 SE +/- 24.14, N = 3 7777.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 13.54, N = 3 7795.1 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 13.16, N = 3 8389.1 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 9.33, N = 3 8126.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 11.13, N = 3 7888.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 2 x AMD EPYC 7F52 16-Core 1600 3200 4800 6400 8000 SE +/- 82.79, N = 3 7237.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 2 x AMD EPYC 7F52 16-Core 1400 2800 4200 5600 7000 SE +/- 17.73, N = 3 6298.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 2 x AMD EPYC 7F52 16-Core 1200 2400 3600 4800 6000 SE +/- 49.51, N = 3 5816.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 2 x AMD EPYC 7F52 16-Core 3K 6K 9K 12K 15K SE +/- 96.11, N = 3 13905 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 2 x AMD EPYC 7F52 16-Core 4K 8K 12K 16K 20K SE +/- 25.20, N = 3 18890 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 2 x AMD EPYC 7F52 16-Core 9K 18K 27K 36K 45K SE +/- 10.71, N = 3 41325 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 2 x AMD EPYC 7F52 16-Core 9K 18K 27K 36K 45K SE +/- 127.77, N = 3 41864 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 2 x AMD EPYC 7F52 16-Core 5K 10K 15K 20K 25K SE +/- 30.60, N = 3 24196 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 2 x AMD EPYC 7F52 16-Core 8K 16K 24K 32K 40K SE +/- 408.77, N = 3 35262 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 2 x AMD EPYC 7F52 16-Core 10K 20K 30K 40K 50K SE +/- 571.32, N = 3 47190 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 2 x AMD EPYC 7F52 16-Core 9K 18K 27K 36K 45K SE +/- 144.07, N = 3 41126 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 2 x AMD EPYC 7F52 16-Core 8K 16K 24K 32K 40K SE +/- 53.41, N = 3 38668 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 2 x AMD EPYC 7F52 16-Core 8K 16K 24K 32K 40K SE +/- 95.19, N = 3 38062 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 2 x AMD EPYC 7F52 16-Core 11K 22K 33K 44K 55K SE +/- 192.06, N = 3 50863 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 2 x AMD EPYC 7F52 16-Core 12K 24K 36K 48K 60K SE +/- 113.59, N = 3 53744 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 2 x AMD EPYC 7F52 16-Core 11K 22K 33K 44K 55K SE +/- 365.99, N = 3 50727 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 2 x AMD EPYC 7F52 16-Core 8K 16K 24K 32K 40K SE +/- 544.98, N = 3 38689 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 2 x AMD EPYC 7F52 16-Core 5K 10K 15K 20K 25K SE +/- 892.19, N = 12 23527 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 2 x AMD EPYC 7F52 16-Core 4K 8K 12K 16K 20K SE +/- 210.08, N = 3 16718 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Geekbench This is a benchmark of Geekbench 5 Pro. The test profile automates the execution of Geekbench 5 under the Phoronix Test Suite, assuming you have a valid license key for Geekbench 5 Pro. This test will not work without a valid license key for Geekbench Pro. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better Geekbench 5.0 Test: CPU Multi Core 2 x AMD EPYC 7F52 16-Core 7K 14K 21K 28K 35K SE +/- 166.82, N = 3 32844
OpenBenchmarking.org Mpixels/sec, More Is Better Geekbench 5.0 Test: CPU Multi Core - Gaussian Blur 2 x AMD EPYC 7F52 16-Core 0.3285 0.657 0.9855 1.314 1.6425 SE +/- 0.09, N = 3 1.46
OpenBenchmarking.org Gpixels/sec, More Is Better Geekbench 5.0 Test: CPU Multi Core - Horizon Detection 2 x AMD EPYC 7F52 16-Core 200 400 600 800 1000 SE +/- 40.45, N = 3 889.4
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13 ATPase Simulation - 327,506 Atoms 2 x AMD EPYC 7F52 16-Core 0.1241 0.2482 0.3723 0.4964 0.6205 SE +/- 0.00120, N = 3 0.55141
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x AMD EPYC 7F52 16-Core 30K 60K 90K 120K 150K SE +/- 1226.95, N = 3 118641.43 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x AMD EPYC 7F52 16-Core 10K 20K 30K 40K 50K SE +/- 745.22, N = 3 45035.90 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x AMD EPYC 7F52 16-Core 600 1200 1800 2400 3000 SE +/- 4.22, N = 3 2804.16 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x AMD EPYC 7F52 16-Core 600 1200 1800 2400 3000 SE +/- 0.48, N = 3 2820.72 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x AMD EPYC 7F52 16-Core 16K 32K 48K 64K 80K SE +/- 866.26, N = 15 76311.27 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x AMD EPYC 7F52 16-Core 500 1000 1500 2000 2500 SE +/- 0.97, N = 3 2322.00 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x AMD EPYC 7F52 16-Core 30K 60K 90K 120K 150K SE +/- 1021.34, N = 3 149391.65 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x AMD EPYC 7F52 16-Core 20K 40K 60K 80K 100K SE +/- 1207.93, N = 3 101748.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x AMD EPYC 7F52 16-Core 20K 40K 60K 80K 100K SE +/- 268.84, N = 3 107910.88 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.3634 0.7268 1.0902 1.4536 1.817 SE +/- 0.02509, N = 15 1.61525 MIN: 1.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 4 8 12 16 20 SE +/- 0.35, N = 12 16.17 MIN: 13.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.3503 0.7006 1.0509 1.4012 1.7515 SE +/- 0.05202, N = 15 1.55677 MIN: 1.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 3 6 9 12 15 SE +/- 0.03, N = 3 11.68 MIN: 10.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.2447 0.4894 0.7341 0.9788 1.2235 SE +/- 0.01822, N = 3 1.08744 MIN: 0.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.5587 1.1174 1.6761 2.2348 2.7935 SE +/- 0.04147, N = 3 2.48291 MIN: 1.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.79 1.58 2.37 3.16 3.95 SE +/- 0.09360, N = 12 3.51112 MIN: 2.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 1.0384 2.0768 3.1152 4.1536 5.192 SE +/- 0.53429, N = 15 4.61523 MIN: 1.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.4536 0.9072 1.3608 1.8144 2.268 SE +/- 0.01808, N = 3 2.01612 MIN: 1.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.3796 0.7592 1.1388 1.5184 1.898 SE +/- 0.01380, N = 13 1.68715 MIN: 1.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 80 160 240 320 400 SE +/- 10.84, N = 12 387.56 MIN: 334.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 30 60 90 120 150 SE +/- 2.12, N = 15 143.13 MIN: 119.52 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.1267 0.2534 0.3801 0.5068 0.6335 SE +/- 0.002289, N = 3 0.563318 MIN: 0.45 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 2 x AMD EPYC 7F52 16-Core 0.2037 0.4074 0.6111 0.8148 1.0185 SE +/- 0.006228, N = 3 0.905118 MIN: 0.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance 2 x AMD EPYC 7F52 16-Core 2K 4K 6K 8K 10K SE +/- 8.09, N = 3 9127.2 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times 2 x AMD EPYC 7F52 16-Core 200 400 600 800 1000 SE +/- 3.48, N = 3 1010
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SADD 2 x AMD EPYC 7F52 16-Core 400K 800K 1200K 1600K 2000K SE +/- 18304.02, N = 15 1722705.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPUSH 2 x AMD EPYC 7F52 16-Core 300K 600K 900K 1200K 1500K SE +/- 9914.80, N = 3 1381357.8 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: GET 2 x AMD EPYC 7F52 16-Core 400K 800K 1200K 1600K 2000K SE +/- 30665.39, N = 15 1823659.70 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SET 2 x AMD EPYC 7F52 16-Core 300K 600K 900K 1200K 1500K SE +/- 27098.91, N = 3 1569169.37 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time 2 x AMD EPYC 7F52 16-Core 20M 40M 60M 80M 100M SE +/- 362360.61, N = 3 82531491 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 2 x AMD EPYC 7F52 16-Core 40K 80K 120K 160K 200K SE +/- 809.33, N = 5 187535.1 1. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 2 x AMD EPYC 7F52 16-Core 40K 80K 120K 160K 200K SE +/- 2507.91, N = 5 201925.7 1. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 2 x AMD EPYC 7F52 16-Core 40K 80K 120K 160K 200K SE +/- 1962.43, N = 5 198645.2 1. (CC) gcc options: -O3 -march=native -fopenmp
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 0 - Input: 1080p 2 x AMD EPYC 7F52 16-Core 0.0263 0.0526 0.0789 0.1052 0.1315 SE +/- 0.000, N = 3 0.117 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p 2 x AMD EPYC 7F52 16-Core 2 4 6 8 10 SE +/- 0.068, N = 11 7.601 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p 2 x AMD EPYC 7F52 16-Core 13 26 39 52 65 SE +/- 0.34, N = 3 60.23 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 2018-07-28 Test: CPU 2 x AMD EPYC 7F52 16-Core 14K 28K 42K 56K 70K SE +/- 8.08, N = 3 65234.82 1. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm
2 x AMD EPYC 7F52 16-Core Processor: 2 x AMD EPYC 7F52 16-Core @ 3.50GHz (32 Cores / 64 Threads), Motherboard: Supermicro H11DSU-iN (2.1b BIOS), Chipset: AMD Starship/Matisse, Memory: 504GB, Disk: 2 x 3841GB Micron_9200_MTFDHAL3T8TCT, Graphics: llvmpipe 504GB, Network: 4 x Intel I350
OS: Ubuntu 20.04, Kernel: 5.4.0-39-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 3.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1024x768
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038Python Notes: Python 3.8.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 30 June 2020 12:01 by user paul.