2 x AMD EPYC 7742 64-Core testing with a AMD DAYTONA_X (RDY1001C BIOS) and llvmpipe 504GB on Ubuntu 19.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1910270-HU-2XAMDEPYC21 2 x AMD EPYC 7642 48-Core Linux Yield - Phoronix Test Suite 2 x AMD EPYC 7642 48-Core Linux Yield 2 x AMD EPYC 7742 64-Core testing with a AMD DAYTONA_X (RDY1001C BIOS) and llvmpipe 504GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1910270-HU-2XAMDEPYC21&rdt&grt .
2 x AMD EPYC 7642 48-Core Linux Yield Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2 x AMD EPYC 7642 48-Core @ 2.30GHz (96 Cores / 192 Threads) AMD DAYTONA_X (RDY1001C BIOS) AMD Starship/Matisse 516096MB 280GB INTEL SSDPED1D280GA + 256GB Micron_1100_MTFD llvmpipe 504GB VE228 2 x Mellanox MT27710 Ubuntu 19.10 5.4.0-999-generic (x86_64) 20191017 GNOME Shell 3.34.1 X Server 1.20.5 modesetting 1.20.5 3.3 Mesa 19.2.1 (LLVM 9.0 128 bits) GCC 9.2.1 20191008 ext4 1920x1080 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand Java Details - OpenJDK Runtime Environment (build 11.0.5-ea+10-post-Ubuntu-0ubuntu1) Security Details - l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
2 x AMD EPYC 7642 48-Core Linux Yield mt-dgemm: Sustained Floating-Point Rate cassandra: Reads cassandra: Writes cassandra: Mixed 1:1 cassandra: Mixed 1:3 appleseed: Emily appleseed: Disney Material appleseed: Material Tester askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding darktable: Boat - CPU-only darktable: Masskrug - CPU-only darktable: Server Rack - CPU-only darktable: Server Room - CPU-only dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj rocksdb: Rand Fill rocksdb: Rand Read rocksdb: Seq Fill rocksdb: Rand Fill Sync rocksdb: Read While Writing glibc-bench: cos glibc-bench: exp glibc-bench: ffs glibc-bench: sin glibc-bench: log2 glibc-bench: modf glibc-bench: sinh glibc-bench: sqrt glibc-bench: tanh glibc-bench: asinh glibc-bench: atanh glibc-bench: ffsll glibc-bench: sincos glibc-bench: pthread_once graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space himeno: Poisson Pressure Solver oidn: Memorial ior: Write Test ior: Read Test encode-mp3: WAV To MP3 libgav1: Chimera 1080p libgav1: Summer Nature 4K libgav1: Summer Nature 1080p libgav1: Chimera 1080p 10-bit libreoffice: 20 Documents To PDF luxcorerender: DLSC luxcorerender: Rainbow Colors and Prism minife: Small mkl-dnn: IP Batch 1D - f32 mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch 1D - u8s8f32 mkl-dnn: IP Batch All - u8s8f32 mkl-dnn: Convolution Batch conv_3d - f32 mkl-dnn: Convolution Batch conv_all - f32 mkl-dnn: Convolution Batch conv_3d - u8s8f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Deconvolution Batch deconv_3d - f32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Convolution Batch conv_all - u8s8f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32 mkl-dnn: Recurrent Neural Network Training - f32 mkl-dnn: Convolution Batch conv_alexnet - u8s8f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32 n-queens: Elapsed Time neatbench: CPU openssl: RSA 4096-bit Performance ospray: San Miguel - SciVis ospray: XFrog Forest - SciVis ospray: San Miguel - Path Tracer ospray: NASA Streamlines - SciVis ospray: XFrog Forest - Path Tracer ospray: Magnetic Reconnection - SciVis ospray: NASA Streamlines - Path Tracer ospray: Magnetic Reconnection - Path Tracer pennant: sedovbig pennant: leblancbig pgbench: Buffer Test - Normal Load - Read Only pgbench: Buffer Test - Normal Load - Read Write pgbench: Buffer Test - Single Thread - Read Only pgbench: Buffer Test - Single Thread - Read Write rav1e: 1080p To AV1 Video Encode redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET sqlite: 1 sqlite: 8 sqlite: 32 sqlite: 64 sqlite: 128 sqlite-speedtest: Timed Time - Size 1,000 sudokut: Total Time svt-av1: Enc Mode 0 - 1080p svt-av1: Enc Mode 4 - 1080p svt-av1: Enc Mode 8 - 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p mrbayes: Primate Phylogeny Analysis vpxenc: vpxenc VP9 1080p Video Encode 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20.04 135 184446 178 111 176.31 58.91 297.42 4495.10 6885.82 9398.43 12657.97 7093.04 3974.54 3.50 4.79 0.24 3.46 573.92 317.43 724.74 98.33 68.68 64.95 56.73 50.03 54.33 46.94 263536 364170524 252029 178224 9887084 55.45 6.40 2.32 55.11 7.62 2.92 10.19 2.92 13.96 11.06 13.21 2.32 15.94 2.32 1992 420 791 1119 134 649 1091 2862.27 26.05 1197.71 1818.81 9.41 34.47 15.40 47.09 14.51 9.19 0.22 0.18 15128.13 1.86 11.55 27.70 166.48 4.21 485.82 1775.59 2.76 2.81 59.19 9665.14 2213.19 658.74 1108.64 662.73 716.86 31.70 846.64 2.07 27.20 19813.40 76.92 16.31 6.99 100 8.87 50 24.39 333.33 188.60 178.93 898293.99 15444.23 20042.30 2163.45 0.82 1268007.32 1597669.88 1237976.26 1804165.17 1430535.50 2.42 6.78 23.61 69.53 137.51 78.42 16.68 0.05 9.69 101.83 313.35 321.15 251.75 106.63 132.83 29.37 82 162 100 175.34 57.00 296.57 4524.24 6637.56 9466.88 12135.60 6751.29 3848.79 3.91 5.28 0.23 4.92 720.53 315.60 782.32 124.37 82.55 76.00 58.06 50.52 55.82 47.84 463664567 188159 53.86 6.21 2.25 53.49 7.40 2.84 9.86 2.83 13.56 10.75 12.83 2.25 15.47 2.25 1963 440 889 1248 94 630 943 2869.36 27.60 1160.81 1865.31 9.14 34.25 15.52 47.92 14.79 8.97 0.27 0.15 15615.67 2.06 14.20 26.73 163.92 4.34 408.68 1393.12 2.85 2.99 48.92 9074.29 844.26 516.79 843.96 791.31 534.90 29.02 1575.73 1.75 27.47 24750.03 91.61 20.41 8.40 125 10.99 55.56 30.01 333.33 231.28 223.17 0.68 1321996.75 1719436.50 1292527.80 1942483.35 1461897.92 2.25 6.57 24.74 59.81 138.96 80.56 16.19 0.05 10.38 103.84 321.44 321.93 259.42 109.87 137.33 OpenBenchmarking.org
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.35, N = 5 20.04 29.37 1. (CC) gcc options: -O3 -march=native -fopenmp
Apache Cassandra Test: Reads OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Reads 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 SE +/- 15.04, N = 9 SE +/- 4.50, N = 6 135 82
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Writes 2 x AMD EPYC 7642 48-Core 40K 80K 120K 160K 200K SE +/- 1320.16, N = 3 184446
Apache Cassandra Test: Mixed 1:1 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Mixed 1:1 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 40 80 120 160 200 SE +/- 20.52, N = 9 SE +/- 6.73, N = 6 178 162
Apache Cassandra Test: Mixed 1:3 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Mixed 1:3 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 6.23, N = 9 SE +/- 4.69, N = 4 111 100
Appleseed Scene: Emily OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Emily 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 40 80 120 160 200 176.31 175.34
Appleseed Scene: Disney Material OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Disney Material 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 13 26 39 52 65 58.91 57.00
Appleseed Scene: Material Tester OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Material Tester 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 60 120 180 240 300 297.42 296.57
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Gridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 1000 2000 3000 4000 5000 SE +/- 72.76, N = 3 SE +/- 57.56, N = 3 4495.10 4524.24 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Degridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 1500 3000 4500 6000 7500 SE +/- 441.89, N = 3 SE +/- 144.38, N = 3 6885.82 6637.56 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MPI - Gridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2K 4K 6K 8K 10K SE +/- 5.02, N = 3 SE +/- 4.01, N = 3 9398.43 9466.88 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MPI - Degridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3K 6K 9K 12K 15K SE +/- 7.54, N = 3 SE +/- 6.27, N = 3 12657.97 12135.60 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Gridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 1500 3000 4500 6000 7500 SE +/- 287.30, N = 15 SE +/- 399.45, N = 12 7093.04 6751.29 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Degridding 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 900 1800 2700 3600 4500 SE +/- 221.18, N = 15 SE +/- 144.06, N = 12 3974.54 3848.79 1. (CXX) g++ options: -lpthread
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.6.0 Test: Boat - Acceleration: CPU-only 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.8798 1.7596 2.6394 3.5192 4.399 SE +/- 0.07, N = 12 SE +/- 0.07, N = 15 3.50 3.91
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.6.0 Test: Masskrug - Acceleration: CPU-only 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 1.188 2.376 3.564 4.752 5.94 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.79 5.28
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.6.0 Test: Server Rack - Acceleration: CPU-only 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.054 0.108 0.162 0.216 0.27 SE +/- 0.01, N = 15 SE +/- 0.01, N = 15 0.24 0.23
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.6.0 Test: Server Room - Acceleration: CPU-only 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 1.107 2.214 3.321 4.428 5.535 SE +/- 0.24, N = 13 SE +/- 0.29, N = 15 3.46 4.92
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 160 320 480 640 800 SE +/- 1.60, N = 3 SE +/- 7.49, N = 3 573.92 720.53 MIN: 301.49 / MAX: 719.11 MIN: 259.62 / MAX: 920.09 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 4K 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 70 140 210 280 350 SE +/- 1.28, N = 3 SE +/- 1.76, N = 3 317.43 315.60 MIN: 72.4 / MAX: 391.45 MIN: 55.99 / MAX: 417.7 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 200 400 600 800 1000 SE +/- 4.33, N = 3 SE +/- 7.44, N = 3 724.74 782.32 MIN: 158.65 / MAX: 886.33 MIN: 132.3 / MAX: 1028.45 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 10-bit 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 SE +/- 0.30, N = 3 SE +/- 0.20, N = 3 98.33 124.37 MIN: 66.13 / MAX: 165.92 MIN: 88.15 / MAX: 185.51 1. (CC) gcc options: -pthread
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer - Model: Crown 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 68.68 82.55 MIN: 65.4 / MAX: 72.66 MIN: 76.03 / MAX: 89.61
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer ISPC - Model: Crown 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.52, N = 3 64.95 76.00 MIN: 61.27 / MAX: 69.08 MIN: 69.25 / MAX: 84.2
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer - Model: Asian Dragon 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 13 26 39 52 65 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 56.73 58.06 MIN: 54.54 / MAX: 59.89 MIN: 54.63 / MAX: 61.04
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer - Model: Asian Dragon Obj 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 50.03 50.52 MIN: 48.42 / MAX: 53.88 MIN: 47.78 / MAX: 53.87
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 54.33 55.82 MIN: 52.36 / MAX: 57.15 MIN: 52.35 / MAX: 58.67
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.6.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 11 22 33 44 55 SE +/- 0.23, N = 3 SE +/- 0.05, N = 3 46.94 47.84 MIN: 45.38 / MAX: 49.91 MIN: 45.36 / MAX: 50.97
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill 2 x AMD EPYC 7642 48-Core 60K 120K 180K 240K 300K SE +/- 3468.04, N = 15 263536 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Read 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 100M 200M 300M 400M 500M SE +/- 1293979.10, N = 3 SE +/- 1391185.52, N = 3 364170524 463664567 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Sequential Fill 2 x AMD EPYC 7642 48-Core 50K 100K 150K 200K 250K SE +/- 2908.66, N = 3 252029 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Sync 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 40K 80K 120K 160K 200K SE +/- 1952.69, N = 3 SE +/- 3196.19, N = 3 178224 188159 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Read While Writing 2 x AMD EPYC 7642 48-Core 2M 4M 6M 8M 10M SE +/- 173881.68, N = 15 9887084 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
glibc bench Benchmark: cos OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: cos 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 55.45 53.86
glibc bench Benchmark: exp OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: exp 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.40 6.21
glibc bench Benchmark: ffs OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: ffs 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.522 1.044 1.566 2.088 2.61 SE +/- 0.00, N = 6 SE +/- 0.00, N = 15 2.32 2.25
glibc bench Benchmark: sin OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: sin 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 55.11 53.49
glibc bench Benchmark: log2 OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: log2 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 7.62 7.40
glibc bench Benchmark: modf OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: modf 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.657 1.314 1.971 2.628 3.285 SE +/- 0.00, N = 12 SE +/- 0.00, N = 3 2.92 2.84
glibc bench Benchmark: sinh OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: sinh 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 10.19 9.86
glibc bench Benchmark: sqrt OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: sqrt 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.657 1.314 1.971 2.628 3.285 SE +/- 0.00, N = 3 SE +/- 0.00, N = 6 2.92 2.83
glibc bench Benchmark: tanh OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: tanh 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.96 13.56
glibc bench Benchmark: asinh OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: asinh 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 11.06 10.75
glibc bench Benchmark: atanh OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: atanh 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.21 12.83
glibc bench Benchmark: ffsll OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: ffsll 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.522 1.044 1.566 2.088 2.61 SE +/- 0.00, N = 5 SE +/- 0.00, N = 15 2.32 2.25
glibc bench Benchmark: sincos OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: sincos 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 15.94 15.47
glibc bench Benchmark: pthread_once OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: pthread_once 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.522 1.044 1.566 2.088 2.61 SE +/- 0.00, N = 3 SE +/- 0.00, N = 12 2.32 2.25
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 400 800 1200 1600 2000 SE +/- 11.67, N = 3 SE +/- 10.74, N = 3 1992 1963 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 100 200 300 400 500 SE +/- 2.00, N = 3 SE +/- 2.40, N = 3 420 440 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 2.40, N = 3 791 889 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300 600 900 1200 1500 SE +/- 2.03, N = 3 SE +/- 2.73, N = 3 1119 1248 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 SE +/- 1.45, N = 3 SE +/- 0.79, N = 15 134 94 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 140 280 420 560 700 SE +/- 6.21, N = 9 SE +/- 4.51, N = 3 649 630 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 200 400 600 800 1000 SE +/- 10.09, N = 3 SE +/- 7.85, N = 15 1091 943 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 600 1200 1800 2400 3000 SE +/- 30.68, N = 15 SE +/- 37.11, N = 5 2862.27 2869.36 1. (CC) gcc options: -O3 -mavx2
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.0.0 Scene: Memorial 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 6 12 18 24 30 SE +/- 0.22, N = 3 SE +/- 0.27, N = 3 26.05 27.60
IOR Write Test OpenBenchmarking.org MB/s, More Is Better IOR 3.2.1 Write Test 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300 600 900 1200 1500 SE +/- 7.32, N = 3 SE +/- 17.41, N = 4 1197.71 1160.81 MIN: 151.13 / MAX: 1470.06 MIN: 67.9 / MAX: 1495.31 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Read Test OpenBenchmarking.org MB/s, More Is Better IOR 3.2.1 Read Test 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 400 800 1200 1600 2000 SE +/- 34.38, N = 3 SE +/- 16.35, N = 4 1818.81 1865.31 MIN: 149.65 / MAX: 2011.51 MIN: 123.91 / MAX: 2062.88 1. (CC) gcc options: -O2 -lm -pthread -lmpi
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 9.41 9.14 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 8 16 24 32 40 SE +/- 0.18, N = 3 SE +/- 0.05, N = 3 34.47 34.25 1. (CXX) g++ options: -O3 -lpthread
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 15.40 15.52 1. (CXX) g++ options: -O3 -lpthread
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 11 22 33 44 55 SE +/- 0.54, N = 3 SE +/- 0.25, N = 3 47.09 47.92 1. (CXX) g++ options: -O3 -lpthread
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 10-bit 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 14.51 14.79 1. (CXX) g++ options: -O3 -lpthread
LibreOffice Test: 20 Documents To PDF OpenBenchmarking.org Seconds, Fewer Is Better LibreOffice Test: 20 Documents To PDF 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.07, N = 5 SE +/- 0.05, N = 5 9.19 8.97 1. LibreOffice 6.3.2.2 30(Build:2)
LuxCoreRender Scene: DLSC OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.2 Scene: DLSC 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.03, N = 12 SE +/- 0.02, N = 15 0.22 0.27 MIN: 0.1 / MAX: 0.4 MIN: 0.09 / MAX: 0.39
LuxCoreRender Scene: Rainbow Colors and Prism OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.2 Scene: Rainbow Colors and Prism 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.0405 0.081 0.1215 0.162 0.2025 SE +/- 0.01, N = 15 SE +/- 0.01, N = 15 0.18 0.15 MIN: 0.1 / MAX: 0.3 MIN: 0.09 / MAX: 0.25
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3K 6K 9K 12K 15K SE +/- 349.52, N = 15 SE +/- 274.76, N = 15 15128.13 15615.67 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
MKL-DNN DNNL Harness: IP Batch 1D - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: IP Batch 1D - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.4635 0.927 1.3905 1.854 2.3175 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 1.86 2.06 MIN: 1.62 MIN: 1.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: IP Batch All - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: IP Batch All - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 11.55 14.20 MIN: 10.82 MIN: 13.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: IP Batch 1D - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: IP Batch 1D - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 7 14 21 28 35 SE +/- 0.23, N = 3 SE +/- 0.11, N = 3 27.70 26.73 MIN: 25.11 MIN: 24.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: IP Batch All - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: IP Batch All - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 40 80 120 160 200 SE +/- 0.14, N = 3 SE +/- 1.06, N = 3 166.48 163.92 MIN: 160.31 MIN: 155.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_3d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_3d - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.9765 1.953 2.9295 3.906 4.8825 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 4.21 4.34 MIN: 3.45 MIN: 3.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_all - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_all - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 110 220 330 440 550 SE +/- 5.18, N = 3 SE +/- 4.19, N = 3 485.82 408.68 MIN: 453.62 MIN: 377.44 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_3d - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_3d - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 400 800 1200 1600 2000 SE +/- 1.21, N = 3 SE +/- 1.67, N = 3 1775.59 1393.12 MIN: 1748.36 MIN: 1369 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Deconvolution Batch deconv_1d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_1d - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.6413 1.2826 1.9239 2.5652 3.2065 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 2.76 2.85 MIN: 2.37 MIN: 2.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Deconvolution Batch deconv_3d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_3d - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.11, N = 12 SE +/- 0.07, N = 15 2.81 2.99 MIN: 2.32 MIN: 2.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_alexnet - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_alexnet - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 13 26 39 52 65 SE +/- 0.57, N = 15 SE +/- 0.40, N = 15 59.19 48.92 MIN: 52.69 MIN: 43.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_all - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_all - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2K 4K 6K 8K 10K SE +/- 13.30, N = 3 SE +/- 11.94, N = 3 9665.14 9074.29 MIN: 9334.8 MIN: 8770.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Deconvolution Batch deconv_all - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_all - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 500 1000 1500 2000 2500 SE +/- 31.06, N = 3 SE +/- 12.63, N = 3 2213.19 844.26 MIN: 2049.6 MIN: 779.57 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 140 280 420 560 700 SE +/- 0.63, N = 3 SE +/- 0.48, N = 3 658.74 516.79 MIN: 652.2 MIN: 508.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 200 400 600 800 1000 SE +/- 0.31, N = 3 SE +/- 0.87, N = 3 1108.64 843.96 MIN: 1105.65 MIN: 840.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Recurrent Neural Network Training - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Recurrent Neural Network Training - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 200 400 600 800 1000 SE +/- 4.72, N = 3 SE +/- 3.21, N = 3 662.73 791.31 MIN: 602.24 MIN: 757.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 150 300 450 600 750 SE +/- 2.19, N = 3 SE +/- 2.01, N = 3 716.86 534.90 MIN: 699.77 MIN: 518.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 7 14 21 28 35 SE +/- 0.37, N = 5 SE +/- 0.27, N = 3 31.70 29.02 MIN: 27.69 MIN: 26.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300 600 900 1200 1500 SE +/- 11.71, N = 3 SE +/- 4.88, N = 3 846.64 1575.73 MIN: 759.39 MIN: 1476.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.06, N = 15 SE +/- 0.04, N = 12 2.07 1.75 1. (CC) gcc options: -static -fopenmp -O3 -march=native
NeatBench Acceleration: CPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: CPU 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 27.20 27.47
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 5K 10K 15K 20K 25K SE +/- 155.67, N = 3 SE +/- 86.09, N = 3 19813.40 24750.03 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.00, N = 12 SE +/- 0.70, N = 13 76.92 91.61 MIN: 31.25 / MAX: 90.91 MIN: 28.57 / MAX: 111.11
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.00, N = 12 16.31 20.41 MIN: 12.35 / MAX: 16.67 MIN: 12.5 / MAX: 21.28
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.99 8.40 MIN: 4.95 / MAX: 7.09 MIN: 5.18 / MAX: 8.7
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 100 125 MIN: 15.63 / MAX: 111.11 MIN: 16.95 / MAX: 142.86
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.02, N = 4 SE +/- 0.07, N = 3 8.87 10.99 MIN: 7.25 / MAX: 9.01 MIN: 8.85 / MAX: 11.24
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 12 24 36 48 60 SE +/- 0.00, N = 15 50.00 55.56 MIN: 10.64 / MAX: 52.63 MIN: 9.62 / MAX: 58.82
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 7 14 21 28 35 SE +/- 0.00, N = 12 SE +/- 0.30, N = 3 24.39 30.01 MIN: 10.53 / MAX: 25 MIN: 11.76 / MAX: 31.25
OSPray Demo: Magnetic Reconnection - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 70 140 210 280 350 SE +/- 0.00, N = 12 SE +/- 0.00, N = 12 333.33 333.33 MIN: 66.67 / MAX: 500 MIN: 47.62 / MAX: 500
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 50 100 150 200 250 SE +/- 0.13, N = 3 SE +/- 0.19, N = 3 188.60 231.28 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 50 100 150 200 250 SE +/- 0.24, N = 3 SE +/- 0.56, N = 3 178.93 223.17 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only 2 x AMD EPYC 7642 48-Core 200K 400K 600K 800K 1000K SE +/- 1549.57, N = 3 898293.99 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write 2 x AMD EPYC 7642 48-Core 3K 6K 9K 12K 15K SE +/- 468.31, N = 9 15444.23 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Single Thread - Mode: Read Only 2 x AMD EPYC 7642 48-Core 4K 8K 12K 16K 20K SE +/- 264.13, N = 3 20042.30 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Single Thread - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Single Thread - Mode: Read Write 2 x AMD EPYC 7642 48-Core 500 1000 1500 2000 2500 SE +/- 9.65, N = 3 2163.45 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
rav1e 1080p To AV1 Video Encode OpenBenchmarking.org Frames Per Second, More Is Better rav1e 20191023 1080p To AV1 Video Encode 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.1845 0.369 0.5535 0.738 0.9225 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.82 0.68
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPOP 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300K 600K 900K 1200K 1500K SE +/- 16507.52, N = 15 SE +/- 18633.50, N = 15 1268007.32 1321996.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SADD 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 400K 800K 1200K 1600K 2000K SE +/- 15943.28, N = 15 SE +/- 24578.27, N = 15 1597669.88 1719436.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPUSH 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300K 600K 900K 1200K 1500K SE +/- 18827.57, N = 15 SE +/- 17074.67, N = 15 1237976.26 1292527.80 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: GET 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 400K 800K 1200K 1600K 2000K SE +/- 13285.92, N = 3 SE +/- 29954.73, N = 15 1804165.17 1942483.35 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SET 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 300K 600K 900K 1200K 1500K SE +/- 20709.04, N = 3 SE +/- 21192.42, N = 3 1430535.50 1461897.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
SQLite Threads / Copies: 1 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 1 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.5445 1.089 1.6335 2.178 2.7225 SE +/- 0.02, N = 9 SE +/- 0.00, N = 3 2.42 2.25 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Threads / Copies: 8 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 8 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.28, N = 15 SE +/- 0.23, N = 15 6.78 6.57 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Threads / Copies: 32 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 32 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 6 12 18 24 30 SE +/- 0.68, N = 12 SE +/- 1.27, N = 12 23.61 24.74 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Threads / Copies: 64 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 64 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 15 30 45 60 75 SE +/- 0.80, N = 3 SE +/- 1.40, N = 15 69.53 59.81 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Threads / Copies: 128 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 128 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 SE +/- 3.30, N = 12 SE +/- 1.17, N = 3 137.51 138.96 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.25, N = 3 78.42 80.56 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Sudokut Total Time OpenBenchmarking.org Seconds, Fewer Is Better Sudokut 0.4 Total Time 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.68 16.19
SVT-AV1 Encoder Mode: Enc Mode 0 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 0 - Input: 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 0.0113 0.0226 0.0339 0.0452 0.0565 SE +/- 0.00, N = 3 SE +/- 0.00, N = 6 0.05 0.05 1. (CXX) g++ options: -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 4 - Input: 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 9.69 10.38 1. (CXX) g++ options: -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 8 - Input: 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.96, N = 3 SE +/- 0.50, N = 3 101.83 103.84 1. (CXX) g++ options: -fPIE -fPIC -pie
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: VMAF Optimized - Input: Bosphorus 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 70 140 210 280 350 SE +/- 2.76, N = 11 SE +/- 3.30, N = 3 313.35 321.44 1. (CC) gcc options: -fPIE -fPIC -fvisibility=hidden -O3 -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 70 140 210 280 350 SE +/- 1.15, N = 3 SE +/- 2.57, N = 3 321.15 321.93 1. (CC) gcc options: -fPIE -fPIC -fvisibility=hidden -O3 -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 60 120 180 240 300 SE +/- 3.61, N = 3 SE +/- 2.59, N = 3 251.75 259.42 1. (CC) gcc options: -fPIE -fPIC -fvisibility=hidden -O3 -pie -rdynamic -lpthread -lrt -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.37, N = 3 SE +/- 0.05, N = 3 106.63 109.87 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
VP9 libvpx Encoding vpxenc VP9 1080p Video Encode OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.1 vpxenc VP9 1080p Video Encode 2 x AMD EPYC 7642 48-Core 2 x AMD EPYC 7742 64-Core 30 60 90 120 150 SE +/- 1.36, N = 8 SE +/- 0.42, N = 3 132.83 137.33 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
Phoronix Test Suite v10.8.4