2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2103122-HA-EPYCMARCH14 epyc-march - Phoronix Test Suite epyc-march 2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103122-HA-EPYCMARCH14 .
epyc-march Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Display Server Compiler File-System Screen Resolution EPYC 7742 2P 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads) Supermicro H11DSi-NT v2.00 (2.1 BIOS) AMD Starship/Matisse 16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED VGA HDMI 2 x Intel 10G X550T Ubuntu 20.04 5.8.0-44-generic (x86_64) X Server 1.20.8 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - EPYC 7742 2P, 7742 2P Repeat: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details - EPYC 7742 2P, 7742 2P Repeat: OpenJDK Runtime Environment (build 11.0.10+9-Ubuntu-0ubuntu1.20.04) Python Details - EPYC 7742 2P, 2P, 7742 2P Repeat: Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
epyc-march ior: 2MB - Default Test Directory ior: 4MB - Default Test Directory ior: 8MB - Default Test Directory ior: 16MB - Default Test Directory blosc: blosclz quantlib: etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering hpcg: hpl: npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C lczero: BLAS lczero: Eigen parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding minife: Small cloverleaf: Lagrangian-Eulerian Hydrodynamics rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster namd: ATPase Simulation - 327,506 Atoms dolfyn: Computational Fluid Dynamics neat: lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression amg: ffte: N=256, 3D Complex FFT Routine fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 2048 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 2048 fftw: Float + SSE - 2D FFT Size 4096 pennant: sedovbig pennant: leblancbig mrbayes: Primate Phylogeny Analysis nwchem: C240 Buckyball qmcpack: simple-H2O hmmer: Pfam Database Search incompact3d: Cylinder mafft: Multiple Sequence Alignment - LSU RNA mocassin: Dust 2D tau100.0 openfoam: Motorbike 30M openfoam: Motorbike 60M qe: AUSURF112 relion: Basic - CPU lammps: 20k Atoms lammps: Rhodopsin Protein webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression libgav1: Summer Nature 4K libgav1: Summer Nature 1080p dacapobench: H2 dacapobench: Jython dacapobench: Tradebeans compress-lz4: 1 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: All srslte: OFDM_Test srslte: PHY_DL_Test srslte: PHY_DL_Test luajit: Composite libraw: Post-Processing Benchmark crafty: Elapsed Time tscp: AI Chess Performance graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU dav1d: Summer Nature 4K dav1d: Summer Nature 1080p ospray: San Miguel - SciVis ospray: XFrog Forest - SciVis ospray: San Miguel - Path Tracer ospray: NASA Streamlines - SciVis ospray: XFrog Forest - Path Tracer ospray: Magnetic Reconnection - SciVis ospray: NASA Streamlines - Path Tracer ospray: Magnetic Reconnection - Path Tracer ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping aom-av1: Speed 6 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 8 Realtime embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast rav1e: 6 rav1e: 10 svt-av1: Enc Mode 4 - 1080p svt-av1: Enc Mode 8 - 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p vpxenc: Speed 5 x264: H.264 Video Encoding x265: Bosphorus 4K x265: Bosphorus 1080p mt-dgemm: Sustained Floating-Point Rate oidn: Memorial openvkl: vklBenchmark luxcorerender: DLSC luxcorerender: Rainbow Colors and Prism himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test stockfish: Total Time asmfish: 1024 Hash Memory, 26 Depth avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless build-apache: Time To Compile build-ffmpeg: Time To Compile build-gcc: Time To Compile build-gdb: Time To Compile build-godot: Time To Compile build-imagemagick: Time To Compile build-linux-kernel: Time To Compile build-llvm: Time To Compile build-mplayer: Time To Compile build-php: Time To Compile build2: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel povray: Trace Time tungsten: Hair tungsten: Water Caustic tungsten: Non-Exponential tungsten: Volumetric Caustic yafaray: Total Time For Sample Scene rays1bench: Large Scene numpy: aobench: 2048 x 2048 - Total Time build-eigen: Time To Compile build-erlang: Time To Compile build-wasmer: Time To Compile compress-gzip: Linux Source Tree Archiving To .tar.gz compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 dcraw: RAW To PPM Image Conversion deepspeech: CPU encode-ape: WAV To APE encode-flac: WAV To FLAC encode-mp3: WAV To MP3 encode-ogg: WAV To Ogg encode-opus: WAV To Opus Encode espeak: Text-To-Speech Synthesis m-queens: Time To Solve montage: Mosaic of M17, K band, 1.5 deg x 1.5 deg n-queens: Elapsed Time ngspice: C2670 ngspice: C7552 radiance: SMP Parallel rnnoise: system-decompress-gzip: system-decompress-xz: tachyon: Total Time webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 system-decompress-zlib: liquid-dsp: 1 - 256 - 57 liquid-dsp: 2 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 256 - 256 - 57 couchdb: 100 - 1000 - 24 financebench: Repo OpenMP financebench: Bonds OpenMP askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP tjbench: Decompression Throughput luaradio: Five Back to Back FIR Filters luaradio: FM Deemphasis Filter luaradio: Hilbert Transform luaradio: Complex Phase gnuradio: Five Back to Back FIR Filters gnuradio: Signal Source (Cosine) gnuradio: FIR Filter gnuradio: IIR Filter gnuradio: FM Deemphasis Filter gnuradio: Hilbert Transform toybrot: TBB toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads gromacs: water_GMX50_bare blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only ior: 32MB - Default Test Directory libgav1: Chimera 1080p compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed jpegxl-decode: 1 luajit: Monte Carlo luajit: Fast Fourier Transform luajit: Sparse Matrix Multiply luajit: Dense LU Matrix Factorization luajit: Jacobi Successive Over-Relaxation EPYC 7742 2P 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 445.10 485.74 489.40 480.63 3491.5 2015.9 1039.813 236.722 139.501 224.164 25.6364 153.59 41060.19 8223.34 8426.04 76051.36 3268.82 194294.65 72806.59 3936 4198 51.018187 0.831875 5.389561 194.080744 11312.66 23.36 30.21 112.829 50.850 10.627 9.970 0.27952 20.206 62.752 32 98 435 1346 82 1497 89 381 415 482 165 571 205 959 1247427667 150712.96819699 6873.9 6160.7 5387.5 44939 26509 18663 5.872848 3.949192 108.673 1963.3 44.322 398.635 345.786885 10.266 239 14.12 112.80 1219.36 542.048 31.977 29.000 1.855 2.863 20.360 8.904 41.976 18.97 72.74 5947 5027 4866 9527.72 11001.5 45.36 10188.3 44.86 10418.3 1989.9 2975.7 70.7 2792.5 629.2 3092.0 587.6 3205.6 32.8 2828.5 63.73 9.77 0.70 51.71 51.36 22.91 99.54 98333333 197.7 83.0 1178.95 30.99 6757787 1031813 1721 543 833 1199 69 650 929 2.07347 1.54650 2.13329 3.08917 0.724395 2.86937 2.88437 4.68078 2.20539 1.21032 2948.06 1267.20 2910.88 1281.86 0.712936 2923.30 1245.81 0.812990 387.05 1245.91 83.33 19.74 6.76 125 10.10 45.45 30.30 333.33 581.879 18.47 3.36 31.89 67.5868 59.3323 44.9727 39.0474 42.1185 36.3283 22.70 64.43 40.15 44.45 136.84 181.14 1.460 3.102 7.456 85.786 340.02 363.96 274.56 20.85 204.01 18.77 61.36 28.494121 28.72 473 15.03 16.89 3962.214819 338315 190042987 236093113 60.112 32.717 12.103 4.228 34.934 7.587 24.677 19.748 715.113 91.309 61.407 15.667 21.500 200.816 10.269 41.668 64.213 7.754 8.028 5.61696 23.6008 1.72256 4.46136 65.660 492.78 305.72 39.902 94.909 190.789 68.219 41.888 26.581 50.516 78.03710 14.369 9.830 9.103 23.699 9.150 35.079 7.083 93.077 1.770 169.462 130.441 213.866 23.142 3.600 4.311 9.8909 3.272 136.405 251.434 7.721 440.928 646.905 2025.984712 53594667 107143333 213280000 427203333 832276667 1616566667 2703933333 3135600000 5525100000 112.558 52054.593750 89585.742187 5224.98 7117.00 38291.8 37599.7 4826.83 3991.99 217.823 172.304423 643.9 346.8 84.4 532.7 400.8 3032.8 554.9 506.9 744.6 436.5 3910 5179 4295 4039 101100000 207.6 87.5 53579667 107166667 213286667 427276667 831613333 1618000000 2693766667 3218138462 5550733333 653.0 343.0 84.4 532.5 433.2 3040.5 555.8 505.0 751.2 436.2 2243.5 2982.6 69.2 2781.5 561.6 3199.6 33.0 2824.9 60.626 32.697 12.958 4.189 35.432 7.400 21.552 185.824 68.433 3886 5141 4307 4050 8.064 24.22 49.06 36.02 81.79 64.56 452.03 480.55 485.13 468.11 3788.1 2009.2 1037.460 237.245 139.561 224.599 25.9558 39810.96 8108.43 7885.52 71133.34 3313.12 176465.67 73254.92 3333 3512 74.068677 0.852529 5.544628 208.037842 8169.17 2994.27 32.511 150.965 55.861 219.378 96.160 0.28306 20.201 85.373 32 99 437 1334 83 1487 89 382 416 481 165 569 206 961 1246138667 148613.89146209 6890.6 6086.5 5306.8 44294 26795 17541 5.875714 3.926446 109.158 1933.8 46.283 408.898 348.516388 10.468 239 14.15 112.70 1225.26 541.471 31.833 28.183 1.854 2.864 20.432 8.864 41.936 6295 5096 4991 9289.07 10868.7 45.16 10373.0 43.49 10278.8 1992.5 2977.7 70.9 2791.9 620.5 3090.0 566.1 3205.8 33.9 2825.8 64.40 9.77 0.70 53.11 51.17 22.72 99.32 97960000 204.5 86.8 1200.41 29.78 6778608 1030655 1730 523 829 1203 68 654 880 2.06564 2.49866 2.13758 2.88822 1.95461 2.86260 2.70898 5.59055 2.19937 1.20163 3125.87 1355.40 3152.16 1285.17 0.714842 3209.71 643.8 347.1 84.6 534.5 423.3 3090.2 555.9 506.9 747.2 437.6 461.38 51.20 5053.7 2908.5 32.97 412.34 210.57 1008.92 2811.62 1644.11 OpenBenchmarking.org
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory EPYC 7742 2P 7742 2P Repeat 100 200 300 400 500 SE +/- 1.26, N = 3 SE +/- 4.26, N = 7 445.10 452.03 MIN: 379.87 / MAX: 836.63 MIN: 378.75 / MAX: 1032.92 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory EPYC 7742 2P 7742 2P Repeat 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 2.94, N = 3 485.74 480.55 MIN: 400.81 / MAX: 1055.7 MIN: 405.25 / MAX: 1028.47 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory EPYC 7742 2P 7742 2P Repeat 110 220 330 440 550 SE +/- 1.12, N = 3 SE +/- 3.76, N = 3 489.40 485.13 MIN: 410.57 / MAX: 920.3 MIN: 411.08 / MAX: 1041.22 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory EPYC 7742 2P 7742 2P Repeat 100 200 300 400 500 SE +/- 1.21, N = 3 SE +/- 5.31, N = 3 480.63 468.11 MIN: 410.37 / MAX: 1034.82 MIN: 411 / MAX: 1031.92 1. (CC) gcc options: -O2 -lm -pthread -lmpi
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Beta 5 Compressor: blosclz EPYC 7742 2P 7742 2P Repeat 800 1600 2400 3200 4000 SE +/- 23.71, N = 3 SE +/- 39.54, N = 3 3491.5 3788.1 1. (CXX) g++ options: -rdynamic
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 EPYC 7742 2P 7742 2P Repeat 400 800 1200 1600 2000 SE +/- 14.66, N = 3 SE +/- 13.17, N = 3 2015.9 2009.2 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 EPYC 7742 2P 7742 2P Repeat 200 400 600 800 1000 SE +/- 3.71, N = 3 SE +/- 3.65, N = 3 1039.81 1037.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 236.72 237.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 EPYC 7742 2P 7742 2P Repeat 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 139.50 139.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 224.16 224.60 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 EPYC 7742 2P 7742 2P Repeat 6 12 18 24 30 SE +/- 0.28, N = 5 SE +/- 0.20, N = 3 25.64 25.96 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
HPL Linpack OpenBenchmarking.org GFLOPS, More Is Better HPL Linpack 2.3 EPYC 7742 2P 30 60 90 120 150 SE +/- 0.42, N = 3 153.59 1. (CC) gcc options: -O2 -lopenblas -lm -pthread -lmpi
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C EPYC 7742 2P 7742 2P Repeat 9K 18K 27K 36K 45K SE +/- 380.32, N = 3 SE +/- 428.23, N = 5 41060.19 39810.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 21.18, N = 3 SE +/- 9.74, N = 3 8223.34 8108.43 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 14.28, N = 3 SE +/- 8.01, N = 3 8426.04 7885.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C EPYC 7742 2P 7742 2P Repeat 16K 32K 48K 64K 80K SE +/- 839.98, N = 4 SE +/- 262.47, N = 3 76051.36 71133.34 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 30.48, N = 3 SE +/- 10.37, N = 3 3268.82 3313.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C EPYC 7742 2P 7742 2P Repeat 40K 80K 120K 160K 200K SE +/- 770.77, N = 3 SE +/- 1600.26, N = 3 194294.65 176465.67 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C EPYC 7742 2P 7742 2P Repeat 16K 32K 48K 64K 80K SE +/- 474.90, N = 3 SE +/- 149.89, N = 3 72806.59 73254.92 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS EPYC 7742 2P 7742 2P Repeat 800 1600 2400 3200 4000 SE +/- 49.55, N = 9 SE +/- 37.92, N = 4 3936 3333 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen EPYC 7742 2P 7742 2P Repeat 900 1800 2700 3600 4500 SE +/- 41.70, N = 3 SE +/- 49.56, N = 9 4198 3512 1. (CXX) g++ options: -flto -pthread
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM EPYC 7742 2P 7742 2P Repeat 16 32 48 64 80 SE +/- 1.30, N = 15 SE +/- 0.77, N = 3 51.02 74.07 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP EPYC 7742 2P 7742 2P Repeat 0.1918 0.3836 0.5754 0.7672 0.959 SE +/- 0.009850, N = 3 SE +/- 0.008717, N = 15 0.831875 0.852529 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil EPYC 7742 2P 7742 2P Repeat 1.2475 2.495 3.7425 4.99 6.2375 SE +/- 0.018548, N = 3 SE +/- 0.042058, N = 3 5.389561 5.544628 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 1.09, N = 3 SE +/- 1.80, N = 3 194.08 208.04 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 414.96, N = 15 SE +/- 70.94, N = 3 11312.66 8169.17 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics EPYC 7742 2P 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 0.38, N = 15 SE +/- 0.03, N = 3 23.36 2994.27 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD EPYC 7742 2P 7742 2P Repeat 8 16 24 32 40 SE +/- 0.27, N = 3 SE +/- 0.11, N = 3 30.21 32.51 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D EPYC 7742 2P 7742 2P Repeat 30 60 90 120 150 SE +/- 1.17, N = 15 SE +/- 0.67, N = 3 112.83 150.97 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte EPYC 7742 2P 7742 2P Repeat 13 26 39 52 65 SE +/- 0.49, N = 3 SE +/- 0.43, N = 15 50.85 55.86 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 10.63 219.38 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.138, N = 15 SE +/- 4.228, N = 12 9.970 96.160 1. (CXX) g++ options: -O2 -lOpenCL
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms EPYC 7742 2P 7742 2P Repeat 0.0637 0.1274 0.1911 0.2548 0.3185 SE +/- 0.00196, N = 3 SE +/- 0.00247, N = 3 0.27952 0.28306
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics EPYC 7742 2P 7742 2P Repeat 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 20.21 20.20
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2020-02-29 EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 8.47, N = 15 SE +/- 6.74, N = 15 62.75 85.37 1. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression EPYC 7742 2P 7742 2P Repeat 7 14 21 28 35 32 32 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.33, N = 3 98 99 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression EPYC 7742 2P 7742 2P Repeat 90 180 270 360 450 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 435 437 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 1.45, N = 3 SE +/- 4.36, N = 3 1346 1334 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.67, N = 3 82 83 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 1497 1487 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.67, N = 3 89 89 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 80 160 240 320 400 381 382 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression EPYC 7742 2P 7742 2P Repeat 90 180 270 360 450 SE +/- 1.53, N = 3 SE +/- 0.67, N = 3 415 416 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 100 200 300 400 500 SE +/- 1.53, N = 3 SE +/- 1.86, N = 3 482 481 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression EPYC 7742 2P 7742 2P Repeat 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 165 165 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 120 240 360 480 600 SE +/- 1.67, N = 3 571 569 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.33, N = 3 205 206 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression EPYC 7742 2P 7742 2P Repeat 200 400 600 800 1000 SE +/- 5.51, N = 3 SE +/- 0.67, N = 3 959 961 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 EPYC 7742 2P 7742 2P Repeat 300M 600M 900M 1200M 1500M SE +/- 1663713.95, N = 3 SE +/- 737641.36, N = 3 1247427667 1246138667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine EPYC 7742 2P 7742 2P Repeat 30K 60K 90K 120K 150K SE +/- 3136.68, N = 12 SE +/- 3906.26, N = 15 150712.97 148613.89 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 EPYC 7742 2P 7742 2P Repeat 1500 3000 4500 6000 7500 SE +/- 6.26, N = 3 SE +/- 9.28, N = 3 6873.9 6890.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 EPYC 7742 2P 7742 2P Repeat 1300 2600 3900 5200 6500 SE +/- 3.97, N = 3 SE +/- 47.11, N = 3 6160.7 6086.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 EPYC 7742 2P 7742 2P Repeat 1200 2400 3600 4800 6000 SE +/- 13.54, N = 3 SE +/- 54.29, N = 3 5387.5 5306.8 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 EPYC 7742 2P 7742 2P Repeat 10K 20K 30K 40K 50K SE +/- 343.27, N = 3 SE +/- 125.07, N = 3 44939 44294 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 EPYC 7742 2P 7742 2P Repeat 6K 12K 18K 24K 30K SE +/- 200.43, N = 12 SE +/- 139.43, N = 3 26509 26795 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 EPYC 7742 2P 7742 2P Repeat 4K 8K 12K 16K 20K SE +/- 230.17, N = 3 18663 17541 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig EPYC 7742 2P 7742 2P Repeat 1.322 2.644 3.966 5.288 6.61 SE +/- 0.011805, N = 3 SE +/- 0.025529, N = 3 5.872848 5.875714 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig EPYC 7742 2P 7742 2P Repeat 0.8886 1.7772 2.6658 3.5544 4.443 SE +/- 0.028494, N = 3 SE +/- 0.047128, N = 3 3.949192 3.926446 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 108.67 109.16 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball EPYC 7742 2P 7742 2P Repeat 400 800 1200 1600 2000 1963.3 1933.8 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lcomex -lm -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O EPYC 7742 2P 7742 2P Repeat 10 20 30 40 50 SE +/- 0.29, N = 3 SE +/- 1.61, N = 15 44.32 46.28 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search EPYC 7742 2P 7742 2P Repeat 90 180 270 360 450 SE +/- 4.02, N = 3 SE +/- 5.75, N = 3 398.64 408.90 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder EPYC 7742 2P 7742 2P Repeat 80 160 240 320 400 SE +/- 0.88, N = 3 SE +/- 1.44, N = 3 345.79 348.52 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA EPYC 7742 2P 7742 2P Repeat 3 6 9 12 15 SE +/- 0.14, N = 3 SE +/- 0.11, N = 15 10.27 10.47 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 EPYC 7742 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.33, N = 3 239 239 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M EPYC 7742 2P 7742 2P Repeat 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 14.12 14.15 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M EPYC 7742 2P 7742 2P Repeat 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.16, N = 3 112.80 112.70 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 4.15, N = 3 SE +/- 3.23, N = 3 1219.36 1225.26 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU EPYC 7742 2P 7742 2P Repeat 120 240 360 480 600 SE +/- 4.40, N = 3 SE +/- 5.12, N = 6 542.05 541.47 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms EPYC 7742 2P 7742 2P Repeat 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 31.98 31.83 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein EPYC 7742 2P 7742 2P Repeat 7 14 21 28 35 SE +/- 0.29, N = 6 SE +/- 0.20, N = 15 29.00 28.18 1. (CXX) g++ options: -O3 -pthread -lm
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default EPYC 7742 2P 7742 2P Repeat 0.4174 0.8348 1.2522 1.6696 2.087 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.855 1.854 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 EPYC 7742 2P 7742 2P Repeat 0.6444 1.2888 1.9332 2.5776 3.222 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 2.863 2.864 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless EPYC 7742 2P 7742 2P Repeat 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 20.36 20.43 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression EPYC 7742 2P 7742 2P Repeat 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 8.904 8.864 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression EPYC 7742 2P 7742 2P Repeat 10 20 30 40 50 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 41.98 41.94 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K EPYC 7742 2P 5 10 15 20 25 SE +/- 0.04, N = 3 18.97 1. (CXX) g++ options: -O3 -lpthread
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p EPYC 7742 2P 16 32 48 64 80 SE +/- 0.95, N = 15 72.74 1. (CXX) g++ options: -O3 -lpthread
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 EPYC 7742 2P 7742 2P Repeat 1300 2600 3900 5200 6500 SE +/- 109.01, N = 20 SE +/- 160.58, N = 20 5947 6295
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython EPYC 7742 2P 7742 2P Repeat 1100 2200 3300 4400 5500 SE +/- 60.94, N = 4 SE +/- 52.30, N = 4 5027 5096
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans EPYC 7742 2P 7742 2P Repeat 1100 2200 3300 4400 5500 SE +/- 98.30, N = 17 SE +/- 116.87, N = 20 4866 4991
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 65.80, N = 3 SE +/- 29.92, N = 3 9527.72 9289.07 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 91.80, N = 3 SE +/- 21.34, N = 3 11001.5 10868.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed EPYC 7742 2P 7742 2P Repeat 10 20 30 40 50 SE +/- 0.39, N = 3 SE +/- 0.26, N = 3 45.36 45.16 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 80.01, N = 3 SE +/- 28.45, N = 3 10188.3 10373.0 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed EPYC 7742 2P 7742 2P Repeat 10 20 30 40 50 SE +/- 0.35, N = 3 SE +/- 0.03, N = 3 44.86 43.49 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed EPYC 7742 2P 7742 2P Repeat 2K 4K 6K 8K 10K SE +/- 44.54, N = 3 SE +/- 45.56, N = 3 10418.3 10278.8 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 500 1000 1500 2000 2500 SE +/- 78.60, N = 12 SE +/- 63.80, N = 15 SE +/- 73.14, N = 15 1989.9 2243.5 1992.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 3.84, N = 11 SE +/- 2.98, N = 15 SE +/- 3.43, N = 15 2975.7 2982.6 2977.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 16 32 48 64 80 SE +/- 0.71, N = 3 SE +/- 1.15, N = 15 SE +/- 1.10, N = 15 70.7 69.2 70.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 6.97, N = 3 SE +/- 2.65, N = 15 SE +/- 2.86, N = 15 2792.5 2781.5 2791.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed EPYC 7742 2P 7742 2P Repeat 140 280 420 560 700 SE +/- 13.86, N = 15 SE +/- 14.50, N = 13 629.2 620.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 2.93, N = 15 SE +/- 2.35, N = 13 3092.0 3090.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 130 260 390 520 650 SE +/- 10.51, N = 15 SE +/- 4.02, N = 3 SE +/- 6.00, N = 3 587.6 561.6 566.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 3.61, N = 15 SE +/- 9.37, N = 3 SE +/- 10.95, N = 3 3205.6 3199.6 3205.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 8 16 24 32 40 SE +/- 0.58, N = 15 SE +/- 0.52, N = 12 SE +/- 0.66, N = 12 32.8 33.0 33.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 3.12, N = 15 SE +/- 2.77, N = 12 SE +/- 4.52, N = 12 2828.5 2824.9 2825.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 EPYC 7742 2P 7742 2P Repeat 14 28 42 56 70 SE +/- 0.58, N = 15 SE +/- 0.52, N = 15 63.73 64.40 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 EPYC 7742 2P 7742 2P Repeat 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 9.77 9.77 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 EPYC 7742 2P 7742 2P Repeat 0.1575 0.315 0.4725 0.63 0.7875 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 EPYC 7742 2P 7742 2P Repeat 12 24 36 48 60 SE +/- 0.39, N = 11 SE +/- 0.58, N = 3 51.71 53.11 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 EPYC 7742 2P 7742 2P Repeat 12 24 36 48 60 SE +/- 0.17, N = 3 SE +/- 0.33, N = 15 51.36 51.17 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 EPYC 7742 2P 7742 2P Repeat 5 10 15 20 25 SE +/- 0.17, N = 15 SE +/- 0.26, N = 15 22.91 22.72 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.77, N = 3 SE +/- 0.25, N = 3 99.54 99.32
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test EPYC 7742 2P 2P 7742 2P Repeat 20M 40M 60M 80M 100M SE +/- 520683.31, N = 3 SE +/- 404145.19, N = 3 SE +/- 1084711.94, N = 5 98333333 101100000 97960000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test EPYC 7742 2P 2P 7742 2P Repeat 50 100 150 200 250 SE +/- 0.38, N = 3 SE +/- 0.19, N = 3 SE +/- 0.92, N = 3 197.7 207.6 204.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test EPYC 7742 2P 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 83.0 87.5 86.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
LuaJIT Test: Composite OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 14.63, N = 4 SE +/- 9.48, N = 15 1178.95 1200.41 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark EPYC 7742 2P 7742 2P Repeat 7 14 21 28 35 SE +/- 0.37, N = 4 SE +/- 0.10, N = 3 30.99 29.78 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time EPYC 7742 2P 7742 2P Repeat 1.5M 3M 4.5M 6M 7.5M SE +/- 11969.14, N = 3 SE +/- 3617.62, N = 3 6757787 6778608 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance EPYC 7742 2P 7742 2P Repeat 200K 400K 600K 800K 1000K SE +/- 1419.07, N = 5 SE +/- 1440.27, N = 5 1031813 1030655 1. (CC) gcc options: -O3 -march=native
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl EPYC 7742 2P 7742 2P Repeat 400 800 1200 1600 2000 SE +/- 11.05, N = 3 SE +/- 14.38, N = 3 1721 1730 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate EPYC 7742 2P 7742 2P Repeat 120 240 360 480 600 SE +/- 4.66, N = 8 SE +/- 4.25, N = 9 543 523 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen EPYC 7742 2P 7742 2P Repeat 200 400 600 800 1000 SE +/- 5.33, N = 3 833 829 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 2.40, N = 3 SE +/- 0.88, N = 3 1199 1203 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing EPYC 7742 2P 7742 2P Repeat 15 30 45 60 75 SE +/- 0.88, N = 3 SE +/- 0.88, N = 3 69 68 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian EPYC 7742 2P 7742 2P Repeat 140 280 420 560 700 SE +/- 6.04, N = 15 SE +/- 4.16, N = 3 650 654 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space EPYC 7742 2P 7742 2P Repeat 200 400 600 800 1000 SE +/- 13.26, N = 15 SE +/- 4.06, N = 3 929 880 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.4665 0.933 1.3995 1.866 2.3325 SE +/- 0.00986, N = 3 SE +/- 0.00722, N = 3 2.07347 2.06564 MIN: 1.86 MIN: 1.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.5622 1.1244 1.6866 2.2488 2.811 SE +/- 0.02717, N = 15 SE +/- 0.13368, N = 12 1.54650 2.49866 MIN: 1.18 MIN: 1.59 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.481 0.962 1.443 1.924 2.405 SE +/- 0.01302, N = 3 SE +/- 0.00603, N = 3 2.13329 2.13758 MIN: 1.93 MIN: 1.93 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.6951 1.3902 2.0853 2.7804 3.4755 SE +/- 0.03597, N = 3 SE +/- 0.01047, N = 3 3.08917 2.88822 MIN: 2.76 MIN: 2.66 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.4398 0.8796 1.3194 1.7592 2.199 SE +/- 0.008019, N = 3 SE +/- 0.027365, N = 3 0.724395 1.954610 MIN: 0.67 MIN: 1.85 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.6456 1.2912 1.9368 2.5824 3.228 SE +/- 0.01849, N = 3 SE +/- 0.01034, N = 3 2.86937 2.86260 MIN: 2.62 MIN: 2.65 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.649 1.298 1.947 2.596 3.245 SE +/- 0.04670, N = 12 SE +/- 0.00982, N = 3 2.88437 2.70898 MIN: 2.52 MIN: 2.55 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 1.2579 2.5158 3.7737 5.0316 6.2895 SE +/- 0.15852, N = 13 SE +/- 0.12818, N = 15 4.68078 5.59055 MIN: 2.73 MIN: 4.05 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.4962 0.9924 1.4886 1.9848 2.481 SE +/- 0.00840, N = 3 SE +/- 0.00319, N = 3 2.20539 2.19937 MIN: 2.03 MIN: 2.03 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.00941, N = 15 SE +/- 0.01177, N = 6 1.21032 1.20163 MIN: 1.07 MIN: 1.06 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 36.76, N = 3 SE +/- 60.50, N = 15 2948.06 3125.87 MIN: 2599.44 MIN: 2374.92 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 15.39, N = 15 SE +/- 34.94, N = 15 1267.20 1355.40 MIN: 1101.26 MIN: 1136.7 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 63.06, N = 15 SE +/- 92.93, N = 12 2910.88 3152.16 MIN: 2232.96 MIN: 2386.06 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 300 600 900 1200 1500 SE +/- 19.98, N = 15 SE +/- 14.75, N = 3 1281.86 1285.17 MIN: 1116.9 MIN: 1199.95 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 0.1608 0.3216 0.4824 0.6432 0.804 SE +/- 0.003926, N = 3 SE +/- 0.008252, N = 3 0.712936 0.714842 MIN: 0.65 MIN: 0.65 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 22.66, N = 3 SE +/- 91.94, N = 15 2923.30 3209.71 MIN: 2522.76 MIN: 2404.69 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU EPYC 7742 2P 300 600 900 1200 1500 SE +/- 17.20, N = 15 1245.81 MIN: 1111.62 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU EPYC 7742 2P 0.1829 0.3658 0.5487 0.7316 0.9145 SE +/- 0.002167, N = 3 0.812990 MIN: 0.76 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K EPYC 7742 2P 80 160 240 320 400 SE +/- 2.34, N = 3 387.05 MIN: 84.5 / MAX: 472.33 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p EPYC 7742 2P 300 600 900 1200 1500 SE +/- 12.08, N = 14 1245.91 MIN: 153.88 / MAX: 1618.67 1. (CC) gcc options: -pthread -lm
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis EPYC 7742 2P 20 40 60 80 100 SE +/- 0.00, N = 3 83.33 MIN: 26.32 / MAX: 90.91
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis EPYC 7742 2P 5 10 15 20 25 SE +/- 0.13, N = 3 19.74 MIN: 12.82 / MAX: 20.83
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer EPYC 7742 2P 2 4 6 8 10 SE +/- 0.07, N = 5 6.76 MIN: 3.98 / MAX: 7.19
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis EPYC 7742 2P 30 60 90 120 150 125 MIN: 16.13
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer EPYC 7742 2P 3 6 9 12 15 SE +/- 0.06, N = 3 10.10 MIN: 7.81 / MAX: 10.64
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis EPYC 7742 2P 10 20 30 40 50 SE +/- 0.00, N = 3 45.45 MIN: 9.09 / MAX: 47.62
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer EPYC 7742 2P 7 14 21 28 35 SE +/- 0.00, N = 3 30.30 MIN: 11.24 / MAX: 31.25
OSPray Demo: Magnetic Reconnection - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer EPYC 7742 2P 70 140 210 280 350 SE +/- 0.00, N = 3 333.33 MIN: 37.04 / MAX: 500
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping EPYC 7742 2P 130 260 390 520 650 SE +/- 9.68, N = 15 581.88 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime EPYC 7742 2P 5 10 15 20 25 SE +/- 0.12, N = 3 18.47 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass EPYC 7742 2P 0.756 1.512 2.268 3.024 3.78 SE +/- 0.01, N = 3 3.36 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime EPYC 7742 2P 7 14 21 28 35 SE +/- 0.37, N = 3 31.89 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Crown EPYC 7742 2P 15 30 45 60 75 SE +/- 0.29, N = 3 67.59 MIN: 62.78 / MAX: 72.69
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown EPYC 7742 2P 13 26 39 52 65 SE +/- 0.51, N = 3 59.33 MIN: 55.07 / MAX: 65.52
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon EPYC 7742 2P 10 20 30 40 50 SE +/- 0.32, N = 3 44.97 MIN: 41.97 / MAX: 48.41
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj EPYC 7742 2P 9 18 27 36 45 SE +/- 0.11, N = 3 39.05 MIN: 37.1 / MAX: 42.47
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon EPYC 7742 2P 10 20 30 40 50 SE +/- 0.14, N = 3 42.12 MIN: 39.85 / MAX: 44.87
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj EPYC 7742 2P 8 16 24 32 40 SE +/- 0.31, N = 3 36.33 MIN: 34.33 / MAX: 39.39
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium EPYC 7742 2P 5 10 15 20 25 SE +/- 0.19, N = 15 22.70 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium EPYC 7742 2P 14 28 42 56 70 SE +/- 0.46, N = 3 64.43 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast EPYC 7742 2P 9 18 27 36 45 SE +/- 0.92, N = 15 40.15 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast EPYC 7742 2P 10 20 30 40 50 SE +/- 1.77, N = 12 44.45 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast EPYC 7742 2P 30 60 90 120 150 SE +/- 0.43, N = 3 136.84 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast EPYC 7742 2P 40 80 120 160 200 SE +/- 2.71, N = 15 181.14 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 EPYC 7742 2P 0.3285 0.657 0.9855 1.314 1.6425 SE +/- 0.019, N = 3 1.460
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 EPYC 7742 2P 0.698 1.396 2.094 2.792 3.49 SE +/- 0.019, N = 3 3.102
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p EPYC 7742 2P 2 4 6 8 10 SE +/- 0.065, N = 3 7.456 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p EPYC 7742 2P 20 40 60 80 100 SE +/- 0.35, N = 3 85.79 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: VMAF Optimized - Input: Bosphorus 1080p EPYC 7742 2P 70 140 210 280 350 SE +/- 14.04, N = 12 340.02 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p EPYC 7742 2P 80 160 240 320 400 SE +/- 4.38, N = 4 363.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p EPYC 7742 2P 60 120 180 240 300 SE +/- 0.09, N = 3 274.56 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding Speed: Speed 5 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 EPYC 7742 2P 5 10 15 20 25 SE +/- 0.23, N = 15 20.85 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding EPYC 7742 2P 40 80 120 160 200 SE +/- 2.66, N = 15 204.01 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K EPYC 7742 2P 5 10 15 20 25 SE +/- 0.09, N = 3 18.77 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p EPYC 7742 2P 14 28 42 56 70 SE +/- 0.62, N = 15 61.36 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate EPYC 7742 2P 7 14 21 28 35 SE +/- 0.14, N = 3 28.49 1. (CC) gcc options: -O3 -march=native -fopenmp
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial EPYC 7742 2P 7 14 21 28 35 SE +/- 0.20, N = 3 28.72
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark EPYC 7742 2P 100 200 300 400 500 473 MIN: 1 / MAX: 1361
LuxCoreRender Scene: DLSC OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: DLSC EPYC 7742 2P 4 8 12 16 20 SE +/- 0.04, N = 3 15.03 MIN: 14.9 / MAX: 15.86
LuxCoreRender Scene: Rainbow Colors and Prism OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: Rainbow Colors and Prism EPYC 7742 2P 4 8 12 16 20 SE +/- 0.05, N = 3 16.89 MIN: 16.01 / MAX: 17.19
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver EPYC 7742 2P 800 1600 2400 3200 4000 SE +/- 19.77, N = 3 3962.21 1. (CC) gcc options: -O3 -mavx2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test EPYC 7742 2P 70K 140K 210K 280K 350K SE +/- 3983.57, N = 15 338315 1. (CXX) g++ options: -pipe -lpthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time EPYC 7742 2P 40M 80M 120M 160M 200M SE +/- 2342263.11, N = 3 190042987 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth EPYC 7742 2P 50M 100M 150M 200M 250M SE +/- 1860963.69, N = 3 236093113
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 14 28 42 56 70 SE +/- 0.20, N = 3 SE +/- 0.76, N = 3 60.11 60.63 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 8 16 24 32 40 SE +/- 0.28, N = 8 SE +/- 0.22, N = 15 32.72 32.70 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 3 6 9 12 15 SE +/- 0.15, N = 3 SE +/- 0.26, N = 15 12.10 12.96 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 0.9513 1.9026 2.8539 3.8052 4.7565 SE +/- 0.028, N = 3 SE +/- 0.048, N = 3 4.228 4.189 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.42, N = 3 34.93 35.43 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.073, N = 15 SE +/- 0.037, N = 3 7.587 7.400 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile EPYC 7742 2P 6 12 18 24 30 SE +/- 0.02, N = 3 24.68
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile EPYC 7742 2P 5 10 15 20 25 SE +/- 0.09, N = 3 19.75
Timed GCC Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GCC Compilation 9.3.0 Time To Compile EPYC 7742 2P 150 300 450 600 750 SE +/- 0.46, N = 3 715.11
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 9.1 Time To Compile EPYC 7742 2P 20 40 60 80 100 SE +/- 0.21, N = 3 91.31
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile EPYC 7742 2P 14 28 42 56 70 SE +/- 0.42, N = 3 61.41
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile EPYC 7742 2P 4 8 12 16 20 SE +/- 0.12, N = 3 15.67
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 5 10 15 20 25 SE +/- 0.15, N = 14 SE +/- 0.13, N = 14 21.50 21.55
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile EPYC 7742 2P 40 80 120 160 200 SE +/- 1.00, N = 3 200.82
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.4 Time To Compile EPYC 7742 2P 3 6 9 12 15 SE +/- 0.02, N = 3 10.27
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile EPYC 7742 2P 10 20 30 40 50 SE +/- 0.19, N = 3 41.67
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile EPYC 7742 2P 14 28 42 56 70 SE +/- 0.18, N = 3 64.21
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel EPYC 7742 2P 2 4 6 8 10 SE +/- 0.165, N = 12 7.754 1. (CC) gcc options: -lm -lpthread -O3
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time EPYC 7742 2P 2 4 6 8 10 SE +/- 0.028, N = 3 8.028 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair EPYC 7742 2P 1.2638 2.5276 3.7914 5.0552 6.319 SE +/- 0.05055, N = 15 5.61696 1. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic EPYC 7742 2P 6 12 18 24 30 SE +/- 0.32, N = 15 23.60 1. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential EPYC 7742 2P 0.3876 0.7752 1.1628 1.5504 1.938 SE +/- 0.03014, N = 15 1.72256 1. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic EPYC 7742 2P 1.0038 2.0076 3.0114 4.0152 5.019 SE +/- 0.00391, N = 3 4.46136 1. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene EPYC 7742 2P 15 30 45 60 75 SE +/- 0.61, N = 15 65.66 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
rays1bench Large Scene OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene EPYC 7742 2P 110 220 330 440 550 SE +/- 2.26, N = 3 492.78
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark EPYC 7742 2P 70 140 210 280 350 SE +/- 0.81, N = 3 305.72
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time EPYC 7742 2P 9 18 27 36 45 SE +/- 0.01, N = 3 39.90 1. (CC) gcc options: -lm -O3
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile EPYC 7742 2P 20 40 60 80 100 SE +/- 0.01, N = 3 94.91
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 40 80 120 160 200 SE +/- 1.54, N = 9 SE +/- 0.13, N = 3 190.79 185.82
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile EPYC 7742 2P 2 x AMD EPYC 7742 64-Core 15 30 45 60 75 SE +/- 0.38, N = 3 SE +/- 0.18, N = 3 68.22 68.43 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
Gzip Compression Linux Source Tree Archiving To .tar.gz OpenBenchmarking.org Seconds, Fewer Is Better Gzip Compression Linux Source Tree Archiving To .tar.gz EPYC 7742 2P 10 20 30 40 50 SE +/- 0.07, N = 3 41.89
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 EPYC 7742 2P 6 12 18 24 30 SE +/- 0.32, N = 15 26.58 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
dcraw RAW To PPM Image Conversion OpenBenchmarking.org Seconds, Fewer Is Better dcraw RAW To PPM Image Conversion EPYC 7742 2P 11 22 33 44 55 SE +/- 0.03, N = 3 50.52 1. (CC) gcc options: -lm
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU EPYC 7742 2P 20 40 60 80 100 SE +/- 2.69, N = 15 78.04
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE EPYC 7742 2P 4 8 12 16 20 SE +/- 0.02, N = 5 14.37 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC EPYC 7742 2P 3 6 9 12 15 SE +/- 0.009, N = 5 9.830 1. (CXX) g++ options: -O2 -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 EPYC 7742 2P 3 6 9 12 15 SE +/- 0.009, N = 3 9.103 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg EPYC 7742 2P 6 12 18 24 30 SE +/- 0.04, N = 3 23.70 1. (CC) gcc options: -O2 -ffast-math -fsigned-char
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode EPYC 7742 2P 3 6 9 12 15 SE +/- 0.019, N = 5 9.150 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis EPYC 7742 2P 8 16 24 32 40 SE +/- 0.10, N = 4 35.08 1. (CC) gcc options: -O2 -std=c99
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve EPYC 7742 2P 2 4 6 8 10 SE +/- 0.092, N = 3 7.083 1. (CXX) g++ options: -fopenmp -O2 -march=native
Montage Astronomical Image Mosaic Engine Mosaic of M17, K band, 1.5 deg x 1.5 deg OpenBenchmarking.org Seconds, Fewer Is Better Montage Astronomical Image Mosaic Engine 6.0 Mosaic of M17, K band, 1.5 deg x 1.5 deg EPYC 7742 2P 20 40 60 80 100 SE +/- 0.05, N = 3 93.08 1. (CC) gcc options: -std=gnu99 -lcfitsio -lm -O2
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time EPYC 7742 2P 0.3983 0.7966 1.1949 1.5932 1.9915 SE +/- 0.068, N = 15 1.770 1. (CC) gcc options: -static -fopenmp -O3 -march=native
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 EPYC 7742 2P 40 80 120 160 200 SE +/- 0.82, N = 3 169.46 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 EPYC 7742 2P 30 60 90 120 150 SE +/- 0.03, N = 3 130.44 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Radiance Benchmark Test: SMP Parallel OpenBenchmarking.org Seconds, Fewer Is Better Radiance Benchmark 5.0 Test: SMP Parallel EPYC 7742 2P 50 100 150 200 250 213.87
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 EPYC 7742 2P 6 12 18 24 30 SE +/- 0.04, N = 3 23.14 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
System GZIP Decompression OpenBenchmarking.org Seconds, Fewer Is Better System GZIP Decompression EPYC 7742 2P 0.81 1.62 2.43 3.24 4.05 SE +/- 0.034, N = 6 3.600
System XZ Decompression OpenBenchmarking.org Seconds, Fewer Is Better System XZ Decompression EPYC 7742 2P 0.97 1.94 2.91 3.88 4.85 SE +/- 0.004, N = 3 4.311
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time EPYC 7742 2P 3 6 9 12 15 SE +/- 0.0989, N = 15 9.8909 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default EPYC 7742 2P 0.7362 1.4724 2.2086 2.9448 3.681 SE +/- 0.035, N = 15 3.272 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 EPYC 7742 2P 30 60 90 120 150 SE +/- 0.15, N = 3 136.41 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 EPYC 7742 2P 50 100 150 200 250 SE +/- 0.06, N = 3 251.43 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 EPYC 7742 2P 2 4 6 8 10 SE +/- 0.027, N = 3 7.721 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression EPYC 7742 2P 100 200 300 400 500 SE +/- 0.44, N = 3 440.93 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 EPYC 7742 2P 140 280 420 560 700 SE +/- 0.61, N = 3 646.91 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
System ZLIB Decompression OpenBenchmarking.org ms, Fewer Is Better System ZLIB Decompression 1.2.7 EPYC 7742 2P 400 800 1200 1600 2000 SE +/- 6.97, N = 10 2025.98
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 11M 22M 33M 44M 55M SE +/- 22980.67, N = 3 SE +/- 10477.49, N = 3 53594667 53579667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 20M 40M 60M 80M 100M SE +/- 61191.87, N = 3 SE +/- 18559.21, N = 3 107143333 107166667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 50M 100M 150M 200M 250M SE +/- 80829.04, N = 3 SE +/- 107445.08, N = 3 213280000 213286667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 90M 180M 270M 360M 450M SE +/- 101707.64, N = 3 SE +/- 127322.86, N = 3 427203333 427276667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 200M 400M 600M 800M 1000M SE +/- 1082240.47, N = 3 SE +/- 620358.32, N = 3 832276667 831613333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 300M 600M 900M 1200M 1500M SE +/- 1822391.59, N = 3 SE +/- 1153256.26, N = 3 1616566667 1618000000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 600M 1200M 1800M 2400M 3000M SE +/- 8434123.81, N = 3 SE +/- 16574813.56, N = 3 2703933333 2693766667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 700M 1400M 2100M 2800M 3500M SE +/- 13159153.97, N = 3 SE +/- 79162966.07, N = 13 3135600000 3218138462 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 256 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 256 - Buffer Length: 256 - Filter Length: 57 EPYC 7742 2P 2P 1200M 2400M 3600M 4800M 6000M SE +/- 29512765.60, N = 3 SE +/- 16339556.64, N = 3 5525100000 5550733333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 EPYC 7742 2P 30 60 90 120 150 SE +/- 0.97, N = 3 112.56 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP EPYC 7742 2P 11K 22K 33K 44K 55K SE +/- 187.65, N = 3 52054.59 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP EPYC 7742 2P 20K 40K 60K 80K 100K SE +/- 634.52, N = 3 89585.74 1. (CXX) g++ options: -O3 -march=native -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding EPYC 7742 2P 1100 2200 3300 4400 5500 SE +/- 48.52, N = 9 5224.98 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding EPYC 7742 2P 1500 3000 4500 6000 7500 SE +/- 208.57, N = 9 7117.00 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding EPYC 7742 2P 8K 16K 24K 32K 40K SE +/- 416.26, N = 3 38291.8 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding EPYC 7742 2P 8K 16K 24K 32K 40K SE +/- 223.13, N = 3 37599.7 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding EPYC 7742 2P 1000 2000 3000 4000 5000 SE +/- 78.34, N = 12 4826.83 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding EPYC 7742 2P 900 1800 2700 3600 4500 SE +/- 55.12, N = 12 3991.99 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP EPYC 7742 2P 50 100 150 200 250 SE +/- 2.90, N = 15 217.82 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.0.2 Test: Decompression Throughput EPYC 7742 2P 40 80 120 160 200 SE +/- 0.03, N = 3 172.30 1. (CC) gcc options: -O3 -rdynamic
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 2P EPYC 7742 2P 7742 2P Repeat 140 280 420 560 700 SE +/- 7.57, N = 4 SE +/- 4.40, N = 3 SE +/- 3.58, N = 3 653.0 643.9 643.8
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 2P EPYC 7742 2P 7742 2P Repeat 80 160 240 320 400 SE +/- 2.83, N = 4 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 343.0 346.8 347.1
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 2P EPYC 7742 2P 7742 2P Repeat 20 40 60 80 100 SE +/- 0.11, N = 4 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 84.4 84.4 84.6
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 2P EPYC 7742 2P 7742 2P Repeat 120 240 360 480 600 SE +/- 0.72, N = 4 SE +/- 0.59, N = 3 SE +/- 0.76, N = 3 532.5 532.7 534.5
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters 2P EPYC 7742 2P 7742 2P Repeat 90 180 270 360 450 SE +/- 12.19, N = 9 SE +/- 11.37, N = 9 SE +/- 4.29, N = 5 433.2 400.8 423.3 1. 3.8.1.0
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) 2P EPYC 7742 2P 7742 2P Repeat 700 1400 2100 2800 3500 SE +/- 19.01, N = 9 SE +/- 23.25, N = 9 SE +/- 29.08, N = 5 3040.5 3032.8 3090.2 1. 3.8.1.0
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter 2P EPYC 7742 2P 7742 2P Repeat 120 240 360 480 600 SE +/- 1.19, N = 9 SE +/- 0.84, N = 9 SE +/- 0.41, N = 5 555.8 554.9 555.9 1. 3.8.1.0
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter 2P EPYC 7742 2P 7742 2P Repeat 110 220 330 440 550 SE +/- 1.12, N = 9 SE +/- 0.97, N = 9 SE +/- 0.31, N = 5 505.0 506.9 506.9 1. 3.8.1.0
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter 2P EPYC 7742 2P 7742 2P Repeat 160 320 480 640 800 SE +/- 8.43, N = 9 SE +/- 8.87, N = 9 SE +/- 17.28, N = 5 751.2 744.6 747.2 1. 3.8.1.0
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform 2P EPYC 7742 2P 7742 2P Repeat 90 180 270 360 450 SE +/- 0.53, N = 9 SE +/- 0.61, N = 9 SE +/- 0.91, N = 5 436.2 436.5 437.6 1. 3.8.1.0
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB 2 x AMD EPYC 7742 64-Core EPYC 7742 2P 800 1600 2400 3200 4000 SE +/- 31.51, N = 3 SE +/- 27.56, N = 12 3886 3910 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP 2 x AMD EPYC 7742 64-Core EPYC 7742 2P 1100 2200 3300 4400 5500 SE +/- 99.81, N = 15 SE +/- 104.40, N = 15 5141 5179 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks 2 x AMD EPYC 7742 64-Core EPYC 7742 2P 900 1800 2700 3600 4500 SE +/- 28.75, N = 3 SE +/- 24.69, N = 3 4307 4295 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads 2 x AMD EPYC 7742 64-Core EPYC 7742 2P 900 1800 2700 3600 4500 SE +/- 23.63, N = 3 SE +/- 25.44, N = 3 4050 4039 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 2 x AMD EPYC 7742 64-Core 2 4 6 8 10 SE +/- 0.019, N = 3 8.064 1. (CXX) g++ options: -O3 -pthread
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only 2 x AMD EPYC 7742 64-Core 6 12 18 24 30 SE +/- 0.22, N = 3 24.22
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only 2 x AMD EPYC 7742 64-Core 11 22 33 44 55 SE +/- 0.29, N = 3 49.06
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only 2 x AMD EPYC 7742 64-Core 8 16 24 32 40 SE +/- 0.30, N = 3 36.02
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only 2 x AMD EPYC 7742 64-Core 20 40 60 80 100 SE +/- 0.90, N = 15 81.79
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only 2 x AMD EPYC 7742 64-Core 14 28 42 56 70 SE +/- 0.17, N = 3 64.56
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 7742 2P Repeat 100 200 300 400 500 SE +/- 2.10, N = 3 461.38 MIN: 406.14 / MAX: 1020.59 1. (CC) gcc options: -O2 -lm -pthread -lmpi
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 7742 2P Repeat 12 24 36 48 60 SE +/- 0.37, N = 11 51.20 1. (CXX) g++ options: -O3 -lpthread
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Compression Speed 7742 2P Repeat 1100 2200 3300 4400 5500 SE +/- 62.74, N = 3 5053.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Decompression Speed 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 3.38, N = 3 2908.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 7742 2P Repeat 8 16 24 32 40 SE +/- 0.11, N = 3 32.97
LuaJIT Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Monte Carlo 7742 2P Repeat 90 180 270 360 450 SE +/- 0.31, N = 3 412.34 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaJIT Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Fast Fourier Transform 7742 2P Repeat 50 100 150 200 250 SE +/- 0.69, N = 3 210.57 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaJIT Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Sparse Matrix Multiply 7742 2P Repeat 200 400 600 800 1000 SE +/- 6.13, N = 3 1008.92 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaJIT Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Dense LU Matrix Factorization 7742 2P Repeat 600 1200 1800 2400 3000 SE +/- 173.31, N = 3 2811.62 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaJIT Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Jacobi Successive Over-Relaxation 7742 2P Repeat 400 800 1200 1600 2000 SE +/- 0.26, N = 3 1644.11 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
Phoronix Test Suite v10.8.5