epyc-march

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2103122-HA-EPYCMARCH14
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 5 Tests
AV1 6 Tests
Bioinformatics 5 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 5 Tests
C++ Boost Tests 6 Tests
Chess Test Suite 7 Tests
Timed Code Compilation 14 Tests
C/C++ Compiler Tests 36 Tests
Compression Tests 10 Tests
CPU Massive 46 Tests
Creator Workloads 49 Tests
Encoding 16 Tests
Finance 2 Tests
Fortran Tests 11 Tests
Game Development 6 Tests
HPC - High Performance Computing 34 Tests
Imaging 10 Tests
LAPACK (Linear Algebra Pack) Tests 4 Tests
Linear Algebra 2 Tests
Machine Learning 5 Tests
Molecular Dynamics 10 Tests
MPI Benchmarks 11 Tests
Multi-Core 58 Tests
NVIDIA GPU Compute 6 Tests
Intel oneAPI 6 Tests
OpenCL 2 Tests
OpenMPI Tests 21 Tests
Programmer / Developer System Benchmarks 18 Tests
Python Tests 7 Tests
Raytracing 6 Tests
Renderers 12 Tests
Rust Tests 2 Tests
Scientific Computing 22 Tests
Software Defined Radio 4 Tests
Server CPU Tests 26 Tests
Single-Threaded 12 Tests
Speech 4 Tests
Telephony 4 Tests
Video Encoding 11 Tests
Common Workstation Benchmarks 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 7742 2P
March 09 2021
  1 Day, 11 Hours, 43 Minutes
2P
March 11 2021
  6 Hours, 49 Minutes
2 x AMD EPYC 7742 64-Core
March 11 2021
  2 Hours, 16 Minutes
7742 2P Repeat
March 11 2021
  22 Hours, 50 Minutes
Invert Hiding All Results Option
  16 Hours, 55 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):


epyc-march OpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVGA HDMI2 x Intel 10G X550TUbuntu 20.045.8.0-44-generic (x86_64)X Server 1.20.8GCC 9.3.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDisplay ServerCompilerFile-SystemScreen ResolutionEpyc-march BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - EPYC 7742 2P, 7742 2P Repeat: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 - EPYC 7742 2P, 7742 2P Repeat: OpenJDK Runtime Environment (build 11.0.10+9-Ubuntu-0ubuntu1.20.04) - EPYC 7742 2P, 2P, 7742 2P Repeat: Python 2.7.18 + Python 3.8.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

epyc-march rodinia: OpenMP CFD Solveronednn: Convolution Batch Shapes Auto - f32 - CPUrodinia: OpenMP HotSpot3Dlczero: Eigenlczero: BLASnpb: LU.Crodinia: OpenMP Leukocyteblosc: blosclzrodinia: OpenMP LavaMDparboil: OpenMP MRI Griddingonednn: IP Shapes 3D - u8s8f32 - CPUnpb: FT.Cnpb: EP.Donednn: Deconvolution Batch shapes_3d - f32 - CPUfftw: Float + SSE - 2D FFT Size 4096graphics-magick: HWB Color Spacesrslte: PHY_DL_Testsrslte: PHY_DL_Testlibraw: Post-Processing Benchmarkgraphics-magick: Rotatesrslte: OFDM_Testcompress-lz4: 9 - Compression Speednpb: CG.Clammps: Rhodopsin Proteinparboil: OpenMP Stenciljpegxl: JPEG - 5ior: 16MB - Default Test Directorybuild-erlang: Time To Compilehmmer: Pfam Database Searchcompress-lz4: 1 - Compression Speedavifenc: 10, Losslessparboil: OpenMP CUTCPmafft: Multiple Sequence Alignment - LSU RNAgnuradio: Signal Source (Cosine)luajit: Compositecompress-lz4: 3 - Decompression Speedior: 2MB - Default Test Directorynwchem: C240 Buckyballfftw: Stock - 2D FFT Size 4096graphics-magick: Resizingfftw: Float + SSE - 1D FFT Size 4096luaradio: Five Back to Back FIR Filtersavifenc: 6, Losslessnpb: EP.Cdacapobench: Jythoncompress-lz4: 9 - Decompression Speednpb: IS.Dnamd: ATPase Simulation - 327,506 Atomshpcg: compress-lz4: 1 - Decompression Speedlzbench: Zstd 8 - Compressionfftw: Stock - 2D FFT Size 2048luaradio: FM Deemphasis Filterior: 4MB - Default Test Directoryfftw: Float + SSE - 2D FFT Size 2048jpegxl: PNG - 5lzbench: XZ 0 - Decompressionavifenc: 10lzbench: Zstd 1 - Decompressiongnuradio: FM Deemphasis Filterior: 8MB - Default Test Directoryavifenc: 0jpegxl: JPEG - 8incompact3d: Cylinderonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUlzbench: Zstd 8 - Decompressiontoybrot: TBBnpb: MG.Cgraphics-magick: Noise-Gaussianpennant: leblancbiggraphics-magick: Swirllzbench: Libdeflate 1 - Compressionqe: AUSURF112graphics-magick: Sharpenliquid-dsp: 256 - 256 - 57lzbench: Zstd 1 - Compressionlammps: 20k Atomswebp: Quality 100, Highest Compressionmrbayes: Primate Phylogeny Analysiscompress-lz4: 3 - Compression Speedcompress-zstd: 19 - Decompression Speedonednn: IP Shapes 1D - f32 - CPUliquid-dsp: 64 - 256 - 57gnuradio: IIR Filterluaradio: Complex Phasejpegxl: JPEG - 7webp: Quality 100, Losslesslzbench: Brotli 2 - Decompressiongraphics-magick: Enhancedquantlib: gnuradio: Hilbert Transformbuild-wasmer: Time To Compilecrafty: Elapsed Timetoybrot: C++ Tasksonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUtoybrot: C++ Threadsonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUlzbench: Crush 0 - Decompressionfftw: Stock - 1D FFT Size 4096build-linux-kernel: Time To Compilelzbench: Brotli 0 - Compressionluaradio: Hilbert Transformonednn: Deconvolution Batch shapes_1d - f32 - CPUcompress-zstd: 8 - Decompression Speedetcpak: DXT1jpegxl-decode: Alletcpak: ETC1openfoam: Motorbike 30Mlzbench: Libdeflate 1 - Decompressionlzbench: Brotli 0 - Decompressiononednn: IP Shapes 1D - u8s8f32 - CPUetcpak: ETC1 + Ditheringcompress-zstd: 8, Long Mode - Decompression Speedgnuradio: FIR Filtercompress-zstd: 19, Long Mode - Decompression Speedtscp: AI Chess Performancerelion: Basic - CPUamg: webp: Quality 100, Lossless, Highest Compressionopenfoam: Motorbike 60Mliquid-dsp: 32 - 256 - 57liquid-dsp: 16 - 256 - 57compress-zstd: 3, Long Mode - Decompression Speedavifenc: 2webp: Defaultpennant: sedovbigetcpak: ETC2webp: Quality 100liquid-dsp: 1 - 256 - 57dolfyn: Computational Fluid Dynamicsliquid-dsp: 2 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 4 - 256 - 57luajit: Jacobi Successive Over-Relaxationluajit: Sparse Matrix Multiplyluajit: Fast Fourier Transformluajit: Monte Carlojpegxl-decode: 1compress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedlibgav1: Chimera 1080pior: 32MB - Default Test Directoryblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Classroom - CPU-Onlyblender: BMW27 - CPU-Onlygromacs: water_GMX50_baretjbench: Decompression Throughputaskap: Hogbom Clean OpenMPaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MT - Griddingfinancebench: Bonds OpenMPfinancebench: Repo OpenMPcouchdb: 100 - 1000 - 24system-decompress-zlib: synthmark: VoiceMark_100webp2: Quality 100, Lossless Compressionwebp2: Quality 100, Compression Effort 5webp2: Quality 95, Compression Effort 7webp2: Quality 75, Compression Effort 7webp2: Defaulttachyon: Total Timesystem-decompress-xz: system-decompress-gzip: rnnoise: radiance: SMP Parallelngspice: C7552ngspice: C2670montage: Mosaic of M17, K band, 1.5 deg x 1.5 degm-queens: Time To Solveespeak: Text-To-Speech Synthesisencode-opus: WAV To Opus Encodeencode-ogg: WAV To Oggencode-mp3: WAV To MP3encode-flac: WAV To FLACencode-ape: WAV To APEdcraw: RAW To PPM Image Conversioncompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9compress-gzip: Linux Source Tree Archiving To .tar.gzbuild-eigen: Time To Compileaobench: 2048 x 2048 - Total Timenumpy: rays1bench: Large Sceneyafaray: Total Time For Sample Scenetungsten: Volumetric Caustictungsten: Water Caustictungsten: Hairpovray: Trace Timebuild2: Time To Compilebuild-php: Time To Compilebuild-mplayer: Time To Compilebuild-llvm: Time To Compilebuild-imagemagick: Time To Compilebuild-godot: Time To Compilebuild-gdb: Time To Compilebuild-gcc: Time To Compilebuild-ffmpeg: Time To Compilebuild-apache: Time To Compileasmfish: 1024 Hash Memory, 26 Depthstockfish: Total Timecompress-7zip: Compress Speed Testhimeno: Poisson Pressure Solverluxcorerender: Rainbow Colors and Prismluxcorerender: DLSCopenvkl: vklBenchmarkoidn: Memorialmt-dgemm: Sustained Floating-Point Ratex265: Bosphorus 1080px265: Bosphorus 4Kx264: H.264 Video Encodingvpxenc: Speed 5svt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080prav1e: 10rav1e: 6kvazaar: Bosphorus 1080p - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Mediumembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragon Objembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Crownembree: Pathtracer - Crownaom-av1: Speed 8 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 6 Realtimeospray: Magnetic Reconnection - Path Tracerospray: NASA Streamlines - Path Tracerospray: Magnetic Reconnection - SciVisospray: XFrog Forest - Path Tracerospray: NASA Streamlines - SciVisospray: San Miguel - Path Tracerospray: XFrog Forest - SciVisospray: San Miguel - SciVisdav1d: Summer Nature 1080pdav1d: Summer Nature 4Konednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUjpegxl: PNG - 8jpegxl: PNG - 7libgav1: Summer Nature 1080plibgav1: Summer Nature 4Kmocassin: Dust 2D tau100.0lzbench: Brotli 2 - Compressionlzbench: Crush 0 - Compressionlzbench: XZ 0 - Compressionhpl: luajit: Dense LU Matrix Factorizationtoybrot: OpenMPgnuradio: Five Back to Back FIR Filtersaskap: tConvolve MT - Degriddingliquid-dsp: 128 - 256 - 57n-queens: Elapsed Timedeepspeech: CPUtungsten: Non-Exponentialc-ray: Total Time - 4K, 16 Rays Per Pixelavifenc: 6svt-vp9: VMAF Optimized - Bosphorus 1080pkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 4K - Very Fastttsiod-renderer: Phong Rendering With Soft-Shadow Mappingonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 8 - Compression Speeddacapobench: Tradebeansdacapobench: H2qmcpack: simple-H2Offte: N=256, 3D Complex FFT Routineneat: rodinia: OpenMP Streamclustercloverleaf: Lagrangian-Eulerian Hydrodynamicsminife: Smallparboil: OpenMP LBMEPYC 7742 2P2P2 x AMD EPYC 7742 64-Core7742 2P Repeat10.6270.724395112.82941983936194294.6550.8503491.530.21194.0807443.0891776051.368426.042.884371866392983.0197.730.995439833333344.8641060.1929.0005.38956151.71480.63190.789398.6359527.727.5870.83187510.2663032.81178.9510188.3445.101963.35387.56944939643.934.9348223.34502710418.33268.820.2795225.636411001.5826160.7346.8485.742650963.73984.2281346744.6489.4060.11222.91345.7868851.210321497391072806.596503.94919217212051219.36833552510000043531.9778.904108.67345.362792.52.073472703933333506.9532.751.3620.36057111992015.9436.568.219675778742952.2053940390.7129363816873.921.50041584.42.869372975.71039.81399.54236.72214.129594822.13329224.1643205.6554.92828.51031813542.048124742766741.976112.8016165666678322766673092.032.7171.8555.872848139.5012.8635359466720.206107143333427203333213280000172.304423217.8233991.994826.8337599.738291.85224.9889585.74218752054.593750112.5582025.984712646.905440.9287.721251.434136.4053.2729.89094.3113.60023.142213.866130.441169.46293.0777.08335.0799.15023.6999.1039.83014.36950.51626.58141.88894.90939.902305.72492.7865.6604.4613623.60085.616968.02864.21341.66810.269200.81615.66761.40791.309715.11319.74824.6772360931131900429873383153962.21481916.8915.0347328.7228.49412161.3618.77204.0120.85274.56363.9685.7867.4563.1021.460181.14136.8464.4322.7036.328342.118539.047444.972759.332367.586831.893.3618.47333.3330.3045.4510.101256.7619.7483.331245.91387.050.8129901245.810.709.7772.7418.972391658932153.595179400.87117.0031356000001.77078.037101.722567.75412.103340.0244.4540.15581.8792923.301281.862910.881267.202948.064.680781.5465032.8587.6629.270.71989.94866594744.322150712.9681969962.7529.97023.3611312.6651.01818787.5207.61011000003040.5653.0343.0751.255507333332693766667505.0532.5436.284.4555.8161800000083161333353579667107166667427276667213286667433.23218138462185.8247.40035.4324.18960.62638862781.568.4334307405021.5522982.63199.62824.932.69764.5681.7936.0249.0624.228.064514112.95833.0561.669.22243.5219.3781.95461150.96535123333176465.6755.8613788.132.511208.0378422.8882271133.347885.522.708981754188086.8204.529.785239796000043.4939810.9628.1835.54462853.11468.11408.8989289.070.85252910.4683090.21200.4110373.0452.031933.85306.86844294643.88108.43509610278.83313.120.2830625.955810868.7836086.5347.1480.552679564.40991334747.2485.1322.72348.5163881.20163148773254.926543.92644617302061225.2682943731.8338.864109.15845.162791.92.06564506.9534.551.1720.43256912032009.2437.667786082.199370.7148423826890.641684.62.862602977.71037.46099.32237.24514.159614812.13758224.5993205.8555.92825.81030655541.471124613866741.936112.703090.01.8545.875714139.5612.86420.2011644.111008.92210.57412.3432.972908.55053.751.20461.380.709.7723916589322811.62423.33209.711285.173152.161355.403125.875.590552.4986633.9566.1620.570.91992.54991629546.283148613.8914620985.37396.1602994.278169.1774.068677OpenBenchmarking.org

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.12, N = 3SE +/- 0.14, N = 310.63219.381. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverEPYC 7742 2P7742 2P Repeat4080120160200Min: 10.39 / Avg: 10.63 / Max: 10.8Min: 219.13 / Avg: 219.38 / Max: 219.61. (CXX) g++ options: -O2 -lOpenCL

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.43980.87961.31941.75922.199SE +/- 0.008019, N = 3SE +/- 0.027365, N = 30.7243951.954610MIN: 0.67MIN: 1.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 0.71 / Avg: 0.72 / Max: 0.73Min: 1.92 / Avg: 1.95 / Max: 2.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DEPYC 7742 2P7742 2P Repeat306090120150SE +/- 1.17, N = 15SE +/- 0.67, N = 3112.83150.971. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DEPYC 7742 2P7742 2P Repeat306090120150Min: 103.77 / Avg: 112.83 / Max: 116.54Min: 149.69 / Avg: 150.97 / Max: 151.981. (CXX) g++ options: -O2 -lOpenCL

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenEPYC 7742 2P7742 2P Repeat9001800270036004500SE +/- 41.70, N = 3SE +/- 49.56, N = 9419835121. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenEPYC 7742 2P7742 2P Repeat7001400210028003500Min: 4116 / Avg: 4198.33 / Max: 4251Min: 3302 / Avg: 3512.11 / Max: 37401. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASEPYC 7742 2P7742 2P Repeat8001600240032004000SE +/- 49.55, N = 9SE +/- 37.92, N = 4393633331. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASEPYC 7742 2P7742 2P Repeat7001400210028003500Min: 3628 / Avg: 3936.44 / Max: 4170Min: 3250 / Avg: 3333 / Max: 34131. (CXX) g++ options: -flto -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CEPYC 7742 2P7742 2P Repeat40K80K120K160K200KSE +/- 770.77, N = 3SE +/- 1600.26, N = 3194294.65176465.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CEPYC 7742 2P7742 2P Repeat30K60K90K120K150KMin: 193269.3 / Avg: 194294.65 / Max: 195804.2Min: 174555.81 / Avg: 176465.67 / Max: 179644.751. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteEPYC 7742 2P7742 2P Repeat1326395265SE +/- 0.49, N = 3SE +/- 0.43, N = 1550.8555.861. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteEPYC 7742 2P7742 2P Repeat1122334455Min: 49.91 / Avg: 50.85 / Max: 51.55Min: 53.64 / Avg: 55.86 / Max: 59.161. (CXX) g++ options: -O2 -lOpenCL

C-Blosc

A simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0 Beta 5Compressor: blosclzEPYC 7742 2P7742 2P Repeat8001600240032004000SE +/- 23.71, N = 3SE +/- 39.54, N = 33491.53788.11. (CXX) g++ options: -rdynamic
OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0 Beta 5Compressor: blosclzEPYC 7742 2P7742 2P Repeat7001400210028003500Min: 3445.9 / Avg: 3491.47 / Max: 3525.6Min: 3709.4 / Avg: 3788.07 / Max: 3834.41. (CXX) g++ options: -rdynamic

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDEPYC 7742 2P7742 2P Repeat816243240SE +/- 0.27, N = 3SE +/- 0.11, N = 330.2132.511. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDEPYC 7742 2P7742 2P Repeat714212835Min: 29.92 / Avg: 30.21 / Max: 30.74Min: 32.29 / Avg: 32.51 / Max: 32.651. (CXX) g++ options: -O2 -lOpenCL

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 1.09, N = 3SE +/- 1.80, N = 3194.08208.041. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingEPYC 7742 2P7742 2P Repeat4080120160200Min: 192 / Avg: 194.08 / Max: 195.68Min: 204.49 / Avg: 208.04 / Max: 210.321. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.69511.39022.08532.78043.4755SE +/- 0.03597, N = 3SE +/- 0.01047, N = 33.089172.88822MIN: 2.76MIN: 2.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 3.02 / Avg: 3.09 / Max: 3.14Min: 2.87 / Avg: 2.89 / Max: 2.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CEPYC 7742 2P7742 2P Repeat16K32K48K64K80KSE +/- 839.98, N = 4SE +/- 262.47, N = 376051.3671133.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CEPYC 7742 2P7742 2P Repeat13K26K39K52K65KMin: 73899.67 / Avg: 76051.36 / Max: 77952.09Min: 70608.85 / Avg: 71133.34 / Max: 71414.321. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 14.28, N = 3SE +/- 8.01, N = 38426.047885.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DEPYC 7742 2P7742 2P Repeat15003000450060007500Min: 8401.91 / Avg: 8426.04 / Max: 8451.33Min: 7871.33 / Avg: 7885.52 / Max: 7899.041. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.6491.2981.9472.5963.245SE +/- 0.04670, N = 12SE +/- 0.00982, N = 32.884372.70898MIN: 2.52MIN: 2.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 2.7 / Avg: 2.88 / Max: 3.3Min: 2.7 / Avg: 2.71 / Max: 2.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat4K8K12K16K20KSE +/- 230.17, N = 318663175411. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat3K6K9K12K15KMin: 17085 / Avg: 17541.33 / Max: 178221. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 13.26, N = 15SE +/- 4.06, N = 39298801. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceEPYC 7742 2P7742 2P Repeat160320480640800Min: 868 / Avg: 929.47 / Max: 997Min: 873 / Avg: 879.67 / Max: 8871. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat20406080100SE +/- 0.33, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 383.087.586.81. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat20406080100Min: 82.7 / Avg: 83.03 / Max: 83.7Min: 87.4 / Avg: 87.47 / Max: 87.5Min: 86.4 / Avg: 86.8 / Max: 87.11. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat50100150200250SE +/- 0.38, N = 3SE +/- 0.19, N = 3SE +/- 0.92, N = 3197.7207.6204.51. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat4080120160200Min: 197.1 / Avg: 197.7 / Max: 198.4Min: 207.4 / Avg: 207.63 / Max: 208Min: 202.9 / Avg: 204.53 / Max: 206.11. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.37, N = 4SE +/- 0.10, N = 330.9929.781. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7742 2P7742 2P Repeat714212835Min: 30.44 / Avg: 30.99 / Max: 32.07Min: 29.59 / Avg: 29.78 / Max: 29.911. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 4.66, N = 8SE +/- 4.25, N = 95435231. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateEPYC 7742 2P7742 2P Repeat100200300400500Min: 511 / Avg: 543 / Max: 551Min: 490 / Avg: 523 / Max: 5321. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_TestEPYC 7742 2P2P7742 2P Repeat20M40M60M80M100MSE +/- 520683.31, N = 3SE +/- 404145.19, N = 3SE +/- 1084711.94, N = 598333333101100000979600001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_TestEPYC 7742 2P2P7742 2P Repeat20M40M60M80M100MMin: 97400000 / Avg: 98333333.33 / Max: 99200000Min: 100600000 / Avg: 101100000 / Max: 101900000Min: 95700000 / Avg: 97960000 / Max: 1018000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.35, N = 3SE +/- 0.03, N = 344.8643.491. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7742 2P7742 2P Repeat918273645Min: 44.17 / Avg: 44.86 / Max: 45.22Min: 43.45 / Avg: 43.49 / Max: 43.541. (CC) gcc options: -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CEPYC 7742 2P7742 2P Repeat9K18K27K36K45KSE +/- 380.32, N = 3SE +/- 428.23, N = 541060.1939810.961. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CEPYC 7742 2P7742 2P Repeat7K14K21K28K35KMin: 40299.57 / Avg: 41060.19 / Max: 41443.93Min: 38468.72 / Avg: 39810.96 / Max: 41087.71. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.29, N = 6SE +/- 0.20, N = 1529.0028.181. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7742 2P7742 2P Repeat612182430Min: 27.96 / Avg: 29 / Max: 29.71Min: 26.71 / Avg: 28.18 / Max: 29.71. (CXX) g++ options: -O3 -pthread -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilEPYC 7742 2P7742 2P Repeat1.24752.4953.74254.996.2375SE +/- 0.018548, N = 3SE +/- 0.042058, N = 35.3895615.5446281. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilEPYC 7742 2P7742 2P Repeat246810Min: 5.35 / Avg: 5.39 / Max: 5.42Min: 5.46 / Avg: 5.54 / Max: 5.61. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1224364860SE +/- 0.39, N = 11SE +/- 0.58, N = 351.7153.111. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1122334455Min: 50.18 / Avg: 51.71 / Max: 54.21Min: 52.33 / Avg: 53.11 / Max: 54.251. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.21, N = 3SE +/- 5.31, N = 3480.63468.11MIN: 410.37 / MAX: 1034.82MIN: 411 / MAX: 1031.921. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat90180270360450Min: 478.62 / Avg: 480.63 / Max: 482.81Min: 458.5 / Avg: 468.11 / Max: 476.841. (CC) gcc options: -O2 -lm -pthread -lmpi

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core4080120160200SE +/- 1.54, N = 9SE +/- 0.13, N = 3190.79185.82
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core306090120150Min: 186.01 / Avg: 190.79 / Max: 199.06Min: 185.67 / Avg: 185.82 / Max: 186.08

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 4.02, N = 3SE +/- 5.75, N = 3398.64408.901. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7742 2P7742 2P Repeat70140210280350Min: 390.6 / Avg: 398.64 / Max: 402.72Min: 397.54 / Avg: 408.9 / Max: 416.181. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 65.80, N = 3SE +/- 29.92, N = 39527.729289.071. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7742 2P7742 2P Repeat17003400510068008500Min: 9396.7 / Avg: 9527.72 / Max: 9603.95Min: 9249.26 / Avg: 9289.07 / Max: 9347.661. (CC) gcc options: -O3

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core246810SE +/- 0.073, N = 15SE +/- 0.037, N = 37.5877.4001. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core3691215Min: 7.25 / Avg: 7.59 / Max: 8.01Min: 7.34 / Avg: 7.4 / Max: 7.461. (CXX) g++ options: -O3 -fPIC -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPEPYC 7742 2P7742 2P Repeat0.19180.38360.57540.76720.959SE +/- 0.009850, N = 3SE +/- 0.008717, N = 150.8318750.8525291. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPEPYC 7742 2P7742 2P Repeat246810Min: 0.82 / Avg: 0.83 / Max: 0.85Min: 0.78 / Avg: 0.85 / Max: 0.921. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAEPYC 7742 2P7742 2P Repeat3691215SE +/- 0.14, N = 3SE +/- 0.11, N = 1510.2710.471. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAEPYC 7742 2P7742 2P Repeat3691215Min: 10 / Avg: 10.27 / Max: 10.45Min: 9.96 / Avg: 10.47 / Max: 11.591. (CC) gcc options: -std=c99 -O3 -lm -lpthread

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)EPYC 7742 2P2P7742 2P Repeat7001400210028003500SE +/- 23.25, N = 9SE +/- 19.01, N = 9SE +/- 29.08, N = 53032.83040.53090.21. 3.8.1.0
OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)EPYC 7742 2P2P7742 2P Repeat5001000150020002500Min: 2903.6 / Avg: 3032.81 / Max: 3150.3Min: 2941.8 / Avg: 3040.48 / Max: 3109.2Min: 2974.2 / Avg: 3090.24 / Max: 3126.71. 3.8.1.0

LuaJIT

This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: CompositeEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 14.63, N = 4SE +/- 9.48, N = 151178.951200.411. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: CompositeEPYC 7742 2P7742 2P Repeat2004006008001000Min: 1153.31 / Avg: 1178.95 / Max: 1220.64Min: 1181.61 / Avg: 1200.41 / Max: 1295.071. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 80.01, N = 3SE +/- 28.45, N = 310188.310373.01. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KMin: 10054 / Avg: 10188.27 / Max: 10330.8Min: 10344.1 / Avg: 10373 / Max: 10429.91. (CC) gcc options: -O3

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.26, N = 3SE +/- 4.26, N = 7445.10452.03MIN: 379.87 / MAX: 836.63MIN: 378.75 / MAX: 1032.921. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat80160240320400Min: 442.85 / Avg: 445.1 / Max: 447.19Min: 438.2 / Avg: 452.03 / Max: 468.251. (CC) gcc options: -O2 -lm -pthread -lmpi

NWChem

NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballEPYC 7742 2P7742 2P Repeat4008001200160020001963.31933.81. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lcomex -lm -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat12002400360048006000SE +/- 13.54, N = 3SE +/- 54.29, N = 35387.55306.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat9001800270036004500Min: 5373.1 / Avg: 5387.53 / Max: 5414.6Min: 5198.9 / Avg: 5306.77 / Max: 5371.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingEPYC 7742 2P7742 2P Repeat1530456075SE +/- 0.88, N = 3SE +/- 0.88, N = 369681. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingEPYC 7742 2P7742 2P Repeat1326395265Min: 68 / Avg: 69.33 / Max: 71Min: 67 / Avg: 68.33 / Max: 701. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat10K20K30K40K50KSE +/- 343.27, N = 3SE +/- 125.07, N = 344939442941. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat8K16K24K32K40KMin: 44255 / Avg: 44939 / Max: 45332Min: 44073 / Avg: 44293.67 / Max: 445061. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR FiltersEPYC 7742 2P2P7742 2P Repeat140280420560700SE +/- 4.40, N = 3SE +/- 7.57, N = 4SE +/- 3.58, N = 3643.9653.0643.8
OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR FiltersEPYC 7742 2P2P7742 2P Repeat120240360480600Min: 635.2 / Avg: 643.9 / Max: 649.4Min: 638.4 / Avg: 653 / Max: 671.3Min: 638.1 / Avg: 643.83 / Max: 650.4

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core816243240SE +/- 0.07, N = 3SE +/- 0.42, N = 334.9335.431. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core816243240Min: 34.85 / Avg: 34.93 / Max: 35.08Min: 34.8 / Avg: 35.43 / Max: 36.211. (CXX) g++ options: -O3 -fPIC -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 21.18, N = 3SE +/- 9.74, N = 38223.348108.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CEPYC 7742 2P7742 2P Repeat14002800420056007000Min: 8183.48 / Avg: 8223.34 / Max: 8255.69Min: 8096.22 / Avg: 8108.43 / Max: 8127.681. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonEPYC 7742 2P7742 2P Repeat11002200330044005500SE +/- 60.94, N = 4SE +/- 52.30, N = 450275096
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonEPYC 7742 2P7742 2P Repeat9001800270036004500Min: 4900 / Avg: 5026.5 / Max: 5183Min: 5002 / Avg: 5096 / Max: 5200

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 44.54, N = 3SE +/- 45.56, N = 310418.310278.81. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KMin: 10329.3 / Avg: 10418.33 / Max: 10465.5Min: 10225.9 / Avg: 10278.8 / Max: 10369.51. (CC) gcc options: -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 30.48, N = 3SE +/- 10.37, N = 33268.823313.121. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DEPYC 7742 2P7742 2P Repeat6001200180024003000Min: 3232.27 / Avg: 3268.82 / Max: 3329.35Min: 3295.98 / Avg: 3313.12 / Max: 3331.791. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7742 2P7742 2P Repeat0.06370.12740.19110.25480.3185SE +/- 0.00196, N = 3SE +/- 0.00247, N = 30.279520.28306
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7742 2P7742 2P Repeat12345Min: 0.28 / Avg: 0.28 / Max: 0.28Min: 0.28 / Avg: 0.28 / Max: 0.29

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7742 2P7742 2P Repeat612182430SE +/- 0.28, N = 5SE +/- 0.20, N = 325.6425.961. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7742 2P7742 2P Repeat612182430Min: 24.94 / Avg: 25.64 / Max: 26.66Min: 25.56 / Avg: 25.96 / Max: 26.231. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 91.80, N = 3SE +/- 21.34, N = 311001.510868.71. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KMin: 10893.1 / Avg: 11001.47 / Max: 11184Min: 10834.6 / Avg: 10868.73 / Max: 109081. (CC) gcc options: -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: CompressionEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.67, N = 382831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: CompressionEPYC 7742 2P7742 2P Repeat1632486480Min: 82 / Avg: 82.67 / Max: 841. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat13002600390052006500SE +/- 3.97, N = 3SE +/- 47.11, N = 36160.76086.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat11002200330044005500Min: 6152.8 / Avg: 6160.7 / Max: 6165.3Min: 5992.5 / Avg: 6086.47 / Max: 6139.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis FilterEPYC 7742 2P2P7742 2P Repeat80160240320400SE +/- 0.15, N = 3SE +/- 2.83, N = 4SE +/- 0.12, N = 3346.8343.0347.1
OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis FilterEPYC 7742 2P2P7742 2P Repeat60120180240300Min: 346.5 / Avg: 346.77 / Max: 347Min: 334.9 / Avg: 342.95 / Max: 347.2Min: 346.9 / Avg: 347.07 / Max: 347.3

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat110220330440550SE +/- 6.27, N = 3SE +/- 2.94, N = 3485.74480.55MIN: 400.81 / MAX: 1055.7MIN: 405.25 / MAX: 1028.471. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat90180270360450Min: 473.98 / Avg: 485.74 / Max: 495.41Min: 474.68 / Avg: 480.55 / Max: 483.771. (CC) gcc options: -O2 -lm -pthread -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat6K12K18K24K30KSE +/- 200.43, N = 12SE +/- 139.43, N = 326509267951. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat5K10K15K20K25KMin: 24506 / Avg: 26509.17 / Max: 27307Min: 26536 / Avg: 26795 / Max: 270141. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1428425670SE +/- 0.58, N = 15SE +/- 0.52, N = 1563.7364.401. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1326395265Min: 59.87 / Avg: 63.73 / Max: 67.2Min: 60.19 / Avg: 64.4 / Max: 67.711. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.33, N = 398991. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat20406080100Min: 98 / Avg: 98.33 / Max: 991. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10EPYC 7742 2P2 x AMD EPYC 7742 64-Core0.95131.90262.85393.80524.7565SE +/- 0.028, N = 3SE +/- 0.048, N = 34.2284.1891. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10EPYC 7742 2P2 x AMD EPYC 7742 64-Core246810Min: 4.17 / Avg: 4.23 / Max: 4.26Min: 4.12 / Avg: 4.19 / Max: 4.281. (CXX) g++ options: -O3 -fPIC -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 1.45, N = 3SE +/- 4.36, N = 3134613341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat2004006008001000Min: 1343 / Avg: 1345.67 / Max: 1348Min: 1326 / Avg: 1334 / Max: 13411. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterEPYC 7742 2P2P7742 2P Repeat160320480640800SE +/- 8.87, N = 9SE +/- 8.43, N = 9SE +/- 17.28, N = 5744.6751.2747.21. 3.8.1.0
OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterEPYC 7742 2P2P7742 2P Repeat130260390520650Min: 699.2 / Avg: 744.58 / Max: 769.1Min: 686 / Avg: 751.19 / Max: 770.6Min: 678.3 / Avg: 747.24 / Max: 768.71. 3.8.1.0

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat110220330440550SE +/- 1.12, N = 3SE +/- 3.76, N = 3489.40485.13MIN: 410.57 / MAX: 920.3MIN: 411.08 / MAX: 1041.221. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat90180270360450Min: 487.16 / Avg: 489.4 / Max: 490.59Min: 478.63 / Avg: 485.13 / Max: 491.651. (CC) gcc options: -O2 -lm -pthread -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0EPYC 7742 2P2 x AMD EPYC 7742 64-Core1428425670SE +/- 0.20, N = 3SE +/- 0.76, N = 360.1160.631. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0EPYC 7742 2P2 x AMD EPYC 7742 64-Core1224364860Min: 59.76 / Avg: 60.11 / Max: 60.45Min: 59.12 / Avg: 60.63 / Max: 61.561. (CXX) g++ options: -O3 -fPIC -lm

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 8EPYC 7742 2P7742 2P Repeat510152025SE +/- 0.17, N = 15SE +/- 0.26, N = 1522.9122.721. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 8EPYC 7742 2P7742 2P Repeat510152025Min: 21.91 / Avg: 22.91 / Max: 23.99Min: 20.58 / Avg: 22.72 / Max: 24.161. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Incompact3D

Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: CylinderEPYC 7742 2P7742 2P Repeat80160240320400SE +/- 0.88, N = 3SE +/- 1.44, N = 3345.79348.521. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: CylinderEPYC 7742 2P7742 2P Repeat60120180240300Min: 344.21 / Avg: 345.79 / Max: 347.24Min: 346.98 / Avg: 348.52 / Max: 351.391. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.27230.54460.81691.08921.3615SE +/- 0.00941, N = 15SE +/- 0.01177, N = 61.210321.20163MIN: 1.07MIN: 1.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 1.17 / Avg: 1.21 / Max: 1.31Min: 1.17 / Avg: 1.2 / Max: 1.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: DecompressionEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 0.67, N = 3SE +/- 0.88, N = 3149714871. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: DecompressionEPYC 7742 2P7742 2P Repeat30060090012001500Min: 1496 / Avg: 1497.33 / Max: 1498Min: 1485 / Avg: 1486.67 / Max: 14881. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBEPYC 7742 2P2 x AMD EPYC 7742 64-Core8001600240032004000SE +/- 27.56, N = 12SE +/- 31.51, N = 3391038861. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBEPYC 7742 2P2 x AMD EPYC 7742 64-Core7001400210028003500Min: 3805 / Avg: 3910.08 / Max: 4099Min: 3823 / Avg: 3886 / Max: 39191. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CEPYC 7742 2P7742 2P Repeat16K32K48K64K80KSE +/- 474.90, N = 3SE +/- 149.89, N = 372806.5973254.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CEPYC 7742 2P7742 2P Repeat13K26K39K52K65KMin: 72129.29 / Avg: 72806.59 / Max: 73721.9Min: 72958.71 / Avg: 73254.92 / Max: 73442.981. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianEPYC 7742 2P7742 2P Repeat140280420560700SE +/- 6.04, N = 15SE +/- 4.16, N = 36506541. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianEPYC 7742 2P7742 2P Repeat120240360480600Min: 622 / Avg: 649.87 / Max: 687Min: 648 / Avg: 654 / Max: 6621. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigEPYC 7742 2P7742 2P Repeat0.88861.77722.66583.55444.443SE +/- 0.028494, N = 3SE +/- 0.047128, N = 33.9491923.9264461. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigEPYC 7742 2P7742 2P Repeat246810Min: 3.91 / Avg: 3.95 / Max: 4Min: 3.84 / Avg: 3.93 / Max: 41. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlEPYC 7742 2P7742 2P Repeat400800120016002000SE +/- 11.05, N = 3SE +/- 14.38, N = 3172117301. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlEPYC 7742 2P7742 2P Repeat30060090012001500Min: 1705 / Avg: 1720.67 / Max: 1742Min: 1701 / Avg: 1729.67 / Max: 17461. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.33, N = 32052061. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat4080120160200Min: 206 / Avg: 206.33 / Max: 2071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112EPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 4.15, N = 3SE +/- 3.23, N = 31219.361225.261. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112EPYC 7742 2P7742 2P Repeat2004006008001000Min: 1212.69 / Avg: 1219.36 / Max: 1226.96Min: 1220.48 / Avg: 1225.26 / Max: 1231.411. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 5.33, N = 38338291. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenEPYC 7742 2P7742 2P Repeat150300450600750Min: 818 / Avg: 828.67 / Max: 8341. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P1200M2400M3600M4800M6000MSE +/- 29512765.60, N = 3SE +/- 16339556.64, N = 3552510000055507333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P1000M2000M3000M4000M5000MMin: 5468000000 / Avg: 5525100000 / Max: 5566600000Min: 5522300000 / Avg: 5550733333.33 / Max: 55789000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 0.88, N = 3SE +/- 0.67, N = 34354371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat80160240320400Min: 434 / Avg: 435.33 / Max: 437Min: 436 / Avg: 437.33 / Max: 4381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.06, N = 3SE +/- 0.06, N = 331.9831.831. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7742 2P7742 2P Repeat714212835Min: 31.92 / Avg: 31.98 / Max: 32.09Min: 31.72 / Avg: 31.83 / Max: 31.931. (CXX) g++ options: -O3 -pthread -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7742 2P7742 2P Repeat246810SE +/- 0.002, N = 3SE +/- 0.002, N = 38.9048.8641. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7742 2P7742 2P Repeat3691215Min: 8.9 / Avg: 8.9 / Max: 8.91Min: 8.86 / Avg: 8.86 / Max: 8.871. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.20, N = 3SE +/- 0.21, N = 3108.67109.161. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisEPYC 7742 2P7742 2P Repeat20406080100Min: 108.28 / Avg: 108.67 / Max: 108.91Min: 108.91 / Avg: 109.16 / Max: 109.571. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.39, N = 3SE +/- 0.26, N = 345.3645.161. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7742 2P7742 2P Repeat918273645Min: 44.88 / Avg: 45.36 / Max: 46.14Min: 44.64 / Avg: 45.16 / Max: 45.431. (CC) gcc options: -O3

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 6.97, N = 3SE +/- 2.65, N = 15SE +/- 2.86, N = 152792.52781.52791.91. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat5001000150020002500Min: 2781.6 / Avg: 2792.53 / Max: 2805.5Min: 2768.1 / Avg: 2781.47 / Max: 2796Min: 2767.5 / Avg: 2791.92 / Max: 2811.91. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.46650.9331.39951.8662.3325SE +/- 0.00986, N = 3SE +/- 0.00722, N = 32.073472.06564MIN: 1.86MIN: 1.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 2.06 / Avg: 2.07 / Max: 2.09Min: 2.05 / Avg: 2.07 / Max: 2.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P600M1200M1800M2400M3000MSE +/- 8434123.81, N = 3SE +/- 16574813.56, N = 3270393333326937666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P500M1000M1500M2000M2500MMin: 2695300000 / Avg: 2703933333.33 / Max: 2720800000Min: 2676300000 / Avg: 2693766666.67 / Max: 27269000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterEPYC 7742 2P2P7742 2P Repeat110220330440550SE +/- 0.97, N = 9SE +/- 1.12, N = 9SE +/- 0.31, N = 5506.9505.0506.91. 3.8.1.0
OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterEPYC 7742 2P2P7742 2P Repeat90180270360450Min: 500.6 / Avg: 506.89 / Max: 510.5Min: 501.3 / Avg: 505 / Max: 510.8Min: 506.2 / Avg: 506.88 / Max: 507.91. 3.8.1.0

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex PhaseEPYC 7742 2P2P7742 2P Repeat120240360480600SE +/- 0.59, N = 3SE +/- 0.72, N = 4SE +/- 0.76, N = 3532.7532.5534.5
OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex PhaseEPYC 7742 2P2P7742 2P Repeat90180270360450Min: 531.6 / Avg: 532.73 / Max: 533.6Min: 531.5 / Avg: 532.48 / Max: 534.6Min: 533.1 / Avg: 534.53 / Max: 535.7

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 7EPYC 7742 2P7742 2P Repeat1224364860SE +/- 0.17, N = 3SE +/- 0.33, N = 1551.3651.171. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 7EPYC 7742 2P7742 2P Repeat1020304050Min: 51.02 / Avg: 51.36 / Max: 51.54Min: 49.61 / Avg: 51.17 / Max: 53.461. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7742 2P7742 2P Repeat510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 320.3620.431. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7742 2P7742 2P Repeat510152025Min: 20.34 / Avg: 20.36 / Max: 20.4Min: 20.37 / Avg: 20.43 / Max: 20.491. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: DecompressionEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 1.67, N = 35715691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: DecompressionEPYC 7742 2P7742 2P Repeat100200300400500Min: 568 / Avg: 571.33 / Max: 5731. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 2.40, N = 3SE +/- 0.88, N = 3119912031. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedEPYC 7742 2P7742 2P Repeat2004006008001000Min: 1196 / Avg: 1199.33 / Max: 1204Min: 1202 / Avg: 1203.33 / Max: 12051. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21EPYC 7742 2P7742 2P Repeat400800120016002000SE +/- 14.66, N = 3SE +/- 13.17, N = 32015.92009.21. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21EPYC 7742 2P7742 2P Repeat400800120016002000Min: 1986.7 / Avg: 2015.87 / Max: 2033Min: 1983.1 / Avg: 2009.2 / Max: 2025.31. (CXX) g++ options: -O3 -march=native -rdynamic

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformEPYC 7742 2P2P7742 2P Repeat90180270360450SE +/- 0.61, N = 9SE +/- 0.53, N = 9SE +/- 0.91, N = 5436.5436.2437.61. 3.8.1.0
OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformEPYC 7742 2P2P7742 2P Repeat80160240320400Min: 433.4 / Avg: 436.46 / Max: 438.9Min: 434 / Avg: 436.21 / Max: 438.2Min: 434.9 / Avg: 437.62 / Max: 440.21. 3.8.1.0

Timed Wasmer Compilation

This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core1530456075SE +/- 0.38, N = 3SE +/- 0.18, N = 368.2268.431. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core1326395265Min: 67.51 / Avg: 68.22 / Max: 68.79Min: 68.13 / Avg: 68.43 / Max: 68.751. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7742 2P7742 2P Repeat1.5M3M4.5M6M7.5MSE +/- 11969.14, N = 3SE +/- 3617.62, N = 3675778767786081. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7742 2P7742 2P Repeat1.2M2.4M3.6M4.8M6MMin: 6733857 / Avg: 6757787 / Max: 6770297Min: 6771408 / Avg: 6778607.67 / Max: 67828281. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksEPYC 7742 2P2 x AMD EPYC 7742 64-Core9001800270036004500SE +/- 24.69, N = 3SE +/- 28.75, N = 3429543071. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksEPYC 7742 2P2 x AMD EPYC 7742 64-Core7001400210028003500Min: 4258 / Avg: 4295.33 / Max: 4342Min: 4254 / Avg: 4306.67 / Max: 43531. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.49620.99241.48861.98482.481SE +/- 0.00840, N = 3SE +/- 0.00319, N = 32.205392.19937MIN: 2.03MIN: 2.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 2.19 / Avg: 2.21 / Max: 2.22Min: 2.19 / Avg: 2.2 / Max: 2.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsEPYC 7742 2P2 x AMD EPYC 7742 64-Core9001800270036004500SE +/- 25.44, N = 3SE +/- 23.63, N = 3403940501. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsEPYC 7742 2P2 x AMD EPYC 7742 64-Core7001400210028003500Min: 3996 / Avg: 4038.67 / Max: 4084Min: 4005 / Avg: 4050 / Max: 40851. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.16080.32160.48240.64320.804SE +/- 0.003926, N = 3SE +/- 0.008252, N = 30.7129360.714842MIN: 0.65MIN: 0.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 0.71 / Avg: 0.71 / Max: 0.72Min: 0.7 / Avg: 0.71 / Max: 0.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat801602403204003813821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat15003000450060007500SE +/- 6.26, N = 3SE +/- 9.28, N = 36873.96890.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat12002400360048006000Min: 6866.9 / Avg: 6873.9 / Max: 6886.4Min: 6872.5 / Avg: 6890.63 / Max: 6903.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core510152025SE +/- 0.15, N = 14SE +/- 0.13, N = 1421.5021.55
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core510152025Min: 21.17 / Avg: 21.5 / Max: 23.46Min: 21.27 / Avg: 21.55 / Max: 23.27

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: CompressionEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 1.53, N = 3SE +/- 0.67, N = 34154161. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: CompressionEPYC 7742 2P7742 2P Repeat70140210280350Min: 413 / Avg: 415 / Max: 418Min: 415 / Avg: 415.67 / Max: 4171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert TransformEPYC 7742 2P2P7742 2P Repeat20406080100SE +/- 0.00, N = 3SE +/- 0.11, N = 4SE +/- 0.03, N = 384.484.484.6
OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert TransformEPYC 7742 2P2P7742 2P Repeat1632486480Min: 84.4 / Avg: 84.4 / Max: 84.4Min: 84.1 / Avg: 84.4 / Max: 84.6Min: 84.5 / Avg: 84.57 / Max: 84.6

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.64561.29121.93682.58243.228SE +/- 0.01849, N = 3SE +/- 0.01034, N = 32.869372.86260MIN: 2.62MIN: 2.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 2.84 / Avg: 2.87 / Max: 2.91Min: 2.85 / Avg: 2.86 / Max: 2.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 3.84, N = 11SE +/- 2.98, N = 15SE +/- 3.43, N = 152975.72982.62977.71. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat5001000150020002500Min: 2958 / Avg: 2975.7 / Max: 2998.6Min: 2961.4 / Avg: 2982.61 / Max: 3000.9Min: 2960.3 / Avg: 2977.73 / Max: 29991. (CC) gcc options: -O3 -pthread -lz -llzma

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1EPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 3.71, N = 3SE +/- 3.65, N = 31039.811037.461. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1EPYC 7742 2P7742 2P Repeat2004006008001000Min: 1032.39 / Avg: 1039.81 / Max: 1043.57Min: 1033.77 / Avg: 1037.46 / Max: 1044.751. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: AllEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.77, N = 3SE +/- 0.25, N = 399.5499.32
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: AllEPYC 7742 2P7742 2P Repeat20406080100Min: 98 / Avg: 99.54 / Max: 100.42Min: 99.02 / Avg: 99.32 / Max: 99.83

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1EPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.10, N = 3SE +/- 0.11, N = 3236.72237.251. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1EPYC 7742 2P7742 2P Repeat4080120160200Min: 236.52 / Avg: 236.72 / Max: 236.87Min: 237.04 / Avg: 237.24 / Max: 237.411. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MEPYC 7742 2P7742 2P Repeat48121620SE +/- 0.07, N = 3SE +/- 0.08, N = 314.1214.151. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MEPYC 7742 2P7742 2P Repeat48121620Min: 14.01 / Avg: 14.12 / Max: 14.26Min: 14.05 / Avg: 14.15 / Max: 14.311. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 5.51, N = 3SE +/- 0.67, N = 39599611. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat2004006008001000Min: 953 / Avg: 959 / Max: 970Min: 960 / Avg: 961.33 / Max: 9621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.53, N = 3SE +/- 1.86, N = 34824811. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat90180270360450Min: 479 / Avg: 482 / Max: 484Min: 479 / Avg: 481.33 / Max: 4851. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.4810.9621.4431.9242.405SE +/- 0.01302, N = 3SE +/- 0.00603, N = 32.133292.13758MIN: 1.93MIN: 1.931. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat246810Min: 2.12 / Avg: 2.13 / Max: 2.16Min: 2.13 / Avg: 2.14 / Max: 2.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.05, N = 3SE +/- 0.03, N = 3224.16224.601. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringEPYC 7742 2P7742 2P Repeat4080120160200Min: 224.09 / Avg: 224.16 / Max: 224.26Min: 224.55 / Avg: 224.6 / Max: 224.641. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat7001400210028003500SE +/- 3.61, N = 15SE +/- 9.37, N = 3SE +/- 10.95, N = 33205.63199.63205.81. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000Min: 3180.9 / Avg: 3205.6 / Max: 3221.1Min: 3182.9 / Avg: 3199.63 / Max: 3215.3Min: 3186 / Avg: 3205.77 / Max: 3223.81. (CC) gcc options: -O3 -pthread -lz -llzma

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterEPYC 7742 2P2P7742 2P Repeat120240360480600SE +/- 0.84, N = 9SE +/- 1.19, N = 9SE +/- 0.41, N = 5554.9555.8555.91. 3.8.1.0
OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterEPYC 7742 2P2P7742 2P Repeat100200300400500Min: 549.3 / Avg: 554.89 / Max: 557.5Min: 551.7 / Avg: 555.8 / Max: 561.6Min: 554.4 / Avg: 555.94 / Max: 556.91. 3.8.1.0

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 3.12, N = 15SE +/- 2.77, N = 12SE +/- 4.52, N = 122828.52824.92825.81. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat5001000150020002500Min: 2802.3 / Avg: 2828.51 / Max: 2848.3Min: 2804.7 / Avg: 2824.91 / Max: 2834.8Min: 2792.5 / Avg: 2825.84 / Max: 2842.41. (CC) gcc options: -O3 -pthread -lz -llzma

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceEPYC 7742 2P7742 2P Repeat200K400K600K800K1000KSE +/- 1419.07, N = 5SE +/- 1440.27, N = 5103181310306551. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceEPYC 7742 2P7742 2P Repeat200K400K600K800K1000KMin: 1027570 / Avg: 1031813 / Max: 1035296Min: 1025657 / Avg: 1030655 / Max: 10333541. (CC) gcc options: -O3 -march=native

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 4.40, N = 3SE +/- 5.12, N = 6542.05541.471. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUEPYC 7742 2P7742 2P Repeat100200300400500Min: 537.48 / Avg: 542.05 / Max: 550.85Min: 535.98 / Avg: 541.47 / Max: 567.041. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2EPYC 7742 2P7742 2P Repeat300M600M900M1200M1500MSE +/- 1663713.95, N = 3SE +/- 737641.36, N = 3124742766712461386671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2EPYC 7742 2P7742 2P Repeat200M400M600M800M1000MMin: 1244398000 / Avg: 1247427666.67 / Max: 1250134000Min: 1244946000 / Avg: 1246138666.67 / Max: 12474870001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.05, N = 3SE +/- 0.03, N = 341.9841.941. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7742 2P7742 2P Repeat918273645Min: 41.91 / Avg: 41.98 / Max: 42.07Min: 41.88 / Avg: 41.94 / Max: 41.981. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MEPYC 7742 2P7742 2P Repeat306090120150SE +/- 0.06, N = 3SE +/- 0.16, N = 3112.80112.701. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MEPYC 7742 2P7742 2P Repeat20406080100Min: 112.69 / Avg: 112.8 / Max: 112.9Min: 112.45 / Avg: 112.7 / Max: 1131. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm