epyc-march

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2103122-HA-EPYCMARCH14
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 5 Tests
AV1 6 Tests
Bioinformatics 5 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 5 Tests
C++ Boost Tests 6 Tests
Chess Test Suite 7 Tests
Timed Code Compilation 14 Tests
C/C++ Compiler Tests 36 Tests
Compression Tests 10 Tests
CPU Massive 46 Tests
Creator Workloads 49 Tests
Encoding 16 Tests
Finance 2 Tests
Fortran Tests 11 Tests
Game Development 6 Tests
HPC - High Performance Computing 34 Tests
Imaging 10 Tests
LAPACK (Linear Algebra Pack) Tests 4 Tests
Linear Algebra 2 Tests
Machine Learning 5 Tests
Molecular Dynamics 10 Tests
MPI Benchmarks 11 Tests
Multi-Core 58 Tests
NVIDIA GPU Compute 6 Tests
Intel oneAPI 6 Tests
OpenCL 2 Tests
OpenMPI Tests 21 Tests
Programmer / Developer System Benchmarks 18 Tests
Python Tests 7 Tests
Raytracing 6 Tests
Renderers 12 Tests
Rust Tests 2 Tests
Scientific Computing 22 Tests
Software Defined Radio 4 Tests
Server CPU Tests 26 Tests
Single-Threaded 12 Tests
Speech 4 Tests
Telephony 4 Tests
Video Encoding 11 Tests
Common Workstation Benchmarks 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 7742 2P
March 09 2021
  1 Day, 11 Hours, 43 Minutes
2P
March 11 2021
  6 Hours, 49 Minutes
2 x AMD EPYC 7742 64-Core
March 11 2021
  2 Hours, 16 Minutes
7742 2P Repeat
March 11 2021
  22 Hours, 50 Minutes
Invert Hiding All Results Option
  16 Hours, 55 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


epyc-march OpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVGA HDMI2 x Intel 10G X550TUbuntu 20.045.8.0-44-generic (x86_64)X Server 1.20.8GCC 9.3.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDisplay ServerCompilerFile-SystemScreen ResolutionEpyc-march BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - EPYC 7742 2P, 7742 2P Repeat: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 - EPYC 7742 2P, 7742 2P Repeat: OpenJDK Runtime Environment (build 11.0.10+9-Ubuntu-0ubuntu1.20.04) - EPYC 7742 2P, 2P, 7742 2P Repeat: Python 2.7.18 + Python 3.8.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

epyc-march hpl: askap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingcloverleaf: Lagrangian-Eulerian Hydrodynamicsqe: AUSURF112blosc: blosclzior: 32MB - Default Test Directorygnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersrelion: Basic - CPUlczero: BLASlczero: Eigenbuild-gcc: Time To Compilelibgav1: Chimera 1080pnwchem: C240 Buckyballopenvkl: vklBenchmarkcompress-7zip: Compress Speed Testhpcg: ior: 16MB - Default Test Directoryblender: Barbershop - CPU-Onlywebp2: Quality 100, Lossless Compressionhmmer: Pfam Database Searchonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUbuild-erlang: Time To Compileonednn: Recurrent Neural Network Training - u8s8f32 - CPUneat: fftw: Float + SSE - 2D FFT Size 4096rodinia: OpenMP HotSpot3Dincompact3d: Cylinderjpegxl: PNG - 8yafaray: Total Time For Sample Scenedeepspeech: CPUnumpy: lammps: 20k Atomsluaradio: Complex Phaseluaradio: Hilbert Transformluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filterslibgav1: Summer Nature 1080pwebp2: Quality 95, Compression Effort 7onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUmocassin: Dust 2D tau100.0fftw: Float + SSE - 2D FFT Size 2048ior: 8MB - Default Test Directoryonednn: Recurrent Neural Network Inference - u8s8f32 - CPUcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedminife: Smallrodinia: OpenMP Streamclusterfftw: Stock - 2D FFT Size 4096parboil: OpenMP MRI Griddinglibgav1: Summer Nature 4Kbuild-llvm: Time To Compileopenfoam: Motorbike 60Mcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedgraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceparboil: OpenMP LBMcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedngspice: C2670graphics-magick: Rotatetungsten: Water Causticrodinia: OpenMP Leukocytecompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedvpxenc: Speed 5asmfish: 1024 Hash Memory, 26 Depthqmcpack: simple-H2Ottsiod-renderer: Phong Rendering With Soft-Shadow Mappingwebp2: Quality 75, Compression Effort 7ngspice: C7552compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9kvazaar: Bosphorus 4K - Mediumavifenc: 2ior: 4MB - Default Test Directoryaskap: Hogbom Clean OpenMProdinia: OpenMP CFD Solvercouchdb: 100 - 1000 - 24jpegxl: PNG - 5mrbayes: Primate Phylogeny Analysisior: 2MB - Default Test Directoryospray: San Miguel - Path Tracerfinancebench: Bonds OpenMPbuild-linux-kernel: Time To Compilejpegxl: PNG - 7build-eigen: Time To Compilemontage: Mosaic of M17, K band, 1.5 deg x 1.5 degaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingbuild-gdb: Time To Compilecompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedluajit: Compositejpegxl-decode: 1jpegxl-decode: Alldacapobench: H2kvazaar: Bosphorus 4K - Very Fasttachyon: Total Timeaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingradiance: SMP Paralleldacapobench: Tradebeansbuild-wasmer: Time To Compilecompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedblender: Pabellon Barcelona - CPU-Onlybuild2: Time To Compileluxcorerender: DLSCbuild-godot: Time To Compileluxcorerender: Rainbow Colors and Prismfinancebench: Repo OpenMPgraphics-magick: Resizingavifenc: 0jpegxl: JPEG - 5himeno: Poisson Pressure Solvergraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Swirljpegxl: JPEG - 7amg: srslte: OFDM_Testkvazaar: Bosphorus 4K - Ultra Fastjpegxl: JPEG - 8liquid-dsp: 128 - 256 - 57srslte: PHY_DL_Testsrslte: PHY_DL_Testfftw: Stock - 2D FFT Size 2048dcraw: RAW To PPM Image Conversionblender: Classroom - CPU-Onlyx265: Bosphorus 1080plzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionlibraw: Post-Processing Benchmarkespeak: Text-To-Speech Synthesiswebp: Quality 100, Lossless, Highest Compressiononednn: IP Shapes 3D - f32 - CPUcompress-gzip: Linux Source Tree Archiving To .tar.gzbuild-php: Time To Compilerav1e: 6aobench: 2048 x 2048 - Total Timeavifenc: 6quantlib: blender: Fishy Cat - CPU-Onlycompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedgromacs: water_GMX50_bareavifenc: 6, Losslessetcpak: ETC2aom-av1: Speed 6 Realtimeospray: San Miguel - SciVisc-ray: Total Time - 4K, 16 Rays Per Pixelaom-av1: Speed 6 Two-Passx265: Bosphorus 4Ktungsten: Haircompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objrodinia: OpenMP LavaMDmafft: Multiple Sequence Alignment - LSU RNAstockfish: Total Timesynthmark: VoiceMark_100onednn: Convolution Batch Shapes Auto - u8s8f32 - CPUrav1e: 10crafty: Elapsed Timeospray: XFrog Forest - Path Traceropenfoam: Motorbike 30Mtoybrot: OpenMPlzbench: Libdeflate 1 - Decompressionlzbench: Libdeflate 1 - Compressionblender: BMW27 - CPU-Onlybuild-apache: Time To Compileencode-ape: WAV To APElzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionencode-ogg: WAV To Ogglzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionrnnoise: lzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressionavifenc: 10, Losslesslzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressionetcpak: ETC1 + Ditheringonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUsvt-av1: Enc Mode 4 - 1080petcpak: ETC1webp: Quality 100, Losslessnamd: ATPase Simulation - 327,506 Atomsdolfyn: Computational Fluid Dynamicsliquid-dsp: 256 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57build-ffmpeg: Time To Compileaom-av1: Speed 8 Realtimeffte: N=256, 3D Complex FFT Routinenpb: EP.Dwebp2: Defaultkvazaar: Bosphorus 1080p - Ultra Fastencode-flac: WAV To FLACsvt-vp9: VMAF Optimized - Bosphorus 1080pospray: XFrog Forest - SciVisbuild-imagemagick: Time To Compileembree: Pathtracer ISPC - Asian Dragonencode-opus: WAV To Opus Encodex264: H.264 Video Encodingonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUembree: Pathtracer - Asian Dragonsvt-av1: Enc Mode 8 - 1080pdav1d: Summer Nature 1080ponednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUnpb: LU.Ctungsten: Non-Exponentialonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUembree: Pathtracer ISPC - Crownnpb: IS.Dpovray: Trace Timeospray: NASA Streamlines - Path Tracertoybrot: TBBbuild-mplayer: Time To Compileembree: Pathtracer - Crownkvazaar: Bosphorus 1080p - Mediumonednn: IP Shapes 3D - u8s8f32 - CPUwebp: Quality 100, Highest Compressiondacapobench: Jythonencode-mp3: WAV To MP3n-queens: Elapsed Timedav1d: Summer Nature 4Kmt-dgemm: Sustained Floating-Point Ratewebp2: Quality 100, Compression Effort 5npb: FT.Conednn: Deconvolution Batch shapes_3d - f32 - CPUparboil: OpenMP Stencilfftw: Float + SSE - 1D FFT Size 4096system-decompress-gzip: oidn: Memorialm-queens: Time To Solvesystem-decompress-zlib: pennant: sedovbigfftw: Stock - 1D FFT Size 4096onednn: Convolution Batch Shapes Auto - f32 - CPUlammps: Rhodopsin Proteinnpb: CG.Ctjbench: Decompression Throughputospray: Magnetic Reconnection - SciVissvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080petcpak: DXT1ospray: NASA Streamlines - SciVistungsten: Volumetric Causticpennant: leblancbigkvazaar: Bosphorus 1080p - Very Fastsvt-vp9: Visual Quality Optimized - Bosphorus 1080ptoybrot: C++ Taskssystem-decompress-xz: avifenc: 10toybrot: C++ Threadsrays1bench: Large Scenenpb: MG.Cwebp: Quality 100parboil: OpenMP CUTCPtscp: AI Chess Performancewebp: Defaultospray: Magnetic Reconnection - Path Tracernpb: EP.Cluajit: Jacobi Successive Over-Relaxationluajit: Dense LU Matrix Factorizationluajit: Sparse Matrix Multiplyluajit: Fast Fourier Transformluajit: Monte CarloEPYC 7742 2P2P2 x AMD EPYC 7742 64-Core7742 2P Repeat153.597117.005224.9823.361219.363491.5436.5744.6506.9554.93032.8400.8542.04839364198715.1131963.347333831525.6364480.63440.928398.6351245.811267.20190.7892910.8862.75218663112.829345.7868850.7065.66078.03710305.7231.977532.784.4346.8643.972.74251.4342923.302948.0623926509489.401281.862828.532.811312.669.9705387.5194.08074418.97200.816112.802975.71989.965092951.0181873092.0629.2169.46254323.600850.8502792.570.720.8523609311344.322581.879136.405130.44126.58122.7032.717485.74217.82310.627112.55863.73108.673445.106.7689585.74218721.5009.7794.90993.07737599.738291.891.3093205.6587.61178.9599.54594740.159.89093991.994826.83213.866486668.21910418.344.8610188.345.3664.21315.0361.40716.8952054.5937506960.11251.713962.2148198331199172151.3612474276679833333344.4522.91313560000083.0197.76160.750.51661.36983230.9935.07941.9761.5465041.88841.6681.46039.90212.1032015.934.934139.50118.4783.337.7543.3618.775.6169611001.59527.7236.328339.047430.2110.266190042987646.9054.680783.102675778710.1014.12517995920524.67714.36957116523.69948241523.1421497827.587134643538189224.1642.205392.869377.456236.72220.3600.2795220.2065525100000427203333270393333316165666678322766672132800001071433335359466719.74831.89150712.968196998426.043.272181.149.830340.0219.7415.66742.11859.150204.012.073472.1332944.972785.7861245.910.7129360.812990194294.651.722561.2103259.33233268.828.02830.30391010.26967.586864.433.089178.90450279.1031.770387.0528.4941217.72176051.362.884375.389561449393.60028.727.0832025.9847125.8728486873.90.72439529.00041060.19172.30442345.45363.961039.8131254.461363.949192136.84274.5642954.3114.2284039492.7872806.592.8630.83187510318131.855333.338223.34436.2751.2505.0555.83040.5433.2532.584.4343.0653.0101100000321813846287.5207.65550733333427276667269376666716180000008316133332132866671071666675357966781.79185.8242824.933.02982.62243.52781.569.232.69721.5523199.6561.668.43364.5660.62649.0612.95836.028.06435.432514124.227.400388643074.18940502994.271225.263788.1461.38437.6747.2506.9555.93090.2423.3541.4713333351251.201933.825.9558468.11408.8981355.403152.1685.37317541150.965348.5163880.7031.833534.584.6347.1643.83209.713125.8723926795485.131285.172825.833.98169.1796.1605306.8208.037842112.702977.71992.565488074.0686773090.0620.552355.8612791.970.946.283480.55219.37864.40109.158452.039.773205.8566.11200.4132.9799.326295499110278.843.4910373.045.166853.118291203173051.1712461386679796000022.7286.8204.56086.5993229.7841.9362.498662009.22908.55053.7139.56110868.79289.0732.51110.4685.59055677860814.15961206569165481416148783133443738289224.5992.199372.86260237.24520.4320.2830620.201148613.891462097885.522.065642.137580.714842176465.671.201633313.122.888228.864509671133.342.708985.544628442945.8757146890.61.9546128.18339810.961037.4603.92644673254.922.8640.85252910306551.8548108.431644.112811.621008.92210.57412.34OpenBenchmarking.org

HPL Linpack

HPL is a well known portable Linpack implementation for distributed memory systems. This test profile is testing HPL upstream directly, outside the scope of the HPC Challenge test profile also available through the Phoronix Test Suite (hpcc). The test profile attempts to generate an optimized HPL.dat input file based on the CPU/memory under test. The automated HPL.dat input generation is still being tuned and thus for now this test profile remains "experimental". Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPL Linpack 2.3EPYC 7742 2P306090120150SE +/- 0.42, N = 3153.591. (CC) gcc options: -O2 -lopenblas -lm -pthread -lmpi

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingEPYC 7742 2P15003000450060007500SE +/- 208.57, N = 97117.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingEPYC 7742 2P11002200330044005500SE +/- 48.52, N = 95224.981. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsEPYC 7742 2P7742 2P Repeat6001200180024003000SE +/- 0.38, N = 15SE +/- 0.03, N = 323.362994.271. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112EPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 4.15, N = 3SE +/- 3.23, N = 31219.361225.261. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

C-Blosc

A simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0 Beta 5Compressor: blosclzEPYC 7742 2P7742 2P Repeat8001600240032004000SE +/- 23.71, N = 3SE +/- 39.54, N = 33491.53788.11. (CXX) g++ options: -rdynamic

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test Directory7742 2P Repeat100200300400500SE +/- 2.10, N = 3461.38MIN: 406.14 / MAX: 1020.591. (CC) gcc options: -O2 -lm -pthread -lmpi

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform2PEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 0.53, N = 9SE +/- 0.61, N = 9SE +/- 0.91, N = 5436.2436.5437.61. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter2PEPYC 7742 2P7742 2P Repeat160320480640800SE +/- 8.43, N = 9SE +/- 8.87, N = 9SE +/- 17.28, N = 5751.2744.6747.21. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter2PEPYC 7742 2P7742 2P Repeat110220330440550SE +/- 1.12, N = 9SE +/- 0.97, N = 9SE +/- 0.31, N = 5505.0506.9506.91. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter2PEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 1.19, N = 9SE +/- 0.84, N = 9SE +/- 0.41, N = 5555.8554.9555.91. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)2PEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 19.01, N = 9SE +/- 23.25, N = 9SE +/- 29.08, N = 53040.53032.83090.21. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters2PEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 12.19, N = 9SE +/- 11.37, N = 9SE +/- 4.29, N = 5433.2400.8423.31. 3.8.1.0

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 4.40, N = 3SE +/- 5.12, N = 6542.05541.471. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASEPYC 7742 2P7742 2P Repeat8001600240032004000SE +/- 49.55, N = 9SE +/- 37.92, N = 4393633331. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenEPYC 7742 2P7742 2P Repeat9001800270036004500SE +/- 41.70, N = 3SE +/- 49.56, N = 9419835121. (CXX) g++ options: -flto -pthread

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 9.3.0Time To CompileEPYC 7742 2P150300450600750SE +/- 0.46, N = 3715.11

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p7742 2P Repeat1224364860SE +/- 0.37, N = 1151.201. (CXX) g++ options: -O3 -lpthread

NWChem

NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballEPYC 7742 2P7742 2P Repeat4008001200160020001963.31933.81. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lcomex -lm -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkEPYC 7742 2P100200300400500473MIN: 1 / MAX: 1361

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestEPYC 7742 2P70K140K210K280K350KSE +/- 3983.57, N = 153383151. (CXX) g++ options: -pipe -lpthread

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7742 2P7742 2P Repeat612182430SE +/- 0.28, N = 5SE +/- 0.20, N = 325.6425.961. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.21, N = 3SE +/- 5.31, N = 3480.63468.11MIN: 410.37 / MAX: 1034.82MIN: 411 / MAX: 1031.921. (CC) gcc options: -O2 -lm -pthread -lmpi

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Only2 x AMD EPYC 7742 64-Core20406080100SE +/- 0.90, N = 1581.79

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionEPYC 7742 2P100200300400500SE +/- 0.44, N = 3440.931. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 4.02, N = 3SE +/- 5.75, N = 3398.64408.901. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUEPYC 7742 2P30060090012001500SE +/- 17.20, N = 151245.81MIN: 1111.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 15.39, N = 15SE +/- 34.94, N = 151267.201355.40MIN: 1101.26MIN: 1136.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core4080120160200SE +/- 1.54, N = 9SE +/- 0.13, N = 3190.79185.82

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 63.06, N = 15SE +/- 92.93, N = 122910.883152.16MIN: 2232.96MIN: 2386.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29EPYC 7742 2P7742 2P Repeat20406080100SE +/- 8.47, N = 15SE +/- 6.74, N = 1562.7585.371. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat4K8K12K16K20KSE +/- 230.17, N = 318663175411. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DEPYC 7742 2P7742 2P Repeat306090120150SE +/- 1.17, N = 15SE +/- 0.67, N = 3112.83150.971. (CXX) g++ options: -O2 -lOpenCL

Incompact3D

Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: CylinderEPYC 7742 2P7742 2P Repeat80160240320400SE +/- 0.88, N = 3SE +/- 1.44, N = 3345.79348.521. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 8EPYC 7742 2P7742 2P Repeat0.15750.3150.47250.630.7875SE +/- 0.00, N = 3SE +/- 0.00, N = 30.700.701. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneEPYC 7742 2P1530456075SE +/- 0.61, N = 1565.661. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUEPYC 7742 2P20406080100SE +/- 2.69, N = 1578.04

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkEPYC 7742 2P70140210280350SE +/- 0.81, N = 3305.72

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.06, N = 3SE +/- 0.06, N = 331.9831.831. (CXX) g++ options: -O3 -pthread -lm

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase2PEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 0.72, N = 4SE +/- 0.59, N = 3SE +/- 0.76, N = 3532.5532.7534.5

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform2PEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.11, N = 4SE +/- 0.00, N = 3SE +/- 0.03, N = 384.484.484.6

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter2PEPYC 7742 2P7742 2P Repeat80160240320400SE +/- 2.83, N = 4SE +/- 0.15, N = 3SE +/- 0.12, N = 3343.0346.8347.1

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters2PEPYC 7742 2P7742 2P Repeat140280420560700SE +/- 7.57, N = 4SE +/- 4.40, N = 3SE +/- 3.58, N = 3653.0643.9643.8

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pEPYC 7742 2P1632486480SE +/- 0.95, N = 1572.741. (CXX) g++ options: -O3 -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7EPYC 7742 2P50100150200250SE +/- 0.06, N = 3251.431. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 22.66, N = 3SE +/- 91.94, N = 152923.303209.71MIN: 2522.76MIN: 2404.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 36.76, N = 3SE +/- 60.50, N = 152948.063125.87MIN: 2599.44MIN: 2374.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0EPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.33, N = 32392391. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat6K12K18K24K30KSE +/- 200.43, N = 12SE +/- 139.43, N = 326509267951. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat110220330440550SE +/- 1.12, N = 3SE +/- 3.76, N = 3489.40485.13MIN: 410.57 / MAX: 920.3MIN: 411.08 / MAX: 1041.221. (CC) gcc options: -O2 -lm -pthread -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 19.98, N = 15SE +/- 14.75, N = 31281.861285.17MIN: 1116.9MIN: 1199.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 3.12, N = 15SE +/- 2.77, N = 12SE +/- 4.52, N = 122828.52824.92825.81. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat816243240SE +/- 0.58, N = 15SE +/- 0.52, N = 12SE +/- 0.66, N = 1232.833.033.91. (CC) gcc options: -O3 -pthread -lz -llzma

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 414.96, N = 15SE +/- 70.94, N = 311312.668169.171. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.138, N = 15SE +/- 4.228, N = 129.97096.1601. (CXX) g++ options: -O2 -lOpenCL

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096EPYC 7742 2P7742 2P Repeat12002400360048006000SE +/- 13.54, N = 3SE +/- 54.29, N = 35387.55306.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 1.09, N = 3SE +/- 1.80, N = 3194.08208.041. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KEPYC 7742 2P510152025SE +/- 0.04, N = 318.971. (CXX) g++ options: -O3 -lpthread

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileEPYC 7742 2P4080120160200SE +/- 1.00, N = 3200.82

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MEPYC 7742 2P7742 2P Repeat306090120150SE +/- 0.06, N = 3SE +/- 0.16, N = 3112.80112.701. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 3.84, N = 11SE +/- 2.98, N = 15SE +/- 3.43, N = 152975.72982.62977.71. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat5001000150020002500SE +/- 78.60, N = 12SE +/- 63.80, N = 15SE +/- 73.14, N = 151989.92243.51992.51. (CC) gcc options: -O3 -pthread -lz -llzma

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianEPYC 7742 2P7742 2P Repeat140280420560700SE +/- 6.04, N = 15SE +/- 4.16, N = 36506541. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 13.26, N = 15SE +/- 4.06, N = 39298801. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMEPYC 7742 2P7742 2P Repeat1632486480SE +/- 1.30, N = 15SE +/- 0.77, N = 351.0274.071. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression SpeedEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 2.93, N = 15SE +/- 2.35, N = 133092.03090.01. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression SpeedEPYC 7742 2P7742 2P Repeat140280420560700SE +/- 13.86, N = 15SE +/- 14.50, N = 13629.2620.51. (CC) gcc options: -O3 -pthread -lz -llzma

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670EPYC 7742 2P4080120160200SE +/- 0.82, N = 3169.461. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 4.66, N = 8SE +/- 4.25, N = 95435231. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticEPYC 7742 2P612182430SE +/- 0.32, N = 1523.601. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteEPYC 7742 2P7742 2P Repeat1326395265SE +/- 0.49, N = 3SE +/- 0.43, N = 1550.8555.861. (CXX) g++ options: -O2 -lOpenCL

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat6001200180024003000SE +/- 6.97, N = 3SE +/- 2.65, N = 15SE +/- 2.86, N = 152792.52781.52791.91. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat1632486480SE +/- 0.71, N = 3SE +/- 1.15, N = 15SE +/- 1.10, N = 1570.769.270.91. (CC) gcc options: -O3 -pthread -lz -llzma

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5EPYC 7742 2P510152025SE +/- 0.23, N = 1520.851. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthEPYC 7742 2P50M100M150M200M250MSE +/- 1860963.69, N = 3236093113

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.29, N = 3SE +/- 1.61, N = 1544.3246.281. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingEPYC 7742 2P130260390520650SE +/- 9.68, N = 15581.881. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7EPYC 7742 2P306090120150SE +/- 0.15, N = 3136.411. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552EPYC 7742 2P306090120150SE +/- 0.03, N = 3130.441. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9EPYC 7742 2P612182430SE +/- 0.32, N = 1526.581. (CC) gcc options: -pthread -fvisibility=hidden -O2

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: MediumEPYC 7742 2P510152025SE +/- 0.19, N = 1522.701. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2EPYC 7742 2P2 x AMD EPYC 7742 64-Core816243240SE +/- 0.28, N = 8SE +/- 0.22, N = 1532.7232.701. (CXX) g++ options: -O3 -fPIC -lm

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat110220330440550SE +/- 6.27, N = 3SE +/- 2.94, N = 3485.74480.55MIN: 400.81 / MAX: 1055.7MIN: 405.25 / MAX: 1028.471. (CC) gcc options: -O2 -lm -pthread -lmpi

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPEPYC 7742 2P50100150200250SE +/- 2.90, N = 15217.821. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.12, N = 3SE +/- 0.14, N = 310.63219.381. (CXX) g++ options: -O2 -lOpenCL

Apache CouchDB

This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.1.1Bulk Size: 100 - Inserts: 1000 - Rounds: 24EPYC 7742 2P306090120150SE +/- 0.97, N = 3112.561. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1428425670SE +/- 0.58, N = 15SE +/- 0.52, N = 1563.7364.401. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.20, N = 3SE +/- 0.21, N = 3108.67109.161. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.26, N = 3SE +/- 4.26, N = 7445.10452.03MIN: 379.87 / MAX: 836.63MIN: 378.75 / MAX: 1032.921. (CC) gcc options: -O2 -lm -pthread -lmpi

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerEPYC 7742 2P246810SE +/- 0.07, N = 56.76MIN: 3.98 / MAX: 7.19

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPEPYC 7742 2P20K40K60K80K100KSE +/- 634.52, N = 389585.741. (CXX) g++ options: -O3 -march=native -fopenmp

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core510152025SE +/- 0.15, N = 14SE +/- 0.13, N = 1421.5021.55

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 7EPYC 7742 2P7742 2P Repeat3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 39.779.771. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To CompileEPYC 7742 2P20406080100SE +/- 0.01, N = 394.91

Montage Astronomical Image Mosaic Engine

Montage is an open-source astronomical image mosaic engine. This BSD-licensed astronomy software is developed by the California Institute of Technology, Pasadena. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMontage Astronomical Image Mosaic Engine 6.0Mosaic of M17, K band, 1.5 deg x 1.5 degEPYC 7742 2P20406080100SE +/- 0.05, N = 393.081. (CC) gcc options: -std=gnu99 -lcfitsio -lm -O2

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingEPYC 7742 2P8K16K24K32K40KSE +/- 223.13, N = 337599.71. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingEPYC 7742 2P8K16K24K32K40KSE +/- 416.26, N = 338291.81. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileEPYC 7742 2P20406080100SE +/- 0.21, N = 391.31

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat7001400210028003500SE +/- 3.61, N = 15SE +/- 9.37, N = 3SE +/- 10.95, N = 33205.63199.63205.81. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedEPYC 7742 2P2 x AMD EPYC 7742 64-Core7742 2P Repeat130260390520650SE +/- 10.51, N = 15SE +/- 4.02, N = 3SE +/- 6.00, N = 3587.6561.6566.11. (CC) gcc options: -O3 -pthread -lz -llzma

LuaJIT

This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: CompositeEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 14.63, N = 4SE +/- 9.48, N = 151178.951200.411. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 17742 2P Repeat816243240SE +/- 0.11, N = 332.97

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: AllEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.77, N = 3SE +/- 0.25, N = 399.5499.32

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2EPYC 7742 2P7742 2P Repeat13002600390052006500SE +/- 109.01, N = 20SE +/- 160.58, N = 2059476295

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastEPYC 7742 2P918273645SE +/- 0.92, N = 1540.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeEPYC 7742 2P3691215SE +/- 0.0989, N = 159.89091. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingEPYC 7742 2P9001800270036004500SE +/- 55.12, N = 123991.991. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingEPYC 7742 2P10002000300040005000SE +/- 78.34, N = 124826.831. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Radiance Benchmark

This is a benchmark of NREL Radiance, a synthetic imaging system that is open-source and developed by the Lawrence Berkeley National Laboratory in California. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP ParallelEPYC 7742 2P50100150200250213.87

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansEPYC 7742 2P7742 2P Repeat11002200330044005500SE +/- 98.30, N = 17SE +/- 116.87, N = 2048664991

Timed Wasmer Compilation

This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To CompileEPYC 7742 2P2 x AMD EPYC 7742 64-Core1530456075SE +/- 0.38, N = 3SE +/- 0.18, N = 368.2268.431. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 44.54, N = 3SE +/- 45.56, N = 310418.310278.81. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.35, N = 3SE +/- 0.03, N = 344.8643.491. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 80.01, N = 3SE +/- 28.45, N = 310188.310373.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.39, N = 3SE +/- 0.26, N = 345.3645.161. (CC) gcc options: -O3

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Only2 x AMD EPYC 7742 64-Core1428425670SE +/- 0.17, N = 364.56

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileEPYC 7742 2P1428425670SE +/- 0.18, N = 364.21

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCEPYC 7742 2P48121620SE +/- 0.04, N = 315.03MIN: 14.9 / MAX: 15.86

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileEPYC 7742 2P1428425670SE +/- 0.42, N = 361.41

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismEPYC 7742 2P48121620SE +/- 0.05, N = 316.89MIN: 16.01 / MAX: 17.19

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPEPYC 7742 2P11K22K33K44K55KSE +/- 187.65, N = 352054.591. (CXX) g++ options: -O3 -march=native -fopenmp

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingEPYC 7742 2P7742 2P Repeat1530456075SE +/- 0.88, N = 3SE +/- 0.88, N = 369681. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0EPYC 7742 2P2 x AMD EPYC 7742 64-Core1428425670SE +/- 0.20, N = 3SE +/- 0.76, N = 360.1160.631. (CXX) g++ options: -O3 -fPIC -lm

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 5EPYC 7742 2P7742 2P Repeat1224364860SE +/- 0.39, N = 11SE +/- 0.58, N = 351.7153.111. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverEPYC 7742 2P8001600240032004000SE +/- 19.77, N = 33962.211. (CC) gcc options: -O3 -mavx2

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 5.33, N = 38338291. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 2.40, N = 3SE +/- 0.88, N = 3119912031. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlEPYC 7742 2P7742 2P Repeat400800120016002000SE +/- 11.05, N = 3SE +/- 14.38, N = 3172117301. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 7EPYC 7742 2P7742 2P Repeat1224364860SE +/- 0.17, N = 3SE +/- 0.33, N = 1551.3651.171. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2EPYC 7742 2P7742 2P Repeat300M600M900M1200M1500MSE +/- 1663713.95, N = 3SE +/- 737641.36, N = 3124742766712461386671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_TestEPYC 7742 2P2P7742 2P Repeat20M40M60M80M100MSE +/- 520683.31, N = 3SE +/- 404145.19, N = 3SE +/- 1084711.94, N = 598333333101100000979600001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastEPYC 7742 2P1020304050SE +/- 1.77, N = 1244.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 8EPYC 7742 2P7742 2P Repeat510152025SE +/- 0.17, N = 15SE +/- 0.26, N = 1522.9122.721. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P700M1400M2100M2800M3500MSE +/- 13159153.97, N = 3SE +/- 79162966.07, N = 13313560000032181384621. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat20406080100SE +/- 0.33, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 383.087.586.81. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestEPYC 7742 2P2P7742 2P Repeat50100150200250SE +/- 0.38, N = 3SE +/- 0.19, N = 3SE +/- 0.92, N = 3197.7207.6204.51. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048EPYC 7742 2P7742 2P Repeat13002600390052006500SE +/- 3.97, N = 3SE +/- 47.11, N = 36160.76086.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

dcraw

This test times how long it takes to convert several high-resolution RAW NEF image files to PPM image format using dcraw. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterdcrawRAW To PPM Image ConversionEPYC 7742 2P1122334455SE +/- 0.03, N = 350.521. (CC) gcc options: -lm

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Only2 x AMD EPYC 7742 64-Core1122334455SE +/- 0.29, N = 349.06

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pEPYC 7742 2P1428425670SE +/- 0.62, N = 1561.361. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.33, N = 398991. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: CompressionEPYC 7742 2P7742 2P Repeat71421283532321. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.37, N = 4SE +/- 0.10, N = 330.9929.781. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisEPYC 7742 2P816243240SE +/- 0.10, N = 435.081. (CC) gcc options: -O2 -std=c99

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7742 2P7742 2P Repeat1020304050SE +/- 0.05, N = 3SE +/- 0.03, N = 341.9841.941. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.56221.12441.68662.24882.811SE +/- 0.02717, N = 15SE +/- 0.13368, N = 121.546502.49866MIN: 1.18MIN: 1.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Gzip Compression

This test measures the time needed to archive/compress two copies of the Linux 4.13 kernel source tree using Gzip compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGzip CompressionLinux Source Tree Archiving To .tar.gzEPYC 7742 2P1020304050SE +/- 0.07, N = 341.89

Timed PHP Compilation

This test times how long it takes to build PHP 7. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To CompileEPYC 7742 2P1020304050SE +/- 0.19, N = 341.67

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6EPYC 7742 2P0.32850.6570.98551.3141.6425SE +/- 0.019, N = 31.460

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeEPYC 7742 2P918273645SE +/- 0.01, N = 339.901. (CC) gcc options: -lm -O3

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6EPYC 7742 2P2 x AMD EPYC 7742 64-Core3691215SE +/- 0.15, N = 3SE +/- 0.26, N = 1512.1012.961. (CXX) g++ options: -O3 -fPIC -lm

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21EPYC 7742 2P7742 2P Repeat400800120016002000SE +/- 14.66, N = 3SE +/- 13.17, N = 32015.92009.21. (CXX) g++ options: -O3 -march=native -rdynamic

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Only2 x AMD EPYC 7742 64-Core816243240SE +/- 0.30, N = 336.02

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed7742 2P Repeat6001200180024003000SE +/- 3.38, N = 32908.51. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed7742 2P Repeat11002200330044005500SE +/- 62.74, N = 35053.71. (CC) gcc options: -O3 -pthread -lz -llzma

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare2 x AMD EPYC 7742 64-Core246810SE +/- 0.019, N = 38.0641. (CXX) g++ options: -O3 -pthread

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core816243240SE +/- 0.07, N = 3SE +/- 0.42, N = 334.9335.431. (CXX) g++ options: -O3 -fPIC -lm

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2EPYC 7742 2P7742 2P Repeat306090120150SE +/- 0.00, N = 3SE +/- 0.07, N = 3139.50139.561. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 RealtimeEPYC 7742 2P510152025SE +/- 0.12, N = 318.471. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisEPYC 7742 2P20406080100SE +/- 0.00, N = 383.33MIN: 26.32 / MAX: 90.91

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelEPYC 7742 2P246810SE +/- 0.165, N = 127.7541. (CC) gcc options: -lm -lpthread -O3

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-PassEPYC 7742 2P0.7561.5122.2683.0243.78SE +/- 0.01, N = 33.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KEPYC 7742 2P510152025SE +/- 0.09, N = 318.771. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairEPYC 7742 2P1.26382.52763.79145.05526.319SE +/- 0.05055, N = 155.616961. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 91.80, N = 3SE +/- 21.34, N = 311001.510868.71. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 65.80, N = 3SE +/- 29.92, N = 39527.729289.071. (CC) gcc options: -O3

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon ObjEPYC 7742 2P816243240SE +/- 0.31, N = 336.33MIN: 34.33 / MAX: 39.39

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon ObjEPYC 7742 2P918273645SE +/- 0.11, N = 339.05MIN: 37.1 / MAX: 42.47

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDEPYC 7742 2P7742 2P Repeat816243240SE +/- 0.27, N = 3SE +/- 0.11, N = 330.2132.511. (CXX) g++ options: -O2 -lOpenCL

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAEPYC 7742 2P7742 2P Repeat3691215SE +/- 0.14, N = 3SE +/- 0.11, N = 1510.2710.471. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeEPYC 7742 2P40M80M120M160M200MSE +/- 2342263.11, N = 31900429871. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100EPYC 7742 2P140280420560700SE +/- 0.61, N = 3646.911. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat1.25792.51583.77375.03166.2895SE +/- 0.15852, N = 13SE +/- 0.12818, N = 154.680785.59055MIN: 2.73MIN: 4.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10EPYC 7742 2P0.6981.3962.0942.7923.49SE +/- 0.019, N = 33.102

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7742 2P7742 2P Repeat1.5M3M4.5M6M7.5MSE +/- 11969.14, N = 3SE +/- 3617.62, N = 3675778767786081. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerEPYC 7742 2P3691215SE +/- 0.06, N = 310.10MIN: 7.81 / MAX: 10.64

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MEPYC 7742 2P7742 2P Repeat48121620SE +/- 0.07, N = 3SE +/- 0.08, N = 314.1214.151. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP2 x AMD EPYC 7742 64-CoreEPYC 7742 2P11002200330044005500SE +/- 99.81, N = 15SE +/- 104.40, N = 15514151791. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 5.51, N = 3SE +/- 0.67, N = 39599611. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.33, N = 32052061. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Only2 x AMD EPYC 7742 64-Core612182430SE +/- 0.22, N = 324.22

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileEPYC 7742 2P612182430SE +/- 0.02, N = 324.68

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APEEPYC 7742 2P48121620SE +/- 0.02, N = 514.371. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: DecompressionEPYC 7742 2P7742 2P Repeat120240360480600SE +/- 1.67, N = 35715691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: CompressionEPYC 7742 2P7742 2P Repeat4080120160200SE +/- 0.58, N = 3SE +/- 0.67, N = 31651651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Ogg Audio Encoding

This test times how long it takes to encode a sample WAV file to Ogg format using the reference Xiph.org tools/libraries. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To OggEPYC 7742 2P612182430SE +/- 0.04, N = 323.701. (CC) gcc options: -O2 -ffast-math -fsigned-char

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat100200300400500SE +/- 1.53, N = 3SE +/- 1.86, N = 34824811. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: CompressionEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 1.53, N = 3SE +/- 0.67, N = 34154161. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28EPYC 7742 2P612182430SE +/- 0.04, N = 323.141. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: DecompressionEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 0.67, N = 3SE +/- 0.88, N = 3149714871. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: CompressionEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.67, N = 382831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessEPYC 7742 2P2 x AMD EPYC 7742 64-Core246810SE +/- 0.073, N = 15SE +/- 0.037, N = 37.5877.4001. (CXX) g++ options: -O3 -fPIC -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: DecompressionEPYC 7742 2P7742 2P Repeat30060090012001500SE +/- 1.45, N = 3SE +/- 4.36, N = 3134613341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: CompressionEPYC 7742 2P7742 2P Repeat90180270360450SE +/- 0.88, N = 3SE +/- 0.67, N = 34354371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: DecompressionEPYC 7742 2P7742 2P Repeat801602403204003813821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: CompressionEPYC 7742 2P7742 2P Repeat20406080100SE +/- 0.67, N = 389891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringEPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.05, N = 3SE +/- 0.03, N = 3224.16224.601. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.49620.99241.48861.98482.481SE +/- 0.00840, N = 3SE +/- 0.00319, N = 32.205392.19937MIN: 2.03MIN: 2.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.64561.29121.93682.58243.228SE +/- 0.01849, N = 3SE +/- 0.01034, N = 32.869372.86260MIN: 2.62MIN: 2.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pEPYC 7742 2P246810SE +/- 0.065, N = 37.4561. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1EPYC 7742 2P7742 2P Repeat50100150200250SE +/- 0.10, N = 3SE +/- 0.11, N = 3236.72237.251. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7742 2P7742 2P Repeat510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 320.3620.431. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7742 2P7742 2P Repeat0.06370.12740.19110.25480.3185SE +/- 0.00196, N = 3SE +/- 0.00247, N = 30.279520.28306

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsEPYC 7742 2P7742 2P Repeat510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 320.2120.20

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P1200M2400M3600M4800M6000MSE +/- 29512765.60, N = 3SE +/- 16339556.64, N = 3552510000055507333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P90M180M270M360M450MSE +/- 101707.64, N = 3SE +/- 127322.86, N = 34272033334272766671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P600M1200M1800M2400M3000MSE +/- 8434123.81, N = 3SE +/- 16574813.56, N = 3270393333326937666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P300M600M900M1200M1500MSE +/- 1822391.59, N = 3SE +/- 1153256.26, N = 3161656666716180000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P200M400M600M800M1000MSE +/- 1082240.47, N = 3SE +/- 620358.32, N = 38322766678316133331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P50M100M150M200M250MSE +/- 80829.04, N = 3SE +/- 107445.08, N = 32132800002132866671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P20M40M60M80M100MSE +/- 61191.87, N = 3SE +/- 18559.21, N = 31071433331071666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57EPYC 7742 2P2P11M22M33M44M55MSE +/- 22980.67, N = 3SE +/- 10477.49, N = 353594667535796671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileEPYC 7742 2P510152025SE +/- 0.09, N = 319.75

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 RealtimeEPYC 7742 2P714212835SE +/- 0.37, N = 331.891. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineEPYC 7742 2P7742 2P Repeat30K60K90K120K150KSE +/- 3136.68, N = 12SE +/- 3906.26, N = 15150712.97148613.891. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 14.28, N = 3SE +/- 8.01, N = 38426.047885.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultEPYC 7742 2P0.73621.47242.20862.94483.681SE +/- 0.035, N = 153.2721. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastEPYC 7742 2P4080120160200SE +/- 2.71, N = 15181.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACEPYC 7742 2P3691215SE +/- 0.009, N = 59.8301. (CXX) g++ options: -O2 -fvisibility=hidden -logg -lm

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pEPYC 7742 2P70140210280350SE +/- 14.04, N = 12340.021. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisEPYC 7742 2P510152025SE +/- 0.13, N = 319.74MIN: 12.82 / MAX: 20.83

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileEPYC 7742 2P48121620SE +/- 0.12, N = 315.67

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian DragonEPYC 7742 2P1020304050SE +/- 0.14, N = 342.12MIN: 39.85 / MAX: 44.87

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeEPYC 7742 2P3691215SE +/- 0.019, N = 59.1501. (CXX) g++ options: -fvisibility=hidden -logg -lm

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingEPYC 7742 2P4080120160200SE +/- 2.66, N = 15204.011. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.46650.9331.39951.8662.3325SE +/- 0.00986, N = 3SE +/- 0.00722, N = 32.073472.06564MIN: 1.86MIN: 1.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.4810.9621.4431.9242.405SE +/- 0.01302, N = 3SE +/- 0.00603, N = 32.133292.13758MIN: 1.93MIN: 1.931. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian DragonEPYC 7742 2P1020304050SE +/- 0.32, N = 344.97MIN: 41.97 / MAX: 48.41

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pEPYC 7742 2P20406080100SE +/- 0.35, N = 385.791. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pEPYC 7742 2P30060090012001500SE +/- 12.08, N = 141245.91MIN: 153.88 / MAX: 1618.671. (CC) gcc options: -pthread -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.16080.32160.48240.64320.804SE +/- 0.003926, N = 3SE +/- 0.008252, N = 30.7129360.714842MIN: 0.65MIN: 0.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P0.18290.36580.54870.73160.9145SE +/- 0.002167, N = 30.812990MIN: 0.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CEPYC 7742 2P7742 2P Repeat40K80K120K160K200KSE +/- 770.77, N = 3SE +/- 1600.26, N = 3194294.65176465.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialEPYC 7742 2P0.38760.77521.16281.55041.938SE +/- 0.03014, N = 151.722561. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.27230.54460.81691.08921.3615SE +/- 0.00941, N = 15SE +/- 0.01177, N = 61.210321.20163MIN: 1.07MIN: 1.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: CrownEPYC 7742 2P1326395265SE +/- 0.51, N = 359.33MIN: 55.07 / MAX: 65.52

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DEPYC 7742 2P7742 2P Repeat7001400210028003500SE +/- 30.48, N = 3SE +/- 10.37, N = 33268.823313.121. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeEPYC 7742 2P246810SE +/- 0.028, N = 38.0281. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerEPYC 7742 2P714212835SE +/- 0.00, N = 330.30MIN: 11.24 / MAX: 31.25

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB2 x AMD EPYC 7742 64-CoreEPYC 7742 2P8001600240032004000SE +/- 31.51, N = 3SE +/- 27.56, N = 12388639101. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Timed MPlayer Compilation

This test times how long it takes to build the MPlayer open-source media player program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To CompileEPYC 7742 2P3691215SE +/- 0.02, N = 310.27

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: CrownEPYC 7742 2P1530456075SE +/- 0.29, N = 367.59MIN: 62.78 / MAX: 72.69

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: MediumEPYC 7742 2P1428425670SE +/- 0.46, N = 364.431. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.69511.39022.08532.78043.4755SE +/- 0.03597, N = 3SE +/- 0.01047, N = 33.089172.88822MIN: 2.76MIN: 2.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7742 2P7742 2P Repeat246810SE +/- 0.002, N = 3SE +/- 0.002, N = 38.9048.8641. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonEPYC 7742 2P7742 2P Repeat11002200330044005500SE +/- 60.94, N = 4SE +/- 52.30, N = 450275096

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3EPYC 7742 2P3691215SE +/- 0.009, N = 39.1031. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeEPYC 7742 2P0.39830.79661.19491.59321.9915SE +/- 0.068, N = 151.7701. (CC) gcc options: -static -fopenmp -O3 -march=native

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KEPYC 7742 2P80160240320400SE +/- 2.34, N = 3387.05MIN: 84.5 / MAX: 472.331. (CC) gcc options: -pthread -lm

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateEPYC 7742 2P714212835SE +/- 0.14, N = 328.491. (CC) gcc options: -O3 -march=native -fopenmp

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5EPYC 7742 2P246810SE +/- 0.027, N = 37.7211. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CEPYC 7742 2P7742 2P Repeat16K32K48K64K80KSE +/- 839.98, N = 4SE +/- 262.47, N = 376051.3671133.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.6491.2981.9472.5963.245SE +/- 0.04670, N = 12SE +/- 0.00982, N = 32.884372.70898MIN: 2.52MIN: 2.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilEPYC 7742 2P7742 2P Repeat1.24752.4953.74254.996.2375SE +/- 0.018548, N = 3SE +/- 0.042058, N = 35.3895615.5446281. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat10K20K30K40K50KSE +/- 343.27, N = 3SE +/- 125.07, N = 344939442941. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

System GZIP Decompression

This simple test measures the time to decompress a gzipped tarball (the Qt5 toolkit source package). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSystem GZIP DecompressionEPYC 7742 2P0.811.622.433.244.05SE +/- 0.034, N = 63.600

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialEPYC 7742 2P714212835SE +/- 0.20, N = 328.72

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveEPYC 7742 2P246810SE +/- 0.092, N = 37.0831. (CXX) g++ options: -fopenmp -O2 -march=native

System ZLIB Decompression

This test measures the time to decompress a Linux kernel tarball using ZLIB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterSystem ZLIB Decompression 1.2.7EPYC 7742 2P400800120016002000SE +/- 6.97, N = 102025.98

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigEPYC 7742 2P7742 2P Repeat1.3222.6443.9665.2886.61SE +/- 0.011805, N = 3SE +/- 0.025529, N = 35.8728485.8757141. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096EPYC 7742 2P7742 2P Repeat15003000450060007500SE +/- 6.26, N = 3SE +/- 9.28, N = 36873.96890.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7742 2P7742 2P Repeat0.43980.87961.31941.75922.199SE +/- 0.008019, N = 3SE +/- 0.027365, N = 30.7243951.954610MIN: 0.67MIN: 1.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7742 2P7742 2P Repeat714212835SE +/- 0.29, N = 6SE +/- 0.20, N = 1529.0028.181. (CXX) g++ options: -O3 -pthread -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CEPYC 7742 2P7742 2P Repeat9K18K27K36K45KSE +/- 380.32, N = 3SE +/- 428.23, N = 541060.1939810.961. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputEPYC 7742 2P4080120160200SE +/- 0.03, N = 3172.301. (CC) gcc options: -O3 -rdynamic

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisEPYC 7742 2P1020304050SE +/- 0.00, N = 345.45MIN: 9.09 / MAX: 47.62

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pEPYC 7742 2P80160240320400SE +/- 4.38, N = 4363.961. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1EPYC 7742 2P7742 2P Repeat2004006008001000SE +/- 3.71, N = 3SE +/- 3.65, N = 31039.811037.461. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisEPYC 7742 2P306090120150125MIN: 16.13

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticEPYC 7742 2P1.00382.00763.01144.01525.019SE +/- 0.00391, N = 34.461361. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigEPYC 7742 2P7742 2P Repeat0.88861.77722.66583.55444.443SE +/- 0.028494, N = 3SE +/- 0.047128, N = 33.9491923.9264461. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastEPYC 7742 2P306090120150SE +/- 0.43, N = 3136.841. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pEPYC 7742 2P60120180240300SE +/- 0.09, N = 3274.561. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks2 x AMD EPYC 7742 64-CoreEPYC 7742 2P9001800270036004500SE +/- 28.75, N = 3SE +/- 24.69, N = 3430742951. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

System XZ Decompression

This test measures the time to decompress a Linux kernel tarball using XZ. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSystem XZ DecompressionEPYC 7742 2P0.971.942.913.884.85SE +/- 0.004, N = 34.311

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10EPYC 7742 2P2 x AMD EPYC 7742 64-Core0.95131.90262.85393.80524.7565SE +/- 0.028, N = 3SE +/- 0.048, N = 34.2284.1891. (CXX) g++ options: -O3 -fPIC -lm

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads2 x AMD EPYC 7742 64-CoreEPYC 7742 2P9001800270036004500SE +/- 23.63, N = 3SE +/- 25.44, N = 3405040391. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

rays1bench

This is a test of rays1bench, a simple path-tracer / ray-tracing that supports SSE and AVX instructions, multi-threading, and other features. This test profile is measuring the performance of the "large scene" in rays1bench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large SceneEPYC 7742 2P110220330440550SE +/- 2.26, N = 3492.78

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CEPYC 7742 2P7742 2P Repeat16K32K48K64K80KSE +/- 474.90, N = 3SE +/- 149.89, N = 372806.5973254.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100EPYC 7742 2P7742 2P Repeat0.64441.28881.93322.57763.222SE +/- 0.001, N = 3SE +/- 0.003, N = 32.8632.8641. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPEPYC 7742 2P7742 2P Repeat0.19180.38360.57540.76720.959SE +/- 0.009850, N = 3SE +/- 0.008717, N = 150.8318750.8525291. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceEPYC 7742 2P7742 2P Repeat200K400K600K800K1000KSE +/- 1419.07, N = 5SE +/- 1440.27, N = 5103181310306551. (CC) gcc options: -O3 -march=native

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultEPYC 7742 2P7742 2P Repeat0.41740.83481.25221.66962.087SE +/- 0.003, N = 3SE +/- 0.002, N = 31.8551.8541. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerEPYC 7742 2P70140210280350SE +/- 0.00, N = 3333.33MIN: 37.04 / MAX: 500

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CEPYC 7742 2P7742 2P Repeat2K4K6K8K10KSE +/- 21.18, N = 3SE +/- 9.74, N = 38223.348108.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

LuaJIT

This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-Relaxation7742 2P Repeat400800120016002000SE +/- 0.26, N = 31644.111. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix Factorization7742 2P Repeat6001200180024003000SE +/- 173.31, N = 32811.621. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix Multiply7742 2P Repeat2004006008001000SE +/- 6.13, N = 31008.921. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier Transform7742 2P Repeat50100150200250SE +/- 0.69, N = 3210.571. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte Carlo7742 2P Repeat90180270360450SE +/- 0.31, N = 3412.341. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

290 Results Shown

HPL Linpack
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
CloverLeaf
Quantum ESPRESSO
C-Blosc
IOR
GNU Radio:
  Hilbert Transform
  FM Deemphasis Filter
  IIR Filter
  FIR Filter
  Signal Source (Cosine)
  Five Back to Back FIR Filters
RELION
LeelaChessZero:
  BLAS
  Eigen
Timed GCC Compilation
libgav1
NWChem
OpenVKL
7-Zip Compression
High Performance Conjugate Gradient
IOR
Blender
WebP2 Image Encode
Timed HMMer Search
oneDNN:
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - f32 - CPU
Timed Erlang/OTP Compilation
oneDNN
Nebular Empirical Analysis Tool
FFTW
Rodinia
Incompact3D
JPEG XL
YafaRay
DeepSpeech
Numpy Benchmark
LAMMPS Molecular Dynamics Simulator
LuaRadio:
  Complex Phase
  Hilbert Transform
  FM Deemphasis Filter
  Five Back to Back FIR Filters
libgav1
WebP2 Image Encode
oneDNN:
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Training - f32 - CPU
Monte Carlo Simulations of Ionised Nebulae
FFTW
IOR
oneDNN
Zstd Compression:
  19, Long Mode - Decompression Speed
  19, Long Mode - Compression Speed
miniFE
Rodinia
FFTW
Parboil
libgav1
Timed LLVM Compilation
OpenFOAM
Zstd Compression:
  8 - Decompression Speed
  8 - Compression Speed
GraphicsMagick:
  Noise-Gaussian
  HWB Color Space
Parboil
Zstd Compression:
  3, Long Mode - Decompression Speed
  3, Long Mode - Compression Speed
Ngspice
GraphicsMagick
Tungsten Renderer
Rodinia
Zstd Compression:
  19 - Decompression Speed
  19 - Compression Speed
VP9 libvpx Encoding
asmFish
QMCPACK
TTSIOD 3D Renderer
WebP2 Image Encode
Ngspice
XZ Compression
Kvazaar
libavif avifenc
IOR
ASKAP
Rodinia
Apache CouchDB
JPEG XL
Timed MrBayes Analysis
IOR
OSPray
FinanceBench
Timed Linux Kernel Compilation
JPEG XL
Timed Eigen Compilation
Montage Astronomical Image Mosaic Engine
ASKAP:
  tConvolve MPI - Gridding
  tConvolve MPI - Degridding
Timed GDB GNU Debugger Compilation
Zstd Compression:
  8, Long Mode - Decompression Speed
  8, Long Mode - Compression Speed
LuaJIT
JPEG XL Decoding:
  1
  All
DaCapo Benchmark
Kvazaar
Tachyon
ASKAP:
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
Radiance Benchmark
DaCapo Benchmark
Timed Wasmer Compilation
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
  3 - Decompression Speed
  3 - Compression Speed
Blender
Build2
LuxCoreRender
Timed Godot Game Engine Compilation
LuxCoreRender
FinanceBench
GraphicsMagick
libavif avifenc
JPEG XL
Himeno Benchmark
GraphicsMagick:
  Sharpen
  Enhanced
  Swirl
JPEG XL
Algebraic Multi-Grid Benchmark
srsLTE
Kvazaar
JPEG XL
Liquid-DSP
srsLTE:
  PHY_DL_Test:
    UE Mb/s
    eNb Mb/s
FFTW
dcraw
Blender
x265
lzbench:
  XZ 0 - Decompression
  XZ 0 - Compression
LibRaw
eSpeak-NG Speech Engine
WebP Image Encode
oneDNN
Gzip Compression
Timed PHP Compilation
rav1e
AOBench
libavif avifenc
QuantLib
Blender
Zstd Compression:
  3 - Decompression Speed
  3 - Compression Speed
GROMACS
libavif avifenc
Etcpak
AOM AV1
OSPray
C-Ray
AOM AV1
x265
Tungsten Renderer
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Embree:
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer - Asian Dragon Obj
Rodinia
Timed MAFFT Alignment
Stockfish
Google SynthMark
oneDNN
rav1e
Crafty
OSPray
OpenFOAM
toyBrot Fractal Generator
lzbench:
  Libdeflate 1 - Decompression
  Libdeflate 1 - Compression
Blender
Timed Apache Compilation
Monkey Audio Encoding
lzbench:
  Brotli 2 - Decompression
  Brotli 2 - Compression
Ogg Audio Encoding
lzbench:
  Brotli 0 - Decompression
  Brotli 0 - Compression
RNNoise
lzbench:
  Zstd 8 - Decompression
  Zstd 8 - Compression
libavif avifenc
lzbench:
  Zstd 1 - Decompression
  Zstd 1 - Compression
  Crush 0 - Decompression
  Crush 0 - Compression
Etcpak
oneDNN:
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
SVT-AV1
Etcpak
WebP Image Encode
NAMD
Dolfyn
Liquid-DSP:
  256 - 256 - 57
  8 - 256 - 57
  64 - 256 - 57
  32 - 256 - 57
  16 - 256 - 57
  4 - 256 - 57
  2 - 256 - 57
  1 - 256 - 57
Timed FFmpeg Compilation
AOM AV1
FFTE
NAS Parallel Benchmarks
WebP2 Image Encode
Kvazaar
FLAC Audio Encoding
SVT-VP9
OSPray
Timed ImageMagick Compilation
Embree
Opus Codec Encoding
x264
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
Embree
SVT-AV1
dav1d
oneDNN:
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
NAS Parallel Benchmarks
Tungsten Renderer
oneDNN
Embree
NAS Parallel Benchmarks
POV-Ray
OSPray
toyBrot Fractal Generator
Timed MPlayer Compilation
Embree
Kvazaar
oneDNN
WebP Image Encode
DaCapo Benchmark
LAME MP3 Encoding
N-Queens
dav1d
ACES DGEMM
WebP2 Image Encode
NAS Parallel Benchmarks
oneDNN
Parboil
FFTW
System GZIP Decompression
Intel Open Image Denoise
m-queens
System ZLIB Decompression
Pennant
FFTW
oneDNN
LAMMPS Molecular Dynamics Simulator
NAS Parallel Benchmarks
libjpeg-turbo tjbench
OSPray
SVT-VP9
Etcpak
OSPray
Tungsten Renderer
Pennant
Kvazaar
SVT-VP9
toyBrot Fractal Generator
System XZ Decompression
libavif avifenc
toyBrot Fractal Generator
rays1bench
NAS Parallel Benchmarks
WebP Image Encode
Parboil
TSCP
WebP Image Encode
OSPray
NAS Parallel Benchmarks
LuaJIT:
  Jacobi Successive Over-Relaxation
  Dense LU Matrix Factorization
  Sparse Matrix Multiply
  Fast Fourier Transform
  Monte Carlo