testeded

2 x AMD EPYC 7F32 8-Core testing with a Supermicro H11DSU-iN (2.1b BIOS) and llvmpipe 504GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2007298-NI-TESTEDED558
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x AMD EPYC 7F32 8-Core
July 28 2020
  9 Hours, 32 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


testededOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 7F32 8-Core @ 3.70GHz (16 Cores / 32 Threads)Supermicro H11DSU-iN (2.1b BIOS)AMD Starship/Matisse504GB2 x 3841GB Micron_9200_MTFDHAL3T8TCTllvmpipe 504GB4 x Intel I350Ubuntu 20.045.4.0-42-generic (x86_64)GNOME Shell 3.36.2X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits)GCC 9.3.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionTesteded BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038- Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

testededblender: Pabellon Barcelona - OpenCLblender: Fishy Cat - OpenCLbuild-gcc: Time To Compileblender: Barbershop - OpenCLfftw: Float + SSE - 2D FFT Size 2048blender: Barbershop - CPU-Onlyblender: BMW27 - OpenCLfftw: Float + SSE - 2D FFT Size 4096build-llvm: Time To Compileonednn: IP Batch All - f32 - CPUonednn: IP Batch All - u8s8f32 - CPUblender: Pabellon Barcelona - CPU-Onlycompress-lzma: 256MB File Compressionmlpack: scikit_linearridgeregressionblender: Classroom - OpenCLnumpy: blender: Classroom - CPU-Onlyfftw: Stock - 2D FFT Size 4096svt-av1: Enc Mode 0 - 1080ponednn: Recurrent Neural Network Training - f32 - CPUblender: Fishy Cat - CPU-Onlybyte: Floating-Point Arithmeticbyte: Integer Arithmeticbyte: Register Arithmeticbyte: Dhrystone 2onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUnpb: EP.Dvpxenc: Speed 0blender: BMW27 - CPU-Onlygeekbench: CPU Multi Core - Horizon Detectiongeekbench: CPU Multi Core - Face Detectiongeekbench: CPU Multi Core - Gaussian Blurgeekbench: CPU Multi Coregromacs: Water Benchmarkonednn: IP Batch 1D - f32 - CPUstockfish: Total Timenamd: ATPase Simulation - 327,506 Atomsv-ray: CPUvpxenc: Speed 5redis: LPUSHredis: LPOPgeekbench: CPU Single Core - Horizon Detectiongeekbench: CPU Single Core - Face Detectiongeekbench: CPU Single Core - Gaussian Blurgeekbench: CPU Single Coreredis: SETredis: GETembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objfftw: Stock - 2D FFT Size 2048build-linux-kernel: Time To Compileredis: SADDembree: Pathtracer ISPC - Crowncompress-7zip: Compress Speed Testonednn: Recurrent Neural Network Inference - f32 - CPUcompress-gzip: Linux Source Tree Archiving To .tar.gzcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9embree: Pathtracer - Crownembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragonnpb: BT.Cc-ray: Total Time - 4K, 16 Rays Per Pixelonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUnpb: SP.Bmlpack: scikit_svmpovray: Trace Timenpb: IS.Dfftw: Float + SSE - 2D FFT Size 1024npb: LU.Cpybench: Total For Average Test Timesfftw: Float + SSE - 1D FFT Size 256onednn: IP Batch 1D - u8s8f32 - CPUopenssl: RSA 4096-bit Performancey-cruncher: Calculating 500M Pi Digitsfftw: Stock - 2D FFT Size 1024x265: H.265 1080p Video Encodingsvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080psysbench: Memorysysbench: CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUfftw: Stock - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 512onednn: Deconvolution Batch deconv_3d - f32 - CPUnpb: CG.Cnpb: FT.Cfftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 2048npb: EP.Cstream: Copyfftw: Float + SSE - 1D FFT Size 1024fftw: Stock - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 512fftw: Stock - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128lammps: Rhodopsin Proteinfftw: Stock - 1D FFT Size 128x264: H.264 Video Encodingfftw: Float + SSE - 2D FFT Size 128svt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psystem-decompress-xz: svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pfftw: Stock - 1D FFT Size 1024onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUfftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 256fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 64npb: MG.Cfftw: Stock - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 32fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 256stream: Triadstream: Addstream: Scale2 x AMD EPYC 7F32 8-Core956.82927.59788.861542.3029372389.22352.4819579327.09235.792419.1361284.15259.075198.24253.79356.33221.685998.40.115336.681126.0511140414443.08.098742.851561408.936.8083.92471.3175.4867.1193222.4262.08308430169781.063022383320.881299310.661361460.8227.29.7074.611821526482.901790562.8915.244616.00636689.544.9231680591.2515.64289079891.972036.82826.83917.244217.459917.489883965.1532.6108.3505768265.8823.4925.0181993.903947683479.031005348891.654244584.713.2727165.655.204.91635.2802072953.454432646.87751.995730.7122361.550158594.2381704.0361020608.8349966.9350933533821405.11171077.6516787894.2388127771.2465538111.38001.311.6667740.4154.1440936183.91217.923.792226.088394.72.8846418925140078560.28215.17789.2241074180185350.579979.341107101298174.8185154.0183823.4170838.7OpenBenchmarking.org

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: OpenCL2 x AMD EPYC 7F32 8-Core2004006008001000SE +/- 5.16, N = 3956.82

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: OpenCL2 x AMD EPYC 7F32 8-Core2004006008001000SE +/- 5.11, N = 3927.59

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 9.3.0Time To Compile2 x AMD EPYC 7F32 8-Core2004006008001000SE +/- 0.39, N = 3788.86

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: OpenCL2 x AMD EPYC 7F32 8-Core120240360480600SE +/- 3.66, N = 3542.30

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x AMD EPYC 7F32 8-Core6K12K18K24K30KSE +/- 342.00, N = 15293721. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Only2 x AMD EPYC 7F32 8-Core80160240320400SE +/- 0.11, N = 3389.22

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: OpenCL2 x AMD EPYC 7F32 8-Core80160240320400SE +/- 1.80, N = 3352.48

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x AMD EPYC 7F32 8-Core4K8K12K16K20KSE +/- 254.89, N = 3195791. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile2 x AMD EPYC 7F32 8-Core70140210280350SE +/- 4.69, N = 3327.09

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core816243240SE +/- 0.50, N = 1535.79MIN: 32.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core510152025SE +/- 0.34, N = 1519.14MIN: 17.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Only2 x AMD EPYC 7F32 8-Core60120180240300SE +/- 0.31, N = 3284.15

LZMA Compression

This test measures the time needed to compress a file using LZMA compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File Compression2 x AMD EPYC 7F32 8-Core60120180240300SE +/- 0.18, N = 3259.081. (CXX) g++ options: -O2

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression2 x AMD EPYC 7F32 8-Core4080120160200SE +/- 0.03, N = 3198.24

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: OpenCL2 x AMD EPYC 7F32 8-Core60120180240300SE +/- 0.55, N = 3253.79

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x AMD EPYC 7F32 8-Core80160240320400SE +/- 0.76, N = 3356.33

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Only2 x AMD EPYC 7F32 8-Core50100150200250SE +/- 0.34, N = 3221.68

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x AMD EPYC 7F32 8-Core13002600390052006500SE +/- 8.31, N = 35998.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080p2 x AMD EPYC 7F32 8-Core0.02590.05180.07770.10360.1295SE +/- 0.000, N = 30.1151. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core70140210280350SE +/- 9.78, N = 12336.68MIN: 277.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Only2 x AMD EPYC 7F32 8-Core306090120150SE +/- 0.06, N = 3126.05

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic2 x AMD EPYC 7F32 8-Core0.2250.450.6750.91.1251

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic2 x AMD EPYC 7F32 8-Core0.2250.450.6750.91.1251

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic2 x AMD EPYC 7F32 8-Core0.2250.450.6750.91.1251

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 22 x AMD EPYC 7F32 8-Core9M18M27M36M45MSE +/- 233444.02, N = 340414443.0

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core246810SE +/- 0.16350, N = 158.09874MIN: 6.831. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.64161.28321.92482.56643.208SE +/- 0.03655, N = 152.85156MIN: 2.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x AMD EPYC 7F32 8-Core30060090012001500SE +/- 0.28, N = 31408.931. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 02 x AMD EPYC 7F32 8-Core246810SE +/- 0.02, N = 36.801. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Only2 x AMD EPYC 7F32 8-Core20406080100SE +/- 0.12, N = 383.92

Geekbench

This is a benchmark of Geekbench 5 Pro. The test profile automates the execution of Geekbench 5 under the Phoronix Test Suite, assuming you have a valid license key for Geekbench 5 Pro. This test will not work without a valid license key for Geekbench Pro. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGpixels/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Horizon Detection2 x AMD EPYC 7F32 8-Core100200300400500SE +/- 9.09, N = 3471.3

OpenBenchmarking.orgimages/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Face Detection2 x AMD EPYC 7F32 8-Core4080120160200SE +/- 1.19, N = 3175.4

OpenBenchmarking.orgMpixels/sec, More Is BetterGeekbench 5.0Test: CPU Multi Core - Gaussian Blur2 x AMD EPYC 7F32 8-Core2004006008001000SE +/- 6.45, N = 3867.1

OpenBenchmarking.orgScore, More Is BetterGeekbench 5.0Test: CPU Multi Core2 x AMD EPYC 7F32 8-Core4K8K12K16K20KSE +/- 125.69, N = 319322

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water Benchmark2 x AMD EPYC 7F32 8-Core0.54591.09181.63772.18362.7295SE +/- 0.002, N = 32.4261. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.46870.93741.40611.87482.3435SE +/- 0.02455, N = 152.08308MIN: 1.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time2 x AMD EPYC 7F32 8-Core9M18M27M36M45MSE +/- 75746.74, N = 3430169781. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13ATPase Simulation - 327,506 Atoms2 x AMD EPYC 7F32 8-Core0.23920.47840.71760.95681.196SE +/- 0.00273, N = 31.06302

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPU2 x AMD EPYC 7F32 8-Core5K10K15K20K25KSE +/- 49.03, N = 323833

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 52 x AMD EPYC 7F32 8-Core510152025SE +/- 0.25, N = 620.881. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSH2 x AMD EPYC 7F32 8-Core300K600K900K1200K1500KSE +/- 16811.35, N = 151299310.661. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOP2 x AMD EPYC 7F32 8-Core300K600K900K1200K1500KSE +/- 15911.70, N = 151361460.821. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Geekbench

This is a benchmark of Geekbench 5 Pro. The test profile automates the execution of Geekbench 5 under the Phoronix Test Suite, assuming you have a valid license key for Geekbench 5 Pro. This test will not work without a valid license key for Geekbench Pro. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGpixels/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Horizon Detection2 x AMD EPYC 7F32 8-Core612182430SE +/- 0.07, N = 327.2

OpenBenchmarking.orgimages/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Face Detection2 x AMD EPYC 7F32 8-Core3691215SE +/- 0.01, N = 39.70

OpenBenchmarking.orgMpixels/sec, More Is BetterGeekbench 5.0Test: CPU Single Core - Gaussian Blur2 x AMD EPYC 7F32 8-Core20406080100SE +/- 0.20, N = 374.6

OpenBenchmarking.orgScore, More Is BetterGeekbench 5.0Test: CPU Single Core2 x AMD EPYC 7F32 8-Core30060090012001500SE +/- 2.00, N = 31182

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET2 x AMD EPYC 7F32 8-Core300K600K900K1200K1500KSE +/- 26865.59, N = 151526482.901. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET2 x AMD EPYC 7F32 8-Core400K800K1200K1600K2000KSE +/- 43725.99, N = 151790562.891. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.11, N = 315.24MIN: 14.84 / MAX: 15.57

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.08, N = 316.01MIN: 15.78 / MAX: 16.29

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x AMD EPYC 7F32 8-Core14002800420056007000SE +/- 16.23, N = 36689.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile2 x AMD EPYC 7F32 8-Core1020304050SE +/- 0.60, N = 344.92

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADD2 x AMD EPYC 7F32 8-Core400K800K1200K1600K2000KSE +/- 28488.23, N = 121680591.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.15, N = 315.64MIN: 15.05 / MAX: 16.48

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test2 x AMD EPYC 7F32 8-Core20K40K60K80K100KSE +/- 1347.80, N = 3907981. (CXX) g++ options: -pipe -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core20406080100SE +/- 1.55, N = 391.97MIN: 81.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Gzip Compression

This test measures the time needed to archive/compress two copies of the Linux 4.13 kernel source tree using Gzip compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGzip CompressionLinux Source Tree Archiving To .tar.gz2 x AMD EPYC 7F32 8-Core816243240SE +/- 0.05, N = 336.83

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 92 x AMD EPYC 7F32 8-Core612182430SE +/- 0.35, N = 426.841. (CC) gcc options: -pthread -fvisibility=hidden -O2

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.05, N = 317.24MIN: 16.78 / MAX: 17.64

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.11, N = 317.46MIN: 17.15 / MAX: 17.8

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon2 x AMD EPYC 7F32 8-Core48121620SE +/- 0.14, N = 317.49MIN: 17.14 / MAX: 17.85

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x AMD EPYC 7F32 8-Core20K40K60K80K100KSE +/- 355.13, N = 383965.151. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel2 x AMD EPYC 7F32 8-Core816243240SE +/- 0.02, N = 332.611. (CC) gcc options: -lm -lpthread -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core246810SE +/- 0.22169, N = 158.35057MIN: 5.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x AMD EPYC 7F32 8-Core15K30K45K60K75KSE +/- 841.03, N = 1568265.881. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm2 x AMD EPYC 7F32 8-Core612182430SE +/- 0.28, N = 323.49

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Time2 x AMD EPYC 7F32 8-Core612182430SE +/- 0.09, N = 325.021. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x AMD EPYC 7F32 8-Core400800120016002000SE +/- 1.02, N = 31993.901. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x AMD EPYC 7F32 8-Core8K16K24K32K40KSE +/- 160.30, N = 3394761. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x AMD EPYC 7F32 8-Core20K40K60K80K100KSE +/- 167.18, N = 383479.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x AMD EPYC 7F32 8-Core2004006008001000SE +/- 1.20, N = 31005

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x AMD EPYC 7F32 8-Core7K14K21K28K35KSE +/- 366.85, N = 15348891. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.37220.74441.11661.48881.861SE +/- 0.02470, N = 41.65424MIN: 1.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance2 x AMD EPYC 7F32 8-Core10002000300040005000SE +/- 0.78, N = 34584.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.7.8.9503Calculating 500M Pi Digits2 x AMD EPYC 7F32 8-Core3691215SE +/- 0.02, N = 413.27

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x AMD EPYC 7F32 8-Core15003000450060007500SE +/- 34.20, N = 57165.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video Encoding2 x AMD EPYC 7F32 8-Core1224364860SE +/- 0.17, N = 555.201. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080p2 x AMD EPYC 7F32 8-Core1.10612.21223.31834.42445.5305SE +/- 0.027, N = 34.9161. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080p2 x AMD EPYC 7F32 8-Core816243240SE +/- 0.17, N = 535.281. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Sysbench

This is a benchmark of Sysbench with CPU and memory sub-tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory2 x AMD EPYC 7F32 8-Core400K800K1200K1600K2000KSE +/- 26381.22, N = 52072953.451. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU2 x AMD EPYC 7F32 8-Core7K14K21K28K35KSE +/- 5.12, N = 532646.881. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.4490.8981.3471.7962.245SE +/- 0.02020, N = 81.99573MIN: 1.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.16030.32060.48090.64120.8015SE +/- 0.002845, N = 40.712236MIN: 0.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.34880.69761.04641.39521.744SE +/- 0.00557, N = 41.55015MIN: 1.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 234.24, N = 158594.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x AMD EPYC 7F32 8-Core8K16K24K32K40KSE +/- 102.47, N = 6381701. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.90811.81622.72433.63244.5405SE +/- 0.04386, N = 154.03610MIN: 3.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x AMD EPYC 7F32 8-Core4K8K12K16K20KSE +/- 93.77, N = 620608.831. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x AMD EPYC 7F32 8-Core11K22K33K44K55KSE +/- 420.09, N = 549966.931. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x AMD EPYC 7F32 8-Core11K22K33K44K55KSE +/- 295.42, N = 6509331. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x AMD EPYC 7F32 8-Core11K22K33K44K55KSE +/- 129.54, N = 7533821. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x AMD EPYC 7F32 8-Core30060090012001500SE +/- 3.02, N = 61405.111. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Stream

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy2 x AMD EPYC 7F32 8-Core40K80K120K160K200KSE +/- 576.21, N = 7171077.61. (CC) gcc options: -O3 -march=native -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x AMD EPYC 7F32 8-Core11K22K33K44K55KSE +/- 399.68, N = 8516781. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 7.53, N = 77894.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x AMD EPYC 7F32 8-Core8K16K24K32K40KSE +/- 239.24, N = 7388121. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x AMD EPYC 7F32 8-Core17003400510068008500SE +/- 4.57, N = 87771.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x AMD EPYC 7F32 8-Core10K20K30K40K50KSE +/- 232.47, N = 8465531. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 12.70, N = 88111.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 13.35, N = 88001.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 9Jan2020Model: Rhodopsin Protein2 x AMD EPYC 7F32 8-Core3691215SE +/- 0.11, N = 1511.671. (CXX) g++ options: -O3 -rdynamic -ljpeg -lpng -lz -lfftw3 -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x AMD EPYC 7F32 8-Core17003400510068008500SE +/- 37.14, N = 87740.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encoding2 x AMD EPYC 7F32 8-Core306090120150SE +/- 0.54, N = 8154.141. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x AMD EPYC 7F32 8-Core9K18K27K36K45KSE +/- 147.59, N = 9409361. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080p2 x AMD EPYC 7F32 8-Core4080120160200SE +/- 0.74, N = 8183.911. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080p2 x AMD EPYC 7F32 8-Core50100150200250SE +/- 0.60, N = 9217.921. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

System XZ Decompression

This test measures the time to decompress a Linux kernel tarball using XZ. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSystem XZ Decompression2 x AMD EPYC 7F32 8-Core0.85321.70642.55963.41284.266SE +/- 0.002, N = 83.792

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p2 x AMD EPYC 7F32 8-Core50100150200250SE +/- 0.63, N = 9226.081. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 7.18, N = 98394.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F32 8-Core0.6491.2981.9472.5963.245SE +/- 0.00307, N = 92.88464MIN: 2.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x AMD EPYC 7F32 8-Core4K8K12K16K20KSE +/- 28.05, N = 10189251. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x AMD EPYC 7F32 8-Core3K6K9K12K15KSE +/- 11.75, N = 9140071. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 46.72, N = 98560.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 10.82, N = 108215.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 6.41, N = 97789.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x AMD EPYC 7F32 8-Core5K10K15K20K25KSE +/- 30.81, N = 10241071. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x AMD EPYC 7F32 8-Core9K18K27K36K45KSE +/- 95.25, N = 10418011. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x AMD EPYC 7F32 8-Core20K40K60K80K100KSE +/- 96.59, N = 1085350.571. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 0.74, N = 119979.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x AMD EPYC 7F32 8-Core9K18K27K36K45KSE +/- 53.58, N = 11411071. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 4.57, N = 11101291. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x AMD EPYC 7F32 8-Core2K4K6K8K10KSE +/- 12.64, N = 108174.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

System Power Consumption Monitor

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System Monitoring2 x AMD EPYC 7F32 8-Core100200300400500Min: 215.5 / Avg: 329.79 / Max: 542.8

CPU Temperature Monitor

OpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System Monitoring2 x AMD EPYC 7F32 8-Core1530456075Min: 27.25 / Avg: 51.26 / Max: 78.75

CPU Peak Freq (Highest CPU Core Frequency) Monitor

OpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System Monitoring2 x AMD EPYC 7F32 8-Core7001400210028003500Min: 1800 / Avg: 3602.5 / Max: 4184

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per Watts2 x AMD EPYC 7F32 8-Core80016002400320040003685.14

Stream

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad2 x AMD EPYC 7F32 8-Core40K80K120K160K200KSE +/- 1143.35, N = 5185154.01. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add2 x AMD EPYC 7F32 8-Core40K80K120K160K200KSE +/- 571.46, N = 5183823.41. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale2 x AMD EPYC 7F32 8-Core40K80K120K160K200KSE +/- 30.23, N = 5170838.71. (CC) gcc options: -O3 -march=native -fopenmp

129 Results Shown

Blender:
  Pabellon Barcelona - OpenCL
  Fishy Cat - OpenCL
Timed GCC Compilation
Blender
FFTW
Blender:
  Barbershop - CPU-Only
  BMW27 - OpenCL
FFTW
Timed LLVM Compilation
oneDNN:
  IP Batch All - f32 - CPU
  IP Batch All - u8s8f32 - CPU
Blender
LZMA Compression
Mlpack Benchmark
Blender
Numpy Benchmark
Blender
FFTW
SVT-AV1
oneDNN
Blender
BYTE Unix Benchmark:
  Floating-Point Arithmetic
  Integer Arithmetic
  Register Arithmetic
  Dhrystone 2
oneDNN:
  Deconvolution Batch deconv_1d - u8s8f32 - CPU
  Deconvolution Batch deconv_1d - f32 - CPU
NAS Parallel Benchmarks
VP9 libvpx Encoding
Blender
Geekbench:
  CPU Multi Core - Horizon Detection
  CPU Multi Core - Face Detection
  CPU Multi Core - Gaussian Blur
  CPU Multi Core
GROMACS
oneDNN
Stockfish
NAMD
Chaos Group V-RAY
VP9 libvpx Encoding
Redis:
  LPUSH
  LPOP
Geekbench:
  CPU Single Core - Horizon Detection
  CPU Single Core - Face Detection
  CPU Single Core - Gaussian Blur
  CPU Single Core
Redis:
  SET
  GET
Embree:
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer - Asian Dragon Obj
FFTW
Timed Linux Kernel Compilation
Redis
Embree
7-Zip Compression
oneDNN
Gzip Compression
XZ Compression
Embree:
  Pathtracer - Crown
  Pathtracer ISPC - Asian Dragon
  Pathtracer - Asian Dragon
NAS Parallel Benchmarks
C-Ray
oneDNN
NAS Parallel Benchmarks
Mlpack Benchmark
POV-Ray
NAS Parallel Benchmarks
FFTW
NAS Parallel Benchmarks
PyBench
FFTW
oneDNN
OpenSSL
Y-Cruncher
FFTW
x265
SVT-AV1:
  Enc Mode 4 - 1080p
  Enc Mode 8 - 1080p
Sysbench:
  Memory
  CPU
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
FFTW:
  Stock - 1D FFT Size 64
  Float + SSE - 2D FFT Size 512
oneDNN
NAS Parallel Benchmarks:
  CG.C
  FT.C
FFTW:
  Float + SSE - 1D FFT Size 4096
  Float + SSE - 1D FFT Size 2048
NAS Parallel Benchmarks
Stream
FFTW:
  Float + SSE - 1D FFT Size 1024
  Stock - 1D FFT Size 4096
  Float + SSE - 2D FFT Size 256
  Stock - 2D FFT Size 512
  Float + SSE - 1D FFT Size 512
  Stock - 1D FFT Size 2048
  Stock - 2D FFT Size 128
LAMMPS Molecular Dynamics Simulator
FFTW
x264
FFTW
SVT-VP9:
  Visual Quality Optimized - Bosphorus 1080p
  VMAF Optimized - Bosphorus 1080p
System XZ Decompression
SVT-VP9
FFTW
oneDNN
FFTW:
  Float + SSE - 1D FFT Size 64
  Float + SSE - 1D FFT Size 32
  Stock - 2D FFT Size 64
  Stock - 1D FFT Size 512
  Stock - 2D FFT Size 256
  Float + SSE - 1D FFT Size 128
  Float + SSE - 2D FFT Size 64
NAS Parallel Benchmarks
FFTW:
  Stock - 2D FFT Size 32
  Float + SSE - 2D FFT Size 32
  Stock - 1D FFT Size 32
  Stock - 1D FFT Size 256
System Power Consumption Monitor:
  Phoronix Test Suite System Monitoring:
    Watts
    Celsius
    Megahertz
  Performance Per Watts:
    Performance Per Watts
Stream:
  Triad
  Add
  Scale