Cascadelake Xeon Platinum

2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and ASPEED Family on Ubuntu 18.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1904197-HV-CASCADELA21
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
ASPEED Family - 2 x Intel Xeon Platinum 8280
April 18 2019
  6 Hours, 45 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Cascadelake Xeon PlatinumOpenBenchmarking.orgPhoronix Test Suite2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers386048MBSamsung SSD 970 PRO 512GBASPEED FamilyVE2282 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEUbuntu 18.045.1.0-999-generic (x86_64) 20190416GNOME Shell 3.28.3X Server 1.20.1modesetting 1.20.1GCC 7.3.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionCascadelake Xeon Platinum BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave- Python 2.7.15rc1 + Python 3.6.7- __user pointer sanitization + Enhanced IBRS IBPB: conditional RSB filling + SSB disabled via prctl and seccomp

Cascadelake Xeon Platinumnpb: BT.Anpb: EP.Cnpb: FT.Anpb: FT.Bnpb: LU.Anpb: LU.Cnpb: SP.Aparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingcloverleaf: Lagrangian-Eulerian Hydrodynamicsrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterhpcg: namd: ATPase Simulation - 327,506 Atomsmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: IP Batch 1D - u8s8u8s32mkl-dnn: IP Batch 1D - u8s8f32s32mkl-dnn: IP Batch All - u8s8u8s32mkl-dnn: IP Batch All - u8s8f32s32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_3d - u8s8u8s32mkl-dnn: Convolution Batch conv_3d - u8s8f32s32mkl-dnn: Convolution Batch conv_all - u8s8u8s32mkl-dnn: Convolution Batch conv_all - u8s8f32s32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32mkl-dnn: Deconvolution Batch deconv_1d - u8s8u8s32mkl-dnn: Deconvolution Batch deconv_3d - u8s8u8s32mkl-dnn: Convolution Batch conv_alexnet - u8s8u8s32mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32s32mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32s32mkl-dnn: Deconvolution Batch deconv_all - u8s8u8s32mkl-dnn: Convolution Batch conv_alexnet - u8s8f32s32mkl-dnn: Deconvolution Batch deconv_all - u8s8f32s32mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8u8s32mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32s32x264: H.264 Video Encodingx265: H.265 1080p Video Encodingstockfish: Total Timeswet: Averagebuild-gcc: Time To Compilebuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelllvm-test-suite: Time To Runsmallpt: Global Illumination Renderer; 128 Samplesy-cruncher: Calculating 500M Pi Digitssysbench: Memorysysbench: CPUxsbench: ASPEED Family - 2 x Intel Xeon Platinum 82803268.852697.582485.592687.17126730.07107075.953132.3321.391.432.73498.760.7315.417.6412.030.930.358739.0568.587.999.2171.8769.694.493901.372.5853.278264.584.54390390.7022.861.352.7555.571.362.63805.1655.01801.7522.7222.85114.5832.47126270130739899268877.6922.15116.5516.310.303.4210.5311390960.911018079759381OpenBenchmarking.org

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.AASPEED Family - 2 x Intel Xeon Platinum 82807001400210028003500SE +/- 7.63, N = 33268.851. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.CASPEED Family - 2 x Intel Xeon Platinum 82806001200180024003000SE +/- 4.82, N = 32697.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.AASPEED Family - 2 x Intel Xeon Platinum 82805001000150020002500SE +/- 3.83, N = 32485.591. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.BASPEED Family - 2 x Intel Xeon Platinum 82806001200180024003000SE +/- 8.07, N = 32687.171. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.AASPEED Family - 2 x Intel Xeon Platinum 828030K60K90K120K150KSE +/- 1874.61, N = 4126730.071. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.CASPEED Family - 2 x Intel Xeon Platinum 828020K40K60K80K100KSE +/- 171.60, N = 3107075.951. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.AASPEED Family - 2 x Intel Xeon Platinum 82807001400210028003500SE +/- 11.80, N = 33132.331. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMASPEED Family - 2 x Intel Xeon Platinum 8280510152025SE +/- 0.53, N = 1521.391. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPASPEED Family - 2 x Intel Xeon Platinum 82800.32180.64360.96541.28721.609SE +/- 0.02, N = 31.431. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilASPEED Family - 2 x Intel Xeon Platinum 82800.61431.22861.84292.45723.0715SE +/- 0.03, N = 152.731. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingASPEED Family - 2 x Intel Xeon Platinum 8280110220330440550SE +/- 8.53, N = 3498.761. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsASPEED Family - 2 x Intel Xeon Platinum 82800.16430.32860.49290.65720.8215SE +/- 0.01, N = 150.731. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDASPEED Family - 2 x Intel Xeon Platinum 828048121620SE +/- 0.06, N = 315.411. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverASPEED Family - 2 x Intel Xeon Platinum 8280246810SE +/- 0.06, N = 137.641. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterASPEED Family - 2 x Intel Xeon Platinum 82803691215SE +/- 0.07, N = 312.031. (CXX) g++ options: -O2 -lOpenCL

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0ASPEED Family - 2 x Intel Xeon Platinum 82800.20930.41860.62790.83721.0465SE +/- 0.01, N = 150.93

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsASPEED Family - 2 x Intel Xeon Platinum 82800.08070.16140.24210.32280.4035SE +/- 0.00051, N = 30.35873

MKL-DNN

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82803691215SE +/- 0.40, N = 159.05MIN: 6.211. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82801530456075SE +/- 2.05, N = 1568.58MIN: 46.761. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 8280246810SE +/- 0.27, N = 127.99MIN: 6.051. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82803691215SE +/- 0.58, N = 159.21MIN: 5.811. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82801632486480SE +/- 1.29, N = 1271.87MIN: 48.791. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82801632486480SE +/- 1.24, N = 1569.69MIN: 50.521. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82801.01032.02063.03094.04125.0515SE +/- 0.03, N = 34.49MIN: 3.951. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 828080160240320400SE +/- 1.36, N = 3390MIN: 381.051. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82800.30830.61660.92491.23321.5415SE +/- 0.03, N = 151.37MIN: 1.031. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82800.58051.1611.74152.3222.9025SE +/- 0.10, N = 122.58MIN: 1.081. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82801224364860SE +/- 0.63, N = 653.27MIN: 48.791. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 82802004006008001000SE +/- 26.64, N = 9826MIN: 779.051. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82801.03052.0613.09154.1225.1525SE +/- 0.10, N = 154.58MIN: 3.51. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82801.02152.0433.06454.0865.1075SE +/- 0.09, N = 154.54MIN: 3.71. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 828080160240320400SE +/- 0.30, N = 3390MIN: 382.931. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 828080160240320400SE +/- 0.45, N = 3390.70MIN: 381.251. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32ASPEED Family - 2 x Intel Xeon Platinum 8280510152025SE +/- 0.15, N = 322.86MIN: 21.321. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82800.30380.60760.91141.21521.519SE +/- 0.03, N = 151.35MIN: 1.021. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82800.61881.23761.85642.47523.094SE +/- 0.10, N = 122.75MIN: 1.141. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82801224364860SE +/- 0.77, N = 355.57MIN: 48.341. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82800.3060.6120.9181.2241.53SE +/- 0.03, N = 151.36MIN: 1.011. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82800.59181.18361.77542.36722.959SE +/- 0.10, N = 152.63MIN: 1.151. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 82802004006008001000SE +/- 4.61, N = 3805.16MIN: 783.271. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82801224364860SE +/- 1.09, N = 1555.01MIN: 48.611. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 82802004006008001000SE +/- 11.84, N = 3801.75MIN: 777.541. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8u8s32ASPEED Family - 2 x Intel Xeon Platinum 8280510152025SE +/- 0.34, N = 322.72MIN: 21.291. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32s32ASPEED Family - 2 x Intel Xeon Platinum 8280510152025SE +/- 0.20, N = 1122.85MIN: 21.241. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingASPEED Family - 2 x Intel Xeon Platinum 8280306090120150SE +/- 0.97, N = 3114.581. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingASPEED Family - 2 x Intel Xeon Platinum 8280816243240SE +/- 0.05, N = 332.471. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeASPEED Family - 2 x Intel Xeon Platinum 828030M60M90M120M150MSE +/- 687138.57, N = 31262701301. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageASPEED Family - 2 x Intel Xeon Platinum 8280160M320M480M640M800MSE +/- 9571654.10, N = 47398992681. (CC) gcc options: -lm -lpthread -lcurses -lrt

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 8.2Time To CompileASPEED Family - 2 x Intel Xeon Platinum 82802004006008001000877.69

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileASPEED Family - 2 x Intel Xeon Platinum 8280510152025SE +/- 0.33, N = 1422.15

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileASPEED Family - 2 x Intel Xeon Platinum 8280306090120150116.55

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelASPEED Family - 2 x Intel Xeon Platinum 828048121620SE +/- 0.08, N = 316.311. (CC) gcc options: -lm -lpthread -O3

LLVM Test Suite

This test times how long it takes to run the LLVM Test Suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLLVM Test Suite 6.0.0Time To RunASPEED Family - 2 x Intel Xeon Platinum 82800.06750.1350.20250.270.3375SE +/- 0.00, N = 30.30

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesASPEED Family - 2 x Intel Xeon Platinum 82800.76951.5392.30853.0783.8475SE +/- 0.11, N = 123.421. (CXX) g++ options: -fopenmp -O3

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.7.5.9481Calculating 500M Pi DigitsASPEED Family - 2 x Intel Xeon Platinum 82803691215SE +/- 0.03, N = 310.53

Sysbench

This is a benchmark of Sysbench with CPU and memory sub-tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: MemoryASPEED Family - 2 x Intel Xeon Platinum 82802M4M6M8M10MSE +/- 119460.10, N = 1511390960.911. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPUASPEED Family - 2 x Intel Xeon Platinum 828020K40K60K80K100KSE +/- 750.97, N = 31018071. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm

Xsbench

XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06ASPEED Family - 2 x Intel Xeon Platinum 82802M4M6M8M10MSE +/- 89386.61, N = 1597593811. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

58 Results Shown

NAS Parallel Benchmarks:
  BT.A
  EP.C
  FT.A
  FT.B
  LU.A
  LU.C
  SP.A
Parboil:
  OpenMP LBM
  OpenMP CUTCP
  OpenMP Stencil
  OpenMP MRI Gridding
CloverLeaf
Rodinia:
  OpenMP LavaMD
  OpenMP CFD Solver
  OpenMP Streamcluster
High Performance Conjugate Gradient
NAMD
MKL-DNN:
  IP Batch 1D - f32
  IP Batch All - f32
  IP Batch 1D - u8s8u8s32
  IP Batch 1D - u8s8f32s32
  IP Batch All - u8s8u8s32
  IP Batch All - u8s8f32s32
  Convolution Batch conv_3d - f32
  Convolution Batch conv_all - f32
  Deconvolution Batch deconv_1d - f32
  Deconvolution Batch deconv_3d - f32
  Convolution Batch conv_alexnet - f32
  Deconvolution Batch deconv_all - f32
  Convolution Batch conv_3d - u8s8u8s32
  Convolution Batch conv_3d - u8s8f32s32
  Convolution Batch conv_all - u8s8u8s32
  Convolution Batch conv_all - u8s8f32s32
  Convolution Batch conv_googlenet_v3 - f32
  Deconvolution Batch deconv_1d - u8s8u8s32
  Deconvolution Batch deconv_3d - u8s8u8s32
  Convolution Batch conv_alexnet - u8s8u8s32
  Deconvolution Batch deconv_1d - u8s8f32s32
  Deconvolution Batch deconv_3d - u8s8f32s32
  Deconvolution Batch deconv_all - u8s8u8s32
  Convolution Batch conv_alexnet - u8s8f32s32
  Deconvolution Batch deconv_all - u8s8f32s32
  Convolution Batch conv_googlenet_v3 - u8s8u8s32
  Convolution Batch conv_googlenet_v3 - u8s8f32s32
x264
x265
Stockfish
Swet
Timed GCC Compilation
Timed Linux Kernel Compilation
Timed LLVM Compilation
C-Ray
LLVM Test Suite
Smallpt
Y-Cruncher
Sysbench:
  Memory
  CPU
Xsbench