Talos II Raptor POWER9 System

AMD EPYC vs. Intel Xeon vs. POWER9 CPUs. Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1805190-FO-1804049AR57&sro.

Talos II Raptor POWER9 SystemProcessorMotherboardMemoryDiskGraphicsAudioNetworkChipsetMonitorOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionDesktopOpenGLRaptor Talos II2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551amd7750AMD7750_cmpPOWER9 altivec supported @ 3.80GHz (64 Cores)PowerNV T2P9D01 REV 1.00262144MB500GB MAXTOR STM350063AMD Radeon Pro WX 7100 8192MBAMD EllesmereBroadcom Limited NetXtreme BCM5719 Gigabit PCIeDebian testing4.16.0-rc4 (ppc64le) 20180307amdgpu 1.4.0GCC 7.3.0ext41024x7682 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads)TYAN S7106 (V1.00 BIOS)Intel Sky Lake-E DMI3 Registers12 x 8192 MB DDR4-2666MT/s Micron 9ASF1G72PZ-2G6B1256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150ASPEED ASPEED FamilyVE228Intel I210 Gigabit Connection4.16.0-041600-generic (x86_64)GNOME Shell 3.28.01920x1080AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V0.05.B10 BIOS)AMD Family 17h8 x 16384 MB DDR4-2666MT/s Samsung M393A2K40BB2-CTD280GB INTEL SSDPE21D280GABroadcom Limited NetXtreme BCM5720 Gigabit PCIeAMD EPYC 7551 32-Core @ 2.00GHz (32 Cores / 64 Threads)GIGABYTE MZ31-AR0-00 v01010101 (F03 BIOS)28672MB2 x Samsung SSD 960 EVO 500GBllvmpipe 28032MBNVIDIA GM204 HD AudioASUS PB278Realtek RTL8111/8168/8411AMD Athlon 7750 @ 2.70GHz (2 Cores)ASUS M3A78-EM (2701 BIOS)AMD RS780 + SB7x0/SB8x0/SB9x08192MB256GB SAMSUNG SSD 830ASUS AMD Radeon HD 3200 256MBRealtek ALC1200L19-54.16.0-1-amd64 (x86_64)GNOME Shell 3.28.1modesetting 1.19.63.3 Mesa 18.0.3 (LLVM 6.0.0)1280x1024OpenBenchmarking.orgCompiler Details- Raptor Talos II: --build=powerpc64le-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-multilib --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-secureplt --enable-shared --enable-targets=powerpcle-linux --enable-threads=posix --host=powerpc64le-linux-gnu --program-prefix=powerpc64le-linux-gnu- --target=powerpc64le-linux-gnu --with-cpu=power8 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-long-double-128 -v - 2 x Intel Xeon Gold 6138: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - AMD EPYC 7601: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - AMD EPYC 7551: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - amd7750: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - AMD7750_cmp: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Raptor Talos II: Scaling Governor: powernv-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate performance- AMD EPYC 7601: Scaling Governor: acpi-cpufreq performance- AMD EPYC 7551: Scaling Governor: acpi-cpufreq performance- amd7750: Scaling Governor: acpi-cpufreq ondemand- AMD7750_cmp: Scaling Governor: acpi-cpufreq ondemandPython Details- Raptor Talos II: Python 2.7.14+ + Python 3.6.4+- 2 x Intel Xeon Gold 6138: Python 2.7.14+ + Python 3.6.5rc1- AMD EPYC 7601: Python 2.7.14+ + Python 3.6.5rc1- AMD EPYC 7551: Python 2.7.14+ + Python 3.6.5rc1- amd7750: Python 2.7.15 + Python 3.6.5rc1- AMD7750_cmp: Python 2.7.15 + Python 3.6.5Security Details- 2 x Intel Xeon Gold 6138: KPTI + __user pointer sanitization + Full generic retpoline Protection- AMD EPYC 7601: __user pointer sanitization + Full AMD retpoline Protection- AMD EPYC 7551: __user pointer sanitization + Full AMD retpoline Protection- amd7750: __user pointer sanitization + Full AMD retpoline Protection- AMD7750_cmp: __user pointer sanitization + Full AMD retpoline Protection

Talos II Raptor POWER9 Systemparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterx264: H.264 Video Encodingcompress-7zip: Compress Speed Testbuild-gcc: Time To Compilebuild-linux-kernel: Time To Compilec-ray: Total Timestockfish: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3openssl: RSA 4096-bit Performanceredis: GETredis: SETpybench: Total For Average Test Timesphpbench: PHP Benchmark Suiteosbench: Create Threadsosbench: Create Processesosbench: Memory AllocationsRaptor Talos II2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551amd7750AMD7750_cmp66.089.6910.51817.4438.5819.0927.8043.72824471070.7072.614.65491551.7975.272104.871049994.41606874.10485916320827.1729.7783.0330.182.286.01397.4327.1010.0014.51127.28143505591.3227.803.15334310.2733.207965.402515784.921744256.79139560246723.0742.9596.0537.392.6114.26256.1532.5910.7117.46128.1799574707.3435.663.46447411.7943.574598.471703353.371195935.33208639365930.7159.6195.1471.882.7617.35355.6334.7912.2728.79101.5279708926.0839.303.59503212.7145.694387.471639124.131161953.33221636576738.2557.9596.32995.6482.59146.1454.401209.54359.76119.076.8845225184.37740.22120.48603427.6482.10155.971211124.58819555.58328224459127.8993.55207.02OpenBenchmarking.org

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II2004006008001000SE +/- 0.75, N = 6SE +/- 1.94, N = 6SE +/- 0.38, N = 3SE +/- 2.54, N = 3SE +/- 0.20, N = 330.1871.8837.39995.6466.081. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II20406080100SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.50, N = 3SE +/- 0.14, N = 32.282.762.6182.599.691. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II306090120150SE +/- 0.06, N = 3SE +/- 0.49, N = 6SE +/- 0.88, N = 6SE +/- 0.30, N = 3SE +/- 0.04, N = 36.0117.3514.26146.1410.511. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI Gridding2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II2004006008001000SE +/- 8.35, N = 6SE +/- 1.42, N = 3SE +/- 2.31, N = 3SE +/- 0.12, N = 3SE +/- 0.64, N = 3397.43355.63256.1554.40817.441. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II30060090012001500SE +/- 0.22, N = 3SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 4.22, N = 3SE +/- 0.05, N = 327.1034.7932.591209.5438.581. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II80160240320400SE +/- 0.20, N = 6SE +/- 0.31, N = 6SE +/- 0.14, N = 3SE +/- 0.70, N = 3SE +/- 0.16, N = 310.0012.2710.71359.7619.091. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP Streamcluster2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II306090120150SE +/- 0.50, N = 6SE +/- 3.79, N = 6SE +/- 0.53, N = 6SE +/- 0.02, N = 3SE +/- 0.08, N = 314.5128.7917.46119.0727.801. (CXX) g++ options: -O2 -lOpenCL

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-02-05H.264 Video Encoding2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II306090120150SE +/- 5.54, N = 6SE +/- 1.55, N = 3SE +/- 0.67, N = 3SE +/- 0.12, N = 4SE +/- 0.14, N = 3127.28101.52128.176.8843.72-m64-m64-m64-maltivec -mabi=altivec -mvsx1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II30K60K90K120K150KSE +/- 2372.97, N = 3SE +/- 2630.45, N = 6SE +/- 669.88, N = 3SE +/- 11.17, N = 3SE +/- 390.52, N = 314350579708995744522824471. (CXX) g++ options: -pipe -lpthread

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 7.2Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II11002200330044005500SE +/- 0.65, N = 3SE +/- 5.33, N = 3SE +/- 5.02, N = 3SE +/- 12.12, N = 3SE +/- 2.17, N = 3591.32926.08707.345184.371070.70

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.13Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II160320480640800SE +/- 0.40, N = 5SE +/- 0.75, N = 6SE +/- 0.47, N = 6SE +/- 12.78, N = 3SE +/- 1.20, N = 427.8039.3035.66740.2272.61

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II306090120150SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 33.153.593.46120.484.651. (CC) gcc options: -lm -lpthread -O3

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total Time2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II13002600390052006500SE +/- 167.95, N = 6SE +/- 10.04, N = 3SE +/- 5.04, N = 3SE +/- 1.33, N = 333435032447460344915-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt1. (CXX) g++ options: -lpthread -fno-exceptions -fno-rtti -ansi -pedantic -O3 -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II1224364860SE +/- 0.02, N = 5SE +/- 0.09, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.25, N = 510.2712.7111.7927.6451.79-logg1. (CXX) g++ options: -O2 -fvisibility=hidden -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP32 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II20406080100SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.52, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 333.2045.6943.5782.1075.27-lncurses1. (CC) gcc options: -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit Performance2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II2K4K6K8K10KSE +/- 50.05, N = 3SE +/- 0.85, N = 3SE +/- 22.34, N = 3SE +/- 0.19, N = 3SE +/- 1.27, N = 37965.404387.474598.47155.972104.871. (CC) gcc options: -O3 -pthread -m64 -lssl -lcrypto -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II500K1000K1500K2000K2500KSE +/- 108856.52, N = 6SE +/- 41782.46, N = 6SE +/- 25113.28, N = 3SE +/- 16775.79, N = 3SE +/- 12692.76, N = 32515784.921639124.131703353.371211124.581049994.411. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II400K800K1200K1600K2000KSE +/- 62786.06, N = 6SE +/- 6077.00, N = 3SE +/- 23759.98, N = 6SE +/- 11660.13, N = 3SE +/- 11534.54, N = 31744256.791161953.331195935.33819555.58606874.101. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II10002000300040005000SE +/- 2.60, N = 3SE +/- 3.61, N = 3SE +/- 1.45, N = 3SE +/- 9.54, N = 3SE +/- 4.84, N = 313952216208632824859

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II130K260K390K520K650KSE +/- 1392.88, N = 3SE +/- 2983.18, N = 3SE +/- 604.68, N = 3SE +/- 2269.14, N = 3SE +/- 55.73, N = 3602467365767393659244591163208

OSBench

Test: Create Threads

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Threads2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II918273645SE +/- 0.28, N = 3SE +/- 0.61, N = 6SE +/- 0.51, N = 3SE +/- 0.08, N = 3SE +/- 0.45, N = 423.0738.2530.7127.8927.17-lm-lm-lm-lm1. (CC) gcc options:

OSBench

Test: Create Processes

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Processes2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II20406080100SE +/- 0.89, N = 6SE +/- 0.23, N = 3SE +/- 0.87, N = 4SE +/- 1.31, N = 3SE +/- 0.19, N = 342.9557.9559.6193.5529.77-lm-lm-lm-lm1. (CC) gcc options:

OSBench

Test: Memory Allocations

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory Allocations2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 7601AMD7750_cmpRaptor Talos II50100150200250SE +/- 0.42, N = 3SE +/- 1.87, N = 6SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.67, N = 396.0596.3295.14207.0283.03-lm-lm-lm-lm1. (CC) gcc options:


Phoronix Test Suite v10.8.4