Talos II Raptor POWER9 System

AMD EPYC vs. Intel Xeon vs. POWER9 CPUs. Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1903271-KH-1804049AR64&sor.

Talos II Raptor POWER9 SystemProcessorMotherboardMemoryDiskGraphicsAudioNetworkChipsetMonitorOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionDesktopRaptor Talos II2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Intel Xeon E5-2620 v4POWER9 altivec supported @ 3.80GHz (64 Cores)PowerNV T2P9D01 REV 1.00262144MB500GB MAXTOR STM350063AMD Radeon Pro WX 7100 8192MBAMD EllesmereBroadcom Limited NetXtreme BCM5719 Gigabit PCIeDebian testing4.16.0-rc4 (ppc64le) 20180307amdgpu 1.4.0GCC 7.3.0ext41024x7682 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads)TYAN S7106 (V1.00 BIOS)Intel Sky Lake-E DMI3 Registers12 x 8192 MB DDR4-2666MT/s Micron 9ASF1G72PZ-2G6B1256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150ASPEED ASPEED FamilyVE228Intel I210 Gigabit Connection4.16.0-041600-generic (x86_64)GNOME Shell 3.28.01920x1080AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HR (V0.05.B10 BIOS)AMD Family 17h8 x 16384 MB DDR4-2666MT/s Samsung M393A2K40BB2-CTD280GB INTEL SSDPE21D280GABroadcom Limited NetXtreme BCM5720 Gigabit PCIeAMD EPYC 7551 32-Core @ 2.00GHz (32 Cores / 64 Threads)GIGABYTE MZ31-AR0-00 v01010101 (F03 BIOS)28672MB2 x Samsung SSD 960 EVO 500GBllvmpipe 28032MBNVIDIA GM204 HD AudioASUS PB278Realtek RTL8111/8168/84112 x Intel Xeon E5-2620 v4 @ 3.00GHz (16 Cores)Supermicro X10DAI v1.02Intel Xeon E7 v4/Xeon8 x 8192 MB DDR4-2400MT/s Samsung240GB BIWIN SSDNVIDIA GP102 [TITAN Xp]Realtek ALC888-VDIntel I210 Gigabit ConnectionUbuntu 18.044.15.0-45-generic (x86_64)modesetting 1.19.6CUDA 9.1OpenBenchmarking.orgCompiler Details- Raptor Talos II: --build=powerpc64le-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-multilib --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-secureplt --enable-shared --enable-targets=powerpcle-linux --enable-threads=posix --host=powerpc64le-linux-gnu --program-prefix=powerpc64le-linux-gnu- --target=powerpc64le-linux-gnu --with-cpu=power8 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-long-double-128 -v - 2 x Intel Xeon Gold 6138: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - AMD EPYC 7601: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - AMD EPYC 7551: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Intel Xeon E5-2620 v4: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Raptor Talos II: Scaling Governor: powernv-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate performance- AMD EPYC 7601: Scaling Governor: acpi-cpufreq performance- AMD EPYC 7551: Scaling Governor: acpi-cpufreq performance- Intel Xeon E5-2620 v4: Scaling Governor: intel_pstate powersavePython Details- Raptor Talos II: Python 2.7.14+ + Python 3.6.4+- 2 x Intel Xeon Gold 6138: Python 2.7.14+ + Python 3.6.5rc1- AMD EPYC 7601: Python 2.7.14+ + Python 3.6.5rc1- AMD EPYC 7551: Python 2.7.14+ + Python 3.6.5rc1Security Details- 2 x Intel Xeon Gold 6138: KPTI + __user pointer sanitization + Full generic retpoline Protection- AMD EPYC 7601: __user pointer sanitization + Full AMD retpoline Protection- AMD EPYC 7551: __user pointer sanitization + Full AMD retpoline ProtectionSystem Details- Intel Xeon E5-2620 v4: Python 2.7.15rc1.

Talos II Raptor POWER9 Systemparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterx264: H.264 Video Encodingcompress-7zip: Compress Speed Testbuild-gcc: Time To Compilebuild-linux-kernel: Time To Compilec-ray: Total Timestockfish: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3openssl: RSA 4096-bit Performanceredis: GETredis: SETpybench: Total For Average Test Timesphpbench: PHP Benchmark Suiteosbench: Create Threadsosbench: Create Processesosbench: Memory AllocationsRaptor Talos II2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Intel Xeon E5-2620 v466.089.6910.51817.4438.5819.0927.8043.72824471070.7072.614.65491551.7975.272104.871049994.41606874.10485916320827.1729.7783.0330.182.286.01397.4327.1010.0014.51127.28143505591.3227.803.15334310.2733.207965.402515784.921744256.79139560246723.0742.9596.0537.392.6114.26256.1532.5910.7117.46128.1799574707.3435.663.46447411.7943.574598.471703353.371195935.33208639365930.7159.6195.1471.882.7617.35355.6334.7912.2728.79101.5279708926.0839.303.59503212.7145.694387.471639124.131161953.33221636576738.2557.9596.3270.046.6915.71179.96108.5321.9421.0060.21382831078.5076.1810.43441113.6540.812608.631664374.631185182.451539448353OpenBenchmarking.org

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM2 x Intel Xeon Gold 6138AMD EPYC 7601Raptor Talos IIIntel Xeon E5-2620 v4AMD EPYC 75511632486480SE +/- 0.75, N = 6SE +/- 0.38, N = 3SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 1.94, N = 630.1837.3966.0870.0471.881. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Intel Xeon E5-2620 v4Raptor Talos II3691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.14, N = 32.282.612.766.699.691. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil2 x Intel Xeon Gold 6138Raptor Talos IIAMD EPYC 7601Intel Xeon E5-2620 v4AMD EPYC 755148121620SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.88, N = 6SE +/- 0.27, N = 3SE +/- 0.49, N = 66.0110.5114.2615.7117.351. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingIntel Xeon E5-2620 v4AMD EPYC 7601AMD EPYC 75512 x Intel Xeon Gold 6138Raptor Talos II2004006008001000SE +/- 2.07, N = 3SE +/- 2.31, N = 3SE +/- 1.42, N = 3SE +/- 8.35, N = 6SE +/- 0.64, N = 3179.96256.15355.63397.43817.441. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Raptor Talos IIIntel Xeon E5-2620 v420406080100SE +/- 0.22, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 327.1032.5934.7938.58108.531. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Raptor Talos IIIntel Xeon E5-2620 v4510152025SE +/- 0.20, N = 6SE +/- 0.14, N = 3SE +/- 0.31, N = 6SE +/- 0.16, N = 3SE +/- 0.08, N = 310.0010.7112.2719.0921.941. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP Streamcluster2 x Intel Xeon Gold 6138AMD EPYC 7601Intel Xeon E5-2620 v4Raptor Talos IIAMD EPYC 7551714212835SE +/- 0.50, N = 6SE +/- 0.53, N = 6SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 3.79, N = 614.5117.4621.0027.8028.791. (CXX) g++ options: -O2 -lOpenCL

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-02-05H.264 Video EncodingAMD EPYC 76012 x Intel Xeon Gold 6138AMD EPYC 7551Intel Xeon E5-2620 v4Raptor Talos II306090120150SE +/- 0.67, N = 3SE +/- 5.54, N = 6SE +/- 1.55, N = 3SE +/- 1.19, N = 6SE +/- 0.14, N = 3128.17127.28101.5260.2143.72-m64-m64-m64-m64-maltivec -mabi=altivec -mvsx1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test2 x Intel Xeon Gold 6138AMD EPYC 7601Raptor Talos IIAMD EPYC 7551Intel Xeon E5-2620 v430K60K90K120K150KSE +/- 2372.97, N = 3SE +/- 669.88, N = 3SE +/- 390.52, N = 3SE +/- 2630.45, N = 6SE +/- 271.98, N = 3143505995748244779708382831. (CXX) g++ options: -pipe -lpthread

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 7.2Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Raptor Talos IIIntel Xeon E5-2620 v42004006008001000SE +/- 0.65, N = 3SE +/- 5.02, N = 3SE +/- 5.33, N = 3SE +/- 2.17, N = 3SE +/- 0.58, N = 3591.32707.34926.081070.701078.50

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.13Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Raptor Talos IIIntel Xeon E5-2620 v420406080100SE +/- 0.40, N = 5SE +/- 0.47, N = 6SE +/- 0.75, N = 6SE +/- 1.20, N = 4SE +/- 1.49, N = 327.8035.6639.3072.6176.18

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Raptor Talos IIIntel Xeon E5-2620 v43691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 33.153.463.594.6510.431. (CC) gcc options: -lm -lpthread -O3

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total Time2 x Intel Xeon Gold 6138Intel Xeon E5-2620 v4AMD EPYC 7601Raptor Talos IIAMD EPYC 755111002200330044005500SE +/- 80.25, N = 3SE +/- 10.04, N = 3SE +/- 1.33, N = 3SE +/- 167.95, N = 633434411447449155032-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt-msse -msse3 -mpopcnt1. (CXX) g++ options: -lpthread -fno-exceptions -fno-rtti -ansi -pedantic -O3 -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Intel Xeon E5-2620 v4Raptor Talos II1224364860SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.09, N = 5SE +/- 0.19, N = 5SE +/- 0.25, N = 510.2711.7912.7113.6551.79-logg1. (CXX) g++ options: -O2 -fvisibility=hidden -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP32 x Intel Xeon Gold 6138Intel Xeon E5-2620 v4AMD EPYC 7601AMD EPYC 7551Raptor Talos II20406080100SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.52, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 333.2040.8143.5745.6975.27-lncurses1. (CC) gcc options: -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit Performance2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7551Intel Xeon E5-2620 v4Raptor Talos II2K4K6K8K10KSE +/- 50.05, N = 3SE +/- 22.34, N = 3SE +/- 0.85, N = 3SE +/- 48.05, N = 3SE +/- 1.27, N = 37965.404598.474387.472608.632104.871. (CC) gcc options: -O3 -pthread -m64 -lssl -lcrypto -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET2 x Intel Xeon Gold 6138AMD EPYC 7601Intel Xeon E5-2620 v4AMD EPYC 7551Raptor Talos II500K1000K1500K2000K2500KSE +/- 108856.52, N = 6SE +/- 25113.28, N = 3SE +/- 19988.56, N = 3SE +/- 41782.46, N = 6SE +/- 12692.76, N = 32515784.921703353.371664374.631639124.131049994.411. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET2 x Intel Xeon Gold 6138AMD EPYC 7601Intel Xeon E5-2620 v4AMD EPYC 7551Raptor Talos II400K800K1200K1600K2000KSE +/- 62786.06, N = 6SE +/- 23759.98, N = 6SE +/- 16254.82, N = 5SE +/- 6077.00, N = 3SE +/- 11534.54, N = 31744256.791195935.331185182.451161953.33606874.101. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x Intel Xeon Gold 6138Intel Xeon E5-2620 v4AMD EPYC 7601AMD EPYC 7551Raptor Talos II10002000300040005000SE +/- 2.60, N = 3SE +/- 2.73, N = 3SE +/- 1.45, N = 3SE +/- 3.61, N = 3SE +/- 4.84, N = 313951539208622164859

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite2 x Intel Xeon Gold 6138Intel Xeon E5-2620 v4AMD EPYC 7601AMD EPYC 7551Raptor Talos II130K260K390K520K650KSE +/- 1392.88, N = 3SE +/- 7811.58, N = 6SE +/- 604.68, N = 3SE +/- 2983.18, N = 3SE +/- 55.73, N = 3602467448353393659365767163208

OSBench

Test: Create Threads

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create Threads2 x Intel Xeon Gold 6138Raptor Talos IIAMD EPYC 7601AMD EPYC 7551918273645SE +/- 0.28, N = 3SE +/- 0.45, N = 4SE +/- 0.51, N = 3SE +/- 0.61, N = 623.0727.1730.7138.25-lm-lm-lm1. (CC) gcc options:

OSBench

Test: Create Processes

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ProcessesRaptor Talos II2 x Intel Xeon Gold 6138AMD EPYC 7551AMD EPYC 76011326395265SE +/- 0.19, N = 3SE +/- 0.89, N = 6SE +/- 0.23, N = 3SE +/- 0.87, N = 429.7742.9557.9559.61-lm-lm-lm1. (CC) gcc options:

OSBench

Test: Memory Allocations

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory AllocationsRaptor Talos IIAMD EPYC 76012 x Intel Xeon Gold 6138AMD EPYC 755120406080100SE +/- 0.67, N = 3SE +/- 0.05, N = 3SE +/- 0.42, N = 3SE +/- 1.87, N = 683.0395.1496.0596.32-lm-lm-lm1. (CC) gcc options:


Phoronix Test Suite v10.8.4