Google TauVM AMD EPYC vs. M6g Graviton2 Benchmarks

Initial Google Tau VM AMD EPYC Milan benchmarking compared to Amazon EC2 M6g instances for a future article on Phoronix.com by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2110179-TJ-TAUVMEC2779&sor.

Google TauVM AMD EPYC vs. M6g Graviton2 BenchmarksProcessorMotherboardMemoryDiskChipsetNetworkOSKernelCompilerFile-SystemSystem Layert2d-standard-8t2d-standard-32m6g.2xlargem6g.8xlargeAMD EPYC 7B13 (8 Cores)Google Compute Engine t2d-standard-832GB11GB PersistentDisk + 107GB PersistentDiskDebian 104.19.0-17-cloud-amd64 (x86_64)GCC 8.3.0ext4KVMAMD EPYC 7B13 (32 Cores)Google Compute Engine t2d-standard-32124GBARMv8 Neoverse-N1 (8 Cores)Amazon EC2 m6g.2xlarge (1.0 BIOS)Amazon Device 020032GB107GB Amazon Elastic Block StoreAmazon Elastic4.19.0-14-arm64 (aarch64)ARMv8 Neoverse-N1 (32 Cores)Amazon EC2 m6g.8xlarge (1.0 BIOS)124GBOpenBenchmarking.orgKernel Details- t2d-standard-8: scsi_mod.use_blk_mq=Y - Transparent Huge Pages: always- t2d-standard-32: scsi_mod.use_blk_mq=Y - Transparent Huge Pages: always- m6g.2xlarge: Transparent Huge Pages: always- m6g.8xlarge: Transparent Huge Pages: alwaysCompiler Details- t2d-standard-8: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - t2d-standard-32: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - m6g.2xlarge: --build=aarch64-linux-gnu --disable-libphobos --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v - m6g.8xlarge: --build=aarch64-linux-gnu --disable-libphobos --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v Processor Details- t2d-standard-8, t2d-standard-32: CPU Microcode: 0x1000065Java Details- t2d-standard-8, t2d-standard-32: OpenJDK Runtime Environment (build 11.0.12+7-post-Debian-2deb10u1)Python Details- t2d-standard-8: Python 3.7.3- t2d-standard-32: Python 3.7.3- m6g.2xlarge: Python 2.7.16 + Python 3.7.3- m6g.8xlarge: Python 2.7.16 + Python 3.7.3Security Details- t2d-standard-8: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-32: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6g.2xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m6g.8xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Google TauVM AMD EPYC vs. M6g Graviton2 Benchmarksjohn-the-ripper: Blowfishjohn-the-ripper: MD5graphics-magick: Swirlgraphics-magick: Sharpengraphics-magick: Enhanceddav1d: Summer Nature 4Kcompress-7zip: Compress Speed Testbuild-ffmpeg: Time To Compilebuild-gdb: Time To Compilebuild-nodejs: Time To Compilebuild2: Time To Compileopenssl: SHA256pgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 500 - Read Onlypgbench: 100 - 500 - Read Only - Average Latencybasis: UASTC Level 2basis: UASTC Level 3stress-ng: Cryptostress-ng: Vector Matht2d-standard-8t2d-standard-32m6g.2xlargem6g.8xlarge1079681253034089149136.963883071.04471.620554.723139.92566595606271522891.6421437633.48837.68174.7711650.0220807.294273232103331248346573326.4813322830.70843.817165.28357.698257398356536179610.4056053730.82614.96524.7616218.8678739.9451831852582697410378.212779984.158100.206761.417183.0765527759170878732.845824746.07350.22999.6441257.5717255.17206487217831006267401198.2610941732.67163.432234.10278.061220784599504066650.6153475441.43919.17731.5925030.5869034.73OpenBenchmarking.org

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: Blowfisht2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge9K18K27K36K45KSE +/- 31.94, N = 3SE +/- 2.19, N = 3SE +/- 5.36, N = 3SE +/- 0.67, N = 34273220648107965183-m64-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5t2d-standard-32t2d-standard-8m6g.8xlargem6g.2xlarge700K1400K2100K2800K3500KSE +/- 1201.85, N = 3SE +/- 1883.13, N = 3SE +/- 670.59, N = 3SE +/- 37.20, N = 33210333812530721783185258-m64-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge30060090012001500SE +/- 2.67, N = 3SE +/- 2.33, N = 3SE +/- 1.53, N = 312481006340269-lSM -lICE -lX11 -lxml2-lSM -lICE -lX11 -lxml2-lxml21. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpent2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge801602403204003462678974-lSM -lICE -lX11 -lxml2-lSM -lICE -lX11 -lxml2-lxml21. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge120240360480600SE +/- 0.33, N = 3573401149103-lSM -lICE -lX11 -lxml2-lSM -lICE -lX11 -lxml2-lxml21. (CC) gcc options: -fopenmp -O2 -pthread -ljpeg -lz -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Summer Nature 4Kt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge70140210280350SE +/- 1.06, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3326.48198.26136.9678.21MIN: 254.49 / MAX: 365.12MIN: 135.94 / MAX: 210.43MIN: 128.38 / MAX: 155.91MIN: 72.84 / MAX: 84.321. (CC) gcc options: -lm -pthread

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge30K60K90K120K150KSE +/- 1357.41, N = 3SE +/- 186.53, N = 3SE +/- 581.53, N = 15SE +/- 77.93, N = 313322810941738830277991. (CXX) g++ options: -pipe -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.4Time To Compilet2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge20406080100SE +/- 0.24, N = 3SE +/- 0.03, N = 3SE +/- 0.62, N = 15SE +/- 0.17, N = 330.7132.6771.0484.16

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To Compilet2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge20406080100SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.43, N = 3SE +/- 0.13, N = 343.8263.4371.62100.21

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compilet2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge160320480640800SE +/- 0.89, N = 3SE +/- 0.29, N = 3SE +/- 1.86, N = 3SE +/- 3.14, N = 3165.28234.10554.72761.42

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilet2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge4080120160200SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.54, N = 3SE +/- 1.20, N = 357.7078.06139.93183.08

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256t2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge6000M12000M18000M24000M30000MSE +/- 4017105.90, N = 3SE +/- 55812357.58, N = 3SE +/- 51724089.12, N = 3SE +/- 13414421.31, N = 3257398356532207845995066595606275527759170-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Onlyt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge130K260K390K520K650KSE +/- 3588.30, N = 3SE +/- 1401.95, N = 3SE +/- 453.87, N = 3SE +/- 593.35, N = 3617961406665152289878731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latencyt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge0.64011.28021.92032.56043.2005SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.019, N = 30.4050.6151.6422.8451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Onlyt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge130K260K390K520K650KSE +/- 3614.22, N = 3SE +/- 2376.63, N = 3SE +/- 2355.50, N = 11SE +/- 997.61, N = 12605373347544143763824741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latencyt2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge246810SE +/- 0.005, N = 3SE +/- 0.010, N = 3SE +/- 0.061, N = 11SE +/- 0.075, N = 120.8261.4393.4886.0731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2t2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge1122334455SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 314.9719.1837.6850.231. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3t2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 324.7631.5974.7799.641. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Cryptot2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge13002600390052006500SE +/- 2.08, N = 3SE +/- 2.69, N = 3SE +/- 5.68, N = 3SE +/- 0.71, N = 36218.865030.581650.021257.571. (CC) gcc options: -O2 -std=gnu99 -lm -lcrypt -lrt -lz -ldl -pthread -lc -latomic

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Vector Matht2d-standard-32m6g.8xlarget2d-standard-8m6g.2xlarge20K40K60K80K100KSE +/- 12.58, N = 3SE +/- 28.36, N = 3SE +/- 58.31, N = 3SE +/- 7.25, N = 378739.9469034.7320807.2917255.171. (CC) gcc options: -O2 -std=gnu99 -lm -lcrypt -lrt -lz -ldl -pthread -lc -latomic


Phoronix Test Suite v10.8.4