AMD EPYC 7601 2P 2021

2 x AMD EPYC 7601 32-Core testing with a Dell 02MJ3T (1.2.5 BIOS) and llvmpipe on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101214-HA-AMDEPYC7645&grr.

AMD EPYC 7601 2P 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1232 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads)Dell 02MJ3T (1.2.5 BIOS)AMD 17h504GB280GB INTEL SSDPED1D280GA + 12 x 500GB Samsung SSD 860 + 120GB INTEL SSDSCKJB120G7RllvmpipeVE2282 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA + 2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 19.105.9.0-050900rc6daily20200922-generic (x86_64) 20200921GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.53.3 Mesa 19.2.8 (LLVM 9.0 128 bits)GCC 9.2.1 20191008ext41600x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- CPU Microcode: 0x8001227Python Details- Python 2.7.17rc1 + Python 3.7.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 7601 2P 2021qe: AUSURF112relion: Basic - CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUkripke: openfoam: Motorbike 60Monednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUlammps: 20k Atomsonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0qmcpack: simple-H2Ocloverleaf: Lagrangian-Eulerian Hydrodynamicsonnx: super-resolution-10 - OpenMP CPUopenfoam: Motorbike 30Monednn: Deconvolution Batch shapes_1d - f32 - CPUbuild-godot: Time To Compiledav1d: Chimera 1080p 10-bitrav1e: 5rav1e: 1rav1e: 6cryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512amg: dav1d: Summer Nature 4Ketcpak: ETC2rav1e: 10unpack-firefox: firefox-84.0.source.tar.xzencode-ape: WAV To APEsynthmark: VoiceMark_100encode-wavpack: WAV To WavPacketcpak: ETC1 + Ditheringlulesh: encode-ogg: WAV To Oggetcpak: ETC1onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1dav1d: Summer Nature 1080pdav1d: Chimera 1080pencode-opus: WAV To Opus Encodeonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUetcpak: DXT1lammps: Rhodopsin Protein1231796.32548.3797621884466.5037882537338.713698.784707.164554.323940.0723.3823557.56535868.5856.76110.74154.12114.95541.83929.54208434.623.54557107.098138.450.7800.2581.016316.0317.7306.7308.61276.71279.4316.5317.4306.7308.11445.41444.35100081170727709699800243.31118.0492.32225.90718.344512.10717.296174.28316092.92526.603184.7473.54912369.108333.403629.60637.0310.2133.867412.902990.9199091.378816.454622.8202819.866123.241017.71533.168801296.78723.3111754.21548.4277123184580.2135226890338.373877.714615.364515.793865.5823.3993580.43525471.5757.01210.90952.86014.81246.24929.24207834.233.42396104.004138.900.7780.2621.031316.6318.1306.9308.81285.01286.7316.8318.3307.0308.41442.61456.95077511157453709739033251.57118.0492.37025.75418.322512.07217.285174.30016073.08626.627184.6903.49511369.669333.272669.24659.1910.2073.819102.525000.9028051.388896.562372.6788419.217322.740416.48793.181181323.79823.3821808.78547.9394661.89340.044137.524885.175020.044050.3723.2014199.6750.65529.8735.323.76272103.914139.130.7750.2611.030316.7318.03073091284.31285.4316.9318.3307.1308.91453.11454.55060731168547708880233248.58118.0702.33618.336512.073174.16615990.72026.665184.7573.71634634.70634.7710.2383.963693.811182.603661.405547.091304.5013021.117225.159520.45283.183921322.33923.083OpenBenchmarking.org

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112123400800120016002000SE +/- 19.90, N = 9SE +/- 5.15, N = 3SE +/- 18.18, N = 91796.321754.211808.781. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU123120240360480600SE +/- 0.24, N = 3SE +/- 0.25, N = 3SE +/- 0.25, N = 3548.38548.43547.941. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1220406080100SE +/- 2.66, N = 12SE +/- 2.07, N = 1276711. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU125001000150020002500SE +/- 142.39, N = 12SE +/- 112.64, N = 12218823181. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 216.78, N = 15SE +/- 178.79, N = 15SE +/- 148.69, N = 154466.504580.214661.89MIN: 2939.4MIN: 3020.84MIN: 3208.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4128M16M24M32M40MSE +/- 1848270.09, N = 15SE +/- 1684001.44, N = 1237882537352268901. (CXX) g++ options: -O3 -fopenmp

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M12370140210280350SE +/- 0.27, N = 3SE +/- 0.73, N = 3SE +/- 0.68, N = 3338.71338.37340.041. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1239001800270036004500SE +/- 128.96, N = 15SE +/- 146.91, N = 15SE +/- 85.43, N = 153698.783877.714137.52MIN: 2872.13MIN: 2904.51MIN: 3448.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 128.63, N = 12SE +/- 149.33, N = 15SE +/- 125.34, N = 154707.164615.364885.17MIN: 3518.69MIN: 3327.21MIN: 3744.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12311002200330044005500SE +/- 120.82, N = 15SE +/- 153.50, N = 12SE +/- 112.92, N = 154554.324515.795020.04MIN: 3273.38MIN: 2781.43MIN: 3390.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1239001800270036004500SE +/- 64.36, N = 15SE +/- 125.32, N = 15SE +/- 94.46, N = 123940.073865.584050.37MIN: 3412.94MIN: 3206.95MIN: 3493.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123612182430SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 323.3823.4023.201. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1239001800270036004500SE +/- 35.53, N = 3SE +/- 114.13, N = 15SE +/- 89.60, N = 153557.563580.434199.67MIN: 3305.84MIN: 3052.37MIN: 3498.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU121224364860SE +/- 0.88, N = 3SE +/- 1.02, N = 1253521. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU121326395265SE +/- 0.44, N = 3SE +/- 2.87, N = 958541. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3121632486480SE +/- 0.67, N = 3SE +/- 1.22, N = 368.5971.58MIN: 62.84 / MAX: 229.88MIN: 64.39 / MAX: 186.821. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.012246810SE +/- 0.103, N = 3SE +/- 0.613, N = 36.7617.012MIN: 6.2 / MAX: 8.27MIN: 6 / MAX: 24.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_224123691215SE +/- 0.19, N = 3SE +/- 0.30, N = 310.7410.91MIN: 10.16 / MAX: 11.79MIN: 10.26 / MAX: 12.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-50121224364860SE +/- 1.07, N = 3SE +/- 2.27, N = 354.1252.86MIN: 46.9 / MAX: 742.65MIN: 46.53 / MAX: 819.731. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01248121620SE +/- 0.10, N = 3SE +/- 0.20, N = 314.9614.81MIN: 13.84 / MAX: 36.38MIN: 13.46 / MAX: 30.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231122334455SE +/- 0.22, N = 3SE +/- 1.39, N = 15SE +/- 1.69, N = 1241.8446.2550.661. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -lm -pthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics123714212835SE +/- 0.32, N = 15SE +/- 0.23, N = 15SE +/- 0.49, N = 1529.5429.2429.871. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU12400800120016002000SE +/- 13.94, N = 3SE +/- 32.31, N = 3208420781. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M123816243240SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.33, N = 1534.6234.2335.321. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.84661.69322.53983.38644.233SE +/- 0.03944, N = 15SE +/- 0.03472, N = 15SE +/- 0.04485, N = 153.545573.423963.76272MIN: 2.98MIN: 2.94MIN: 3.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12320406080100SE +/- 1.83, N = 3SE +/- 1.42, N = 3SE +/- 1.34, N = 3107.10104.00103.91

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123306090120150SE +/- 0.41, N = 3SE +/- 0.30, N = 3SE +/- 0.14, N = 3138.45138.90139.13MIN: 95.91 / MAX: 217.11MIN: 96.19 / MAX: 217.56MIN: 96.19 / MAX: 219.51. (CC) gcc options: -pthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51230.17550.3510.52650.7020.8775SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 30.7800.7780.775

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11230.0590.1180.1770.2360.295SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.2580.2620.261

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61230.2320.4640.6960.9281.16SE +/- 0.009, N = 3SE +/- 0.005, N = 3SE +/- 0.014, N = 31.0161.0311.030

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12370140210280350SE +/- 0.05, N = 2SE +/- 0.06, N = 7SE +/- 0.12, N = 3316.0316.6316.7

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12370140210280350SE +/- 0.19, N = 3SE +/- 0.06, N = 7SE +/- 0.20, N = 2317.7318.1318.0

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption12370140210280350SE +/- 0.05, N = 2SE +/- 0.17, N = 4306.7306.9307.0

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption12370140210280350SE +/- 0.10, N = 2SE +/- 0.09, N = 6308.6308.8309.0

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12330060090012001500SE +/- 2.31, N = 3SE +/- 1.94, N = 7SE +/- 1.76, N = 31276.71285.01284.3

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12330060090012001500SE +/- 1.32, N = 3SE +/- 1.46, N = 7SE +/- 1.42, N = 31279.41286.71285.4

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12370140210280350SE +/- 0.06, N = 3SE +/- 0.07, N = 7SE +/- 0.10, N = 3316.5316.8316.9

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12370140210280350SE +/- 0.37, N = 3SE +/- 0.08, N = 7SE +/- 0.03, N = 3317.4318.3318.3

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption12370140210280350SE +/- 0.12, N = 3SE +/- 0.10, N = 7SE +/- 0.00, N = 3306.7307.0307.1

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption12370140210280350SE +/- 0.58, N = 3SE +/- 0.53, N = 7SE +/- 0.03, N = 3308.1308.4308.9

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12330060090012001500SE +/- 2.14, N = 3SE +/- 12.89, N = 7SE +/- 2.05, N = 31445.41442.61453.1

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12330060090012001500SE +/- 3.56, N = 3SE +/- 2.27, N = 7SE +/- 1.65, N = 31444.31456.91454.5

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool123110K220K330K440K550KSE +/- 572.73, N = 3SE +/- 280.29, N = 7SE +/- 975.00, N = 3510008507751506073

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512123300K600K900K1200K1500KSE +/- 1900.25, N = 3SE +/- 12198.12, N = 7SE +/- 434.00, N = 3117072711574531168547

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123150M300M450M600M750MSE +/- 1313948.59, N = 3SE +/- 201948.45, N = 3SE +/- 658905.95, N = 37096998007097390337088802331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K12350100150200250SE +/- 4.09, N = 12SE +/- 1.22, N = 3SE +/- 4.61, N = 12243.31251.57248.58MIN: 81.19 / MAX: 282.97MIN: 91.04 / MAX: 277.22MIN: 85.73 / MAX: 286.181. (CC) gcc options: -pthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123306090120150SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3118.05118.05118.071. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101230.53331.06661.59992.13322.6665SE +/- 0.021, N = 3SE +/- 0.016, N = 3SE +/- 0.011, N = 32.3222.3702.336

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz12612182430SE +/- 0.08, N = 4SE +/- 0.03, N = 425.9125.75

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE123510152025SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 518.3418.3218.341. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123110220330440550SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3512.11512.07512.071. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1248121620SE +/- 0.00, N = 5SE +/- 0.01, N = 517.3017.291. (CXX) g++ options: -rdynamic

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1234080120160200SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3174.28174.30174.171. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31233K6K9K12K15KSE +/- 42.67, N = 3SE +/- 42.52, N = 3SE +/- 65.00, N = 316092.9316073.0915990.721. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123612182430SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 326.6026.6326.671. (CC) gcc options: -O2 -ffast-math -fsigned-char

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11234080120160200SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3184.75184.69184.761. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.83621.67242.50863.34484.181SE +/- 0.03910, N = 3SE +/- 0.04588, N = 5SE +/- 0.01698, N = 33.549123.495113.71634MIN: 3MIN: 3MIN: 3.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21280160240320400SE +/- 1.37, N = 3SE +/- 0.16, N = 3369.11369.67MIN: 357.24 / MAX: 557.3MIN: 358.63 / MAX: 519.861. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11270140210280350SE +/- 0.06, N = 3SE +/- 0.15, N = 3333.40333.27MIN: 332.68 / MAX: 338.81MIN: 332.42 / MAX: 334.081. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p123140280420560700SE +/- 8.65, N = 15SE +/- 4.94, N = 3SE +/- 9.48, N = 15629.60669.24634.70MIN: 194.05 / MAX: 739.75MIN: 231.81 / MAX: 754.48MIN: 194.36 / MAX: 755.241. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p123140280420560700SE +/- 9.95, N = 3SE +/- 1.84, N = 3SE +/- 9.40, N = 4637.03659.19634.77MIN: 344.69 / MAX: 796.13MIN: 348.24 / MAX: 815.29MIN: 349.39 / MAX: 815.271. (CC) gcc options: -pthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 510.2110.2110.241. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.89181.78362.67543.56724.459SE +/- 0.04801, N = 4SE +/- 0.05005, N = 3SE +/- 0.03574, N = 33.867413.819103.96369MIN: 3.27MIN: 3.31MIN: 3.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.85751.7152.57253.434.2875SE +/- 0.04679, N = 3SE +/- 0.02862, N = 3SE +/- 0.05747, N = 32.902992.525003.81118MIN: 2.24MIN: 2.08MIN: 3.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.58581.17161.75742.34322.929SE +/- 0.007779, N = 3SE +/- 0.002458, N = 3SE +/- 0.009404, N = 30.9199090.9028052.603660MIN: 0.77MIN: 0.77MIN: 2.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.31620.63240.94861.26481.581SE +/- 0.00740, N = 3SE +/- 0.00510, N = 3SE +/- 0.00370, N = 31.378811.388891.40554MIN: 1.12MIN: 1.21MIN: 1.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.15837, N = 15SE +/- 0.12641, N = 15SE +/- 0.10920, N = 36.454626.562377.09130MIN: 5.22MIN: 5.14MIN: 6.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.01282.02563.03844.05125.064SE +/- 0.03873, N = 3SE +/- 0.02321, N = 3SE +/- 0.01074, N = 32.820282.678844.50130MIN: 2.33MIN: 2.27MIN: 4.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123510152025SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 319.8719.2221.12MIN: 18.97MIN: 18.41MIN: 20.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.11, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 323.2422.7425.16MIN: 20.59MIN: 20.43MIN: 22.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 317.7216.4920.45MIN: 16.71MIN: 15.75MIN: 19.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.71641.43282.14922.86563.582SE +/- 0.03490, N = 6SE +/- 0.01629, N = 3SE +/- 0.01545, N = 33.168803.181183.18392MIN: 2.83MIN: 2.88MIN: 2.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT112330060090012001500SE +/- 1.47, N = 3SE +/- 0.97, N = 3SE +/- 0.70, N = 31296.791323.801322.341. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123612182430SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.23, N = 323.3123.3823.081. (CXX) g++ options: -O3 -pthread -lm


Phoronix Test Suite v10.8.4