AMD EPYC 7601 2P 2021

2 x AMD EPYC 7601 32-Core testing with a Dell 02MJ3T (1.2.5 BIOS) and llvmpipe on Ubuntu 19.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101214-HA-AMDEPYC7645
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 4 Tests
AV1 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 5 Tests
Creator Workloads 10 Tests
Encoding 6 Tests
Fortran Tests 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 13 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Machine Learning 4 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 2 Tests
Multi-Core 5 Tests
OpenMPI Tests 8 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 2 Tests
Scientific Computing 8 Tests
Server CPU Tests 2 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
January 19 2021
  11 Hours, 17 Minutes
2
January 20 2021
  9 Hours, 38 Minutes
3
January 20 2021
  9 Hours, 36 Minutes
Invert Hiding All Results Option
  10 Hours, 10 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7601 2P 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1232 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads)Dell 02MJ3T (1.2.5 BIOS)AMD 17h504GB280GB INTEL SSDPED1D280GA + 12 x 500GB Samsung SSD 860 + 120GB INTEL SSDSCKJB120G7RllvmpipeVE2282 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA + 2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 19.105.9.0-050900rc6daily20200922-generic (x86_64) 20200921GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.53.3 Mesa 19.2.8 (LLVM 9.0 128 bits)GCC 9.2.1 20191008ext41600x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- CPU Microcode: 0x8001227Python Details- Python 2.7.17rc1 + Python 3.7.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%105%111%116%QMCPACKoneDNNQuantum ESPRESSOdav1dTimed Godot Game Engine CompilationCloverLeafOpenFOAMrav1eLAMMPS Molecular Dynamics SimulatorLULESHRELIONEtcpakOpus Codec EncodingOgg Audio EncodingCryptsetupMonkey Audio EncodingAlgebraic Multi-Grid BenchmarkGoogle SynthMark

AMD EPYC 7601 2P 2021amg: cloverleaf: Lagrangian-Eulerian Hydrodynamicscryptsetup: PBKDF2-sha512cryptsetup: PBKDF2-whirlpoolcryptsetup: AES-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Twofish-XTS 512b Decryptiondav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitetcpak: DXT1etcpak: ETC1etcpak: ETC2etcpak: ETC1 + Ditheringsynthmark: VoiceMark_100kripke: lammps: 20k Atomslammps: Rhodopsin Proteinlulesh: mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3encode-ape: WAV To APEencode-ogg: WAV To Oggonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUopenfoam: Motorbike 30Mopenfoam: Motorbike 60Mencode-opus: WAV To Opus Encodeqmcpack: simple-H2Oqe: AUSURF112rav1e: 1rav1e: 5rav1e: 6rav1e: 10relion: Basic - CPUbuild-godot: Time To Compiletnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1unpack-firefox: firefox-84.0.source.tar.xzencode-wavpack: WAV To WavPack12370969980029.5411707275100081444.31445.4308.1306.7317.4316.51279.41276.7308.6306.7317.7316.0637.03243.31629.60138.451296.787184.747118.049174.283512.1073788253723.38223.31116092.92514.95554.12110.7416.76168.58518.34426.6032.9029919.86613.867412.8202817.71533.545576.4546223.24103.549123.168804554.323940.074466.503557.560.9199094707.163698.781.378817658532188208434.62338.7110.21341.8391796.320.2580.7801.0162.322548.379107.098369.108333.40325.90717.29670973903329.2411574535077511456.91442.6308.4307.0318.3316.81286.71285.0308.8306.9318.1316.6659.19251.57669.24138.901323.798184.690118.049174.300512.0723522689023.39923.38216073.08614.81252.86010.9097.01271.57518.32226.6272.5250019.21733.819102.6788416.48793.423966.5623722.74043.495113.181184515.793865.584580.213580.430.9028054615.363877.711.388897154522318207834.23338.3710.20746.2491754.210.2620.7781.0312.370548.427104.004369.669333.27225.75417.28570888023329.8711685475060731454.51453.1308.9307.1318.3316.91285.41284.3309307318.0316.7634.77248.58634.70139.131322.339184.757118.070174.166512.07323.20123.08315990.72018.33626.6653.8111821.11723.963694.5013020.45283.762727.0913025.15953.716343.183925020.044050.374661.894199.672.603664885.174137.521.4055435.32340.0410.23850.6551808.780.2610.7751.0302.336547.939103.914OpenBenchmarking.org

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123150M300M450M600M750MSE +/- 1313948.59, N = 3SE +/- 201948.45, N = 3SE +/- 658905.95, N = 37096998007097390337088802331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123120M240M360M480M600MMin: 708250800 / Avg: 709699800 / Max: 712322900Min: 709396000 / Avg: 709739033.33 / Max: 710095200Min: 707917000 / Avg: 708880233.33 / Max: 7101407001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics123714212835SE +/- 0.32, N = 15SE +/- 0.23, N = 15SE +/- 0.49, N = 1529.5429.2429.871. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics123714212835Min: 27.97 / Avg: 29.54 / Max: 31.99Min: 27.18 / Avg: 29.24 / Max: 30.56Min: 26.75 / Avg: 29.87 / Max: 33.121. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512123300K600K900K1200K1500KSE +/- 1900.25, N = 3SE +/- 12198.12, N = 7SE +/- 434.00, N = 3117072711574531168547
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512123200K400K600K800K1000KMin: 1167679 / Avg: 1170727 / Max: 1174217Min: 1084359 / Avg: 1157452.71 / Max: 1172903Min: 1167679 / Avg: 1168547 / Max: 1168981

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool123110K220K330K440K550KSE +/- 572.73, N = 3SE +/- 280.29, N = 7SE +/- 975.00, N = 3510008507751506073
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool12390K180K270K360K450KMin: 509017 / Avg: 510008.33 / Max: 511001Min: 506069 / Avg: 507750.71 / Max: 508031Min: 504123 / Avg: 506073 / Max: 507048

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12330060090012001500SE +/- 3.56, N = 3SE +/- 2.27, N = 7SE +/- 1.65, N = 31444.31456.91454.5
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12330060090012001500Min: 1438.5 / Avg: 1444.33 / Max: 1450.8Min: 1450.2 / Avg: 1456.89 / Max: 1467.4Min: 1451.2 / Avg: 1454.47 / Max: 1456.5

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12330060090012001500SE +/- 2.14, N = 3SE +/- 12.89, N = 7SE +/- 2.05, N = 31445.41442.61453.1
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12330060090012001500Min: 1442.1 / Avg: 1445.37 / Max: 1449.4Min: 1366.5 / Avg: 1442.61 / Max: 1467.3Min: 1449.1 / Avg: 1453.1 / Max: 1455.9

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption12370140210280350SE +/- 0.58, N = 3SE +/- 0.53, N = 7SE +/- 0.03, N = 3308.1308.4308.9
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption12360120180240300Min: 306.9 / Avg: 308.07 / Max: 308.7Min: 305.3 / Avg: 308.36 / Max: 309.2Min: 308.9 / Avg: 308.93 / Max: 309

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption12370140210280350SE +/- 0.12, N = 3SE +/- 0.10, N = 7SE +/- 0.00, N = 3306.7307.0307.1
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption12360120180240300Min: 306.5 / Avg: 306.67 / Max: 306.9Min: 306.4 / Avg: 306.96 / Max: 307.2Min: 307.1 / Avg: 307.1 / Max: 307.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12370140210280350SE +/- 0.37, N = 3SE +/- 0.08, N = 7SE +/- 0.03, N = 3317.4318.3318.3
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12360120180240300Min: 316.7 / Avg: 317.43 / Max: 317.8Min: 318.1 / Avg: 318.3 / Max: 318.6Min: 318.2 / Avg: 318.27 / Max: 318.3

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12370140210280350SE +/- 0.06, N = 3SE +/- 0.07, N = 7SE +/- 0.10, N = 3316.5316.8316.9
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12360120180240300Min: 316.4 / Avg: 316.5 / Max: 316.6Min: 316.6 / Avg: 316.81 / Max: 317.1Min: 316.7 / Avg: 316.9 / Max: 317

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12330060090012001500SE +/- 1.32, N = 3SE +/- 1.46, N = 7SE +/- 1.42, N = 31279.41286.71285.4
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1232004006008001000Min: 1277.7 / Avg: 1279.4 / Max: 1282Min: 1283.3 / Avg: 1286.7 / Max: 1294.1Min: 1282.6 / Avg: 1285.37 / Max: 1287.3

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12330060090012001500SE +/- 2.31, N = 3SE +/- 1.94, N = 7SE +/- 1.76, N = 31276.71285.01284.3
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1232004006008001000Min: 1273.1 / Avg: 1276.67 / Max: 1281Min: 1278.5 / Avg: 1284.97 / Max: 1294.9Min: 1280.9 / Avg: 1284.3 / Max: 1286.8

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption12370140210280350SE +/- 0.10, N = 2SE +/- 0.09, N = 6308.6308.8309.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption12360120180240300Min: 308.5 / Avg: 308.6 / Max: 308.7Min: 308.4 / Avg: 308.78 / Max: 309

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption12370140210280350SE +/- 0.05, N = 2SE +/- 0.17, N = 4306.7306.9307.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption12360120180240300Min: 306.6 / Avg: 306.65 / Max: 306.7Min: 306.5 / Avg: 306.93 / Max: 307.3

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12370140210280350SE +/- 0.19, N = 3SE +/- 0.06, N = 7SE +/- 0.20, N = 2317.7318.1318.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12360120180240300Min: 317.5 / Avg: 317.73 / Max: 318.1Min: 317.9 / Avg: 318.11 / Max: 318.3Min: 317.8 / Avg: 318 / Max: 318.2

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12370140210280350SE +/- 0.05, N = 2SE +/- 0.06, N = 7SE +/- 0.12, N = 3316.0316.6316.7
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12360120180240300Min: 315.9 / Avg: 315.95 / Max: 316Min: 316.5 / Avg: 316.64 / Max: 316.9Min: 316.5 / Avg: 316.73 / Max: 316.9

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p123140280420560700SE +/- 9.95, N = 3SE +/- 1.84, N = 3SE +/- 9.40, N = 4637.03659.19634.77MIN: 344.69 / MAX: 796.13MIN: 348.24 / MAX: 815.29MIN: 349.39 / MAX: 815.271. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p123120240360480600Min: 617.14 / Avg: 637.03 / Max: 647.71Min: 655.53 / Avg: 659.19 / Max: 661.44Min: 620.8 / Avg: 634.77 / Max: 662.161. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K12350100150200250SE +/- 4.09, N = 12SE +/- 1.22, N = 3SE +/- 4.61, N = 12243.31251.57248.58MIN: 81.19 / MAX: 282.97MIN: 91.04 / MAX: 277.22MIN: 85.73 / MAX: 286.181. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K12350100150200250Min: 209.32 / Avg: 243.31 / Max: 258.5Min: 249.26 / Avg: 251.57 / Max: 253.4Min: 199.11 / Avg: 248.58 / Max: 261.581. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p123140280420560700SE +/- 8.65, N = 15SE +/- 4.94, N = 3SE +/- 9.48, N = 15629.60669.24634.70MIN: 194.05 / MAX: 739.75MIN: 231.81 / MAX: 754.48MIN: 194.36 / MAX: 755.241. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p123120240360480600Min: 549.8 / Avg: 629.6 / Max: 664.85Min: 659.42 / Avg: 669.24 / Max: 675Min: 535.77 / Avg: 634.7 / Max: 673.871. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123306090120150SE +/- 0.41, N = 3SE +/- 0.30, N = 3SE +/- 0.14, N = 3138.45138.90139.13MIN: 95.91 / MAX: 217.11MIN: 96.19 / MAX: 217.56MIN: 96.19 / MAX: 219.51. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123306090120150Min: 137.68 / Avg: 138.45 / Max: 139.1Min: 138.48 / Avg: 138.9 / Max: 139.48Min: 138.86 / Avg: 139.13 / Max: 139.341. (CC) gcc options: -pthread

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT112330060090012001500SE +/- 1.47, N = 3SE +/- 0.97, N = 3SE +/- 0.70, N = 31296.791323.801322.341. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11232004006008001000Min: 1293.94 / Avg: 1296.79 / Max: 1298.84Min: 1322.17 / Avg: 1323.8 / Max: 1325.53Min: 1321.08 / Avg: 1322.34 / Max: 1323.481. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11234080120160200SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3184.75184.69184.761. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1123306090120150Min: 184.72 / Avg: 184.75 / Max: 184.79Min: 184.66 / Avg: 184.69 / Max: 184.73Min: 184.72 / Avg: 184.76 / Max: 184.781. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123306090120150SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3118.05118.05118.071. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC212320406080100Min: 118.04 / Avg: 118.05 / Max: 118.07Min: 118.04 / Avg: 118.05 / Max: 118.06Min: 118.07 / Avg: 118.07 / Max: 118.081. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1234080120160200SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3174.28174.30174.171. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering123306090120150Min: 174.26 / Avg: 174.28 / Max: 174.32Min: 174.27 / Avg: 174.3 / Max: 174.35Min: 174.06 / Avg: 174.17 / Max: 174.231. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123110220330440550SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3512.11512.07512.071. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10012390180270360450Min: 512.11 / Avg: 512.11 / Max: 512.11Min: 512.06 / Avg: 512.07 / Max: 512.11Min: 512.06 / Avg: 512.07 / Max: 512.111. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4128M16M24M32M40MSE +/- 1848270.09, N = 15SE +/- 1684001.44, N = 1237882537352268901. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4127M14M21M28M35MMin: 25581770 / Avg: 37882537.33 / Max: 52034560Min: 24563190 / Avg: 35226890 / Max: 463741401. (CXX) g++ options: -O3 -fopenmp

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123612182430SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 323.3823.4023.201. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123510152025Min: 23.23 / Avg: 23.38 / Max: 23.48Min: 23.25 / Avg: 23.4 / Max: 23.58Min: 23.1 / Avg: 23.2 / Max: 23.271. (CXX) g++ options: -O3 -pthread -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123612182430SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.23, N = 323.3123.3823.081. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123510152025Min: 23.24 / Avg: 23.31 / Max: 23.35Min: 23.27 / Avg: 23.38 / Max: 23.49Min: 22.74 / Avg: 23.08 / Max: 23.531. (CXX) g++ options: -O3 -pthread -lm

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31233K6K9K12K15KSE +/- 42.67, N = 3SE +/- 42.52, N = 3SE +/- 65.00, N = 316092.9316073.0915990.721. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31233K6K9K12K15KMin: 16035.61 / Avg: 16092.93 / Max: 16176.33Min: 15996.32 / Avg: 16073.09 / Max: 16143.15Min: 15884.61 / Avg: 15990.72 / Max: 16108.811. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01248121620SE +/- 0.10, N = 3SE +/- 0.20, N = 314.9614.81MIN: 13.84 / MAX: 36.38MIN: 13.46 / MAX: 30.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01248121620Min: 14.77 / Avg: 14.96 / Max: 15.12Min: 14.48 / Avg: 14.81 / Max: 15.161. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-50121224364860SE +/- 1.07, N = 3SE +/- 2.27, N = 354.1252.86MIN: 46.9 / MAX: 742.65MIN: 46.53 / MAX: 819.731. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-50121122334455Min: 52.95 / Avg: 54.12 / Max: 56.25Min: 50.06 / Avg: 52.86 / Max: 57.351. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_224123691215SE +/- 0.19, N = 3SE +/- 0.30, N = 310.7410.91MIN: 10.16 / MAX: 11.79MIN: 10.26 / MAX: 12.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_224123691215Min: 10.47 / Avg: 10.74 / Max: 11.1Min: 10.4 / Avg: 10.91 / Max: 11.431. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.012246810SE +/- 0.103, N = 3SE +/- 0.613, N = 36.7617.012MIN: 6.2 / MAX: 8.27MIN: 6 / MAX: 24.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0123691215Min: 6.56 / Avg: 6.76 / Max: 6.91Min: 6.11 / Avg: 7.01 / Max: 8.181. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3121632486480SE +/- 0.67, N = 3SE +/- 1.22, N = 368.5971.58MIN: 62.84 / MAX: 229.88MIN: 64.39 / MAX: 186.821. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3121428425670Min: 67.36 / Avg: 68.59 / Max: 69.69Min: 69.19 / Avg: 71.57 / Max: 73.231. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE123510152025SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 518.3418.3218.341. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE123510152025Min: 18.31 / Avg: 18.34 / Max: 18.38Min: 18.32 / Avg: 18.32 / Max: 18.33Min: 18.33 / Avg: 18.34 / Max: 18.341. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Ogg Audio Encoding

This test times how long it takes to encode a sample WAV file to Ogg format using the reference Xiph.org tools/libraries. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123612182430SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 326.6026.6326.671. (CC) gcc options: -O2 -ffast-math -fsigned-char
OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123612182430Min: 26.58 / Avg: 26.6 / Max: 26.64Min: 26.62 / Avg: 26.63 / Max: 26.63Min: 26.65 / Avg: 26.66 / Max: 26.681. (CC) gcc options: -O2 -ffast-math -fsigned-char

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.85751.7152.57253.434.2875SE +/- 0.04679, N = 3SE +/- 0.02862, N = 3SE +/- 0.05747, N = 32.902992.525003.81118MIN: 2.24MIN: 2.08MIN: 3.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810Min: 2.82 / Avg: 2.9 / Max: 2.98Min: 2.47 / Avg: 2.52 / Max: 2.56Min: 3.72 / Avg: 3.81 / Max: 3.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123510152025SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 319.8719.2221.12MIN: 18.97MIN: 18.41MIN: 20.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123510152025Min: 19.69 / Avg: 19.87 / Max: 20.15Min: 19.09 / Avg: 19.22 / Max: 19.38Min: 20.91 / Avg: 21.12 / Max: 21.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.89181.78362.67543.56724.459SE +/- 0.04801, N = 4SE +/- 0.05005, N = 3SE +/- 0.03574, N = 33.867413.819103.96369MIN: 3.27MIN: 3.31MIN: 3.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU123246810Min: 3.73 / Avg: 3.87 / Max: 3.94Min: 3.73 / Avg: 3.82 / Max: 3.9Min: 3.89 / Avg: 3.96 / Max: 4.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.01282.02563.03844.05125.064SE +/- 0.03873, N = 3SE +/- 0.02321, N = 3SE +/- 0.01074, N = 32.820282.678844.50130MIN: 2.33MIN: 2.27MIN: 4.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU123246810Min: 2.75 / Avg: 2.82 / Max: 2.89Min: 2.65 / Avg: 2.68 / Max: 2.72Min: 4.48 / Avg: 4.5 / Max: 4.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 317.7216.4920.45MIN: 16.71MIN: 15.75MIN: 19.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025Min: 17.69 / Avg: 17.72 / Max: 17.76Min: 16.41 / Avg: 16.49 / Max: 16.54Min: 20.39 / Avg: 20.45 / Max: 20.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.84661.69322.53983.38644.233SE +/- 0.03944, N = 15SE +/- 0.03472, N = 15SE +/- 0.04485, N = 153.545573.423963.76272MIN: 2.98MIN: 2.94MIN: 3.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810Min: 3.3 / Avg: 3.55 / Max: 3.76Min: 3.25 / Avg: 3.42 / Max: 3.76Min: 3.49 / Avg: 3.76 / Max: 4.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.15837, N = 15SE +/- 0.12641, N = 15SE +/- 0.10920, N = 36.454626.562377.09130MIN: 5.22MIN: 5.14MIN: 6.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1233691215Min: 5.47 / Avg: 6.45 / Max: 7.27Min: 5.61 / Avg: 6.56 / Max: 7.41Min: 6.92 / Avg: 7.09 / Max: 7.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.11, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 323.2422.7425.16MIN: 20.59MIN: 20.43MIN: 22.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430Min: 23.12 / Avg: 23.24 / Max: 23.45Min: 22.39 / Avg: 22.74 / Max: 23.21Min: 24.98 / Avg: 25.16 / Max: 25.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.83621.67242.50863.34484.181SE +/- 0.03910, N = 3SE +/- 0.04588, N = 5SE +/- 0.01698, N = 33.549123.495113.71634MIN: 3MIN: 3MIN: 3.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU123246810Min: 3.48 / Avg: 3.55 / Max: 3.61Min: 3.42 / Avg: 3.5 / Max: 3.67Min: 3.69 / Avg: 3.72 / Max: 3.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.71641.43282.14922.86563.582SE +/- 0.03490, N = 6SE +/- 0.01629, N = 3SE +/- 0.01545, N = 33.168803.181183.18392MIN: 2.83MIN: 2.88MIN: 2.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123246810Min: 3.03 / Avg: 3.17 / Max: 3.27Min: 3.15 / Avg: 3.18 / Max: 3.21Min: 3.16 / Avg: 3.18 / Max: 3.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12311002200330044005500SE +/- 120.82, N = 15SE +/- 153.50, N = 12SE +/- 112.92, N = 154554.324515.795020.04MIN: 3273.38MIN: 2781.43MIN: 3390.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1239001800270036004500Min: 3684.02 / Avg: 4554.32 / Max: 5305.34Min: 3764.53 / Avg: 4515.79 / Max: 5416.72Min: 4410.15 / Avg: 5020.04 / Max: 5715.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1239001800270036004500SE +/- 64.36, N = 15SE +/- 125.32, N = 15SE +/- 94.46, N = 123940.073865.584050.37MIN: 3412.94MIN: 3206.95MIN: 3493.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1237001400210028003500Min: 3478.78 / Avg: 3940.07 / Max: 4287.38Min: 3355.02 / Avg: 3865.58 / Max: 4626.94Min: 3550.3 / Avg: 4050.37 / Max: 46881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 216.78, N = 15SE +/- 178.79, N = 15SE +/- 148.69, N = 154466.504580.214661.89MIN: 2939.4MIN: 3020.84MIN: 3208.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1238001600240032004000Min: 3120.4 / Avg: 4466.5 / Max: 5728.89Min: 3215.97 / Avg: 4580.21 / Max: 5600.15Min: 3650.55 / Avg: 4661.89 / Max: 5442.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1239001800270036004500SE +/- 35.53, N = 3SE +/- 114.13, N = 15SE +/- 89.60, N = 153557.563580.434199.67MIN: 3305.84MIN: 3052.37MIN: 3498.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1237001400210028003500Min: 3517.06 / Avg: 3557.56 / Max: 3628.37Min: 3166.44 / Avg: 3580.43 / Max: 4676.29Min: 3555.05 / Avg: 4199.67 / Max: 4788.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.58581.17161.75742.34322.929SE +/- 0.007779, N = 3SE +/- 0.002458, N = 3SE +/- 0.009404, N = 30.9199090.9028052.603660MIN: 0.77MIN: 0.77MIN: 2.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810Min: 0.91 / Avg: 0.92 / Max: 0.93Min: 0.9 / Avg: 0.9 / Max: 0.91Min: 2.58 / Avg: 2.6 / Max: 2.611. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 128.63, N = 12SE +/- 149.33, N = 15SE +/- 125.34, N = 154707.164615.364885.17MIN: 3518.69MIN: 3327.21MIN: 3744.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000Min: 4115.45 / Avg: 4707.16 / Max: 5576.88Min: 3589.05 / Avg: 4615.36 / Max: 5551.98Min: 4108.9 / Avg: 4885.17 / Max: 5519.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1239001800270036004500SE +/- 128.96, N = 15SE +/- 146.91, N = 15SE +/- 85.43, N = 153698.783877.714137.52MIN: 2872.13MIN: 2904.51MIN: 3448.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1237001400210028003500Min: 3036.06 / Avg: 3698.78 / Max: 4742.94Min: 3276.6 / Avg: 3877.71 / Max: 4808.88Min: 3533.92 / Avg: 4137.52 / Max: 4714.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.31620.63240.94861.26481.581SE +/- 0.00740, N = 3SE +/- 0.00510, N = 3SE +/- 0.00370, N = 31.378811.388891.40554MIN: 1.12MIN: 1.21MIN: 1.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810Min: 1.37 / Avg: 1.38 / Max: 1.39Min: 1.38 / Avg: 1.39 / Max: 1.4Min: 1.4 / Avg: 1.41 / Max: 1.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1220406080100SE +/- 2.66, N = 12SE +/- 2.07, N = 1276711. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU121530456075Min: 62 / Avg: 75.88 / Max: 87.5Min: 59.5 / Avg: 71.17 / Max: 78.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU121326395265SE +/- 0.44, N = 3SE +/- 2.87, N = 958541. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU121122334455Min: 57 / Avg: 57.67 / Max: 58.5Min: 47 / Avg: 54.11 / Max: 711. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU121224364860SE +/- 0.88, N = 3SE +/- 1.02, N = 1253521. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU121122334455Min: 52 / Avg: 53.33 / Max: 55Min: 46 / Avg: 51.71 / Max: 55.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU125001000150020002500SE +/- 142.39, N = 12SE +/- 112.64, N = 12218823181. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU12400800120016002000Min: 1275.5 / Avg: 2187.67 / Max: 2618Min: 1598.5 / Avg: 2317.58 / Max: 2816.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU12400800120016002000SE +/- 13.94, N = 3SE +/- 32.31, N = 3208420781. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU12400800120016002000Min: 2056 / Avg: 2083.83 / Max: 2099Min: 2032 / Avg: 2077.5 / Max: 21401. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M123816243240SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.33, N = 1534.6234.2335.321. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M123816243240Min: 34.45 / Avg: 34.62 / Max: 34.78Min: 34.09 / Avg: 34.23 / Max: 34.5Min: 34.24 / Avg: 35.32 / Max: 38.111. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M12370140210280350SE +/- 0.27, N = 3SE +/- 0.73, N = 3SE +/- 0.68, N = 3338.71338.37340.041. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M12360120180240300Min: 338.17 / Avg: 338.71 / Max: 339.03Min: 337.61 / Avg: 338.37 / Max: 339.82Min: 339.2 / Avg: 340.04 / Max: 341.381. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 510.2110.2110.241. (CXX) g++ options: -fvisibility=hidden -logg -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215Min: 10.21 / Avg: 10.21 / Max: 10.23Min: 10.2 / Avg: 10.21 / Max: 10.22Min: 10.23 / Avg: 10.24 / Max: 10.251. (CXX) g++ options: -fvisibility=hidden -logg -lm

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231122334455SE +/- 0.22, N = 3SE +/- 1.39, N = 15SE +/- 1.69, N = 1241.8446.2550.661. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -lm -pthread
OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231020304050Min: 41.52 / Avg: 41.84 / Max: 42.25Min: 41.56 / Avg: 46.25 / Max: 56.81Min: 41.58 / Avg: 50.66 / Max: 60.771. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -lm -pthread

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112123400800120016002000SE +/- 19.90, N = 9SE +/- 5.15, N = 3SE +/- 18.18, N = 91796.321754.211808.781. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF11212330060090012001500Min: 1723.76 / Avg: 1796.32 / Max: 1871.01Min: 1746.21 / Avg: 1754.21 / Max: 1763.84Min: 1716.03 / Avg: 1808.78 / Max: 1900.471. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11230.0590.1180.1770.2360.295SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.2580.2620.261
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 112312345Min: 0.26 / Avg: 0.26 / Max: 0.26Min: 0.26 / Avg: 0.26 / Max: 0.26Min: 0.26 / Avg: 0.26 / Max: 0.26

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51230.17550.3510.52650.7020.8775SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 30.7800.7780.775
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5123246810Min: 0.77 / Avg: 0.78 / Max: 0.79Min: 0.77 / Avg: 0.78 / Max: 0.79Min: 0.77 / Avg: 0.78 / Max: 0.79

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61230.2320.4640.6960.9281.16SE +/- 0.009, N = 3SE +/- 0.005, N = 3SE +/- 0.014, N = 31.0161.0311.030
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6123246810Min: 1 / Avg: 1.02 / Max: 1.03Min: 1.02 / Avg: 1.03 / Max: 1.04Min: 1 / Avg: 1.03 / Max: 1.05

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101230.53331.06661.59992.13322.6665SE +/- 0.021, N = 3SE +/- 0.016, N = 3SE +/- 0.011, N = 32.3222.3702.336
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10123246810Min: 2.28 / Avg: 2.32 / Max: 2.35Min: 2.35 / Avg: 2.37 / Max: 2.4Min: 2.32 / Avg: 2.34 / Max: 2.36

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU123120240360480600SE +/- 0.24, N = 3SE +/- 0.25, N = 3SE +/- 0.25, N = 3548.38548.43547.941. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU123100200300400500Min: 547.92 / Avg: 548.38 / Max: 548.74Min: 547.94 / Avg: 548.43 / Max: 548.8Min: 547.67 / Avg: 547.94 / Max: 548.451. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12320406080100SE +/- 1.83, N = 3SE +/- 1.42, N = 3SE +/- 1.34, N = 3107.10104.00103.91
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12320406080100Min: 104.44 / Avg: 107.1 / Max: 110.59Min: 102.28 / Avg: 104 / Max: 106.82Min: 101.96 / Avg: 103.91 / Max: 106.49

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21280160240320400SE +/- 1.37, N = 3SE +/- 0.16, N = 3369.11369.67MIN: 357.24 / MAX: 557.3MIN: 358.63 / MAX: 519.861. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21270140210280350Min: 366.81 / Avg: 369.11 / Max: 371.54Min: 369.44 / Avg: 369.67 / Max: 369.981. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11270140210280350SE +/- 0.06, N = 3SE +/- 0.15, N = 3333.40333.27MIN: 332.68 / MAX: 338.81MIN: 332.42 / MAX: 334.081. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11260120180240300Min: 333.33 / Avg: 333.4 / Max: 333.53Min: 333 / Avg: 333.27 / Max: 333.51. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz12612182430SE +/- 0.08, N = 4SE +/- 0.03, N = 425.9125.75
OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz12612182430Min: 25.75 / Avg: 25.91 / Max: 26.14Min: 25.66 / Avg: 25.75 / Max: 25.8

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1248121620SE +/- 0.00, N = 5SE +/- 0.01, N = 517.3017.291. (CXX) g++ options: -rdynamic
OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1248121620Min: 17.29 / Avg: 17.3 / Max: 17.31Min: 17.28 / Avg: 17.29 / Max: 17.311. (CXX) g++ options: -rdynamic

74 Results Shown

Algebraic Multi-Grid Benchmark
CloverLeaf
Cryptsetup:
  PBKDF2-sha512
  PBKDF2-whirlpool
  AES-XTS 256b Encryption
  AES-XTS 256b Decryption
  Serpent-XTS 256b Encryption
  Serpent-XTS 256b Decryption
  Twofish-XTS 256b Encryption
  Twofish-XTS 256b Decryption
  AES-XTS 512b Encryption
  AES-XTS 512b Decryption
  Serpent-XTS 512b Encryption
  Serpent-XTS 512b Decryption
  Twofish-XTS 512b Encryption
  Twofish-XTS 512b Decryption
dav1d:
  Chimera 1080p
  Summer Nature 4K
  Summer Nature 1080p
  Chimera 1080p 10-bit
Etcpak:
  DXT1
  ETC1
  ETC2
  ETC1 + Dithering
Google SynthMark
Kripke
LAMMPS Molecular Dynamics Simulator:
  20k Atoms
  Rhodopsin Protein
LULESH
Mobile Neural Network:
  SqueezeNetV1.0
  resnet-v2-50
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
Monkey Audio Encoding
Ogg Audio Encoding
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
ONNX Runtime:
  yolov4 - OpenMP CPU
  bertsquad-10 - OpenMP CPU
  fcn-resnet101-11 - OpenMP CPU
  shufflenet-v2-10 - OpenMP CPU
  super-resolution-10 - OpenMP CPU
OpenFOAM:
  Motorbike 30M
  Motorbike 60M
Opus Codec Encoding
QMCPACK
Quantum ESPRESSO
rav1e:
  1
  5
  6
  10
RELION
Timed Godot Game Engine Compilation
TNN:
  CPU - MobileNet v2
  CPU - SqueezeNet v1.1
Unpacking Firefox
WavPack Audio Encoding