Microsoft Azure EPYC 7003 HBv3 Benchmarks

Azure HBv3 vs. Azure HBv2 benchmarks.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2104116-PTS-AZURE71063
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 2 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 16 Tests
Creator Workloads 4 Tests
Encoding 3 Tests
Finance 2 Tests
Fortran Tests 4 Tests
HPC - High Performance Computing 19 Tests
Machine Learning 6 Tests
Molecular Dynamics 7 Tests
MPI Benchmarks 6 Tests
Multi-Core 14 Tests
NVIDIA GPU Compute 5 Tests
OpenMPI Tests 9 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 3 Tests
Scientific Computing 10 Tests
Server CPU Tests 10 Tests
Single-Threaded 2 Tests
Video Encoding 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Azure HBv3
April 09 2021
  11 Hours, 37 Minutes
Azure HBv2
April 10 2021
  14 Hours, 14 Minutes
Invert Hiding All Results Option
  12 Hours, 55 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Microsoft Azure EPYC 7003 HBv3 BenchmarksProcessorMotherboardMemoryDiskGraphicsOSKernelCompilerFile-SystemScreen ResolutionSystem LayerAzure HBv3Azure HBv22 x AMD EPYC 7V13 64-Core (120 Cores)Microsoft Virtual Machine (Hyper-V UEFI v4.1 BIOS)442GB2 x 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Diskhyperv_fbCentOS Linux 84.18.0-147.8.1.el8_1.x86_64 (x86_64)GCC 8.3.1 20190507nfs1152x864microsoft2 x AMD EPYC 7V12 64-Core (120 Cores)Microsoft Virtual Machine (Hyper-V UEFI v4.0 BIOS)450GB960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual DiskOpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- CPU Microcode: 0xffffffffPython Details- Python 3.6.8Security Details- Azure HBv3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected - Azure HBv2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

Azure HBv3 vs. Azure HBv2 ComparisonPhoronix Test SuiteBaseline+50.4%+50.4%+100.8%+100.8%+151.2%+151.2%IP Shapes 3D - f32 - CPU201.5%CPU - vgg16180.6%MobileNetV2_224162%VMAF Optimized - Bosphorus 1080p154.5%D.B.s - u8s8f32 - CPU153.7%V.Q.O - Bosphorus 1080p140.2%P.S.O - Bosphorus 1080p130.5%7 - Bosphorus 1080p126.5%CPU - resnet50115.8%mobilenet-v1-1.0103.3%CPU - mobilenet88.6%87.8%resnet-v2-5080.6%O.S75%SqueezeNetV1.071.9%D.B.s - u8s8f32 - CPU60.7%CPU-v2-v2 - mobilenet-v260.1%inception-v359.5%CPU - alexnet56.7%CPU - shufflenet-v256%OpenMP CFD Solver54.2%CPU - efficientnet-b054.1%R.N.N.T - bf16bf16bf16 - CPU53.5%R.N.N.T - f32 - CPU50.1%No - Inference - VGG16 - CPU50.1%No - Inference - VGG19 - CPU49.4%R.N.N.I - bf16bf16bf16 - CPU47.5%R.N.N.I - f32 - CPU46.6%10 - Bosphorus 1080p44.3%L.E.H42.7%19 - D.S41.3%R.N.N.I - u8s8f32 - CPU40.9%8, Long Mode - D.S40.1%19, Long Mode - D.S39.4%AES-25637.6%AES-256 - Decrypt37.3%M.M.B.S.T - f32 - CPU37.1%R.N.N.T - u8s8f32 - CPU33.7%33.3%8, Long Mode - Compression Speed31.3%OpenMP HotSpot3D30.9%Enc Mode 0 - 1080p29.8%Enc Mode 4 - 1080p29%IP Shapes 1D - f32 - CPU27.6%Bonds OpenMP27.5%D.B.s - f32 - CPU27.4%Repo OpenMP27%1 - Bosphorus 1080p25.7%Twofish23.3%Twofish - Decrypt22.1%Blowfish21.9%Blowfish - Decrypt21.8%Enc Mode 8 - 1080p20.4%8 - Compression Speed20%19.6%KASUMI19.5%CPU - SqueezeNet v1.118.7%P.D.S18.6%KASUMI - Decrypt18.2%M.M.B.S.T - u8s8f32 - CPU17.9%Total Time17.8%CAST-25617.7%CAST-256 - Decrypt17.5%OpenMP Leukocyte15.6%IP Shapes 3D - u8s8f32 - CPU14.9%SqueezeNet12.9%19 - Compression Speed12.5%M.S.A - LSU RNA12.1%No - Inference - ResNet 50 - CPU12%X.b.i.i10.9%ATPase Simulation - 327,506 Atoms9%19, Long Mode - Compression Speed8.7%ChaCha20Poly13058.4%ChaCha20Poly1305 - Decrypt7.8%Time To Compile6.9%Water Benchmark6.9%Time To Compile6.9%Time To Compile6.7%LU.C5.3%4.8%Small4.7%leblancbig4.5%OpenMP LavaMD3.7%sedovbig3.3%oneDNNNCNNMobile Neural NetworkSVT-VP9oneDNNSVT-VP9SVT-VP9SVT-HEVCNCNNMobile Neural NetworkNCNNKripkeMobile Neural NetworkRodiniaMobile Neural NetworkoneDNNNCNNMobile Neural NetworkNCNNNCNNRodiniaNCNNoneDNNoneDNNPlaidMLPlaidMLoneDNNoneDNNSVT-HEVCCloverLeafZstd CompressiononeDNNZstd CompressionZstd CompressionBotanBotanoneDNNoneDNNQuantLibZstd CompressionRodiniaSVT-AV1SVT-AV1oneDNNFinanceBenchoneDNNFinanceBenchSVT-HEVCBotanBotanBotanBotanSVT-AV1Zstd CompressionLULESHBotanTNNTimed HMMer SearchBotanoneDNNGNU GMP GMPbenchBotanBotanRodiniaoneDNNTensorFlow LiteZstd CompressionTimed MAFFT AlignmentPlaidMLXcompact3d Incompact3dNAMDZstd CompressionBotanBotanTimed Linux Kernel CompilationGROMACSTimed Node.js CompilationTimed LLVM CompilationNAS Parallel BenchmarksHigh Performance Conjugate GradientminiFEPennantRodiniaPennantAzure HBv3Azure HBv2

Microsoft Azure EPYC 7003 HBv3 Benchmarkssvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUmnn: inception-v3plaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcompress-zstd: 19 - Decompression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Decompression Speedbotan: AES-256botan: AES-256 - Decryptquantlib: compress-zstd: 8, Long Mode - Compression Speedrodinia: OpenMP HotSpot3Dsvt-av1: Enc Mode 0 - 1080ponednn: IP Shapes 1D - f32 - CPUfinancebench: Bonds OpenMPonednn: Deconvolution Batch shapes_1d - f32 - CPUfinancebench: Repo OpenMPsvt-hevc: 1 - Bosphorus 1080pbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptsvt-av1: Enc Mode 8 - 1080pcompress-zstd: 8 - Compression Speedlulesh: botan: KASUMItnn: CPU - SqueezeNet v1.1hmmer: Pfam Database Searchbotan: KASUMI - Decryptonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUgmpbench: Total Timebotan: CAST-256botan: CAST-256 - Decryptrodinia: OpenMP Leukocytemafft: Multiple Sequence Alignment - LSU RNAincompact3d: X3D-benchmarking input.i3dnamd: ATPase Simulation - 327,506 Atomscompress-zstd: 19, Long Mode - Compression Speedbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptbuild-linux-kernel: Time To Compilegromacs: Water Benchmarkbuild-nodejs: Time To Compilebuild-llvm: Time To Compilenpb: LU.Chpcg: pennant: leblancbigrodinia: OpenMP LavaMDpennant: sedovbigonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUkripke: plaidml: No - Inference - ResNet 50 - CPUncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - vgg16ncnn: CPU - efficientnet-b0ncnn: CPU - shufflenet-v2ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetmnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0tensorflow-lite: SqueezeNetonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-av1: Enc Mode 4 - 1080pcompress-zstd: 19 - Compression Speedrodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solvercloverleaf: Lagrangian-Eulerian Hydrodynamicsminife: SmallAzure HBv3Azure HBv2337.710.45568234.71038.3934.53540.1943717.54256.83727.55412.1315407.4772299.3771.982.4890.1610.881307104884.2942719.5044559196.54687545.39346.319345.776424.738425.15894.0283184.241636.60387.480272.614175.23984.0590.4452234893.4133.691133.66549.35314.326287.5995080.2756636.3667.765658.37042.0089.041111.327163.76056682.8239.06203.33720137.8035.8334190.5752381.58182823418656.2730.036.8947.6721.0413.7123.2427.873.9504.78229.0868.22466320.4843.1030.255501561.773962.336530.898875.1660.4066860.3838553.21204376.91357.06548.33378.0112.27578.27.3798.46016.6613785.3140.590.73213055.35625.5723.12796.8162631.03037.52674.23934.4013938.4441725.2588.0107.9760.1241.12484133719.32812512.105175154.25781336.12280.902283.257348.425349.14778.1042653.134803.53973.175323.615207.86771.1010.5247374155.5113.600113.72457.04016.064318.9050090.3005633.4615.795610.86144.9228.458118.969174.74353829.3437.26863.48650339.1986.0264490.5819811.59557438555505.6064.8010.80133.7832.4221.3937.2052.568.03012.52752.53314.13674885.41294.530.350347791.4601287.03778.3051314.001.0319420.4410129.68321163.51140.30379.91166.909.51269.512.91213.04223.7813165.09OpenBenchmarking.org

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv270140210280350SE +/- 4.36, N = 3SE +/- 0.20, N = 3337.71140.591. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv260120180240300Min: 329.01 / Avg: 337.71 / Max: 342.57Min: 140.32 / Avg: 140.59 / Max: 140.981. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv20.16470.32940.49410.65880.8235SE +/- 0.002442, N = 3SE +/- 0.002363, N = 30.4556820.732130MIN: 0.38MIN: 0.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 0.45 / Avg: 0.46 / Max: 0.46Min: 0.73 / Avg: 0.73 / Max: 0.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Azure HBv3Azure HBv21224364860SE +/- 0.24, N = 15SE +/- 0.95, N = 1234.7155.36MIN: 31.09 / MAX: 427.77MIN: 47.56 / MAX: 509.741. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Azure HBv3Azure HBv21122334455Min: 33.47 / Avg: 34.71 / Max: 36.67Min: 50 / Avg: 55.36 / Max: 62.271. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUAzure HBv3Azure HBv2918273645SE +/- 0.39, N = 3SE +/- 0.21, N = 338.3925.57
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUAzure HBv3Azure HBv2816243240Min: 37.7 / Avg: 38.39 / Max: 39.06Min: 25.32 / Avg: 25.57 / Max: 25.98

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUAzure HBv3Azure HBv2816243240SE +/- 0.42, N = 4SE +/- 0.31, N = 1534.5323.12
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUAzure HBv3Azure HBv2714212835Min: 33.43 / Avg: 34.53 / Max: 35.48Min: 20.74 / Avg: 23.12 / Max: 25.41

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAzure HBv3Azure HBv22004006008001000SE +/- 8.06, N = 15SE +/- 6.35, N = 15540.19796.82MIN: 462.7MIN: 726.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAzure HBv3Azure HBv2140280420560700Min: 480.13 / Avg: 540.19 / Max: 601.19Min: 756.99 / Avg: 796.82 / Max: 825.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedAzure HBv3Azure HBv28001600240032004000SE +/- 6.14, N = 15SE +/- 1.17, N = 153717.52631.01. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedAzure HBv3Azure HBv26001200180024003000Min: 3679.6 / Avg: 3717.45 / Max: 3752Min: 2621.5 / Avg: 2631.01 / Max: 2637.81. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedAzure HBv3Azure HBv29001800270036004500SE +/- 4.89, N = 7SE +/- 4.57, N = 34256.83037.51. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedAzure HBv3Azure HBv27001400210028003500Min: 4238.7 / Avg: 4256.83 / Max: 4275.7Min: 3028.9 / Avg: 3037.5 / Max: 3044.51. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedAzure HBv3Azure HBv28001600240032004000SE +/- 12.70, N = 15SE +/- 2.70, N = 153727.52674.21. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedAzure HBv3Azure HBv26001200180024003000Min: 3565.1 / Avg: 3727.52 / Max: 3764.5Min: 2640.1 / Avg: 2674.21 / Max: 2684.61. (CC) gcc options: -O3 -pthread -lz -llzma

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Azure HBv3Azure HBv212002400360048006000SE +/- 3.73, N = 3SE +/- 2.68, N = 35412.133934.401. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Azure HBv3Azure HBv29001800270036004500Min: 5405.03 / Avg: 5412.13 / Max: 5417.66Min: 3929.32 / Avg: 3934.4 / Max: 3938.411. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptAzure HBv3Azure HBv212002400360048006000SE +/- 7.85, N = 3SE +/- 3.26, N = 35407.483938.441. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptAzure HBv3Azure HBv29001800270036004500Min: 5393.16 / Avg: 5407.48 / Max: 5420.23Min: 3931.93 / Avg: 3938.44 / Max: 3941.771. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Azure HBv3Azure HBv25001000150020002500SE +/- 4.80, N = 3SE +/- 0.77, N = 32299.31725.21. (CXX) g++ options: -O3 -march=native -O2 -rdynamic -lboost_timer -lboost_system -lboost_chrono
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Azure HBv3Azure HBv2400800120016002000Min: 2290.1 / Avg: 2299.27 / Max: 2306.3Min: 1724.1 / Avg: 1725.23 / Max: 1726.71. (CXX) g++ options: -O3 -march=native -O2 -rdynamic -lboost_timer -lboost_system -lboost_chrono

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedAzure HBv3Azure HBv2170340510680850SE +/- 6.99, N = 7SE +/- 0.81, N = 3771.9588.01. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedAzure HBv3Azure HBv2140280420560700Min: 748.6 / Avg: 771.91 / Max: 801.2Min: 586.7 / Avg: 588 / Max: 589.51. (CC) gcc options: -O3 -pthread -lz -llzma

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DAzure HBv3Azure HBv220406080100SE +/- 1.18, N = 15SE +/- 0.89, N = 382.49107.981. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DAzure HBv3Azure HBv220406080100Min: 77.17 / Avg: 82.49 / Max: 91.09Min: 106.46 / Avg: 107.98 / Max: 109.521. (CXX) g++ options: -O2 -lOpenCL

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAzure HBv3Azure HBv20.03620.07240.10860.14480.181SE +/- 0.000, N = 3SE +/- 0.001, N = 30.1610.1241. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAzure HBv3Azure HBv212345Min: 0.16 / Avg: 0.16 / Max: 0.16Min: 0.12 / Avg: 0.12 / Max: 0.131. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv20.25310.50620.75931.01241.2655SE +/- 0.012358, N = 3SE +/- 0.011740, N = 150.8813071.124840MIN: 0.76MIN: 1.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 0.86 / Avg: 0.88 / Max: 0.9Min: 1.07 / Avg: 1.12 / Max: 1.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPAzure HBv3Azure HBv230K60K90K120K150KSE +/- 454.05, N = 3SE +/- 476.22, N = 3104884.29133719.331. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPAzure HBv3Azure HBv220K40K60K80K100KMin: 104078.39 / Avg: 104884.29 / Max: 105649.7Min: 132784.52 / Avg: 133719.33 / Max: 134344.661. (CXX) g++ options: -O3 -march=native -fopenmp

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv23691215SE +/- 0.09389, N = 3SE +/- 0.11811, N = 69.5044512.10510MIN: 4.19MIN: 5.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv248121620Min: 9.33 / Avg: 9.5 / Max: 9.66Min: 11.71 / Avg: 12.11 / Max: 12.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAzure HBv3Azure HBv216K32K48K64K80KSE +/- 228.72, N = 3SE +/- 166.77, N = 359196.5575154.261. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAzure HBv3Azure HBv213K26K39K52K65KMin: 58815.6 / Avg: 59196.55 / Max: 59606.33Min: 74919.02 / Avg: 75154.26 / Max: 75476.661. (CXX) g++ options: -O3 -march=native -fopenmp

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pAzure HBv3Azure HBv21020304050SE +/- 0.60, N = 3SE +/- 0.13, N = 345.3936.121. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pAzure HBv3Azure HBv2918273645Min: 44.65 / Avg: 45.39 / Max: 46.57Min: 35.86 / Avg: 36.12 / Max: 36.31. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishAzure HBv3Azure HBv280160240320400SE +/- 1.10, N = 3SE +/- 0.15, N = 3346.32280.901. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishAzure HBv3Azure HBv260120180240300Min: 344.13 / Avg: 346.32 / Max: 347.56Min: 280.75 / Avg: 280.9 / Max: 281.21. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptAzure HBv3Azure HBv280160240320400SE +/- 0.77, N = 3SE +/- 0.22, N = 3345.78283.261. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptAzure HBv3Azure HBv260120180240300Min: 344.23 / Avg: 345.78 / Max: 346.56Min: 282.85 / Avg: 283.26 / Max: 283.61. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishAzure HBv3Azure HBv290180270360450SE +/- 0.47, N = 3SE +/- 0.42, N = 3424.74348.431. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishAzure HBv3Azure HBv280160240320400Min: 423.81 / Avg: 424.74 / Max: 425.22Min: 347.64 / Avg: 348.43 / Max: 349.091. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptAzure HBv3Azure HBv290180270360450SE +/- 0.42, N = 3SE +/- 0.56, N = 3425.16349.151. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptAzure HBv3Azure HBv280160240320400Min: 424.32 / Avg: 425.16 / Max: 425.59Min: 348.3 / Avg: 349.15 / Max: 350.21. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAzure HBv3Azure HBv220406080100SE +/- 0.34, N = 3SE +/- 0.65, N = 394.0378.101. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAzure HBv3Azure HBv220406080100Min: 93.43 / Avg: 94.03 / Max: 94.59Min: 76.81 / Avg: 78.1 / Max: 78.781. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedAzure HBv3Azure HBv27001400210028003500SE +/- 45.10, N = 3SE +/- 33.03, N = 153184.22653.11. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedAzure HBv3Azure HBv26001200180024003000Min: 3127 / Avg: 3184.2 / Max: 3273.2Min: 2446.8 / Avg: 2653.14 / Max: 2850.11. (CC) gcc options: -O3 -pthread -lz -llzma

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Azure HBv3Azure HBv29K18K27K36K45KSE +/- 476.10, N = 3SE +/- 44.77, N = 341636.6034803.541. (CXX) g++ options: -O3 -fopenmp -lm -fexceptions -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Azure HBv3Azure HBv27K14K21K28K35KMin: 40950.33 / Avg: 41636.6 / Max: 42551.38Min: 34743.07 / Avg: 34803.54 / Max: 34890.961. (CXX) g++ options: -O3 -fopenmp -lm -fexceptions -pthread -lmpi_cxx -lmpi

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIAzure HBv3Azure HBv220406080100SE +/- 0.04, N = 3SE +/- 0.07, N = 387.4873.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIAzure HBv3Azure HBv220406080100Min: 87.41 / Avg: 87.48 / Max: 87.53Min: 73.08 / Avg: 73.18 / Max: 73.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Azure HBv3Azure HBv270140210280350SE +/- 0.15, N = 3SE +/- 0.19, N = 3272.61323.62MIN: 272.1 / MAX: 273.5MIN: 322.04 / MAX: 326.051. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Azure HBv3Azure HBv260120180240300Min: 272.47 / Avg: 272.61 / Max: 272.91Min: 323.35 / Avg: 323.62 / Max: 323.971. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchAzure HBv3Azure HBv250100150200250SE +/- 1.00, N = 3SE +/- 1.08, N = 3175.24207.871. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchAzure HBv3Azure HBv24080120160200Min: 173.26 / Avg: 175.24 / Max: 176.52Min: 205.71 / Avg: 207.87 / Max: 209.041. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptAzure HBv3Azure HBv220406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 384.0671.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptAzure HBv3Azure HBv21632486480Min: 84.03 / Avg: 84.06 / Max: 84.08Min: 71.04 / Avg: 71.1 / Max: 71.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv20.11810.23620.35430.47240.5905SE +/- 0.004912, N = 4SE +/- 0.004967, N = 30.4452230.524737MIN: 0.39MIN: 0.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 0.43 / Avg: 0.45 / Max: 0.45Min: 0.52 / Avg: 0.52 / Max: 0.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total TimeAzure HBv3Azure HBv2100020003000400050004893.44155.51. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Azure HBv3Azure HBv2306090120150SE +/- 0.03, N = 3SE +/- 0.07, N = 3133.69113.601. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Azure HBv3Azure HBv2306090120150Min: 133.65 / Avg: 133.69 / Max: 133.74Min: 113.46 / Avg: 113.6 / Max: 113.691. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptAzure HBv3Azure HBv2306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3133.67113.721. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptAzure HBv3Azure HBv2306090120150Min: 133.63 / Avg: 133.67 / Max: 133.69Min: 113.71 / Avg: 113.72 / Max: 113.731. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteAzure HBv3Azure HBv21326395265SE +/- 0.43, N = 8SE +/- 0.57, N = 349.3557.041. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteAzure HBv3Azure HBv21122334455Min: 47.97 / Avg: 49.35 / Max: 52.04Min: 56.01 / Avg: 57.04 / Max: 57.971. (CXX) g++ options: -O2 -lOpenCL

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAAzure HBv3Azure HBv248121620SE +/- 0.10, N = 15SE +/- 0.06, N = 314.3316.061. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAAzure HBv3Azure HBv248121620Min: 13.68 / Avg: 14.33 / Max: 15.05Min: 16 / Avg: 16.06 / Max: 16.191. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dAzure HBv3Azure HBv270140210280350SE +/- 0.17, N = 3SE +/- 0.07, N = 3287.60318.911. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dAzure HBv3Azure HBv260120180240300Min: 287.31 / Avg: 287.6 / Max: 287.89Min: 318.81 / Avg: 318.91 / Max: 319.041. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsAzure HBv3Azure HBv20.06760.13520.20280.27040.338SE +/- 0.00027, N = 3SE +/- 0.00059, N = 30.275660.30056
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsAzure HBv3Azure HBv212345Min: 0.28 / Avg: 0.28 / Max: 0.28Min: 0.3 / Avg: 0.3 / Max: 0.3

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedAzure HBv3Azure HBv2816243240SE +/- 0.34, N = 15SE +/- 0.38, N = 1536.333.41. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedAzure HBv3Azure HBv2816243240Min: 34 / Avg: 36.33 / Max: 38.5Min: 31.2 / Avg: 33.42 / Max: 35.51. (CC) gcc options: -O3 -pthread -lz -llzma

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Azure HBv3Azure HBv2140280420560700SE +/- 0.27, N = 3SE +/- 0.90, N = 3667.77615.801. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Azure HBv3Azure HBv2120240360480600Min: 667.33 / Avg: 667.77 / Max: 668.25Min: 614.01 / Avg: 615.79 / Max: 616.821. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptAzure HBv3Azure HBv2140280420560700SE +/- 0.06, N = 3SE +/- 0.78, N = 3658.37610.861. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptAzure HBv3Azure HBv2120240360480600Min: 658.25 / Avg: 658.37 / Max: 658.44Min: 609.48 / Avg: 610.86 / Max: 612.171. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileAzure HBv3Azure HBv21020304050SE +/- 0.58, N = 15SE +/- 0.58, N = 1342.0144.92
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileAzure HBv3Azure HBv2918273645Min: 39.43 / Avg: 42.01 / Max: 49.12Min: 43.11 / Avg: 44.92 / Max: 50.84

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkAzure HBv3Azure HBv23691215SE +/- 0.009, N = 3SE +/- 0.009, N = 39.0418.4581. (CXX) g++ options: -O2 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkAzure HBv3Azure HBv23691215Min: 9.03 / Avg: 9.04 / Max: 9.06Min: 8.45 / Avg: 8.46 / Max: 8.481. (CXX) g++ options: -O2 -pthread -lrt -lpthread -lm

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileAzure HBv3Azure HBv2306090120150SE +/- 1.09, N = 3SE +/- 0.83, N = 3111.33118.97
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileAzure HBv3Azure HBv220406080100Min: 110.23 / Avg: 111.33 / Max: 113.5Min: 118.09 / Avg: 118.97 / Max: 120.62

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileAzure HBv3Azure HBv24080120160200SE +/- 1.89, N = 3SE +/- 1.99, N = 3163.76174.74
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileAzure HBv3Azure HBv2306090120150Min: 160.99 / Avg: 163.76 / Max: 167.37Min: 172.31 / Avg: 174.74 / Max: 178.69

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CAzure HBv3Azure HBv212K24K36K48K60KSE +/- 428.57, N = 14SE +/- 21.24, N = 356682.8253829.341. (F9X) gfortran options: -O3 -march=native -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CAzure HBv3Azure HBv210K20K30K40K50KMin: 51213.56 / Avg: 56682.82 / Max: 57398.81Min: 53787.12 / Avg: 53829.34 / Max: 53854.581. (F9X) gfortran options: -O3 -march=native -fexceptions -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Azure HBv3Azure HBv2918273645SE +/- 0.06, N = 3SE +/- 0.06, N = 339.0637.271. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -fexceptions -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Azure HBv3Azure HBv2816243240Min: 38.95 / Avg: 39.06 / Max: 39.14Min: 37.16 / Avg: 37.27 / Max: 37.361. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -fexceptions -pthread -lmpi_cxx -lmpi

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAzure HBv3Azure HBv20.78451.5692.35353.1383.9225SE +/- 0.017738, N = 3SE +/- 0.009939, N = 33.3372013.4865031. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAzure HBv3Azure HBv2246810Min: 3.3 / Avg: 3.34 / Max: 3.36Min: 3.47 / Avg: 3.49 / Max: 3.511. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDAzure HBv3Azure HBv2918273645SE +/- 0.23, N = 3SE +/- 0.20, N = 337.8039.201. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDAzure HBv3Azure HBv2816243240Min: 37.37 / Avg: 37.8 / Max: 38.12Min: 38.79 / Avg: 39.2 / Max: 39.411. (CXX) g++ options: -O2 -lOpenCL

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAzure HBv3Azure HBv2246810SE +/- 0.003905, N = 3SE +/- 0.012548, N = 35.8334196.0264491. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAzure HBv3Azure HBv2246810Min: 5.83 / Avg: 5.83 / Max: 5.84Min: 6 / Avg: 6.03 / Max: 6.051. (CXX) g++ options: -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv20.13090.26180.39270.52360.6545SE +/- 0.005232, N = 3SE +/- 0.000573, N = 30.5752380.581981MIN: 0.5MIN: 0.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 0.57 / Avg: 0.58 / Max: 0.59Min: 0.58 / Avg: 0.58 / Max: 0.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv20.3590.7181.0771.4361.795SE +/- 0.00509, N = 3SE +/- 0.01470, N = 31.581821.59557MIN: 1.49MIN: 1.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 1.57 / Avg: 1.58 / Max: 1.59Min: 1.57 / Avg: 1.6 / Max: 1.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4Azure HBv3Azure HBv220M40M60M80M100MSE +/- 1839362.01, N = 15SE +/- 812916.11, N = 1582341865438555501. (CXX) g++ options: -O2 -fopenmp
OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4Azure HBv3Azure HBv214M28M42M56M70MMin: 69796040 / Avg: 82341864.67 / Max: 94657260Min: 40401240 / Avg: 43855550 / Max: 517929401. (CXX) g++ options: -O2 -fopenmp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUAzure HBv3Azure HBv2246810SE +/- 0.16, N = 9SE +/- 0.03, N = 36.275.60
OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUAzure HBv3Azure HBv23691215Min: 6.01 / Avg: 6.27 / Max: 7.53Min: 5.56 / Avg: 5.6 / Max: 5.65

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50Azure HBv3Azure HBv21428425670SE +/- 0.90, N = 9SE +/- 2.16, N = 1230.0364.80MIN: 25.77 / MAX: 962.03MIN: 44.36 / MAX: 2196.771. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50Azure HBv3Azure HBv21326395265Min: 26.33 / Avg: 30.03 / Max: 34.39Min: 53.36 / Avg: 64.8 / Max: 81.51. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetAzure HBv3Azure HBv23691215SE +/- 0.57, N = 9SE +/- 0.21, N = 126.8910.80MIN: 5.85 / MAX: 759.55MIN: 9.2 / MAX: 35.441. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetAzure HBv3Azure HBv23691215Min: 5.94 / Avg: 6.89 / Max: 11.35Min: 9.91 / Avg: 10.8 / Max: 11.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16Azure HBv3Azure HBv2306090120150SE +/- 2.09, N = 9SE +/- 6.58, N = 1247.67133.78MIN: 40.11 / MAX: 1444.13MIN: 82.15 / MAX: 2364.481. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16Azure HBv3Azure HBv2306090120150Min: 41.22 / Avg: 47.67 / Max: 61.33Min: 102.77 / Avg: 133.78 / Max: 180.341. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0Azure HBv3Azure HBv2816243240SE +/- 3.62, N = 9SE +/- 1.94, N = 1221.0432.42MIN: 12.75 / MAX: 4928.42MIN: 20.32 / MAX: 786.111. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0Azure HBv3Azure HBv2714212835Min: 13.44 / Avg: 21.04 / Max: 49.11Min: 25.07 / Avg: 32.42 / Max: 46.081. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2Azure HBv3Azure HBv2510152025SE +/- 0.83, N = 9SE +/- 1.32, N = 1213.7121.39MIN: 9.5 / MAX: 286.31MIN: 14.34 / MAX: 1188.391. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2Azure HBv3Azure HBv2510152025Min: 10 / Avg: 13.71 / Max: 17.91Min: 15.42 / Avg: 21.39 / Max: 31.921. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2Azure HBv3Azure HBv2918273645SE +/- 3.10, N = 9SE +/- 2.17, N = 1223.2437.20MIN: 10.66 / MAX: 3825.91MIN: 13.35 / MAX: 4343.961. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2Azure HBv3Azure HBv2816243240Min: 12.73 / Avg: 23.24 / Max: 37.48Min: 25.01 / Avg: 37.2 / Max: 52.551. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetAzure HBv3Azure HBv21224364860SE +/- 1.74, N = 9SE +/- 2.25, N = 1227.8752.56MIN: 21.41 / MAX: 63.52MIN: 40.97 / MAX: 511.31. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetAzure HBv3Azure HBv21122334455Min: 22.17 / Avg: 27.87 / Max: 38.48Min: 44.28 / Avg: 52.56 / Max: 66.781. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0Azure HBv3Azure HBv2246810SE +/- 0.151, N = 15SE +/- 0.500, N = 123.9508.030MIN: 2.55 / MAX: 5.41MIN: 4.5 / MAX: 15.231. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0Azure HBv3Azure HBv23691215Min: 3.02 / Avg: 3.95 / Max: 4.71Min: 6.88 / Avg: 8.03 / Max: 13.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224Azure HBv3Azure HBv23691215SE +/- 0.085, N = 15SE +/- 0.587, N = 124.78212.527MIN: 3.71 / MAX: 61.38MIN: 7.24 / MAX: 67.341. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224Azure HBv3Azure HBv248121620Min: 4.33 / Avg: 4.78 / Max: 5.5Min: 10.07 / Avg: 12.53 / Max: 16.11. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Azure HBv3Azure HBv21224364860SE +/- 0.25, N = 15SE +/- 1.18, N = 1229.0952.53MIN: 25.79 / MAX: 252.29MIN: 41.27 / MAX: 326.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Azure HBv3Azure HBv21122334455Min: 27.82 / Avg: 29.09 / Max: 31.32Min: 44.92 / Avg: 52.53 / Max: 56.921. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Azure HBv3Azure HBv248121620SE +/- 0.113, N = 15SE +/- 0.657, N = 128.22414.136MIN: 5.66 / MAX: 54.17MIN: 10.98 / MAX: 78.481. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Azure HBv3Azure HBv248121620Min: 7.01 / Avg: 8.22 / Max: 8.85Min: 11.71 / Avg: 14.14 / Max: 19.711. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetAzure HBv3Azure HBv216K32K48K64K80KSE +/- 2854.14, N = 15SE +/- 1151.94, N = 1566320.474885.4
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetAzure HBv3Azure HBv213K26K39K52K65KMin: 48475.2 / Avg: 66320.38 / Max: 78669Min: 69961.9 / Avg: 74885.4 / Max: 83330.7

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAzure HBv3Azure HBv230060090012001500SE +/- 14.29, N = 15SE +/- 11.10, N = 15843.101294.53MIN: 711.66MIN: 1179.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAzure HBv3Azure HBv22004006008001000Min: 743.86 / Avg: 843.1 / Max: 928.84Min: 1239.94 / Avg: 1294.53 / Max: 1401.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv20.07880.15760.23640.31520.394SE +/- 0.000278, N = 3SE +/- 0.006544, N = 120.2555010.350347MIN: 0.22MIN: 0.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv212345Min: 0.26 / Avg: 0.26 / Max: 0.26Min: 0.33 / Avg: 0.35 / Max: 0.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv22004006008001000SE +/- 8.80, N = 15SE +/- 5.73, N = 15561.77791.46MIN: 471.83MIN: 718.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv2140280420560700Min: 490.11 / Avg: 561.77 / Max: 612.81Min: 746 / Avg: 791.46 / Max: 823.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv230060090012001500SE +/- 102.39, N = 14SE +/- 13.52, N = 15962.341287.03MIN: 722.13MIN: 1149.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv22004006008001000Min: 741.04 / Avg: 962.34 / Max: 2245.6Min: 1207.12 / Avg: 1287.03 / Max: 1368.081. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv22004006008001000SE +/- 8.62, N = 15SE +/- 8.13, N = 3530.90778.31MIN: 458.96MIN: 738.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv2140280420560700Min: 476.45 / Avg: 530.9 / Max: 573.73Min: 765.12 / Avg: 778.31 / Max: 793.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv230060090012001500SE +/- 25.25, N = 12SE +/- 16.54, N = 15875.171314.00MIN: 727.38MIN: 1140.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv22004006008001000Min: 755.33 / Avg: 875.17 / Max: 1009.77Min: 1182.01 / Avg: 1314 / Max: 1419.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv20.23220.46440.69660.92881.161SE +/- 0.004961, N = 3SE +/- 0.030825, N = 150.4066861.031942MIN: 0.35MIN: 0.831. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv2246810Min: 0.4 / Avg: 0.41 / Max: 0.41Min: 0.92 / Avg: 1.03 / Max: 1.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv20.09920.19840.29760.39680.496SE +/- 0.009558, N = 12SE +/- 0.003995, N = 70.3838550.441012MIN: 0.32MIN: 0.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAzure HBv3Azure HBv212345Min: 0.35 / Avg: 0.38 / Max: 0.47Min: 0.43 / Avg: 0.44 / Max: 0.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv23691215SE +/- 0.02162, N = 3SE +/- 0.28898, N = 153.212049.68321MIN: 2.58MIN: 5.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAzure HBv3Azure HBv23691215Min: 3.17 / Avg: 3.21 / Max: 3.25Min: 9.08 / Avg: 9.68 / Max: 12.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv280160240320400SE +/- 5.29, N = 15SE +/- 9.44, N = 12376.91163.511. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv270140210280350Min: 356.6 / Avg: 376.91 / Max: 425.74Min: 140.75 / Avg: 163.51 / Max: 260.451. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv280160240320400SE +/- 23.14, N = 12SE +/- 5.31, N = 15357.06140.301. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pAzure HBv3Azure HBv260120180240300Min: 106.25 / Avg: 357.06 / Max: 406.73Min: 97.49 / Avg: 140.3 / Max: 193.31. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAzure HBv3Azure HBv2120240360480600SE +/- 7.76, N = 3SE +/- 21.86, N = 15548.33379.911. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAzure HBv3Azure HBv2100200300400500Min: 532.86 / Avg: 548.33 / Max: 557.1Min: 151.29 / Avg: 379.91 / Max: 460.121. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAzure HBv3Azure HBv280160240320400SE +/- 4.22, N = 3SE +/- 6.53, N = 12378.01166.901. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAzure HBv3Azure HBv270140210280350Min: 369.69 / Avg: 378.01 / Max: 383.39Min: 129.12 / Avg: 166.9 / Max: 202.771. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAzure HBv3Azure HBv23691215SE +/- 0.031, N = 3SE +/- 0.218, N = 1512.2759.5121. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAzure HBv3Azure HBv248121620Min: 12.22 / Avg: 12.28 / Max: 12.32Min: 7.69 / Avg: 9.51 / Max: 10.391. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedAzure HBv3Azure HBv220406080100SE +/- 0.71, N = 15SE +/- 1.18, N = 1578.269.51. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedAzure HBv3Azure HBv21530456075Min: 72.9 / Avg: 78.15 / Max: 83.1Min: 61.4 / Avg: 69.51 / Max: 77.61. (CC) gcc options: -O3 -pthread -lz -llzma

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterAzure HBv3Azure HBv23691215SE +/- 0.196, N = 15SE +/- 0.453, N = 157.37912.9121. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterAzure HBv3Azure HBv248121620Min: 6.73 / Avg: 7.38 / Max: 8.97Min: 10.48 / Avg: 12.91 / Max: 15.911. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverAzure HBv3Azure HBv23691215SE +/- 0.801, N = 12SE +/- 0.748, N = 128.46013.0421. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverAzure HBv3Azure HBv248121620Min: 6.63 / Avg: 8.46 / Max: 15.47Min: 10.79 / Avg: 13.04 / Max: 19.761. (CXX) g++ options: -O2 -lOpenCL

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsAzure HBv3Azure HBv2612182430SE +/- 0.83, N = 15SE +/- 0.45, N = 1216.6623.781. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsAzure HBv3Azure HBv2612182430Min: 13.14 / Avg: 16.66 / Max: 24.43Min: 21.27 / Avg: 23.78 / Max: 26.481. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallAzure HBv3Azure HBv23K6K9K12K15KSE +/- 387.39, N = 15SE +/- 608.74, N = 1213785.3013165.091. (CXX) g++ options: -O3 -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallAzure HBv3Azure HBv22K4K6K8K10KMin: 11739.8 / Avg: 13785.25 / Max: 17284.4Min: 8616.53 / Avg: 13165.09 / Max: 15973.21. (CXX) g++ options: -O3 -fopenmp -fexceptions -pthread -lmpi_cxx -lmpi

86 Results Shown

SVT-VP9
oneDNN
Mobile Neural Network
PlaidML:
  No - Inference - VGG16 - CPU
  No - Inference - VGG19 - CPU
oneDNN
Zstd Compression:
  19 - Decompression Speed
  8, Long Mode - Decompression Speed
  19, Long Mode - Decompression Speed
Botan:
  AES-256
  AES-256 - Decrypt
QuantLib
Zstd Compression
Rodinia
SVT-AV1
oneDNN
FinanceBench
oneDNN
FinanceBench
SVT-HEVC
Botan:
  Twofish
  Twofish - Decrypt
  Blowfish
  Blowfish - Decrypt
SVT-AV1
Zstd Compression
LULESH
Botan
TNN
Timed HMMer Search
Botan
oneDNN
GNU GMP GMPbench
Botan:
  CAST-256
  CAST-256 - Decrypt
Rodinia
Timed MAFFT Alignment
Xcompact3d Incompact3d
NAMD
Zstd Compression
Botan:
  ChaCha20Poly1305
  ChaCha20Poly1305 - Decrypt
Timed Linux Kernel Compilation
GROMACS
Timed Node.js Compilation
Timed LLVM Compilation
NAS Parallel Benchmarks
High Performance Conjugate Gradient
Pennant
Rodinia
Pennant
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
Kripke
PlaidML
NCNN:
  CPU - resnet50
  CPU - alexnet
  CPU - vgg16
  CPU - efficientnet-b0
  CPU - shufflenet-v2
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
Mobile Neural Network:
  mobilenet-v1-1.0
  MobileNetV2_224
  resnet-v2-50
  SqueezeNetV1.0
TensorFlow Lite
oneDNN:
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
SVT-VP9:
  PSNR/SSIM Optimized - Bosphorus 1080p
  VMAF Optimized - Bosphorus 1080p
SVT-HEVC:
  10 - Bosphorus 1080p
  7 - Bosphorus 1080p
SVT-AV1
Zstd Compression
Rodinia:
  OpenMP Streamcluster
  OpenMP CFD Solver
CloverLeaf
miniFE