m7g.8xlarge

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Neural Magic DeepSparse

Mlpack Benchmark

Mlpack Benchmark

Mlpack Benchmark

Mlpack Benchmark

oneDNN

oneDNN

oneDNN

oneDNN

oneDNN

oneDNN

OpenCV

OpenCV

OpenCV

OpenCV

OpenCV

OpenCV

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

OpenVINO

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

Whisper.cpp

Whisper.cpp

Whisper.cpp

Llama.cpp

oneDNN

OpenCV

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

Phoronix Test Suite v10.8.5

m7g.8xlarge

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: Llama2 Chat 7b Quantized - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: Llama2 Chat 7b Quantized - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: Llama2 Chat 7b Quantized - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: Llama2 Chat 7b Quantized - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse