Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/).

Neural Magic DeepSparse 1.7

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.org metrics for this test profile configuration based on 72 public results since 15 March 2024 with the latest data as of 20 August 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Details

Percentile Rank

# Compatible Public Results

items/sec (Average)

Intel Core i9-14900K

Raptor Lake [24 Cores / 32 Threads]

97th

327 ^{+/- 1}

2 x INTEL XEON PLATINUM 8592

Emerald Rapids [128 Cores / 256 Threads]

95th