Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/).


Neural Magic DeepSparse 1.7

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.org metrics for this test profile configuration based on 72 public results since 15 March 2024 with the latest data as of 20 August 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Details
Percentile Rank
# Compatible Public Results
items/sec (Average)
Raptor Lake [24 Cores / 32 Threads]
97th
5
327 +/- 1
Emerald Rapids [128 Cores / 256 Threads]
95th
4
305
Zen 4 [16 Cores / 32 Threads]
85th
5
301 +/- 1
Raptor Lake [8 Cores / 16 Threads]
79th
7
261 +/- 2
Mid-Tier
75th
< 260
Zen 4 [64 Cores / 128 Threads]
70th
4
252 +/- 1
Zen 4 [8 Cores / 16 Threads]
64th
3
215 +/- 1
Zen 4 [64 Cores / 128 Threads]
58th
3
202 +/- 1
Zen 3 [24 Cores / 48 Threads]
51st
4
193 +/- 1
Median
50th
193
Zen 4 [192 Cores / 384 Threads]
49th
4
192 +/- 1
Zen 2 [32 Cores / 64 Threads]
42nd
3
170 +/- 1
Zen 4 [8 Cores / 16 Threads]
38th
4
160 +/- 2
Cascade Lake [18 Cores / 36 Threads]
31st
4
142 +/- 2
Low-Tier
25th
< 133
Alder Lake [14 Cores / 20 Threads]
20th
3
127 +/- 5
Zen 2 [8 Cores / 16 Threads]
15th
4
118
Meteor Lake [16 Cores / 22 Threads]
7th
4
98 +/- 3
Zen 2 [6 Cores / 12 Threads]
3rd
3
86