Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/).

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark deepsparse.

Project Site

neuralmagic.com

Source Repository

github.com

Test Created

13 October 2022

Last Updated

15 March 2024

Test Maintainer

Michael Larabel

Test Type

System

Average Install Time

14 Minutes, 19 Seconds

Average Run Time

3 Minutes, 24 Seconds

Test Dependencies

Python

Accolades

40k+ Downloads

Supported Platforms

* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 17 January 2025.

Revision History

pts/deepsparse-1.7.0 [View Source] Fri, 15 Mar 2024 12:35:17 GMT
Update against DeepSparse 1.7 upstream, add Llama 2 chat test.

pts/deepsparse-1.6.0 [View Source] Mon, 11 Dec 2023 16:59:10 GMT
Update against deepsparse 1.6 upstream.

pts/deepsparse-1.5.2 [View Source] Wed, 26 Jul 2023 15:52:28 GMT
Update against 1.5.2 point release, add more models.

pts/deepsparse-1.5.0 [View Source] Wed, 07 Jun 2023 07:51:58 GMT
Update against Deepsparse 1.5 upstream.

pts/deepsparse-1.3.2 [View Source] Sun, 22 Jan 2023 19:05:03 GMT
Update against DeepSparse 1.3.2 upstream.

pts/deepsparse-1.0.1 [View Source] Thu, 13 Oct 2022 13:47:39 GMT
Initial commit of DeepSparse benchmark.

Suites Using This Test

Machine Learning

HPC - High Performance Computing

Performance Metrics

Analyze Test Configuration:

Neural Magic DeepSparse 1.7

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.org metrics for this test profile configuration based on 72 public results since 15 March 2024 with the latest data as of 20 August 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

items/sec (Average)

Intel Core i9-14900K

97th

327 ^{+/- 1}

2 x INTEL XEON PLATINUM 8592

95th

305

AMD Ryzen 9 7950X 16-Core

85th

301 ^{+/- 1}

Intel Xeon E E-2488

79th

261 ^{+/- 2}

Mid-Tier

75th

< 260

AMD Ryzen Threadripper 7980X 64-Cores

70th

252 ^{+/- 1}

AMD Ryzen 7 7840HS

64th

215 ^{+/- 1}

AMD EPYC 8534P 64-Core

58th

202 ^{+/- 1}

AMD Ryzen Threadripper PRO 5965WX 24-Cores

51st

193 ^{+/- 1}

Median

50th

193

2 x AMD EPYC 9684X 96-Core

49th

192 ^{+/- 1}

AMD Ryzen Threadripper 3970X 32-Core

42nd

170 ^{+/- 1}

AMD Ryzen 7 7840U

38th

160 ^{+/- 2}

Intel Core i9-10980XE

31st

142 ^{+/- 2}

Low-Tier

25th

< 133

ARMv8 Neoverse-N1 128-Core

24th

132 ^{+/- 1}

Intel Core i7-1280P

20th

127 ^{+/- 5}

AMD Ryzen 7 3800XT 8-Core

15th

118

Intel Core Ultra 7 155H

7th

98 ^{+/- 3}

AMD Ryzen 5 5500U

3rd

Detailed Performance Overview

Based on OpenBenchmarking.org data, the selected test / test configuration (Neural Magic DeepSparse 1.7 - Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream) has an average run-time of 3 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture

Kernel Identifier

Verified On

Intel / AMD x86 64-bit

x86_64

(Many Processors)

ARMv8 64-bit

aarch64

ARMv8 Neoverse-N1 128-Core, ARMv8 Neoverse-V1