2024 year

AMD Ryzen Threadripper PRO 5965WX 24-Cores testing with a ASUS Pro WS WRX80E-SAGE SE WIFI (1201 BIOS) and ASUS NVIDIA NV106 2GB on Ubuntu 23.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402040-NE-2024YEAR116&grr.

LeelaChessZero

Backend: BLAS

LeelaChessZero

Backend: Eigen

Quicksilver

Input: CTS2

Llama.cpp

Model: llama-2-70b-chat.Q5_0.gguf

TensorFlow

Device: CPU - Batch Size: 16 - Model: VGG-16

Quicksilver

Input: CORAL2 P2

CacheBench

Test: Read

CacheBench

Test: Read / Modify / Write

CacheBench

Test: Write

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

Llamafile

Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU

Speedb

Test: Sequential Fill

rav1e

Speed: 1

rav1e

Speed: 10

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

Llamafile

Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

PyTorch

Device: CPU - Batch Size: 16 - Model: ResNet-50

Speedb

Test: Random Fill Sync

Speedb

Test: Random Fill

Speedb

Test: Update Random

Speedb

Test: Read While Writing

Speedb

Test: Read Random Write Random

Speedb

Test: Random Read

rav1e

Speed: 5

Quicksilver

Input: CORAL2 P1

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

TensorFlow

Device: CPU - Batch Size: 1 - Model: VGG-16

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Llama.cpp

Model: llama-2-13b.Q4_0.gguf

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

rav1e

Speed: 6

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

TensorFlow

Device: CPU - Batch Size: 16 - Model: GoogLeNet

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

LZ4 Compression

Compression Level: 9 - Decompression Speed

LZ4 Compression

Compression Level: 9 - Compression Speed

Llama.cpp

Model: llama-2-7b.Q4_0.gguf

LZ4 Compression

Compression Level: 3 - Decompression Speed

LZ4 Compression

Compression Level: 3 - Compression Speed

LZ4 Compression

Compression Level: 1 - Decompression Speed

LZ4 Compression

Compression Level: 1 - Compression Speed

TensorFlow

Device: CPU - Batch Size: 16 - Model: AlexNet

TensorFlow

Device: CPU - Batch Size: 1 - Model: AlexNet

Y-Cruncher

Pi Digits To Calculate: 1B

Llamafile

Test: llava-v1.5-7b-q4 - Acceleration: CPU

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

TensorFlow

Device: CPU - Batch Size: 1 - Model: GoogLeNet

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

Y-Cruncher

Pi Digits To Calculate: 500M

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 1080p

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 1080p

Phoronix Test Suite v10.8.5