12700k HPC+OpenCL AVX512 performance profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112125-TJ-12700KHPC62&grs&sro.

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

CP2K Molecular Dynamics

Input: Fayalite-FIST

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

LeelaChessZero

Backend: BLAS

Pennant

Test: leblancbig

Darktable

Test: Boat - Acceleration: OpenCL

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 1000

RNNoise

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

Pennant

Test: sedovbig

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

HPL Linpack

Intel MPI Benchmarks

Test: IMB-MPI1 PingPong

ACES DGEMM

Sustained Floating-Point Rate

cl-mem

Benchmark: Write

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

ASKAP

Test: tConvolve OpenMP - Gridding

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

FFTW

Build: Stock - Size: 2D FFT Size 4096

RELION

Test: Basic - Device: CPU

cl-mem

Benchmark: Copy

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenFOAM

Input: Motorbike 30M

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

NAMD

ATPase Simulation - 327,506 Atoms

Intel MPI Benchmarks

Test: IMB-P2P PingPong

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

FFTW

Build: Stock - Size: 1D FFT Size 4096

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

Numpy Benchmark

QMCPACK

Input: simple-H2O

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

cl-mem

Benchmark: Read

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

Himeno Benchmark

Poisson Pressure Solver

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

Timed MrBayes Analysis

Primate Phylogeny Analysis

ASKAP

Test: Hogbom Clean OpenMP

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

ASKAP

Test: tConvolve MPI - Degridding

R Benchmark

FFTW

Build: Stock - Size: 2D FFT Size 32

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

Algebraic Multi-Grid Benchmark

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

Darktable

Test: Server Room - Acceleration: OpenCL

ASKAP

Test: tConvolve OpenMP - Degridding

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

TensorFlow Lite

Model: SqueezeNet

OpenFOAM

Input: Motorbike 60M

GNU Octave Benchmark

TensorFlow Lite

Model: Mobilenet Float

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

ASKAP

Test: tConvolve MT - Gridding

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

Darktable

Test: Masskrug - Acceleration: OpenCL

ArrayFire

Test: BLAS CPU

DeepSpeech

Acceleration: CPU

Parboil

Test: OpenMP Stencil

Parboil

Test: OpenMP CUTCP

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

TensorFlow Lite

Model: Inception ResNet V2

Timed HMMer Search

Pfam Database Search

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

TensorFlow Lite

Model: Inception V4

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU