12700k HPC+OpenCL AVX512 performance profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112125-TJ-12700KHPC62&gru&sor.

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

Intel MPI Benchmarks

Test: IMB-MPI1 PingPong

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

Intel MPI Benchmarks

Test: IMB-P2P PingPong

miniFE

Problem Size: Small

Algebraic Multi-Grid Benchmark

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

cl-mem

Benchmark: Copy

cl-mem

Benchmark: Read

cl-mem

Benchmark: Write

ACES DGEMM

Sustained Floating-Point Rate

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

HPL Linpack

ArrayFire

Test: BLAS CPU

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

ASKAP

Test: Hogbom Clean OpenMP

FFTW

Build: Stock - Size: 1D FFT Size 32

FFTW

Build: Stock - Size: 2D FFT Size 32

FFTW

Build: Stock - Size: 1D FFT Size 4096

FFTW

Build: Stock - Size: 2D FFT Size 4096

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

Himeno Benchmark

Poisson Pressure Solver

ASKAP

Test: tConvolve MT - Gridding

ASKAP

Test: tConvolve MT - Degridding

ASKAP

Test: tConvolve OpenMP - Gridding

ASKAP

Test: tConvolve OpenMP - Degridding

ASKAP

Test: tConvolve MPI - Degridding

ASKAP

Test: tConvolve MPI - Gridding

LeelaChessZero

Backend: BLAS

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

Numpy Benchmark

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

LULESH

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

NAMD

ATPase Simulation - 327,506 Atoms

Pennant

Test: sedovbig

Pennant

Test: leblancbig

TensorFlow Lite

Model: SqueezeNet

TensorFlow Lite

Model: Inception V4

TensorFlow Lite

Model: NASNet Mobile

TensorFlow Lite

Model: Mobilenet Float

TensorFlow Lite

Model: Mobilenet Quant

TensorFlow Lite

Model: Inception ResNet V2

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 1000

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU