Granite Rapids MRDIMM vs. DDR5 Benchmarks

Benchmarks for a future article. 2 x Intel Xeon 6980P testing with a Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.

24 x DDR5-6400

Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07 + 2 x 3201GB KIOXIA KCMYXVUG3T20, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2

OS: Ubuntu 24.04, Kernel: 6.8.0-22-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0
Java Notes: OpenJDK Runtime Environment (build 21.0.3-ea+7-Ubuntu-1build1)
Python Notes: Python 3.12.2
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

24 x MRDIMM 88800

Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 2 x 1920GB KIOXIA KCD8XPUG1T92 + 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2

OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0
Java Notes: OpenJDK Runtime Environment (build 21.0.3-ea+7-Ubuntu-1build1)
Python Notes: Python 3.12.2
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

PostgreSQL

Result

System Power Consumption

Result

System Power Consumption

Apache Cassandra

Result

System Power Consumption

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Tinymembench

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

High Performance Conjugate Gradient

Result

System Power Consumption

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

PostgreSQL

Result

System Power Consumption

Result

System Power Consumption

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Result

System Power Consumption

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

Result

System Power Consumption

Result

System Power Consumption

Result

System Power Consumption

Result

System Power Consumption

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

OpenFOAM

Result

System Power Consumption

NAS Parallel Benchmarks

Result

System Power Consumption

Result

System Power Consumption

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

Result

System Power Consumption

NAS Parallel Benchmarks

Result

System Power Consumption

Result

System Power Consumption

Xcompact3d Incompact3d

Result

System Power Consumption

NAS Parallel Benchmarks

Result

System Power Consumption

Result

System Power Consumption

libxsmm

Result

System Power Consumption

System Power Consumption Monitor

QuantLib

Size: S

24 x MRDIMM 88800: The test run did not produce a result. E: sh: 1: exec: ./quantlib: not found

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

52 Results Shown

GROMACS
PostgreSQL:
100 - 1000 - Read Only - Average Latency
100 - 1000 - Read Only
100 - 800 - Read Only - Average Latency
100 - 800 - Read Only
Apache Cassandra
libxsmm
High Performance Conjugate Gradient
Stream
Timed LLVM Compilation
Tinymembench:
Standard Memset
Standard Memcpy
High Performance Conjugate Gradient
Java JMH
LULESH
Timed Node.js Compilation
PostgreSQL:
100 - 800 - Read Write - Average Latency
100 - 800 - Read Write
100 - 1000 - Read Write - Average Latency
100 - 1000 - Read Write
Timed Linux Kernel Compilation:
allmodconfig
defconfig
OpenRadioss
Pennant
MBW
Xcompact3d Incompact3d
MBW
OpenFOAM:
drivaerFastback, Medium Mesh Size - Execution Time
drivaerFastback, Medium Mesh Size - Mesh Time
Timed LLVM Compilation
NAS Parallel Benchmarks
SPECFEM3D:
Water-layered Halfspace
Layered Halfspace
Homogeneous Halfspace
Tomographic Model
Mount St. Helens
Algebraic Multi-Grid Benchmark
OpenFOAM:
drivaerFastback, Small Mesh Size - Execution Time
drivaerFastback, Small Mesh Size - Mesh Time
NAS Parallel Benchmarks:
SP.B
BT.C
Pennant
NAS Parallel Benchmarks:
IS.D
LU.C
Xcompact3d Incompact3d
NAS Parallel Benchmarks:
CG.C
MG.C
libxsmm
System Power Consumption Monitor
Stream:
Triad
Add
Scale

24 x DDR5-6400

OS: Ubuntu 24.04, Kernel: 6.8.0-22-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Testing initiated at 3 October 2024 10:26 by user phoronix.

24 x MRDIMM 88800

Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 2 x 1920GB KIOXIA KCD8XPUG1T92 + 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2

OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0
Java Notes: OpenJDK Runtime Environment (build 21.0.3-ea+7-Ubuntu-1build1)
Python Notes: Python 3.12.2
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 30 September 2024 14:18 by user phoronix.

Granite Rapids MRDIMM vs. DDR5 Benchmarks

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

24 x DDR5-6400

24 x MRDIMM 88800

GROMACS

PostgreSQL

Apache Cassandra

libxsmm

High Performance Conjugate Gradient

Stream

Timed LLVM Compilation

Tinymembench

High Performance Conjugate Gradient

Java JMH

LULESH

Timed Node.js Compilation

PostgreSQL

Timed Linux Kernel Compilation

OpenRadioss

Pennant

MBW

Xcompact3d Incompact3d

MBW

OpenFOAM

Timed LLVM Compilation

NAS Parallel Benchmarks

SPECFEM3D

Algebraic Multi-Grid Benchmark

OpenFOAM

NAS Parallel Benchmarks

Pennant

NAS Parallel Benchmarks

Xcompact3d Incompact3d

NAS Parallel Benchmarks

libxsmm

System Power Consumption Monitor

QuantLib

Stream

52 Results Shown

24 x DDR5-6400

24 x MRDIMM 88800