Ryzen 9 9950X Memory Performance

AMD Ryzen 9 9950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS) and AMD Radeon RX 7900 GRE 16GB on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2408144-NE-RYZEN999520&grr.

Ryzen 9 9950X Memory PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution2 x 16GB DDR5-6000 CL30 F5-6000J3038F16GAMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x2160OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a- Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Ryzen 9 9950X Memory Performancellamafile: Meta-Llama-3-8B-Instruct.F16 - CPUopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenradioss: Chrysler Neon 1Mbuild-linux-kernel: allmodconfigblender: Barbershop - CPU-Onlylczero: Eigenlczero: BLASbuild-nodejs: Time To Compilexnnpack: QU8MobileNetV3Smallxnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV2xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV2build-llvm: Ninjastockfish: Chess Benchmarkopenradioss: INIVOL and Fluid Structure Interaction Drop Containerbrl-cad: VGR Performance Metricbuild-gem5: Time To Compilejava-jmh: Throughputhpcg: 104 104 104 - 60openfoam: motorBike - Execution Timeopenfoam: motorBike - Mesh Timetensorflow: CPU - 64 - ResNet-50openradioss: Bird Strike on Windshieldnpb: SP.Cgromacs: water_GMX50_bareopenradioss: Bumper Beambuild2: Time To Compilespecfem3d: Layered Halfspacespecfem3d: Water-layered Halfspaceminibude: OpenMP - BM2minibude: OpenMP - BM2memcached: 1:100memcached: 1:10namd: STMV with 1,066,628 Atomsincompact3d: input.i3d 193 Cells Per Directionluxcorerender: Orange Juice - CPUllama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufblender: Junkshop - CPU-Onlyluxcorerender: LuxCore Benchmark - CPUluxcorerender: Danish Mood - CPUluxcorerender: DLSC - CPUopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timegromacs: MPI CPU - water_GMX50_bareopenradioss: Rubber O-Ring Seal Installationnpb: BT.Cbuild-linux-kernel: defconfigopenradioss: Cell Phone Drop Testblender: BMW27 - CPU-Onlymbw: Memory Copy, Fixed Block Size - 8192 MiBquicksilver: CORAL2 P1mbw: Memory Copy - 8192 MiBnpb: EP.Dnpb: LU.Cpytorch: CPU - 256 - ResNet-50specfem3d: Homogeneous Halfspacenamd: ATPase with 327,506 Atomsstress-ng: Memory Copyingramspeed: Add - Integerramspeed: Copy - Integerramspeed: Triad - Integernpb: IS.Dramspeed: Average - Integerramspeed: Scale - Integerspecfem3d: Mount St. Helensspecfem3d: Tomographic Modelmbw: Memory Copy, Fixed Block Size - 4096 MiBcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingetcpak: Multi-Threaded - ETC2embree: Pathtracer ISPC - Crownmbw: Memory Copy - 4096 MiBembree: Pathtracer ISPC - Asian Dragony-cruncher: 1Bpovray: Trace Timenpb: SP.Bx265: Bosphorus 4Knpb: FT.Cpytorch: CPU - 1 - ResNet-50incompact3d: input.i3d 129 Cells Per Directionnpb: CG.Cminibude: OpenMP - BM1minibude: OpenMP - BM1luxcorerender: Rainbow Colors and Prism - CPUtensorflow: CPU - 1 - ResNet-50y-cruncher: 500Mnpb: MG.Clulesh: x265: Bosphorus 1080pnpb: EP.Cllamafile: llava-v1.6-mistral-7b.Q8_0 - CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4.682096.5664181.56905705.55590.495448.96220226345.2467561118759770129198581215161264301.62951377778247.06498966221.13991554717298.4268.9356058.569584.994952.27128.7614528.261.92776.9076.00373.13879708472.19726051277.3851934.6137507615.735915791.340.9808464.11302188.648.3662.095.154.625.40157.1489822.5798493.14153.1655287.3747.75544.9046.3819467.3942567666722247.7843823.8059920.9657.5430.6779184573.4100710752.8161079.9365797.0562428.701505.4765503.9369141.0825.69098664423.35909292019534.159158133195585741.78436.553222774.94741.491416.45216.10121328.2838.2727106.1884.8914.090555211144.8876.8851922.12519.9916.927.58724236.219945.5309134.523678.46OpenBenchmarking.org

Llamafile

Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50010001500200025002096.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4080120160200181.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G150300450600750SE +/- 0.28, N = 3705.55

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G130260390520650SE +/- 0.15, N = 3590.50

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G100200300400500SE +/- 0.20, N = 3448.96

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 1.45, N = 32201. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 2.65, N = 32261. (CXX) g++ options: -flto -pthread

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G80160240320400SE +/- 0.34, N = 3345.25

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1603204806408007561. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 1.20, N = 311181. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G160320480640800SE +/- 0.33, N = 37591. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G170340510680850SE +/- 1.76, N = 37701. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 2.89, N = 312911. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.31, N = 39851. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.31, N = 38121. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 19.19, N = 315161. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 5.21, N = 312641. (CXX) g++ options: -O3 -lrt -lm

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G70140210280350SE +/- 0.12, N = 3301.63

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G11M22M33M44M55MSE +/- 491754.88, N = 15513777781. Stockfish 16 by the Stockfish developers (see AUTHORS file)

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.74, N = 3247.06

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110K220K330K440K550K4989661. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.11, N = 3221.14

Java JMH

Throughput

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20000M40000M60000M80000M100000M91554717298.43

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.00142, N = 38.935601. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G132639526558.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: motorBike - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2040608010084.991. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.02, N = 352.27

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.28, N = 3128.76

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 18.27, N = 314528.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.43360.86721.30081.73442.168SE +/- 0.002, N = 31.9271. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.07, N = 376.90

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.16, N = 376.00

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.31, N = 373.141. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.34, N = 372.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.03, N = 377.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 0.64, N = 31934.611. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.6M3.2M4.8M6.4M8MSE +/- 1150.91, N = 37507615.731. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.3M2.6M3.9M5.2M6.5MSE +/- 15831.56, N = 35915791.341. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.22070.44140.66210.88281.1035SE +/- 0.00027, N = 30.98084

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.09, N = 364.111. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.01, N = 38.64MIN: 7.64 / MAX: 9.25

Llama.cpp

Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.01, N = 38.361. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.12, N = 362.09

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.15882.31763.47644.63525.794SE +/- 0.01, N = 35.15MIN: 2.38 / MAX: 5.77

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.03952.0793.11854.1585.1975SE +/- 0.01, N = 34.62MIN: 2.07 / MAX: 5.22

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.2152.433.6454.866.075SE +/- 0.01, N = 35.40MIN: 5.27 / MAX: 5.72

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150157.151. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G51015202522.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.70671.41342.12012.82683.5335SE +/- 0.001, N = 33.1411. (CXX) g++ options: -O3 -lm

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.10, N = 353.16

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G12K24K36K48K60KSE +/- 54.79, N = 355287.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.30, N = 347.76

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1020304050SE +/- 0.18, N = 344.90

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.01, N = 346.38

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 10.56, N = 319467.391. (CC) gcc options: -O3 -march=native

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5M10M15M20M25MSE +/- 21858.13, N = 3256766671. (CXX) g++ options: -fopenmp -O3 -march=native

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 49.10, N = 322247.781. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 2.85, N = 33823.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 150.93, N = 359920.961. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1326395265SE +/- 0.34, N = 357.54MIN: 53.14 / MAX: 58.34

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G714212835SE +/- 0.28, N = 330.681. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.76731.53462.30193.06923.8365SE +/- 0.02586, N = 33.41007

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 44.94, N = 310752.811. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 426.98, N = 361079.931. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 820.93, N = 365797.051. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 156.92, N = 362428.701. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 5.33, N = 31505.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 119.74, N = 365503.931. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 220.54, N = 369141.081. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.17, N = 325.691. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.16, N = 323.361. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 24.77, N = 319534.161. (CC) gcc options: -O3 -march=native

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30K60K90K120K150KSE +/- 14.15, N = 31581331. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G40K80K120K160K200KSE +/- 322.74, N = 31955851. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G160320480640800SE +/- 1.25, N = 3741.781. (CXX) g++ options: -flto -pthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G816243240SE +/- 0.04, N = 336.55MIN: 36.13 / MAX: 37.34

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 72.38, N = 322774.951. (CC) gcc options: -O3 -march=native

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G918273645SE +/- 0.03, N = 341.49MIN: 41.24 / MAX: 42.02

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.04, N = 316.45

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 316.101. POV-Ray 3.7.0.10.unofficial

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 21.50, N = 321328.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G918273645SE +/- 0.04, N = 338.271. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 141.73, N = 327106.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.27, N = 384.89MIN: 77.17 / MAX: 85.87

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.08, N = 314.091. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 21.24, N = 311144.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.27, N = 376.891. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 6.84, N = 31922.131. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G510152025SE +/- 0.07, N = 319.99MIN: 18.05 / MAX: 20.42

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 316.92

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.006, N = 37.587

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 6.88, N = 324236.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 19.00, N = 39945.531. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.27, N = 3134.521. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 25.12, N = 33678.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6


Phoronix Test Suite v10.8.5