AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2408225-NE-RYZEN999510
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 13 Tests
Creator Workloads 5 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Common Kernel Benchmarks 2 Tests
Large Language Models 2 Tests
Machine Learning 7 Tests
Memory Test Suite 2 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 4 Tests
Multi-Core 12 Tests
NVIDIA GPU Compute 4 Tests
OpenMPI Tests 9 Tests
Programmer / Developer System Benchmarks 6 Tests
Python Tests 6 Tests
Renderers 2 Tests
Scientific Computing 5 Tests
Server 2 Tests
Server CPU Tests 8 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G
August 14
  13 Hours, 26 Minutes
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32
August 22
  10 Hours, 21 Minutes
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36
August 20
  10 Hours, 59 Minutes
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38
August 21
  7 Hours, 49 Minutes
Invert Hiding All Results Option
  10 Hours, 39 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 9950X DDR5 Memory PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C382000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionAMD Ryzen 9 9950X DDR5 Memory Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a- Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32, 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38Result OverviewPhoronix Test Suite100%123%145%168%191%OpenFOAMOpenRadiossLeelaChessZeroXcompact3d Incompact3dlibxsmmHigh Performance Conjugate GradientLlama.cppLULESHNAS Parallel BenchmarksRAMspeed SMPBuild2PyTorchNAMDStockfishSPECFEM3DTensorFlowMBWMemcachedEmbreeJava JMHXNNPACKsimdjsonx265BlenderBRL-CADXmrigNumpy BenchmarkLuxCoreRenderQuicksilverPOV-RayminiBUDEEtcpakStress-NG

AMD Ryzen 9 9950X DDR5 Memory Performancellamafile: Meta-Llama-3-8B-Instruct.F16 - CPUopenradioss: Chrysler Neon 1Mopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timexmrig: GhostRider - 1Mblender: Barbershop - CPU-Onlybuild-linux-kernel: allmodconfiglczero: Eigenopenradioss: Bird Strike on Windshieldlczero: BLASxnnpack: QU8MobileNetV3Smallxnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV2xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV2build-llvm: Ninjabuild-nodejs: Time To Compileopenradioss: INIVOL and Fluid Structure Interaction Drop Containerbrl-cad: VGR Performance Metricstockfish: Chess Benchmarkjava-jmh: Throughputhpcg: 104 104 104 - 60build-gem5: Time To Compileopenfoam: motorBike - Execution Timeopenfoam: motorBike - Mesh Timetensorflow: CPU - 64 - ResNet-50npb: SP.Csimdjson: Kostyambw: Memory Copy, Fixed Block Size - 8192 MiBbuild2: Time To Compilegromacs: water_GMX50_barenumpy: openradioss: Bumper Beamspecfem3d: Layered Halfspacesimdjson: LargeRandspecfem3d: Water-layered Halfspacesimdjson: TopTweetmbw: Memory Copy - 8192 MiBpytorch: CPU - 256 - ResNet-50libxsmm: 128minibude: OpenMP - BM2minibude: OpenMP - BM2memcached: 1:10memcached: 1:100blender: Junkshop - CPU-Onlyluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUluxcorerender: LuxCore Benchmark - CPUllama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufluxcorerender: DLSC - CPUincompact3d: input.i3d 193 Cells Per Directionopenradioss: Rubber O-Ring Seal Installationgromacs: MPI CPU - water_GMX50_bareopenfoam: drivaerFastback, Small Mesh Size - Mesh Timenpb: BT.Cnamd: STMV with 1,066,628 Atomsspecfem3d: Tomographic Modelopenfoam: drivaerFastback, Small Mesh Size - Execution Timeblender: BMW27 - CPU-Onlyopenradioss: Cell Phone Drop Testlibxsmm: 64libxsmm: 32build-linux-kernel: defconfigquicksilver: CORAL2 P1mbw: Memory Copy - 4096 MiBnamd: ATPase with 327,506 Atomsnpb: EP.Dnpb: LU.Cramspeed: Copy - Integerspecfem3d: Homogeneous Halfspacestress-ng: Memory Copyingramspeed: Average - Integerspecfem3d: Mount St. Helensramspeed: Add - Integernpb: IS.Dramspeed: Triad - Integerramspeed: Scale - Integermbw: Memory Copy, Fixed Block Size - 4096 MiBetcpak: Multi-Threaded - ETC2embree: Pathtracer ISPC - Crownpytorch: CPU - 1 - ResNet-50embree: Pathtracer ISPC - Asian Dragonpovray: Trace Timecompress-7zip: Compression Ratingy-cruncher: 1Bcompress-7zip: Decompression Ratingnpb: SP.Bx265: Bosphorus 4Knpb: FT.Cincompact3d: input.i3d 129 Cells Per Directionnpb: CG.Cminibude: OpenMP - BM1minibude: OpenMP - BM1luxcorerender: Rainbow Colors and Prism - CPUtensorflow: CPU - 1 - ResNet-50y-cruncher: 500Mnpb: MG.Clulesh: x265: Bosphorus 1080pnpb: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C384.68705.55181.569052096.56643616.6448.96590.495220128.762267561118759770129198581215161264301.629345.246247.064989665137777891554717298.4268.93560221.13958.569584.994952.2714528.267.7119467.39476.0031.9271068.3676.9073.1387970842.1272.19726051213.5922247.78457.54472.077.3851934.6135915791.347507615.7362.098.644.625.158.365.4064.113021853.163.14122.57984955287.370.9808423.359092920157.1489846.3844.90237.1121.047.7552567666722774.9473.410073823.8059920.9665797.0530.67791845710752.8165503.9325.69098664461079.931505.4762428.7069141.0819534.159741.78436.553284.8941.491416.10119558516.45215813321328.2838.2727106.1814.090555211144.8876.8851922.12519.9916.927.58724236.219945.5309134.523678.46668.16177.78941994.7473638.6450.71589.331236128.012447611115765776130399681515271273298.049341.679239.224944374919415592045826799.548.79737219.26857.245284.980254.3015610.307.6320850.88377.0061.9961070.6177.1869.4393872992.1169.56164246713.3822294.06258.89505.977.2461931.1415898565.347647187.4862.218.644.695.148.325.4260.799243953.233.22822.82138958570.771.0160223.000501341148.7584546.5643.42253.3128.047.7832566000022510.4643.604643714.6662084.7270747.0029.81843126810720.3168588.8424.95654992863967.701584.9365845.2171754.5820817.569739.73636.109985.5240.944716.13320306515.42615751222908.5838.3028891.8613.402805311374.5177.0761926.89120.0817.347.24725622.9310235.722134.483731.97573.33174.125721909.66553699.6459.12247128.322647641124765773130299781815531272304.621242.004869544997421089012581592.6539.5160155.573888.080755.1916097.007.5720349.41477.9232.0091046.9177.4569.8174223112.1068.92093565113.2422817.90058.49508.976.1981904.9535874747.737498825.1963.418.554.655.049.005.3658.198459653.663.29823.14363160124.081.0242623.287517388145.8832347.7243.25258.6132.248.2582566666722539.7203.577553739.1664031.3170345.5330.54298448710608.9669491.4225.19465363065113.521635.3466963.0373724.8820923.989734.53336.254686.7941.723816.39520081015.60615700523254.4038.0729713.5512.974287011908.4776.1221903.04019.8417.337.23626596.4610578.516134.193741.863676.5459.19227124.37786115478379713341018836159012854897845010139690340365706.7929.1659085.474553.5415531.767.5020124.94380.1041064.1777.8172.43834892.1172.07947665812.8821444.00256.19491.175.9301898.2585694123.527350932.8864.088.464.605.058.665.3460.480583254.2758570.030.9985223.86945318147.51249.7127.02513666722093.9853.509583712.4562375.4366545.2431.98171567710608.8266822.4925.45331906062771.001586.5964832.8670445.7419747.584731.22935.182881.8940.069616.31922436.8137.0928743.4913.259573311662.4475.9921899.80419.3316.8125715.349832.6385131.283722.56OpenBenchmarking.org

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G150300450600750SE +/- 49.36, N = 9SE +/- 1.02, N = 3SE +/- 0.28, N = 3573.33668.16705.55

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4080120160200174.13177.79181.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50010001500200025001909.671994.752096.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 30.71, N = 9SE +/- 36.12, N = 6SE +/- 37.04, N = 3SE +/- 6.72, N = 33699.63676.53638.63616.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38100200300400500SE +/- 0.20, N = 3SE +/- 0.26, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 3448.96450.71459.12459.19

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G130260390520650SE +/- 0.52, N = 3SE +/- 0.15, N = 3589.33590.50

Build: allmodconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as

LeelaChessZero

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 1.15, N = 3SE +/- 2.85, N = 3SE +/- 1.45, N = 32472362272201. (CXX) g++ options: -flto -pthread

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 5.81, N = 15SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.28, N = 3124.37128.01128.32128.76

LeelaChessZero

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 3.79, N = 3SE +/- 2.91, N = 3SE +/- 2.65, N = 32642442261. (CXX) g++ options: -flto -pthread

Backend: BLAS

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 37567617647861. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 4.91, N = 3SE +/- 1.20, N = 3SE +/- 2.73, N = 3SE +/- 3.21, N = 311151118112411541. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 2.96, N = 3SE +/- 3.18, N = 37597657657831. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 1.76, N = 3SE +/- 1.20, N = 3SE +/- 1.86, N = 3SE +/- 0.58, N = 37707737767971. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 2.89, N = 3SE +/- 4.58, N = 3SE +/- 3.53, N = 3SE +/- 8.11, N = 312911302130313341. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 2.31, N = 3SE +/- 3.38, N = 3SE +/- 1.20, N = 3SE +/- 2.31, N = 398599699710181. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 2.31, N = 3SE +/- 2.31, N = 3SE +/- 5.00, N = 3SE +/- 3.18, N = 38128158188361. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 19.19, N = 3SE +/- 9.64, N = 3SE +/- 2.08, N = 3SE +/- 3.18, N = 315161527155315901. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 5.21, N = 3SE +/- 4.98, N = 3SE +/- 8.84, N = 3SE +/- 9.00, N = 312641272127312851. (CXX) g++ options: -O3 -lrt -lm

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3670140210280350SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.38, N = 2298.05301.63304.62

Build System: Ninja

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G80160240320400SE +/- 0.24, N = 3SE +/- 0.34, N = 3341.68345.25

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.99, N = 3SE +/- 0.76, N = 3SE +/- 0.74, N = 3239.22242.00247.06

Model: INIVOL and Fluid Structure Interaction Drop Container

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36110K220K330K440K550K4989664944374897844869541. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3211M22M33M44M55MSE +/- 491754.88, N = 15SE +/- 166041.67, N = 3SE +/- 515794.89, N = 15SE +/- 410305.51, N = 3513777785010139649974210491941551. Stockfish 16 by the Stockfish developers (see AUTHORS file)

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3620000M40000M60000M80000M100000M92045826799.5491554717298.4390340365706.7989012581592.65

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.00741, N = 3SE +/- 0.00584, N = 3SE +/- 0.00142, N = 3SE +/- 0.00125, N = 39.516019.165908.935608.797371. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.19, N = 3SE +/- 0.11, N = 3219.27221.14

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G132639526555.5757.2558.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362040608010084.9884.9985.4788.081. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 355.1954.3053.5452.27

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 15.11, N = 3SE +/- 16.93, N = 3SE +/- 42.19, N = 3SE +/- 18.27, N = 316097.0015610.3015531.7614528.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38246810SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 37.717.637.577.501. (CXX) g++ options: -O3 -lrt

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 58.10, N = 3SE +/- 144.90, N = 15SE +/- 239.88, N = 3SE +/- 10.56, N = 320850.8820349.4120124.9419467.391. (CC) gcc options: -O3 -march=native

Build2

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.16, N = 3SE +/- 0.52, N = 3SE +/- 0.40, N = 3SE +/- 0.99, N = 276.0077.0177.9280.10

GROMACS

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4520.9041.3561.8082.26SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 32.0091.9961.9271. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362004006008001000SE +/- 2.01, N = 3SE +/- 8.61, N = 3SE +/- 4.91, N = 3SE +/- 1.37, N = 31070.611068.361064.171046.91

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 376.9077.1877.4577.81

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.59, N = 3SE +/- 0.59, N = 3SE +/- 0.31, N = 369.4469.8272.4473.141. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.4770.9541.4311.9082.385SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 32.122.112.112.101. (CXX) g++ options: -O3 -lrt

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.24, N = 3SE +/- 0.34, N = 3SE +/- 0.16, N = 2SE +/- 0.34, N = 368.9269.5672.0872.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C383691215SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 313.5913.3813.2412.881. (CXX) g++ options: -O3 -lrt

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385K10K15K20K25KSE +/- 204.03, N = 3SE +/- 188.50, N = 12SE +/- 49.10, N = 3SE +/- 56.11, N = 322817.9022294.0622247.7821444.001. (CC) gcc options: -O3 -march=native

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381326395265SE +/- 0.40, N = 15SE +/- 0.36, N = 3SE +/- 0.34, N = 3SE +/- 0.25, N = 358.8958.4957.5456.19MIN: 34.79 / MAX: 62.62MIN: 34.6 / MAX: 60.01MIN: 53.14 / MAX: 58.34MIN: 39.35 / MAX: 57.3

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110220330440550SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.38, N = 3508.9505.9491.1472.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 377.3977.2576.2075.931. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38400800120016002000SE +/- 0.64, N = 3SE +/- 0.32, N = 3SE +/- 1.03, N = 3SE +/- 2.32, N = 31934.611931.141904.951898.261. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.3M2.6M3.9M5.2M6.5MSE +/- 15831.56, N = 3SE +/- 11599.22, N = 3SE +/- 11157.27, N = 3SE +/- 4372.40, N = 35915791.345898565.345874747.735694123.521. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.6M3.2M4.8M6.4M8MSE +/- 87376.76, N = 3SE +/- 1150.91, N = 3SE +/- 43634.73, N = 3SE +/- 71150.32, N = 37647187.487507615.737498825.197350932.881. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381428425670SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 362.0962.2163.4164.08

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.648.648.558.46MIN: 7.67 / MAX: 9.25MIN: 7.64 / MAX: 9.25MIN: 7.48 / MAX: 9.15MIN: 7.49 / MAX: 9.04

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.05532.11063.16594.22125.2765SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.694.654.624.60MIN: 2.15 / MAX: 5.26MIN: 2.02 / MAX: 5.24MIN: 2.07 / MAX: 5.22MIN: 2.13 / MAX: 5.16

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361.15882.31763.47644.63525.794SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.155.145.055.04MIN: 2.38 / MAX: 5.77MIN: 2.39 / MAX: 5.75MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.008.668.368.321. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.21952.4393.65854.8786.0975SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.425.405.365.34MIN: 5.31 / MAX: 5.77MIN: 5.27 / MAX: 5.72MIN: 5.25 / MAX: 5.7MIN: 5.23 / MAX: 5.69

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.14, N = 3SE +/- 0.01, N = 2SE +/- 0.20, N = 3SE +/- 0.09, N = 358.2060.4860.8064.111. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381224364860SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 353.1653.2353.6654.27

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.74211.48422.22632.96843.7105SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 33.2983.2283.1411. (CXX) g++ options: -O3 -lm

Implementation: MPI CPU - Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3661218243022.5822.8223.141. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 85.26, N = 3SE +/- 8.10, N = 3SE +/- 74.71, N = 3SE +/- 54.79, N = 360124.0858570.7758570.0355287.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.23050.4610.69150.9221.1525SE +/- 0.00057, N = 3SE +/- 0.00200, N = 3SE +/- 0.00137, N = 2SE +/- 0.00027, N = 31.024261.016020.998520.98084

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38612182430SE +/- 0.20, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.16, N = 1523.0023.2923.3623.871. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150145.88148.76157.151. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361122334455SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 346.3846.5647.5147.72

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1020304050SE +/- 0.15, N = 3SE +/- 0.29, N = 3SE +/- 0.18, N = 343.2543.4244.90

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 0.37, N = 3SE +/- 0.03, N = 3SE +/- 0.53, N = 3SE +/- 0.20, N = 3258.6253.3249.7237.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3132.2128.0127.0121.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361122334455SE +/- 0.30, N = 3SE +/- 0.30, N = 3SE +/- 0.40, N = 347.7647.7848.26

Build: defconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385M10M15M20M25MSE +/- 21858.13, N = 3SE +/- 92074.85, N = 3SE +/- 41633.32, N = 3SE +/- 16666.67, N = 3256766672566666725660000251366671. (CXX) g++ options: -fopenmp -O3 -march=native

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385K10K15K20K25KSE +/- 72.38, N = 3SE +/- 31.03, N = 3SE +/- 265.89, N = 3SE +/- 227.86, N = 1522774.9522539.7222510.4622093.991. (CC) gcc options: -O3 -march=native

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.8111.6222.4333.2444.055SE +/- 0.03856, N = 3SE +/- 0.02618, N = 15SE +/- 0.02740, N = 9SE +/- 0.02586, N = 33.604643.577553.509583.41007

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C388001600240032004000SE +/- 2.85, N = 3SE +/- 40.74, N = 3SE +/- 36.74, N = 3SE +/- 51.16, N = 33823.803739.163714.663712.451. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 72.30, N = 3SE +/- 210.02, N = 3SE +/- 130.21, N = 3SE +/- 150.93, N = 364031.3162375.4362084.7259920.961. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 453.50, N = 3SE +/- 762.95, N = 5SE +/- 299.96, N = 3SE +/- 820.93, N = 370747.0070345.5366545.2465797.051. (CC) gcc options: -O3 -march=native

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38714212835SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.28, N = 329.8230.5430.6831.981. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382K4K6K8K10KSE +/- 44.94, N = 3SE +/- 72.54, N = 3SE +/- 39.56, N = 3SE +/- 25.18, N = 310752.8110720.3110608.9610608.821. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 315.00, N = 3SE +/- 204.48, N = 3SE +/- 805.54, N = 4SE +/- 119.74, N = 369491.4268588.8466822.4965503.931. (CC) gcc options: -O3 -march=native

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 324.9625.1925.4525.691. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 108.61, N = 3SE +/- 382.92, N = 3SE +/- 64.38, N = 3SE +/- 426.98, N = 365113.5263967.7062771.0061079.931. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 4.49, N = 3SE +/- 5.23, N = 3SE +/- 2.77, N = 3SE +/- 5.33, N = 31635.341586.591584.931505.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 439.59, N = 3SE +/- 33.42, N = 3SE +/- 130.09, N = 3SE +/- 156.92, N = 366963.0365845.2164832.8662428.701. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G16K32K48K64K80KSE +/- 364.59, N = 3SE +/- 266.14, N = 3SE +/- 513.22, N = 3SE +/- 220.54, N = 373724.8871754.5870445.7469141.081. (CC) gcc options: -O3 -march=native

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 211.85, N = 3SE +/- 32.81, N = 3SE +/- 36.42, N = 3SE +/- 24.77, N = 320923.9920817.5719747.5819534.161. (CC) gcc options: -O3 -march=native

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38160320480640800SE +/- 1.25, N = 3SE +/- 0.38, N = 3SE +/- 1.36, N = 3SE +/- 0.82, N = 3741.78739.74734.53731.231. (CXX) g++ options: -flto -pthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38816243240SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 336.5536.2536.1135.18MIN: 36.13 / MAX: 37.34MIN: 35.89 / MAX: 36.94MIN: 35.69 / MAX: 36.98MIN: 34.83 / MAX: 36.01

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.63, N = 3SE +/- 0.96, N = 4SE +/- 0.27, N = 3SE +/- 0.82, N = 586.7985.5284.8981.89MIN: 78.82 / MAX: 88.23MIN: 68.54 / MAX: 88.53MIN: 77.17 / MAX: 85.87MIN: 67.48 / MAX: 84.5

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 341.7241.4940.9440.07MIN: 41.48 / MAX: 42.26MIN: 41.24 / MAX: 42.02MIN: 40.73 / MAX: 41.6MIN: 39.86 / MAX: 40.55

POV-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3648121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.1016.1316.3216.401. POV-Ray 3.7.0.10.unofficial

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G40K80K120K160K200KSE +/- 467.61, N = 3SE +/- 94.03, N = 3SE +/- 322.74, N = 32030652008101955851. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Compression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 315.4315.6116.45

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3630K60K90K120K150KSE +/- 14.15, N = 3SE +/- 69.90, N = 3SE +/- 33.91, N = 31581331575121570051. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Decompression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 10.38, N = 3SE +/- 26.91, N = 3SE +/- 40.85, N = 3SE +/- 21.50, N = 323254.4022908.5822436.8121328.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38918273645SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 338.3038.2738.0737.091. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 95.45, N = 3SE +/- 30.67, N = 3SE +/- 113.15, N = 3SE +/- 141.73, N = 329713.5528891.8628743.4927106.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 312.9713.2613.4014.091. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 10.26, N = 3SE +/- 41.76, N = 3SE +/- 36.86, N = 3SE +/- 21.24, N = 311908.4711662.4411374.5111144.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.02, N = 3SE +/- 0.27, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 377.0876.8976.1275.991. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38400800120016002000SE +/- 0.54, N = 3SE +/- 6.84, N = 3SE +/- 0.31, N = 3SE +/- 0.87, N = 31926.891922.131903.041899.801. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 320.0819.9919.8419.33MIN: 17.96 / MAX: 20.55MIN: 18.05 / MAX: 20.42MIN: 17.86 / MAX: 20.21MIN: 17.37 / MAX: 19.62

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3848121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 317.3417.3316.9216.81

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.023, N = 3SE +/- 0.016, N = 3SE +/- 0.006, N = 37.2367.2477.587

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 34.50, N = 3SE +/- 10.24, N = 3SE +/- 20.77, N = 3SE +/- 6.88, N = 326596.4625715.3425622.9324236.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382K4K6K8K10KSE +/- 13.71, N = 3SE +/- 67.88, N = 3SE +/- 19.00, N = 3SE +/- 38.48, N = 310578.5210235.729945.539832.641. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38306090120150SE +/- 0.27, N = 3SE +/- 0.45, N = 3SE +/- 0.13, N = 3SE +/- 0.23, N = 3134.52134.48134.19131.281. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 17.86, N = 3SE +/- 53.14, N = 3SE +/- 21.03, N = 3SE +/- 25.12, N = 33741.863731.973722.563678.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

104 Results Shown

Llamafile
OpenRadioss
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
Xmrig
Blender
Timed Linux Kernel Compilation
LeelaChessZero
OpenRadioss
LeelaChessZero
XNNPACK:
  QU8MobileNetV3Small
  QU8MobileNetV3Large
  QU8MobileNetV2
  FP16MobileNetV3Small
  FP16MobileNetV3Large
  FP16MobileNetV2
  FP32MobileNetV3Small
  FP32MobileNetV3Large
  FP32MobileNetV2
Timed LLVM Compilation
Timed Node.js Compilation
OpenRadioss
BRL-CAD
Stockfish
Java JMH
High Performance Conjugate Gradient
Timed Gem5 Compilation
OpenFOAM:
  motorBike - Execution Time
  motorBike - Mesh Time
TensorFlow
NAS Parallel Benchmarks
simdjson
MBW
Build2
GROMACS
Numpy Benchmark
OpenRadioss
SPECFEM3D
simdjson
SPECFEM3D
simdjson
MBW
PyTorch
libxsmm
miniBUDE:
  OpenMP - BM2:
    Billion Interactions/s
    GFInst/s
Memcached:
  1:10
  1:100
Blender
LuxCoreRender:
  Orange Juice - CPU
  Danish Mood - CPU
  LuxCore Benchmark - CPU
Llama.cpp
LuxCoreRender
Xcompact3d Incompact3d
OpenRadioss
GROMACS
OpenFOAM
NAS Parallel Benchmarks
NAMD
SPECFEM3D
OpenFOAM
Blender
OpenRadioss
libxsmm:
  64
  32
Timed Linux Kernel Compilation
Quicksilver
MBW
NAMD
NAS Parallel Benchmarks:
  EP.D
  LU.C
RAMspeed SMP
SPECFEM3D
Stress-NG
RAMspeed SMP
SPECFEM3D
RAMspeed SMP
NAS Parallel Benchmarks
RAMspeed SMP:
  Triad - Integer
  Scale - Integer
MBW
Etcpak
Embree
PyTorch
Embree
POV-Ray
7-Zip Compression
Y-Cruncher
7-Zip Compression
NAS Parallel Benchmarks
x265
NAS Parallel Benchmarks
Xcompact3d Incompact3d
NAS Parallel Benchmarks
miniBUDE:
  OpenMP - BM1:
    Billion Interactions/s
    GFInst/s
LuxCoreRender
TensorFlow
Y-Cruncher
NAS Parallel Benchmarks
LULESH
x265
NAS Parallel Benchmarks