AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2408225-NE-RYZEN999510&grr&sor.

AMD Ryzen 9 9950X DDR5 Memory PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x21602 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Java Details- 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32, 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

AMD Ryzen 9 9950X DDR5 Memory Performancellamafile: Meta-Llama-3-8B-Instruct.F16 - CPUopenradioss: Chrysler Neon 1Mopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timexmrig: GhostRider - 1Mblender: Barbershop - CPU-Onlybuild-linux-kernel: allmodconfiglczero: Eigenopenradioss: Bird Strike on Windshieldlczero: BLASxnnpack: QU8MobileNetV3Smallxnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV2xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV2build-llvm: Ninjabuild-nodejs: Time To Compileopenradioss: INIVOL and Fluid Structure Interaction Drop Containerbrl-cad: VGR Performance Metricstockfish: Chess Benchmarkjava-jmh: Throughputhpcg: 104 104 104 - 60build-gem5: Time To Compileopenfoam: motorBike - Execution Timeopenfoam: motorBike - Mesh Timetensorflow: CPU - 64 - ResNet-50npb: SP.Csimdjson: Kostyambw: Memory Copy, Fixed Block Size - 8192 MiBbuild2: Time To Compilegromacs: water_GMX50_barenumpy: openradioss: Bumper Beamspecfem3d: Layered Halfspacesimdjson: LargeRandspecfem3d: Water-layered Halfspacesimdjson: TopTweetmbw: Memory Copy - 8192 MiBpytorch: CPU - 256 - ResNet-50libxsmm: 128minibude: OpenMP - BM2minibude: OpenMP - BM2memcached: 1:10memcached: 1:100blender: Junkshop - CPU-Onlyluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUluxcorerender: LuxCore Benchmark - CPUllama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufluxcorerender: DLSC - CPUincompact3d: input.i3d 193 Cells Per Directionopenradioss: Rubber O-Ring Seal Installationgromacs: MPI CPU - water_GMX50_bareopenfoam: drivaerFastback, Small Mesh Size - Mesh Timenpb: BT.Cnamd: STMV with 1,066,628 Atomsspecfem3d: Tomographic Modelopenfoam: drivaerFastback, Small Mesh Size - Execution Timeblender: BMW27 - CPU-Onlyopenradioss: Cell Phone Drop Testlibxsmm: 64libxsmm: 32build-linux-kernel: defconfigquicksilver: CORAL2 P1mbw: Memory Copy - 4096 MiBnamd: ATPase with 327,506 Atomsnpb: EP.Dnpb: LU.Cramspeed: Copy - Integerspecfem3d: Homogeneous Halfspacestress-ng: Memory Copyingramspeed: Average - Integerspecfem3d: Mount St. Helensramspeed: Add - Integernpb: IS.Dramspeed: Triad - Integerramspeed: Scale - Integermbw: Memory Copy, Fixed Block Size - 4096 MiBetcpak: Multi-Threaded - ETC2embree: Pathtracer ISPC - Crownpytorch: CPU - 1 - ResNet-50embree: Pathtracer ISPC - Asian Dragonpovray: Trace Timecompress-7zip: Compression Ratingy-cruncher: 1Bcompress-7zip: Decompression Ratingnpb: SP.Bx265: Bosphorus 4Knpb: FT.Cincompact3d: input.i3d 129 Cells Per Directionnpb: CG.Cminibude: OpenMP - BM1minibude: OpenMP - BM1luxcorerender: Rainbow Colors and Prism - CPUtensorflow: CPU - 1 - ResNet-50y-cruncher: 500Mnpb: MG.Clulesh: x265: Bosphorus 1080pnpb: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C384.68705.55181.569052096.56643616.6448.96590.495220128.762267561118759770129198581215161264301.629345.246247.064989665137777891554717298.4268.93560221.13958.569584.994952.2714528.267.7119467.39476.0031.9271068.3676.9073.1387970842.1272.19726051213.5922247.78457.54472.077.3851934.6135915791.347507615.7362.098.644.625.158.365.4064.113021853.163.14122.57984955287.370.9808423.359092920157.1489846.3844.90237.1121.047.7552567666722774.9473.410073823.8059920.9665797.0530.67791845710752.8165503.9325.69098664461079.931505.4762428.7069141.0819534.159741.78436.553284.8941.491416.10119558516.45215813321328.2838.2727106.1814.090555211144.8876.8851922.12519.9916.927.58724236.219945.5309134.523678.46668.16177.78941994.7473638.6450.71589.331236128.012447611115765776130399681515271273298.049341.679239.224944374919415592045826799.548.79737219.26857.245284.980254.3015610.307.6320850.88377.0061.9961070.6177.1869.4393872992.1169.56164246713.3822294.06258.89505.977.2461931.1415898565.347647187.4862.218.644.695.148.325.4260.799243953.233.22822.82138958570.771.0160223.000501341148.7584546.5643.42253.3128.047.7832566000022510.4643.604643714.6662084.7270747.0029.81843126810720.3168588.8424.95654992863967.701584.9365845.2171754.5820817.569739.73636.109985.5240.944716.13320306515.42615751222908.5838.3028891.8613.402805311374.5177.0761926.89120.0817.347.24725622.9310235.722134.483731.97573.33174.125721909.66553699.6459.12247128.322647641124765773130299781815531272304.621242.004869544997421089012581592.6539.5160155.573888.080755.1916097.007.5720349.41477.9232.0091046.9177.4569.8174223112.1068.92093565113.2422817.90058.49508.976.1981904.9535874747.737498825.1963.418.554.655.049.005.3658.198459653.663.29823.14363160124.081.0242623.287517388145.8832347.7243.25258.6132.248.2582566666722539.7203.577553739.1664031.3170345.5330.54298448710608.9669491.4225.19465363065113.521635.3466963.0373724.8820923.989734.53336.254686.7941.723816.39520081015.60615700523254.4038.0729713.5512.974287011908.4776.1221903.04019.8417.337.23626596.4610578.516134.193741.863676.5459.19227124.37786115478379713341018836159012854897845010139690340365706.7929.1659085.474553.5415531.767.5020124.94380.1041064.1777.8172.43834892.1172.07947665812.8821444.00256.19491.175.9301898.2585694123.527350932.8864.088.464.605.058.665.3460.480583254.2758570.030.9985223.86945318147.51249.7127.02513666722093.9853.509583712.4562375.4366545.2431.98171567710608.8266822.4925.45331906062771.001586.5964832.8670445.7419747.584731.22935.182881.8940.069616.31922436.8137.0928743.4913.259573311662.4475.9921899.80419.3316.8125715.349832.6385131.283722.56OpenBenchmarking.org

Llamafile

Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G150300450600750SE +/- 49.36, N = 9SE +/- 1.02, N = 3SE +/- 0.28, N = 3573.33668.16705.55

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4080120160200174.13177.79181.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50010001500200025001909.671994.752096.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 30.71, N = 9SE +/- 36.12, N = 6SE +/- 37.04, N = 3SE +/- 6.72, N = 33699.63676.53638.63616.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38100200300400500SE +/- 0.20, N = 3SE +/- 0.26, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 3448.96450.71459.12459.19

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G130260390520650SE +/- 0.52, N = 3SE +/- 0.15, N = 3589.33590.50

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 1.15, N = 3SE +/- 2.85, N = 3SE +/- 1.45, N = 32472362272201. (CXX) g++ options: -flto -pthread

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 5.81, N = 15SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.28, N = 3124.37128.01128.32128.76

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 3.79, N = 3SE +/- 2.91, N = 3SE +/- 2.65, N = 32642442261. (CXX) g++ options: -flto -pthread

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 37567617647861. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 4.91, N = 3SE +/- 1.20, N = 3SE +/- 2.73, N = 3SE +/- 3.21, N = 311151118112411541. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 2.96, N = 3SE +/- 3.18, N = 37597657657831. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 1.76, N = 3SE +/- 1.20, N = 3SE +/- 1.86, N = 3SE +/- 0.58, N = 37707737767971. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 2.89, N = 3SE +/- 4.58, N = 3SE +/- 3.53, N = 3SE +/- 8.11, N = 312911302130313341. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 2.31, N = 3SE +/- 3.38, N = 3SE +/- 1.20, N = 3SE +/- 2.31, N = 398599699710181. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382004006008001000SE +/- 2.31, N = 3SE +/- 2.31, N = 3SE +/- 5.00, N = 3SE +/- 3.18, N = 38128158188361. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 19.19, N = 3SE +/- 9.64, N = 3SE +/- 2.08, N = 3SE +/- 3.18, N = 315161527155315901. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3830060090012001500SE +/- 5.21, N = 3SE +/- 4.98, N = 3SE +/- 8.84, N = 3SE +/- 9.00, N = 312641272127312851. (CXX) g++ options: -O3 -lrt -lm

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3670140210280350SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.38, N = 2298.05301.63304.62

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G80160240320400SE +/- 0.24, N = 3SE +/- 0.34, N = 3341.68345.25

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.99, N = 3SE +/- 0.76, N = 3SE +/- 0.74, N = 3239.22242.00247.06

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36110K220K330K440K550K4989664944374897844869541. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3211M22M33M44M55MSE +/- 491754.88, N = 15SE +/- 166041.67, N = 3SE +/- 515794.89, N = 15SE +/- 410305.51, N = 3513777785010139649974210491941551. Stockfish 16 by the Stockfish developers (see AUTHORS file)

Java JMH

Throughput

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3620000M40000M60000M80000M100000M92045826799.5491554717298.4390340365706.7989012581592.65

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.00741, N = 3SE +/- 0.00584, N = 3SE +/- 0.00142, N = 3SE +/- 0.00125, N = 39.516019.165908.935608.797371. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.19, N = 3SE +/- 0.11, N = 3219.27221.14

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G132639526555.5757.2558.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: motorBike - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362040608010084.9884.9985.4788.081. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 355.1954.3053.5452.27

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 15.11, N = 3SE +/- 16.93, N = 3SE +/- 42.19, N = 3SE +/- 18.27, N = 316097.0015610.3015531.7614528.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38246810SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 37.717.637.577.501. (CXX) g++ options: -O3 -lrt

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 58.10, N = 3SE +/- 144.90, N = 15SE +/- 239.88, N = 3SE +/- 10.56, N = 320850.8820349.4120124.9419467.391. (CC) gcc options: -O3 -march=native

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.16, N = 3SE +/- 0.52, N = 3SE +/- 0.40, N = 3SE +/- 0.99, N = 276.0077.0177.9280.10

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4520.9041.3561.8082.26SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 32.0091.9961.9271. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362004006008001000SE +/- 2.01, N = 3SE +/- 8.61, N = 3SE +/- 4.91, N = 3SE +/- 1.37, N = 31070.611068.361064.171046.91

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 376.9077.1877.4577.81

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.59, N = 3SE +/- 0.59, N = 3SE +/- 0.31, N = 369.4469.8272.4473.141. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.4770.9541.4311.9082.385SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 32.122.112.112.101. (CXX) g++ options: -O3 -lrt

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.24, N = 3SE +/- 0.34, N = 3SE +/- 0.16, N = 2SE +/- 0.34, N = 368.9269.5672.0872.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C383691215SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 313.5913.3813.2412.881. (CXX) g++ options: -O3 -lrt

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385K10K15K20K25KSE +/- 204.03, N = 3SE +/- 188.50, N = 12SE +/- 49.10, N = 3SE +/- 56.11, N = 322817.9022294.0622247.7821444.001. (CC) gcc options: -O3 -march=native

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381326395265SE +/- 0.40, N = 15SE +/- 0.36, N = 3SE +/- 0.34, N = 3SE +/- 0.25, N = 358.8958.4957.5456.19MIN: 34.79 / MAX: 62.62MIN: 34.6 / MAX: 60.01MIN: 53.14 / MAX: 58.34MIN: 39.35 / MAX: 57.3

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110220330440550SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.38, N = 3508.9505.9491.1472.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 377.3977.2576.2075.931. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38400800120016002000SE +/- 0.64, N = 3SE +/- 0.32, N = 3SE +/- 1.03, N = 3SE +/- 2.32, N = 31934.611931.141904.951898.261. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.3M2.6M3.9M5.2M6.5MSE +/- 15831.56, N = 3SE +/- 11599.22, N = 3SE +/- 11157.27, N = 3SE +/- 4372.40, N = 35915791.345898565.345874747.735694123.521. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.6M3.2M4.8M6.4M8MSE +/- 87376.76, N = 3SE +/- 1150.91, N = 3SE +/- 43634.73, N = 3SE +/- 71150.32, N = 37647187.487507615.737498825.197350932.881. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381428425670SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 362.0962.2163.4164.08

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.648.648.558.46MIN: 7.67 / MAX: 9.25MIN: 7.64 / MAX: 9.25MIN: 7.48 / MAX: 9.15MIN: 7.49 / MAX: 9.04

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.05532.11063.16594.22125.2765SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.694.654.624.60MIN: 2.15 / MAX: 5.26MIN: 2.02 / MAX: 5.24MIN: 2.07 / MAX: 5.22MIN: 2.13 / MAX: 5.16

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361.15882.31763.47644.63525.794SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.155.145.055.04MIN: 2.38 / MAX: 5.77MIN: 2.39 / MAX: 5.75MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66

Llama.cpp

Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.008.668.368.321. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381.21952.4393.65854.8786.0975SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.425.405.365.34MIN: 5.31 / MAX: 5.77MIN: 5.27 / MAX: 5.72MIN: 5.25 / MAX: 5.7MIN: 5.23 / MAX: 5.69

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.14, N = 3SE +/- 0.01, N = 2SE +/- 0.20, N = 3SE +/- 0.09, N = 358.2060.4860.8064.111. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381224364860SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 353.1653.2353.6654.27

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.74211.48422.22632.96843.7105SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 33.2983.2283.1411. (CXX) g++ options: -O3 -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3661218243022.5822.8223.141. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 85.26, N = 3SE +/- 8.10, N = 3SE +/- 74.71, N = 3SE +/- 54.79, N = 360124.0858570.7758570.0355287.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.23050.4610.69150.9221.1525SE +/- 0.00057, N = 3SE +/- 0.00200, N = 3SE +/- 0.00137, N = 2SE +/- 0.00027, N = 31.024261.016020.998520.98084

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38612182430SE +/- 0.20, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.16, N = 1523.0023.2923.3623.871. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150145.88148.76157.151. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361122334455SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 346.3846.5647.5147.72

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1020304050SE +/- 0.15, N = 3SE +/- 0.29, N = 3SE +/- 0.18, N = 343.2543.4244.90

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 0.37, N = 3SE +/- 0.03, N = 3SE +/- 0.53, N = 3SE +/- 0.20, N = 3258.6253.3249.7237.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3132.2128.0127.0121.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361122334455SE +/- 0.30, N = 3SE +/- 0.30, N = 3SE +/- 0.40, N = 347.7647.7848.26

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385M10M15M20M25MSE +/- 21858.13, N = 3SE +/- 92074.85, N = 3SE +/- 41633.32, N = 3SE +/- 16666.67, N = 3256766672566666725660000251366671. (CXX) g++ options: -fopenmp -O3 -march=native

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C385K10K15K20K25KSE +/- 72.38, N = 3SE +/- 31.03, N = 3SE +/- 265.89, N = 3SE +/- 227.86, N = 1522774.9522539.7222510.4622093.991. (CC) gcc options: -O3 -march=native

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.8111.6222.4333.2444.055SE +/- 0.03856, N = 3SE +/- 0.02618, N = 15SE +/- 0.02740, N = 9SE +/- 0.02586, N = 33.604643.577553.509583.41007

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C388001600240032004000SE +/- 2.85, N = 3SE +/- 40.74, N = 3SE +/- 36.74, N = 3SE +/- 51.16, N = 33823.803739.163714.663712.451. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 72.30, N = 3SE +/- 210.02, N = 3SE +/- 130.21, N = 3SE +/- 150.93, N = 364031.3162375.4362084.7259920.961. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 453.50, N = 3SE +/- 762.95, N = 5SE +/- 299.96, N = 3SE +/- 820.93, N = 370747.0070345.5366545.2465797.051. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38714212835SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.28, N = 329.8230.5430.6831.981. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382K4K6K8K10KSE +/- 44.94, N = 3SE +/- 72.54, N = 3SE +/- 39.56, N = 3SE +/- 25.18, N = 310752.8110720.3110608.9610608.821. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 315.00, N = 3SE +/- 204.48, N = 3SE +/- 805.54, N = 4SE +/- 119.74, N = 369491.4268588.8466822.4965503.931. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 324.9625.1925.4525.691. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 108.61, N = 3SE +/- 382.92, N = 3SE +/- 64.38, N = 3SE +/- 426.98, N = 365113.5263967.7062771.0061079.931. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 4.49, N = 3SE +/- 5.23, N = 3SE +/- 2.77, N = 3SE +/- 5.33, N = 31635.341586.591584.931505.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 439.59, N = 3SE +/- 33.42, N = 3SE +/- 130.09, N = 3SE +/- 156.92, N = 366963.0365845.2164832.8662428.701. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G16K32K48K64K80KSE +/- 364.59, N = 3SE +/- 266.14, N = 3SE +/- 513.22, N = 3SE +/- 220.54, N = 373724.8871754.5870445.7469141.081. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 211.85, N = 3SE +/- 32.81, N = 3SE +/- 36.42, N = 3SE +/- 24.77, N = 320923.9920817.5719747.5819534.161. (CC) gcc options: -O3 -march=native

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38160320480640800SE +/- 1.25, N = 3SE +/- 0.38, N = 3SE +/- 1.36, N = 3SE +/- 0.82, N = 3741.78739.74734.53731.231. (CXX) g++ options: -flto -pthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38816243240SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 336.5536.2536.1135.18MIN: 36.13 / MAX: 37.34MIN: 35.89 / MAX: 36.94MIN: 35.69 / MAX: 36.98MIN: 34.83 / MAX: 36.01

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.63, N = 3SE +/- 0.96, N = 4SE +/- 0.27, N = 3SE +/- 0.82, N = 586.7985.5284.8981.89MIN: 78.82 / MAX: 88.23MIN: 68.54 / MAX: 88.53MIN: 77.17 / MAX: 85.87MIN: 67.48 / MAX: 84.5

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C381020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 341.7241.4940.9440.07MIN: 41.48 / MAX: 42.26MIN: 41.24 / MAX: 42.02MIN: 40.73 / MAX: 41.6MIN: 39.86 / MAX: 40.55

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3648121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.1016.1316.3216.401. POV-Ray 3.7.0.10.unofficial

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G40K80K120K160K200KSE +/- 467.61, N = 3SE +/- 94.03, N = 3SE +/- 322.74, N = 32030652008101955851. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 315.4315.6116.45

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3630K60K90K120K150KSE +/- 14.15, N = 3SE +/- 69.90, N = 3SE +/- 33.91, N = 31581331575121570051. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 10.38, N = 3SE +/- 26.91, N = 3SE +/- 40.85, N = 3SE +/- 21.50, N = 323254.4022908.5822436.8121328.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38918273645SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 338.3038.2738.0737.091. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 95.45, N = 3SE +/- 30.67, N = 3SE +/- 113.15, N = 3SE +/- 141.73, N = 329713.5528891.8628743.4927106.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 312.9713.2613.4014.091. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 10.26, N = 3SE +/- 41.76, N = 3SE +/- 36.86, N = 3SE +/- 21.24, N = 311908.4711662.4411374.5111144.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3820406080100SE +/- 0.02, N = 3SE +/- 0.27, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 377.0876.8976.1275.991. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38400800120016002000SE +/- 0.54, N = 3SE +/- 6.84, N = 3SE +/- 0.31, N = 3SE +/- 0.87, N = 31926.891922.131903.041899.801. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 320.0819.9919.8419.33MIN: 17.96 / MAX: 20.55MIN: 18.05 / MAX: 20.42MIN: 17.86 / MAX: 20.21MIN: 17.37 / MAX: 19.62

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3848121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 317.3417.3316.9216.81

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.023, N = 3SE +/- 0.016, N = 3SE +/- 0.006, N = 37.2367.2477.587

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 34.50, N = 3SE +/- 10.24, N = 3SE +/- 20.77, N = 3SE +/- 6.88, N = 326596.4625715.3425622.9324236.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382K4K6K8K10KSE +/- 13.71, N = 3SE +/- 67.88, N = 3SE +/- 19.00, N = 3SE +/- 38.48, N = 310578.5210235.729945.539832.641. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38306090120150SE +/- 0.27, N = 3SE +/- 0.45, N = 3SE +/- 0.13, N = 3SE +/- 0.23, N = 3134.52134.48134.19131.281. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 17.86, N = 3SE +/- 53.14, N = 3SE +/- 21.03, N = 3SE +/- 25.12, N = 33741.863731.973722.563678.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6


Phoronix Test Suite v10.8.5