AMD Ryzen 9700X DDR5-8000

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2408072-NE-RYZEN970047&rdt&grr.

AMD Ryzen 9700X DDR5-8000ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionDDR5-6000DDR5-8000AMD Ryzen 7 9700X 8-Core @ 5.58GHz (8 Cores / 16 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2124 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x21602 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD Ryzen 9700X DDR5-8000hpcg: 144 144 144 - 60build-linux-kernel: allmodconfigblender: Barbershop - CPU-Onlyopenradioss: Chrysler Neon 1Mbuild-llvm: Ninjamt-dgemm: Sustained Floating-Point Ratestockfish: Chess Benchmarkluxcorerender: Danish Mood - CPUtensorflow: CPU - 64 - ResNet-50mbw: Memory Copy - 8192 MiBblender: Junkshop - CPU-Onlyspecfem3d: Layered Halfspacespecfem3d: Water-layered Halfspacehpcg: 104 104 104 - 60blender: BMW27 - CPU-Onlygromacs: MPI CPU - water_GMX50_barenginx: 1000openfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timebuild-linux-kernel: defconfignpb: BT.Cincompact3d: input.i3d 193 Cells Per Directionnamd: STMV with 1,066,628 Atomsllama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufquicksilver: CORAL2 P1memcached: 1:10memcached: 1:5memcached: 1:100luxcorerender: DLSC - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUspecfem3d: Homogeneous Halfspaceramspeed: Scale - Floating Pointspecfem3d: Tomographic Modelspecfem3d: Mount St. Helensnpb: LU.Cmbw: Memory Copy, Fixed Block Size - 8192 MiBembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonramspeed: Add - Integernpb: IS.Dramspeed: Copy - Integerramspeed: Copy - Floating Pointramspeed: Average - Floating Pointramspeed: Scale - Integerramspeed: Add - Floating Pointramspeed: Triad - Integerramspeed: Triad - Floating Pointramspeed: Average - Integerpytorch: CPU - 64 - ResNet-50npb: SP.Bnamd: ATPase with 327,506 Atomsmbw: Memory Copy - 4096 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingincompact3d: input.i3d 129 Cells Per Directionnpb: FT.Cnpb: CG.Cnpb: MG.Cnpb: EP.CDDR5-6000DDR5-80008.055891174.619946.72820.56564.344147.577416223595822.1542.0823116.231129.45117.514713520114.3301536338.1154398.261.934102714.96242.7052329.03850587.64035029.1175.07513430.634617.47168333334332006.374397076.544257935.592.7336.57109.1912.42321.506.861149.004.02986.313.301200.322.871357.3117.17462.632.313325.8453.00852867850741.8442.12102583144.32332829245578.1019878.50817.897022.020051853.361630.9849895.0549909.0451389.7051251.1551926.9451920.5151965.1351564.6464.8212746.112.1220123198.28019855.4187800111591018.137507124413.1210109.0024478.401815.828.009971204.486975.11796.19576.190147.012600217017982.0742.5023634.834132.47117.784535379112.5307851618.24192101.691.962103496.12229.4744529.70570789.65335463.3270.59066010.628307.5163000004217383.954277609.214147073.232.6537.17107.4412.39322.484.17952.523.451151.312.403210.0854.45391527154705.7143.21305332744.04437603147073.6221108.64117.445821.538655167.971738.4554740.6853371.4754688.2955739.2555122.6754724.6955302.8955009.7862.4913487.992.0686523002.40821162.8517605311465717.240215325410.2910099.7526178.561771.18OpenBenchmarking.org

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60DDR5-6000DDR5-8000246810SE +/- 0.00263, N = 38.055898.009971. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigDDR5-6000DDR5-800030060090012001500SE +/- 0.11, N = 3SE +/- 0.40, N = 31174.621204.49

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-OnlyDDR5-6000DDR5-80002004006008001000SE +/- 0.39, N = 3SE +/- 1.06, N = 3946.72975.11

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MDDR5-6000DDR5-80002004006008001000SE +/- 1.69, N = 3SE +/- 0.56, N = 3820.56796.19

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaDDR5-6000DDR5-8000120240360480600SE +/- 0.25, N = 3SE +/- 0.28, N = 2564.34576.19

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateDDR5-6000DDR5-8000306090120150SE +/- 0.07, N = 3SE +/- 0.13, N = 3147.58147.011. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkDDR5-6000DDR5-80005M10M15M20M25MSE +/- 190312.42, N = 15SE +/- 206180.75, N = 322359582217017981. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUDDR5-6000DDR5-80000.48380.96761.45141.93522.419SE +/- 0.02, N = 3SE +/- 0.01, N = 142.152.07MIN: 0.67 / MAX: 2.57MIN: 0.63 / MAX: 2.52

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-50DDR5-6000DDR5-80001020304050SE +/- 0.02, N = 3SE +/- 0.02, N = 342.0842.50

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiBDDR5-6000DDR5-80005K10K15K20K25KSE +/- 196.39, N = 8SE +/- 214.05, N = 1523116.2323634.831. (CC) gcc options: -O3 -march=native

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-OnlyDDR5-6000DDR5-8000306090120150SE +/- 0.21, N = 3SE +/- 0.04, N = 3129.45132.47

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered HalfspaceDDR5-6000DDR5-8000306090120150SE +/- 0.60, N = 3SE +/- 1.01, N = 3117.51117.781. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered HalfspaceDDR5-6000DDR5-8000306090120150SE +/- 0.19, N = 3SE +/- 0.32, N = 3114.33112.531. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60DDR5-6000DDR5-8000246810SE +/- 0.00215, N = 3SE +/- 0.00041, N = 38.115438.241921. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-OnlyDDR5-6000DDR5-800020406080100SE +/- 0.11, N = 3SE +/- 0.13, N = 398.26101.69

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareDDR5-6000DDR5-80000.44150.8831.32451.7662.2075SE +/- 0.003, N = 3SE +/- 0.003, N = 31.9341.9621. (CXX) g++ options: -O3 -lm

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000DDR5-6000DDR5-800020K40K60K80K100KSE +/- 469.67, N = 3SE +/- 288.35, N = 3102714.96103496.121. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeDDR5-6000DDR5-800050100150200250242.71229.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeDDR5-6000DDR5-800071421283529.0429.711. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfigDDR5-6000DDR5-800020406080100SE +/- 0.34, N = 3SE +/- 0.28, N = 387.6489.65

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CDDR5-6000DDR5-80008K16K24K32K40KSE +/- 12.17, N = 3SE +/- 4.26, N = 335029.1135463.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionDDR5-6000DDR5-800020406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 375.0870.591. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 AtomsDDR5-6000DDR5-80000.14280.28560.42840.57120.714SE +/- 0.00062, N = 3SE +/- 0.00026, N = 30.634610.62830

Llama.cpp

Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.ggufDDR5-6000DDR5-8000246810SE +/- 0.00, N = 3SE +/- 0.00, N = 37.477.501. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P1DDR5-6000DDR5-80004M8M12M16M20MSE +/- 26034.17, N = 3SE +/- 26457.51, N = 316833333163000001. (CXX) g++ options: -fopenmp -O3 -march=native

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10DDR5-6000DDR5-8000900K1800K2700K3600K4500KSE +/- 6374.80, N = 3SE +/- 8054.06, N = 34332006.374217383.951. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5DDR5-6000DDR5-8000900K1800K2700K3600K4500KSE +/- 9637.55, N = 3SE +/- 2275.96, N = 34397076.544277609.211. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100DDR5-6000DDR5-8000900K1800K2700K3600K4500KSE +/- 7416.92, N = 3SE +/- 6230.21, N = 34257935.594147073.231. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUDDR5-6000DDR5-80000.61431.22861.84292.45723.0715SE +/- 0.00, N = 3SE +/- 0.00, N = 32.732.65MIN: 2.65 / MAX: 2.84MIN: 2.57 / MAX: 2.75

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUDDR5-6000DDR5-8000918273645SE +/- 0.14, N = 3SE +/- 0.08, N = 336.5737.17MIN: 19.07 / MAX: 49.49MIN: 19.51 / MAX: 50.831. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUDDR5-6000DDR5-800020406080100SE +/- 0.40, N = 3SE +/- 0.23, N = 3109.19107.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUDDR5-6000DDR5-80003691215SE +/- 0.05, N = 3SE +/- 0.06, N = 312.4212.39MIN: 5.34 / MAX: 17.16MIN: 6.1 / MAX: 16.961. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUDDR5-6000DDR5-800070140210280350SE +/- 1.20, N = 3SE +/- 1.52, N = 3321.50322.481. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUDDR5-6000246810SE +/- 0.01, N = 36.86MIN: 3.89 / MAX: 13.091. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUDDR5-60002004006008001000SE +/- 1.31, N = 31149.001. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUDDR5-6000DDR5-80000.93831.87662.81493.75324.6915SE +/- 0.02, N = 3SE +/- 0.00, N = 34.024.17MIN: 2.41 / MAX: 6.15MIN: 2.83 / MAX: 10.641. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUDDR5-6000DDR5-80002004006008001000SE +/- 4.21, N = 3SE +/- 0.74, N = 3986.31952.521. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUDDR5-6000DDR5-80000.77631.55262.32893.10523.8815SE +/- 0.01, N = 3SE +/- 0.01, N = 33.303.45MIN: 1.86 / MAX: 13.69MIN: 1.92 / MAX: 10.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUDDR5-6000DDR5-800030060090012001500SE +/- 1.73, N = 3SE +/- 2.17, N = 31200.321151.311. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUDDR5-60000.64581.29161.93742.58323.229SE +/- 0.01, N = 32.87MIN: 1.57 / MAX: 13.041. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUDDR5-600030060090012001500SE +/- 2.17, N = 31357.311. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUDDR5-600048121620SE +/- 0.04, N = 317.17MIN: 9.75 / MAX: 30.651. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUDDR5-6000100200300400500SE +/- 0.83, N = 3462.631. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUDDR5-6000DDR5-80000.541.081.622.162.7SE +/- 0.01, N = 3SE +/- 0.01, N = 32.312.40MIN: 1.23 / MAX: 5.68MIN: 1.24 / MAX: 5.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUDDR5-6000DDR5-80007001400210028003500SE +/- 11.14, N = 3SE +/- 13.93, N = 33325.843210.081. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous HalfspaceDDR5-6000DDR5-80001224364860SE +/- 0.41, N = 3SE +/- 0.34, N = 353.0154.451. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating PointDDR5-6000DDR5-800012K24K36K48K60KSE +/- 316.23, N = 3SE +/- 487.24, N = 750741.8454705.711. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic ModelDDR5-6000DDR5-80001020304050SE +/- 0.48, N = 4SE +/- 0.22, N = 342.1243.211. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. HelensDDR5-6000DDR5-80001020304050SE +/- 0.09, N = 3SE +/- 0.11, N = 344.3244.041. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CDDR5-6000DDR5-800010K20K30K40K50KSE +/- 75.43, N = 3SE +/- 70.13, N = 345578.1047073.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiBDDR5-6000DDR5-80005K10K15K20K25KSE +/- 46.08, N = 3SE +/- 124.22, N = 319878.5121108.641. (CC) gcc options: -O3 -march=native

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownDDR5-6000DDR5-800048121620SE +/- 0.00, N = 3SE +/- 0.01, N = 317.9017.45MIN: 17.78 / MAX: 18.23MIN: 17.32 / MAX: 17.78

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonDDR5-6000DDR5-8000510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 322.0221.54MIN: 21.91 / MAX: 22.24MIN: 21.44 / MAX: 21.8

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: IntegerDDR5-6000DDR5-800012K24K36K48K60KSE +/- 356.38, N = 3SE +/- 106.94, N = 351853.3655167.971. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DDDR5-6000DDR5-8000400800120016002000SE +/- 1.50, N = 3SE +/- 2.47, N = 31630.981738.451. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: IntegerDDR5-6000DDR5-800012K24K36K48K60KSE +/- 470.33, N = 3SE +/- 634.58, N = 349895.0554740.681. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating PointDDR5-6000DDR5-800011K22K33K44K55KSE +/- 427.15, N = 3SE +/- 201.79, N = 349909.0453371.471. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating PointDDR5-6000DDR5-800012K24K36K48K60KSE +/- 142.31, N = 3SE +/- 407.90, N = 351389.7054688.291. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: IntegerDDR5-6000DDR5-800012K24K36K48K60KSE +/- 415.28, N = 3SE +/- 605.29, N = 351251.1555739.251. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating PointDDR5-6000DDR5-800012K24K36K48K60KSE +/- 181.75, N = 3SE +/- 101.15, N = 351926.9455122.671. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: IntegerDDR5-6000DDR5-800012K24K36K48K60KSE +/- 165.42, N = 3SE +/- 230.27, N = 351920.5154724.691. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating PointDDR5-6000DDR5-800012K24K36K48K60KSE +/- 99.55, N = 3SE +/- 191.15, N = 351965.1355302.891. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: IntegerDDR5-6000DDR5-800012K24K36K48K60KSE +/- 540.49, N = 3SE +/- 254.36, N = 351564.6455009.781. (CC) gcc options: -O3 -march=native

PyTorch

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: ResNet-50DDR5-6000DDR5-80001428425670SE +/- 0.55, N = 3SE +/- 0.78, N = 364.8262.49MIN: 62.33 / MAX: 66.72MIN: 44.17 / MAX: 64.68

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BDDR5-6000DDR5-80003K6K9K12K15KSE +/- 11.87, N = 3SE +/- 19.82, N = 312746.1113487.991. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 AtomsDDR5-6000DDR5-80000.47750.9551.43251.912.3875SE +/- 0.00481, N = 3SE +/- 0.00569, N = 32.122012.06865

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiBDDR5-6000DDR5-80005K10K15K20K25KSE +/- 270.25, N = 4SE +/- 6.86, N = 323198.2823002.411. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiBDDR5-6000DDR5-80005K10K15K20K25KSE +/- 16.48, N = 3SE +/- 128.78, N = 319855.4221162.851. (CC) gcc options: -O3 -march=native

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingDDR5-6000DDR5-800020K40K60K80K100KSE +/- 76.48, N = 3SE +/- 63.01, N = 378001760531. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingDDR5-6000DDR5-800020K40K60K80K100KSE +/- 63.84, N = 3SE +/- 87.39, N = 31159101146571. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionDDR5-6000DDR5-800048121620SE +/- 0.02, N = 3SE +/- 0.07, N = 318.1417.241. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CDDR5-6000DDR5-80005K10K15K20K25KSE +/- 48.52, N = 3SE +/- 117.06, N = 324413.1225410.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CDDR5-6000DDR5-80002K4K6K8K10KSE +/- 19.77, N = 3SE +/- 14.03, N = 310109.0010099.751. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CDDR5-6000DDR5-80006K12K18K24K30KSE +/- 6.50, N = 3SE +/- 6.15, N = 324478.4026178.561. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CDDR5-6000DDR5-8000400800120016002000SE +/- 24.35, N = 3SE +/- 19.13, N = 31815.821771.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6


Phoronix Test Suite v10.8.5