Benchmarks by Michael Larabel for a future article.
DDR5-6000 Processor: AMD Ryzen 7 9700X 8-Core @ 5.58GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2124 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
DDR5-8000 Changed Memory to 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36 .
AMD Ryzen 9700X DDR5-8000 OpenBenchmarking.org Phoronix Test Suite AMD Ryzen 7 9700X 8-Core @ 5.58GHz (8 Cores / 16 Threads) ASUS ROG STRIX X670E-E GAMING WIFI (2124 BIOS) AMD Device 14d8 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36 2000GB Corsair MP700 PRO AMD Radeon RX 7900 GRE 16GB AMD Navi 31 HDMI/DP DELL U2723QE Intel I225-V + Intel Wi-Fi 6E Ubuntu 24.04 6.10.0-phx (x86_64) GNOME Shell 46.0 X Server + Wayland 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57) GCC 13.2.0 ext4 3840x2160 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution AMD Ryzen 9700X DDR5-8000 Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a - Python 3.12.3 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
DDR5-6000 vs. DDR5-8000 Comparison Phoronix Test Suite Baseline +2.4% +2.4% +4.8% +4.8% +7.2% +7.2% +9.6% +9.6% 9.7% 8.8% 7.8% 6.9% 6.9% 6.7% 6.6% 6.6% 6.4% 6.4% 6.4% 6.4% 6.2% 6.2% 5.8% 5.8% 5.4% 5.2% 4.1% 3.3% 3.1% 2.2% Copy - Integer Scale - Integer Scale - Floating Point MG.C Copy - Floating Point Average - Integer IS.D M.C.F.B.S - 4096 MiB Triad - Floating Point Average - Floating Point Add - Integer i.i.1.C.P.D M.C.F.B.S - 8192 MiB Add - Floating Point SP.B d.S.M.S - Execution Time Triad - Integer i.i.1.C.P.D V.D.F.I - CPU 4.5% V.D.F.I - CPU 4.3% FT.C F.D.R.F.I - CPU 3.9% Danish Mood - CPU 3.9% P.V.B.D.F - CPU 3.7% CPU - 64 - ResNet-50 3.7% F.D.R.F.I - CPU 3.6% P.V.B.D.F - CPU 3.5% BMW27 - CPU-Only 3.5% LU.C CORAL2 P1 3.3% Chrysler Neon 1M Chess Benchmark 3% DLSC - CPU 3% Barbershop - CPU-Only 3% 1:5 2.8% H.H 2.7% 1:10 2.7% 1:100 2.7% Tomographic Model 2.6% Pathtracer ISPC - Crown 2.6% A.w.3.5.A 2.6% D.R 2.6% allmodconfig 2.5% EP.C 2.5% Junkshop - CPU-Only 2.3% d.S.M.S - Mesh Time 2.3% defconfig 2.3% Memory Copy - 8192 MiB Pathtracer ISPC - Asian Dragon 2.2% Ninja 2.1% RAMspeed SMP RAMspeed SMP RAMspeed SMP NAS Parallel Benchmarks RAMspeed SMP RAMspeed SMP NAS Parallel Benchmarks MBW RAMspeed SMP RAMspeed SMP RAMspeed SMP Xcompact3d Incompact3d MBW RAMspeed SMP NAS Parallel Benchmarks OpenFOAM RAMspeed SMP Xcompact3d Incompact3d OpenVINO OpenVINO NAS Parallel Benchmarks OpenVINO LuxCoreRender OpenVINO PyTorch OpenVINO OpenVINO Blender NAS Parallel Benchmarks Quicksilver OpenRadioss Stockfish LuxCoreRender Blender Memcached SPECFEM3D Memcached Memcached SPECFEM3D Embree NAMD 7-Zip Compression Timed Linux Kernel Compilation NAS Parallel Benchmarks Blender OpenFOAM Timed Linux Kernel Compilation MBW Embree Timed LLVM Compilation DDR5-6000 DDR5-8000
AMD Ryzen 9700X DDR5-8000 mbw: Memory Copy - 4096 MiB mbw: Memory Copy - 8192 MiB mbw: Memory Copy, Fixed Block Size - 4096 MiB mbw: Memory Copy, Fixed Block Size - 8192 MiB ramspeed: Add - Integer ramspeed: Copy - Integer ramspeed: Scale - Integer ramspeed: Triad - Integer ramspeed: Average - Integer ramspeed: Add - Floating Point ramspeed: Copy - Floating Point ramspeed: Scale - Floating Point ramspeed: Triad - Floating Point ramspeed: Average - Floating Point openradioss: Chrysler Neon 1M tensorflow: CPU - 64 - ResNet-50 pytorch: CPU - 64 - ResNet-50 specfem3d: Homogeneous Halfspace llama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.gguf quicksilver: CORAL2 P1 gromacs: MPI CPU - water_GMX50_bare hpcg: 104 104 104 - 60 hpcg: 144 144 144 - 60 npb: BT.C npb: CG.C npb: EP.C npb: FT.C npb: IS.D npb: SP.B specfem3d: Water-layered Halfspace npb: LU.C specfem3d: Tomographic Model specfem3d: Layered Halfspace specfem3d: Mount St. Helens npb: MG.C namd: ATPase with 327,506 Atoms namd: STMV with 1,066,628 Atoms openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Noise Suppression Poconet-Like FP16 - CPU openvino: Noise Suppression Poconet-Like FP16 - CPU openvino: Person Re-Identification Retail FP16 - CPU openvino: Person Re-Identification Retail FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU mt-dgemm: Sustained Floating-Point Rate openfoam: drivaerFastback, Small Mesh Size - Mesh Time openfoam: drivaerFastback, Small Mesh Size - Execution Time incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction stockfish: Chess Benchmark compress-7zip: Compression Rating compress-7zip: Decompression Rating build-llvm: Ninja build-linux-kernel: defconfig build-linux-kernel: allmodconfig blender: BMW27 - CPU-Only blender: Junkshop - CPU-Only blender: Barbershop - CPU-Only embree: Pathtracer ISPC - Crown embree: Pathtracer ISPC - Asian Dragon luxcorerender: DLSC - CPU luxcorerender: Danish Mood - CPU nginx: 1000 memcached: 1:5 memcached: 1:10 memcached: 1:100 DDR5-6000 DDR5-8000 23198.280 23116.231 19855.418 19878.508 51853.36 49895.05 51251.15 51920.51 51564.64 51926.94 49909.04 50741.84 51965.13 51389.70 820.56 42.08 64.82 53.008528678 7.47 16833333 1.934 8.11543 8.05589 35029.11 10109.00 1815.82 24413.12 1630.98 12746.11 114.330153633 45578.10 42.121025831 117.514713520 44.323328292 24478.40 2.12201 0.63461 1200.32 3.30 3325.84 2.31 321.50 12.42 109.19 36.57 986.31 4.02 1149.00 6.86 1357.31 2.87 462.63 17.17 147.577416 29.038505 242.70523 18.1375071 75.0751343 22359582 115910 78001 564.344 87.640 1174.619 98.26 129.45 946.72 17.8970 22.0200 2.73 2.15 102714.96 4397076.54 4332006.37 4257935.59 23002.408 23634.834 21162.851 21108.641 55167.97 54740.68 55739.25 54724.69 55009.78 55122.67 53371.47 54705.71 55302.89 54688.29 796.19 42.50 62.49 54.453915271 7.5 16300000 1.962 8.24192 8.00997 35463.32 10099.75 1771.18 25410.29 1738.45 13487.99 112.530785161 47073.62 43.213053327 117.784535379 44.044376031 26178.56 2.06865 0.62830 1151.31 3.45 3210.08 2.40 322.48 12.39 107.44 37.17 952.52 4.17 147.012600 29.705707 229.47445 17.2402153 70.5906601 21701798 114657 76053 576.190 89.653 1204.486 101.69 132.47 975.11 17.4458 21.5386 2.65 2.07 103496.12 4277609.21 4217383.95 4147073.23 OpenBenchmarking.org
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB DDR5-8000 DDR5-6000 5K 10K 15K 20K 25K SE +/- 214.05, N = 15 SE +/- 196.39, N = 8 23634.83 23116.23 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB DDR5-8000 DDR5-6000 5K 10K 15K 20K 25K SE +/- 128.78, N = 3 SE +/- 16.48, N = 3 21162.85 19855.42 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB DDR5-8000 DDR5-6000 5K 10K 15K 20K 25K SE +/- 124.22, N = 3 SE +/- 46.08, N = 3 21108.64 19878.51 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 634.58, N = 3 SE +/- 470.33, N = 3 54740.68 49895.05 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 605.29, N = 3 SE +/- 415.28, N = 3 55739.25 51251.15 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 230.27, N = 3 SE +/- 165.42, N = 3 54724.69 51920.51 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 254.36, N = 3 SE +/- 540.49, N = 3 55009.78 51564.64 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 101.15, N = 3 SE +/- 181.75, N = 3 55122.67 51926.94 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point DDR5-8000 DDR5-6000 11K 22K 33K 44K 55K SE +/- 201.79, N = 3 SE +/- 427.15, N = 3 53371.47 49909.04 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 487.24, N = 7 SE +/- 316.23, N = 3 54705.71 50741.84 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 191.15, N = 3 SE +/- 99.55, N = 3 55302.89 51965.13 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point DDR5-8000 DDR5-6000 12K 24K 36K 48K 60K SE +/- 407.90, N = 3 SE +/- 142.31, N = 3 54688.29 51389.70 1. (CC) gcc options: -O3 -march=native
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
Scale: 26
DDR5-6000: The test quit with a non-zero exit status. E: mpirun noticed that process rank 3 with PID 0 on node phoronix-System-Product-Name exited on signal 9 (Killed).
DDR5-8000: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-System-Product-Name exited on signal 9 (Killed).
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M DDR5-8000 DDR5-6000 200 400 600 800 1000 SE +/- 0.56, N = 3 SE +/- 1.69, N = 3 796.19 820.56
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 DDR5-8000 DDR5-6000 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 42.50 42.08
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace DDR5-6000 DDR5-8000 12 24 36 48 60 SE +/- 0.41, N = 3 SE +/- 0.34, N = 3 53.01 54.45 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b3067 Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf DDR5-8000 DDR5-6000 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 7.50 7.47 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 DDR5-6000 DDR5-8000 4M 8M 12M 16M 20M SE +/- 26034.17, N = 3 SE +/- 26457.51, N = 3 16833333 16300000 1. (CXX) g++ options: -fopenmp -O3 -march=native
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare DDR5-8000 DDR5-6000 0.4415 0.883 1.3245 1.766 2.2075 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 1.962 1.934 1. (CXX) g++ options: -O3 -lm
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C DDR5-8000 DDR5-6000 8K 16K 24K 32K 40K SE +/- 4.26, N = 3 SE +/- 12.17, N = 3 35463.32 35029.11 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C DDR5-6000 DDR5-8000 2K 4K 6K 8K 10K SE +/- 19.77, N = 3 SE +/- 14.03, N = 3 10109.00 10099.75 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C DDR5-6000 DDR5-8000 400 800 1200 1600 2000 SE +/- 24.35, N = 3 SE +/- 19.13, N = 3 1815.82 1771.18 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C DDR5-8000 DDR5-6000 5K 10K 15K 20K 25K SE +/- 117.06, N = 3 SE +/- 48.52, N = 3 25410.29 24413.12 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D DDR5-8000 DDR5-6000 400 800 1200 1600 2000 SE +/- 2.47, N = 3 SE +/- 1.50, N = 3 1738.45 1630.98 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B DDR5-8000 DDR5-6000 3K 6K 9K 12K 15K SE +/- 19.82, N = 3 SE +/- 11.87, N = 3 13487.99 12746.11 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace DDR5-8000 DDR5-6000 30 60 90 120 150 SE +/- 0.32, N = 3 SE +/- 0.19, N = 3 112.53 114.33 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C DDR5-8000 DDR5-6000 10K 20K 30K 40K 50K SE +/- 70.13, N = 3 SE +/- 75.43, N = 3 47073.62 45578.10 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model DDR5-6000 DDR5-8000 10 20 30 40 50 SE +/- 0.48, N = 4 SE +/- 0.22, N = 3 42.12 43.21 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace DDR5-6000 DDR5-8000 30 60 90 120 150 SE +/- 0.60, N = 3 SE +/- 1.01, N = 3 117.51 117.78 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens DDR5-8000 DDR5-6000 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 44.04 44.32 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C DDR5-8000 DDR5-6000 6K 12K 18K 24K 30K SE +/- 6.15, N = 3 SE +/- 6.50, N = 3 26178.56 24478.40 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms DDR5-6000 DDR5-8000 0.4775 0.955 1.4325 1.91 2.3875 SE +/- 0.00481, N = 3 SE +/- 0.00569, N = 3 2.12201 2.06865
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms DDR5-6000 DDR5-8000 0.1428 0.2856 0.4284 0.5712 0.714 SE +/- 0.00062, N = 3 SE +/- 0.00026, N = 3 0.63461 0.62830
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU DDR5-6000 DDR5-8000 300 600 900 1200 1500 SE +/- 1.73, N = 3 SE +/- 2.17, N = 3 1200.32 1151.31 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU DDR5-6000 DDR5-8000 0.7763 1.5526 2.3289 3.1052 3.8815 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.30 3.45 MIN: 1.86 / MAX: 13.69 MIN: 1.92 / MAX: 10.98 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU DDR5-6000 DDR5-8000 700 1400 2100 2800 3500 SE +/- 11.14, N = 3 SE +/- 13.93, N = 3 3325.84 3210.08 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU DDR5-6000 DDR5-8000 0.54 1.08 1.62 2.16 2.7 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.31 2.40 MIN: 1.23 / MAX: 5.68 MIN: 1.24 / MAX: 5.33 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU DDR5-8000 DDR5-6000 70 140 210 280 350 SE +/- 1.52, N = 3 SE +/- 1.20, N = 3 322.48 321.50 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU DDR5-8000 DDR5-6000 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 12.39 12.42 MIN: 6.1 / MAX: 16.96 MIN: 5.34 / MAX: 17.16 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU DDR5-6000 DDR5-8000 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 0.23, N = 3 109.19 107.44 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU DDR5-6000 DDR5-8000 9 18 27 36 45 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 36.57 37.17 MIN: 19.07 / MAX: 49.49 MIN: 19.51 / MAX: 50.83 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU DDR5-6000 DDR5-8000 200 400 600 800 1000 SE +/- 4.21, N = 3 SE +/- 0.74, N = 3 986.31 952.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU DDR5-6000 DDR5-8000 0.9383 1.8766 2.8149 3.7532 4.6915 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 4.02 4.17 MIN: 2.41 / MAX: 6.15 MIN: 2.83 / MAX: 10.64 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU DDR5-6000 200 400 600 800 1000 SE +/- 1.31, N = 3 1149.00 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU DDR5-6000 2 4 6 8 10 SE +/- 0.01, N = 3 6.86 MIN: 3.89 / MAX: 13.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU DDR5-6000 300 600 900 1200 1500 SE +/- 2.17, N = 3 1357.31 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU DDR5-6000 0.6458 1.2916 1.9374 2.5832 3.229 SE +/- 0.01, N = 3 2.87 MIN: 1.57 / MAX: 13.04 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU DDR5-6000 100 200 300 400 500 SE +/- 0.83, N = 3 462.63 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU DDR5-6000 4 8 12 16 20 SE +/- 0.04, N = 3 17.17 MIN: 9.75 / MAX: 30.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
ACES DGEMM OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate DDR5-6000 DDR5-8000 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 147.58 147.01 1. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time DDR5-6000 DDR5-8000 7 14 21 28 35 29.04 29.71 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time DDR5-8000 DDR5-6000 50 100 150 200 250 229.47 242.71 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction DDR5-8000 DDR5-6000 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 17.24 18.14 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction DDR5-8000 DDR5-6000 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 70.59 75.08 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark DDR5-6000 DDR5-8000 5M 10M 15M 20M 25M SE +/- 190312.42, N = 15 SE +/- 206180.75, N = 3 22359582 21701798 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Compression Rating DDR5-6000 DDR5-8000 20K 40K 60K 80K 100K SE +/- 63.84, N = 3 SE +/- 87.39, N = 3 115910 114657 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Decompression Rating DDR5-6000 DDR5-8000 20K 40K 60K 80K 100K SE +/- 76.48, N = 3 SE +/- 63.01, N = 3 78001 76053 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only DDR5-6000 DDR5-8000 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.13, N = 3 98.26 101.69
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only DDR5-6000 DDR5-8000 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.04, N = 3 129.45 132.47
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only DDR5-6000 DDR5-8000 200 400 600 800 1000 SE +/- 0.39, N = 3 SE +/- 1.06, N = 3 946.72 975.11
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown DDR5-6000 DDR5-8000 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 17.90 17.45 MIN: 17.78 / MAX: 18.23 MIN: 17.32 / MAX: 17.78
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon DDR5-6000 DDR5-8000 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 22.02 21.54 MIN: 21.91 / MAX: 22.24 MIN: 21.44 / MAX: 21.8
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU DDR5-6000 DDR5-8000 0.6143 1.2286 1.8429 2.4572 3.0715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.73 2.65 MIN: 2.65 / MAX: 2.84 MIN: 2.57 / MAX: 2.75
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU DDR5-6000 DDR5-8000 0.4838 0.9676 1.4514 1.9352 2.419 SE +/- 0.02, N = 3 SE +/- 0.01, N = 14 2.15 2.07 MIN: 0.67 / MAX: 2.57 MIN: 0.63 / MAX: 2.52
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 DDR5-8000 DDR5-6000 20K 40K 60K 80K 100K SE +/- 288.35, N = 3 SE +/- 469.67, N = 3 103496.12 102714.96 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:5 DDR5-6000 DDR5-8000 900K 1800K 2700K 3600K 4500K SE +/- 9637.55, N = 3 SE +/- 2275.96, N = 3 4397076.54 4277609.21 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 DDR5-6000 DDR5-8000 900K 1800K 2700K 3600K 4500K SE +/- 6374.80, N = 3 SE +/- 8054.06, N = 3 4332006.37 4217383.95 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 DDR5-6000 DDR5-8000 900K 1800K 2700K 3600K 4500K SE +/- 7416.92, N = 3 SE +/- 6230.21, N = 3 4257935.59 4147073.23 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
DDR5-6000 Processor: AMD Ryzen 7 9700X 8-Core @ 5.58GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2124 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 4 August 2024 12:40 by user phoronix.
DDR5-8000 Processor: AMD Ryzen 7 9700X 8-Core @ 5.58GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2124 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 4 August 2024 19:57 by user phoronix.