Benchmarks preparing for a future article by Michael Larabel.
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.174, N = 9 SE +/- 0.297, N = 9 SE +/- 0.342, N = 7 SE +/- 0.177, N = 9 SE +/- 0.120, N = 9 8.215 13.691 14.471 7.579 8.031 1. (CXX) g++ options: -O3 -lm -ldl
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 30.70 41.02 39.73 22.98 30.41
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 10.68, N = 9 SE +/- 4.49, N = 3 SE +/- 5.48, N = 3 SE +/- 17.37, N = 7 SE +/- 2.88, N = 3 1003.59 891.84 915.82 983.54 1060.53
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 130 260 390 520 650 SE +/- 5.87, N = 9 SE +/- 2.52, N = 3 SE +/- 4.48, N = 9 SE +/- 8.95, N = 9 SE +/- 10.02, N = 7 585.75 322.67 349.91 538.42 582.32
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.32, N = 3 SE +/- 0.38, N = 3 SE +/- 0.17, N = 3 SE +/- 0.63, N = 3 SE +/- 0.19, N = 3 1310.96 714.38 717.71 1508.04 1312.35
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 2827.88 2351.51 2445.68 2260.99 2926.98 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60 120 180 240 300 296.21 226.01 245.10 225.95 276.71 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 30.63 40.68 39.37 23.07 30.33
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Barbershop - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.87, N = 3 SE +/- 0.46, N = 3 SE +/- 0.08, N = 3 SE +/- 4.06, N = 3 SE +/- 0.87, N = 3 925.38 481.15 484.46 1281.54 928.10
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60 120 180 240 300 SE +/- 7.66, N = 9 SE +/- 2.93, N = 12 SE +/- 2.98, N = 12 SE +/- 3.59, N = 9 SE +/- 6.27, N = 9 266.65 183.64 177.58 259.88 276.24
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 0.38, N = 3 SE +/- 0.68, N = 3 SE +/- 0.94, N = 3 SE +/- 1.08, N = 3 SE +/- 0.37, N = 3 739.34 421.43 427.64 915.94 737.29
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 130 260 390 520 650 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 2.52, N = 3 331 603 596 288 327 MIN: 25 / MAX: 4820 MIN: 48 / MAX: 8450 MIN: 47 / MAX: 8430 MIN: 20 / MAX: 4338 MIN: 25 / MAX: 4775
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 3.91, N = 12 SE +/- 1.49, N = 15 SE +/- 1.80, N = 15 SE +/- 2.49, N = 12 SE +/- 2.19, N = 12 145.32 108.64 107.17 153.90 140.26
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: GoogLeNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 91.90 132.62 129.06 76.13 91.07
Timed LLVM Compilation This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Unix Makefiles EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 160 320 480 640 800 SE +/- 0.38, N = 3 SE +/- 1.00, N = 3 SE +/- 1.22, N = 3 SE +/- 0.50, N = 3 SE +/- 0.98, N = 3 662.52 387.01 395.19 719.39 670.27
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 2.92, N = 3 SE +/- 1.88, N = 3 SE +/- 2.33, N = 12 SE +/- 8.24, N = 12 SE +/- 4.83, N = 12 205.87 154.99 166.51 228.90 217.50 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 10.83, N = 3 SE +/- 22.46, N = 3 SE +/- 4.40, N = 3 SE +/- 3.35, N = 3 SE +/- 5.04, N = 3 1699.3 3266.9 3239.4 1742.9 1690.9 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 8.68, N = 9 SE +/- 1.00, N = 3 SE +/- 3.52, N = 15 SE +/- 5.45, N = 12 SE +/- 9.50, N = 9 162.03 110.05 101.62 177.74 175.46
Llama.cpp Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-70b-chat.Q5_0.gguf EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.306 0.612 0.918 1.224 1.53 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.06 1.21 1.16 1.36 1.09 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 3.06, N = 15 SE +/- 1.68, N = 12 SE +/- 1.27, N = 15 SE +/- 3.30, N = 15 SE +/- 3.05, N = 12 95.48 67.71 68.68 93.26 100.19
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 100 200 300 400 500 SE +/- 0.34, N = 3 SE +/- 0.80, N = 3 SE +/- 0.46, N = 3 SE +/- 0.89, N = 3 SE +/- 0.18, N = 3 432.22 273.49 277.73 468.38 436.05
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.04118, N = 3 SE +/- 0.05325, N = 3 SE +/- 0.01035, N = 3 SE +/- 0.12782, N = 12 SE +/- 0.06329, N = 12 6.73761 7.22104 6.94846 7.58350 6.67485 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
Helsing Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.18, N = 3 SE +/- 0.37, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.35, N = 3 375.92 200.77 199.83 437.41 372.10 1. (CC) gcc options: -O2 -pthread
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9M 18M 27M 36M 45M SE +/- 160295.89, N = 3 SE +/- 574948.55, N = 15 SE +/- 511472.18, N = 4 SE +/- 121681.53, N = 15 SE +/- 150504.97, N = 15 23143858 44270858 42791247 16898291 22631087 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 110 220 330 440 550 SE +/- 4.56, N = 3 SE +/- 4.31, N = 3 SE +/- 7.45, N = 3 SE +/- 1.92, N = 3 SE +/- 4.57, N = 3 462.14 399.56 415.04 524.13 460.21 MAX: 27920.53 MAX: 27980.82 MAX: 27057.75 MAX: 26588.63 MAX: 27836.55
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16M 32M 48M 64M 80M SE +/- 441409.11, N = 3 SE +/- 597457.28, N = 3 SE +/- 669845.56, N = 3 SE +/- 207001.08, N = 3 SE +/- 766998.72, N = 3 64182563 73716025 71135797 56494499 64354682
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80K 160K 240K 320K 400K SE +/- 20.61, N = 3 SE +/- 143.01, N = 3 SE +/- 28.66, N = 3 SE +/- 1626.68, N = 3 SE +/- 1102.39, N = 3 194210.3 373632.3 372775.0 195618.8 193565.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3K 6K 9K 12K 15K SE +/- 2.36, N = 3 SE +/- 5.90, N = 3 SE +/- 10.46, N = 3 SE +/- 28.09, N = 3 SE +/- 6.93, N = 3 7697.3 14289.7 14222.8 3000.2 7692.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.48, N = 3 SE +/- 0.21, N = 3 SE +/- 0.16, N = 3 SE +/- 0.51, N = 3 SE +/- 0.94, N = 3 122.55 103.37 106.72 137.92 123.74 MAX: 23938.16 MAX: 23852.59 MAX: 23939.91 MAX: 23917.2 MAX: 23948.92
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16M 32M 48M 64M 80M SE +/- 268932.10, N = 3 SE +/- 205604.00, N = 3 SE +/- 109473.22, N = 3 SE +/- 209117.79, N = 3 SE +/- 503428.59, N = 3 62951458 74075244 71922519 56035278 62420035
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: GoogLeNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 91.77 132.13 128.54 76.84 90.98
MariaDB This is a MariaDB MySQL database server benchmark making use of sysbench rather than the existing pts/mysqlslap test profile that uses MariaDB with mysqlslap/mariadb-slap as the benchmark driver. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_read_write - Threads: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 73.28, N = 3 SE +/- 158.42, N = 3 SE +/- 135.64, N = 3 SE +/- 554.89, N = 3 SE +/- 199.70, N = 3 158499 181519 184015 180020 159448 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
SecureMark SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90K 180K 270K 360K 450K SE +/- 1282.82, N = 3 SE +/- 1954.24, N = 3 SE +/- 1244.71, N = 3 SE +/- 739.32, N = 3 SE +/- 1137.99, N = 3 430421 429779 435859 426706 437475 1. (CC) gcc options: -pedantic -O3
MariaDB This is a MariaDB MySQL database server benchmark making use of sysbench rather than the existing pts/mysqlslap test profile that uses MariaDB with mysqlslap/mariadb-slap as the benchmark driver. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_read_write - Threads: 64 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 211.83, N = 3 SE +/- 389.87, N = 3 SE +/- 108.10, N = 3 SE +/- 562.40, N = 3 SE +/- 60.79, N = 3 168593 201182 201579 186418 168388 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_write_only - Threads: 64 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70K 140K 210K 280K 350K SE +/- 419.19, N = 3 SE +/- 56.85, N = 3 SE +/- 559.43, N = 3 SE +/- 798.38, N = 3 SE +/- 526.67, N = 3 265119 348099 349473 296034 265093 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_update_non_index - Threads: 64 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50K 100K 150K 200K 250K SE +/- 295.88, N = 3 SE +/- 290.99, N = 3 SE +/- 437.08, N = 3 SE +/- 516.32, N = 3 SE +/- 410.68, N = 3 201401 233313 234051 220068 200246 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_update_index - Threads: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50K 100K 150K 200K 250K SE +/- 331.57, N = 3 SE +/- 307.26, N = 3 SE +/- 113.22, N = 3 SE +/- 290.25, N = 3 SE +/- 249.27, N = 3 198779 207212 206240 210974 198305 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_update_index - Threads: 64 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 308.54, N = 3 SE +/- 282.52, N = 3 SE +/- 130.01, N = 3 SE +/- 250.58, N = 3 SE +/- 125.38, N = 3 177579 198538 200156 187089 177842 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_write_only - Threads: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80K 160K 240K 320K 400K SE +/- 471.57, N = 3 SE +/- 441.31, N = 3 SE +/- 101.24, N = 3 SE +/- 574.53, N = 3 SE +/- 133.55, N = 3 306926 346114 352931 304169 310088 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.5 Test: oltp_update_non_index - Threads: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50K 100K 150K 200K 250K SE +/- 399.51, N = 3 SE +/- 861.02, N = 3 SE +/- 466.56, N = 3 SE +/- 223.46, N = 3 SE +/- 181.91, N = 3 224118 239192 239542 253110 224636 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
Timed Godot Game Engine Compilation This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80 160 240 320 400 SE +/- 0.15, N = 3 SE +/- 0.40, N = 3 SE +/- 0.56, N = 3 SE +/- 0.34, N = 3 SE +/- 0.20, N = 3 326.16 188.53 191.08 364.42 329.18
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Pabellon Barcelona - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.40, N = 3 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 1.05, N = 3 SE +/- 0.09, N = 3 315.13 165.35 166.82 411.15 316.02
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60 120 180 240 300 SE +/- 4.93, N = 3 SE +/- 2.29, N = 5 SE +/- 1.20, N = 3 SE +/- 1.74, N = 3 SE +/- 1.72, N = 3 232.47 251.80 241.90 249.90 228.79 MIN: 10.67 / MAX: 12000 MIN: 15.75 / MAX: 10000 MIN: 15.08 / MAX: 7500 MIN: 9.25 / MAX: 12000 MIN: 10.54 / MAX: 10000
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 1.00, N = 3 SE +/- 1.51, N = 5 SE +/- 1.60, N = 3 SE +/- 3.87, N = 3 SE +/- 1.92, N = 3 236.47 248.89 240.37 249.50 226.35 MIN: 10.66 / MAX: 10000 MIN: 15.92 / MAX: 8571.43 MIN: 15.16 / MAX: 10000 MIN: 9.27 / MAX: 12000 MIN: 10.69 / MAX: 10000
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 1.84, N = 3 SE +/- 2.37, N = 5 SE +/- 2.08, N = 3 SE +/- 1.98, N = 3 SE +/- 2.84, N = 3 222.47 227.39 222.80 234.39 211.59 MIN: 10.49 / MAX: 12000 MIN: 13.31 / MAX: 10000 MIN: 12.9 / MAX: 7500 MIN: 9.14 / MAX: 12000 MIN: 10.43 / MAX: 10000
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10K 20K 30K 40K 50K SE +/- 613.23, N = 15 SE +/- 521.71, N = 15 SE +/- 545.16, N = 15 SE +/- 714.33, N = 15 SE +/- 662.65, N = 12 37149.88 47556.69 46710.34 40491.56 37048.80 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 30.53 40.24 39.07 23.10 30.22
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Classroom - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 1.57, N = 3 SE +/- 0.38, N = 3 259.46 136.11 136.92 336.94 259.85
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: AlexNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.55, N = 3 SE +/- 1.74, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 240.23 425.47 422.88 192.28 240.74
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 12.91, N = 15 SE +/- 1.90, N = 3 SE +/- 4.73, N = 3 SE +/- 11.74, N = 14 SE +/- 8.12, N = 15 864.56 567.07 567.02 960.86 889.17 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.3969 0.7938 1.1907 1.5876 1.9845 SE +/- 0.018640, N = 15 SE +/- 0.005924, N = 3 SE +/- 0.014643, N = 3 SE +/- 0.013081, N = 14 SE +/- 0.010436, N = 15 1.160540 1.763490 1.763840 1.042812 1.125970 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 12.91, N = 15 SE +/- 1.90, N = 3 SE +/- 4.73, N = 3 SE +/- 11.74, N = 14 SE +/- 8.12, N = 15 864.56 567.07 567.02 960.86 889.17 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.0039 2.0078 3.0117 4.0156 5.0195 SE +/- 0.00310, N = 3 SE +/- 0.05300, N = 15 SE +/- 0.03044, N = 15 SE +/- 0.03583, N = 3 SE +/- 0.02531, N = 15 3.09150 2.38299 2.49599 4.46170 3.05542 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.32, N = 3 SE +/- 9.29, N = 15 SE +/- 4.78, N = 15 SE +/- 1.81, N = 3 SE +/- 2.89, N = 15 323.38 422.42 401.35 224.11 327.53 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.0039 2.0078 3.0117 4.0156 5.0195 SE +/- 0.00310, N = 3 SE +/- 0.05300, N = 15 SE +/- 0.03044, N = 15 SE +/- 0.03583, N = 3 SE +/- 0.02531, N = 15 3.09150 2.38299 2.49599 4.46170 3.05542 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 26.36 49.99 49.92 12.55 26.49 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.13, N = 3 SE +/- 1.54, N = 3 SE +/- 0.90, N = 3 SE +/- 0.15, N = 3 SE +/- 0.43, N = 3 658.93 1249.62 1248.08 313.74 662.34 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90K 180K 270K 360K 450K 226198 397138 394575 194127 229365 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60 120 180 240 300 SE +/- 1.82, N = 3 SE +/- 2.35, N = 3 SE +/- 2.23, N = 3 SE +/- 4.53, N = 3 SE +/- 1.83, N = 3 268.42 235.16 238.34 290.14 267.99 MAX: 27440.82 MAX: 26882.16 MAX: 26718.64 MAX: 26535.45 MAX: 27513.85
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 17M 34M 51M 68M 85M SE +/- 261270.67, N = 3 SE +/- 360981.97, N = 3 SE +/- 362879.93, N = 3 SE +/- 787042.09, N = 3 SE +/- 239601.11, N = 3 69144164 77658914 75893047 62655549 68897602
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.20, N = 3 SE +/- 0.06, N = 3 68.16 60.66 62.09 77.24 69.05 MAX: 23889.9 MAX: 23861.25 MAX: 23874.44 MAX: 23891.04 MAX: 23877.52
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 17M 34M 51M 68M 85M SE +/- 296062.71, N = 3 SE +/- 390613.00, N = 3 SE +/- 113056.05, N = 3 SE +/- 232749.13, N = 3 SE +/- 75302.81, N = 3 69561449 77605627 75706552 61842388 68672756
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80 160 240 320 400 SE +/- 6.12, N = 3 SE +/- 5.40, N = 3 SE +/- 8.45, N = 3 SE +/- 10.20, N = 3 SE +/- 1.80, N = 3 347.97 321.29 322.24 382.48 359.26 MAX: 27085.8 MAX: 27126.33 MAX: 27055.68 MAX: 27376.22 MAX: 27346.14
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20M 40M 60M 80M 100M SE +/- 255843.12, N = 3 SE +/- 818556.17, N = 3 SE +/- 919852.08, N = 3 SE +/- 922667.78, N = 3 SE +/- 357039.39, N = 3 71338740 78454316 77609975 64886364 69856949
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.82, N = 3 SE +/- 0.76, N = 3 SE +/- 0.54, N = 3 SE +/- 0.47, N = 3 105.58 92.08 95.72 119.49 106.44 MAX: 15103.61 MAX: 11356.99 MAX: 11335.36 MAX: 13845.12 MAX: 13870.13
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20M 40M 60M 80M 100M SE +/- 282897.65, N = 3 SE +/- 368630.91, N = 3 SE +/- 224737.25, N = 3 SE +/- 262085.32, N = 3 SE +/- 351738.51, N = 3 70305285 79315319 76780782 62558098 69450850
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30000M 60000M 90000M 120000M 150000M SE +/- 32786296.71, N = 3 SE +/- 13732325.68, N = 3 SE +/- 23437434.86, N = 3 SE +/- 7700475.42, N = 3 SE +/- 85172787.78, N = 3 68366108810 129853526473 129320991313 34651855800 68362354417 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-128-GCM EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50000M 100000M 150000M 200000M 250000M SE +/- 48206635.82, N = 3 SE +/- 197196146.33, N = 3 SE +/- 105284888.52, N = 3 SE +/- 45687236.99, N = 3 SE +/- 53677368.01, N = 3 133134074673 251224402513 250497225070 64812725680 132890512590 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20-Poly1305 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20000M 40000M 60000M 80000M 100000M SE +/- 11858939.41, N = 3 SE +/- 63328398.60, N = 3 SE +/- 14328313.13, N = 3 SE +/- 5622113.45, N = 3 SE +/- 17581304.10, N = 3 48542739663 92129622150 91743101110 23800263020 48632082067 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-256-GCM EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50000M 100000M 150000M 200000M 250000M SE +/- 101608115.02, N = 3 SE +/- 27189836.88, N = 3 SE +/- 47970633.04, N = 3 SE +/- 5611535.27, N = 3 SE +/- 93170961.59, N = 3 113941258877 215336847677 214525919597 56193249190 113843625530 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2000M 4000M 6000M 8000M 10000M SE +/- 749962.39, N = 3 SE +/- 2951460.39, N = 3 SE +/- 1453158.01, N = 3 SE +/- 2787035.99, N = 3 SE +/- 1105217.67, N = 3 5794405307 11041579923 11004843967 4429397160 5813555230 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA256 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7000M 14000M 21000M 28000M 35000M SE +/- 8102567.82, N = 3 SE +/- 7892274.19, N = 3 SE +/- 8523911.43, N = 3 SE +/- 27097230.48, N = 3 SE +/- 18056187.30, N = 3 17876358337 33989652273 33924881420 12336557263 17965660883 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.58, N = 10 SE +/- 1.01, N = 3 SE +/- 0.87, N = 15 SE +/- 0.76, N = 3 87.77 73.88 79.60 73.72 90.69 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.305 2.61 3.915 5.22 6.525 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.04 5.80 5.74 2.26 3.04 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
MariaDB mariadb-slap This is a MariaDB MySQL database server benchmark making use of mysqlslap (mariadb-slap). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB mariadb-slap 11.5 Clients: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 130 260 390 520 650 SE +/- 0.74, N = 3 SE +/- 0.54, N = 3 SE +/- 0.55, N = 3 SE +/- 1.05, N = 3 SE +/- 0.57, N = 3 580 612 614 574 587 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.59, N = 6 SE +/- 0.35, N = 8 SE +/- 0.31, N = 15 SE +/- 0.67, N = 4 SE +/- 0.86, N = 15 61.97 42.23 44.43 55.90 61.28 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 172.36 171.30 178.81 138.32 177.83 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Java JMH This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16000M 32000M 48000M 64000M 80000M 38776376503.47 75134161573.04 75186159786.82 36765943472.27 38717919571.11
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/scivis/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00347, N = 3 SE +/- 0.00296, N = 3 SE +/- 0.00801, N = 3 SE +/- 0.00749, N = 3 SE +/- 0.00354, N = 3 4.61871 8.98195 8.81722 5.88640 4.64920
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 1.14, N = 15 SE +/- 1.04, N = 4 SE +/- 0.47, N = 3 SE +/- 1.78, N = 3 SE +/- 1.34, N = 15 107.79 86.84 85.63 123.59 107.51 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.10193, N = 15 SE +/- 0.13812, N = 4 SE +/- 0.06318, N = 3 SE +/- 0.11600, N = 3 SE +/- 0.12476, N = 15 9.29179 11.51990 11.67770 8.09421 9.32307 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 1.14, N = 15 SE +/- 1.04, N = 4 SE +/- 0.47, N = 3 SE +/- 1.78, N = 3 SE +/- 1.34, N = 15 107.79 86.84 85.63 123.59 107.51 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.01376, N = 3 SE +/- 0.08068, N = 15 SE +/- 0.09699, N = 3 SE +/- 0.02440, N = 3 SE +/- 0.07551, N = 15 8.06389 7.51744 7.60699 7.89080 7.81820 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 1.39, N = 15 SE +/- 1.70, N = 3 SE +/- 0.39, N = 3 SE +/- 1.28, N = 15 123.95 133.17 131.44 126.67 128.01 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.01376, N = 3 SE +/- 0.08068, N = 15 SE +/- 0.09699, N = 3 SE +/- 0.02440, N = 3 SE +/- 0.07551, N = 15 8.06389 7.51744 7.60699 7.89080 7.81820 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
ScyllaDB This is a benchmark of ScyllaDB and is making use of Apache Cassandra's cassandra-stress for conducting the benchmark. ScyllaDB is an open-source distributed NoSQL data store that is compatible with Apache Cassandra while focusing on higher throughput and lower latency. ScyllaDB uses a sharded design on each node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better ScyllaDB 5.2.9 Test: Writes EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 60.93, N = 3 SE +/- 255.51, N = 3 SE +/- 78.75, N = 3 SE +/- 395.91, N = 3 SE +/- 446.24, N = 3 94217 151729 151849 104832 94899
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 5.00, N = 15 SE +/- 0.70, N = 3 SE +/- 0.60, N = 3 SE +/- 1.27, N = 3 SE +/- 5.32, N = 15 97.12 97.94 96.99 127.67 100.03 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.57269, N = 15 SE +/- 0.07376, N = 3 SE +/- 0.06449, N = 3 SE +/- 0.07886, N = 3 SE +/- 0.63014, N = 15 10.70547 10.21150 10.31040 7.83432 10.46448 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: yolov4 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 5.00, N = 15 SE +/- 0.70, N = 3 SE +/- 0.60, N = 3 SE +/- 1.27, N = 3 SE +/- 5.32, N = 15 97.12 97.94 96.99 127.67 100.03 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Appleseed Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Emily EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 263.51 152.30 153.37 323.27 264.55
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Write - Average Latency EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.078, N = 3 SE +/- 0.109, N = 3 SE +/- 0.094, N = 4 SE +/- 0.101, N = 3 SE +/- 0.141, N = 4 12.355 8.850 8.696 11.791 12.454 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Write EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12K 24K 36K 48K 60K SE +/- 256.18, N = 3 SE +/- 699.92, N = 3 SE +/- 617.95, N = 4 SE +/- 362.37, N = 3 SE +/- 453.73, N = 4 40473 56517 57520 42412 40163 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60 120 180 240 300 SE +/- 2.74, N = 3 SE +/- 3.67, N = 3 SE +/- 3.26, N = 3 SE +/- 1.91, N = 3 SE +/- 3.81, N = 3 243.61 230.41 235.20 275.14 248.62 MAX: 26975.13 MAX: 26735.87 MAX: 26852.2 MAX: 27185.57 MAX: 26892.13
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 15M 30M 45M 60M 75M SE +/- 418677.10, N = 3 SE +/- 489754.87, N = 3 SE +/- 438509.29, N = 3 SE +/- 245819.57, N = 3 SE +/- 382550.66, N = 3 64655502 69731166 67794681 57760197 63296864
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.9405 1.881 2.8215 3.762 4.7025 SE +/- 0.02, N = 11 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 15 SE +/- 0.02, N = 3 2.10 4.18 4.14 1.45 2.06 MIN: 0.64 / MAX: 2.54 MIN: 1.81 / MAX: 4.75 MIN: 1.71 / MAX: 4.75 MIN: 0.39 / MAX: 1.82 MIN: 0.66 / MAX: 2.48
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/pathtracer/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.61, N = 3 SE +/- 0.68, N = 3 SE +/- 0.83, N = 3 SE +/- 0.42, N = 3 SE +/- 0.86, N = 3 170.39 220.91 219.74 157.97 168.64
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.21, N = 3 SE +/- 0.25, N = 3 SE +/- 0.20, N = 3 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 152.79 127.02 132.13 145.99 159.04 1. (CXX) g++ options: -O3 -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3K 6K 9K 12K 15K SE +/- 1.44, N = 3 SE +/- 146.76, N = 5 SE +/- 175.17, N = 3 SE +/- 4.20, N = 3 SE +/- 2.24, N = 3 10027.11 13374.35 12630.55 11387.05 9707.08 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
ACES DGEMM This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.144416, N = 15 SE +/- 0.258971, N = 12 SE +/- 0.166167, N = 15 SE +/- 0.054700, N = 3 SE +/- 0.133403, N = 15 5.330500 9.577753 9.536711 4.204110 5.160364 1. (CC) gcc options: -O3 -march=native -fopenmp
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.1 Length: 1e13 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.15, N = 3 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 153.60 82.47 82.12 245.67 152.68 1. (CXX) g++ options: -O3
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.10, N = 3 SE +/- 0.42, N = 3 SE +/- 0.51, N = 3 SE +/- 0.41, N = 3 70.70 65.63 67.07 80.56 72.28 MAX: 13822.83 MAX: 12576.01 MAX: 12570.5 MAX: 15084.51 MAX: 13851.96
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 15M 30M 45M 60M 75M SE +/- 42980.57, N = 3 SE +/- 41672.18, N = 3 SE +/- 313014.56, N = 3 SE +/- 205982.51, N = 3 SE +/- 161254.73, N = 3 64781285 69857519 68166248 57028398 63627945
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Very Thorough EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.6039 1.2078 1.8117 2.4156 3.0195 SE +/- 0.0010, N = 3 SE +/- 0.0093, N = 3 SE +/- 0.0070, N = 3 SE +/- 0.0116, N = 3 SE +/- 0.0023, N = 3 1.3979 2.6841 2.6608 1.4639 1.4109 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Exhaustive EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.3688 0.7376 1.1064 1.4752 1.844 SE +/- 0.0000, N = 3 SE +/- 0.0020, N = 3 SE +/- 0.0018, N = 3 SE +/- 0.0051, N = 3 SE +/- 0.0007, N = 3 0.8557 1.6391 1.6262 0.8741 0.8633 1. (CXX) g++ options: -O3 -flto -pthread
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.4673 0.9346 1.4019 1.8692 2.3365 SE +/- 0.023, N = 3 SE +/- 0.006, N = 3 SE +/- 0.009, N = 3 SE +/- 0.005, N = 3 SE +/- 0.030, N = 3 2.051 0.722 0.709 1.852 2.077 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 4300.20, N = 3 SE +/- 8757.92, N = 3 SE +/- 14172.11, N = 3 SE +/- 1154.98, N = 3 SE +/- 5557.21, N = 3 390018 1108243 1128893 431962 385411 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.2162 0.4324 0.6486 0.8648 1.081 SE +/- 0.012, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.010, N = 3 0.961 0.361 0.360 0.899 0.940 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300K 600K 900K 1200K 1500K SE +/- 6448.13, N = 3 SE +/- 3948.07, N = 3 SE +/- 3056.15, N = 3 SE +/- 1617.01, N = 3 SE +/- 5499.26, N = 3 520502 1384166 1388157 555873 532015 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 24.50 16.16 16.10 23.20 24.76 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11K 22K 33K 44K 55K SE +/- 123.75, N = 3 SE +/- 481.81, N = 3 SE +/- 155.76, N = 3 SE +/- 85.58, N = 3 SE +/- 337.25, N = 3 32657 49522 49683 34486 32316 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.88, N = 15 SE +/- 0.27, N = 3 SE +/- 0.33, N = 5 SE +/- 1.09, N = 12 SE +/- 0.82, N = 15 42.94 29.55 30.24 32.87 43.80 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
DuckDB DuckDB is an in-progress SQL OLAP database management system optimized for analytics and features a vectorized and parallel engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 100.63 82.38 83.01 99.58 101.94 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
FFmpeg This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Platform EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16 32 48 64 80 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 65.26 69.80 69.24 58.75 65.45 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Video On Demand EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16 32 48 64 80 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 65.35 69.81 69.52 58.75 65.27 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400 800 1200 1600 2000 SE +/- 9.44, N = 3 SE +/- 20.05, N = 3 SE +/- 6.90, N = 15 SE +/- 9.70, N = 3 SE +/- 97.47, N = 3 1926.9 1908.3 1890.5 1906.1 1836.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 10.20 10.70 11.50 8.91 10.30 1. (CC) gcc options: -O3 -pthread -lz -llzma
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.28 6.79 6.53 7.91 6.34
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.16, N = 3 SE +/- 0.94, N = 15 SE +/- 0.27, N = 3 SE +/- 1.58, N = 15 SE +/- 0.35, N = 3 45.11 43.38 45.76 32.25 45.20 MIN: 43.61 / MAX: 45.86 MIN: 34.95 / MAX: 46.71 MIN: 34.75 / MAX: 46.5 MIN: 23.14 / MAX: 40.93 MIN: 29.05 / MAX: 46.24
FFmpeg This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Upload EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 31.95 34.21 34.06 28.89 32.03 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/ao/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00129, N = 3 SE +/- 0.00600, N = 3 SE +/- 0.00268, N = 3 SE +/- 0.00839, N = 3 SE +/- 0.05544, N = 4 4.63828 8.98399 8.81078 5.93161 4.58353
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Junkshop - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.43, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.19, N = 3 139.48 72.75 73.73 173.17 139.94
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400 800 1200 1600 2000 SE +/- 18.77, N = 3 SE +/- 9.65, N = 3 SE +/- 7.95, N = 15 SE +/- 7.69, N = 3 SE +/- 20.68, N = 3 2066.7 2010.8 1998.3 1972.1 2026.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.23, N = 15 SE +/- 0.03, N = 3 SE +/- 0.15, N = 3 18.0 18.8 19.5 16.3 17.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.49, N = 3 SE +/- 0.14, N = 3 SE +/- 0.17, N = 3 SE +/- 1.26, N = 3 SE +/- 0.89, N = 3 133.31 83.97 86.00 144.54 130.53 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 16.77, N = 15 SE +/- 4.80, N = 3 SE +/- 10.32, N = 15 SE +/- 16.64, N = 3 SE +/- 18.32, N = 12 1269.37 1278.44 1223.22 1604.85 1228.58 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.42, N = 3 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 SE +/- 0.40, N = 3 129.33 85.25 87.86 135.42 129.55 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 30.79 39.99 38.96 23.19 30.52
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12 24 36 48 60 SE +/- 0.25, N = 3 SE +/- 0.29, N = 15 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 SE +/- 0.33, N = 15 51.29 33.13 34.80 53.29 49.54 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.495 0.99 1.485 1.98 2.475 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.82 2.01 1.94 2.20 1.84
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 4.31, N = 3 SE +/- 24.36, N = 9 SE +/- 5.41, N = 9 SE +/- 2.75, N = 3 SE +/- 3.55, N = 3 2322.3 2245.7 2279.9 2375.6 2313.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 1.88, N = 3 SE +/- 6.27, N = 9 SE +/- 6.44, N = 9 SE +/- 3.15, N = 3 SE +/- 1.37, N = 3 520.3 793.8 778.5 485.6 513.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Intel Open Image Denoise Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.0923 0.1846 0.2769 0.3692 0.4615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.23 0.41 0.40 0.21 0.23
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Fishy Cat - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.39, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.24, N = 3 SE +/- 0.38, N = 3 127.01 66.78 67.34 165.00 127.12
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.1702 0.3404 0.5106 0.6808 0.851 SE +/- 0.00059, N = 3 SE +/- 0.00230, N = 3 SE +/- 0.00230, N = 3 SE +/- 0.00016, N = 3 SE +/- 0.00121, N = 3 0.49345 0.75646 0.74072 0.20565 0.49007
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.41, N = 15 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 19.26 18.66 16.78 23.61 19.10 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 13 26 39 52 65 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 1.44, N = 15 SE +/- 0.04, N = 3 SE +/- 0.20, N = 3 51.91 53.58 60.06 42.35 52.34 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.41, N = 15 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 19.26 18.66 16.78 23.61 19.10 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 4.77, N = 15 SE +/- 0.03, N = 3 SE +/- 0.32, N = 3 SE +/- 0.14, N = 3 SE +/- 0.63, N = 3 70.83 66.03 65.48 91.52 85.12 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 1.17, N = 15 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 15.22 15.14 15.27 10.93 11.75 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 4.77, N = 15 SE +/- 0.03, N = 3 SE +/- 0.32, N = 3 SE +/- 0.14, N = 3 SE +/- 0.63, N = 3 70.83 66.03 65.48 91.52 85.12 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.03666, N = 3 SE +/- 0.08220, N = 3 SE +/- 0.08219, N = 3 SE +/- 0.07186, N = 15 SE +/- 0.04051, N = 3 7.16751 6.16145 6.36984 5.90786 7.03702 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.71, N = 3 SE +/- 2.20, N = 3 SE +/- 2.01, N = 3 SE +/- 2.25, N = 15 SE +/- 0.82, N = 3 139.48 162.33 157.01 169.62 142.06 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.03666, N = 3 SE +/- 0.08220, N = 3 SE +/- 0.08219, N = 3 SE +/- 0.07186, N = 15 SE +/- 0.04051, N = 3 7.16751 6.16145 6.36984 5.90786 7.03702 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 60% Reads - Concurrency: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 47.09, N = 3 SE +/- 156.58, N = 3 SE +/- 137.27, N = 3 SE +/- 198.29, N = 3 SE +/- 204.39, N = 3 67506.7 95084.8 93864.1 63021.4 67444.0
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: AlexNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 237.57 409.96 405.64 189.14 237.76
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 264.18, N = 3 SE +/- 831.25, N = 3 SE +/- 764.38, N = 3 SE +/- 189.35, N = 3 SE +/- 103.24, N = 3 64052.7 92195.2 90528.3 59866.6 64458.4
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 302.28, N = 3 SE +/- 765.52, N = 3 SE +/- 716.09, N = 3 SE +/- 237.27, N = 3 SE +/- 258.97, N = 3 83155.8 121061.7 119084.9 83987.5 84052.3
Dragonflydb Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:10 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.4M 2.8M 4.2M 5.6M 7M SE +/- 38431.66, N = 3 SE +/- 90168.82, N = 12 SE +/- 45878.41, N = 3 SE +/- 44122.15, N = 3 SE +/- 43642.69, N = 3 4729191.29 6266297.87 6406922.86 4522560.58 4759359.71 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1600 3200 4800 6400 8000 SE +/- 180.65, N = 12 SE +/- 298.02, N = 15 SE +/- 298.76, N = 15 SE +/- 48.43, N = 3 SE +/- 0.00, N = 3 4256.28 7477.19 7317.33 6199.09 3975.69 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1400 2800 4200 5600 7000 SE +/- 192.19, N = 12 SE +/- 321.53, N = 15 SE +/- 328.86, N = 15 SE +/- 58.27, N = 3 SE +/- 14.20, N = 3 3635.69 6642.94 6613.59 5146.33 3349.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.5722 1.1444 1.7166 2.2888 2.861 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 1.546 2.543 2.468 1.434 1.526 1. (CXX) g++ options: -O3 -lm
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 10.81, N = 3 SE +/- 25.03, N = 3 SE +/- 11.41, N = 12 SE +/- 2.01, N = 3 SE +/- 7.37, N = 3 2404.2 2314.7 2308.6 2377.0 2394.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.92, N = 3 SE +/- 2.97, N = 3 SE +/- 4.91, N = 12 SE +/- 2.27, N = 3 SE +/- 1.44, N = 3 171.0 220.9 207.5 170.6 168.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 SE +/- 0.08, N = 3 SE +/- 0.49, N = 3 SE +/- 0.12, N = 3 108.65 72.11 72.42 133.57 107.79 1. (CXX) g++ options: -O3 -fPIC -lm
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10K 20K 30K 40K 50K SE +/- 14.25, N = 3 SE +/- 110.88, N = 3 SE +/- 553.44, N = 3 SE +/- 96.46, N = 3 SE +/- 21.68, N = 3 22935.27 44844.13 43371.78 28326.36 22551.09 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.021, N = 3 SE +/- 0.058, N = 3 SE +/- 0.061, N = 3 SE +/- 0.018, N = 3 SE +/- 0.024, N = 3 5.693 7.818 7.746 4.975 5.681 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.69, N = 12 37.00 26.83 27.35 43.40 36.25 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 SE +/- 0.60, N = 12 27.02 37.27 36.56 23.04 27.71 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.69, N = 12 37.00 26.83 27.35 43.40 36.25 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 500 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800K 1600K 2400K 3200K 4000K SE +/- 201571.91, N = 12 SE +/- 164667.68, N = 15 SE +/- 72840.48, N = 12 SE +/- 128040.11, N = 15 SE +/- 181123.85, N = 15 3684418.98 3419833.50 3827317.96 3327348.65 3656582.35 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
PETSc PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better PETSc 3.19 Test: Streams EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 15K 30K 45K 60K 75K SE +/- 249.54, N = 3 SE +/- 23.59, N = 3 SE +/- 31.25, N = 3 SE +/- 12.95, N = 3 SE +/- 40.69, N = 3 26545.71 41077.63 39471.07 69652.73 39918.97 1. (CC) gcc options: -fPIC -O3 -O2 -lpthread -ludev -lpciaccess -lm
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 296.60 183.23 199.88 242.94 311.36 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 38.17 30.77 29.74 35.19 41.68 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Dragonflydb Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.5M 3M 4.5M 6M 7.5M SE +/- 6458.55, N = 3 SE +/- 34452.25, N = 3 SE +/- 57648.71, N = 9 SE +/- 43112.16, N = 3 SE +/- 16892.77, N = 3 4762998.42 6289782.08 7046043.06 4604951.36 4818585.25 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 98.21, N = 3 SE +/- 301.50, N = 3 SE +/- 536.64, N = 3 SE +/- 46.60, N = 3 SE +/- 52.64, N = 3 85990.12 136360.72 144355.05 87132.22 84772.31 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Apache HTTP Server This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 1000 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 62.36, N = 3 SE +/- 797.39, N = 3 SE +/- 802.85, N = 3 SE +/- 873.03, N = 3 SE +/- 59.58, N = 3 87708.39 131437.32 130637.70 113097.85 87686.63 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 191.32, N = 3 SE +/- 288.60, N = 3 SE +/- 126.64, N = 3 SE +/- 173.47, N = 3 SE +/- 239.98, N = 3 94947.75 154835.06 156935.63 92582.29 91505.24 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Apache HTTP Server This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 500 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 152.51, N = 3 SE +/- 521.55, N = 3 SE +/- 667.30, N = 3 SE +/- 774.43, N = 3 SE +/- 141.36, N = 3 96508.30 133534.11 136714.05 125745.61 96630.23 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 127.61, N = 3 SE +/- 150.85, N = 3 SE +/- 223.05, N = 3 SE +/- 231.66, N = 3 SE +/- 204.87, N = 3 101214.88 162348.33 162256.48 98692.95 100527.89 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Appleseed Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Material Tester EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 142.94 102.15 102.74 173.41 143.38
MariaDB mariadb-slap This is a MariaDB MySQL database server benchmark making use of mysqlslap (mariadb-slap). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB mariadb-slap 11.5 Clients: 64 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 5.66, N = 3 SE +/- 4.85, N = 3 SE +/- 5.34, N = 3 SE +/- 4.64, N = 3 SE +/- 5.02, N = 3 1124 1183 1182 1128 1132 -shared -lrt -lstdc++ -shared -lrt -lstdc++ -shared -lrt -lstdc++ -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto -lnuma -lpcre2-8 -lcrypt -laio -lssl -lcrypto 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -lpthread -lz -ldl -lm
Appleseed Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Disney Material EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 156.07 85.32 85.93 174.95 156.37
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: BMW27 - Compute: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.05, N = 3 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.24, N = 3 101.72 53.25 53.56 123.51 101.52
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.32, N = 3 SE +/- 0.36, N = 3 SE +/- 0.42, N = 3 SE +/- 0.45, N = 3 SE +/- 0.32, N = 3 97.41 57.47 57.93 108.34 97.92
Renaissance Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Genetic Algorithm Using Jenetics + Futures EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 5.89, N = 3 SE +/- 6.12, N = 15 SE +/- 8.75, N = 3 SE +/- 9.79, N = 3 SE +/- 9.08, N = 3 821.7 882.7 887.0 886.6 819.5 MIN: 782.09 / MAX: 843.31 MIN: 756.16 / MAX: 981 MIN: 771.69 / MAX: 950.63 MIN: 844.84 / MAX: 942.25 MIN: 773.51 / MAX: 850.94
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Training - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 1.64, N = 3 SE +/- 1.43, N = 3 SE +/- 3.18, N = 3 SE +/- 9.27, N = 3 SE +/- 3.90, N = 3 2491.14 1348.37 1354.12 2931.95 2496.65 MIN: 2459.9 MIN: 1314.05 MIN: 1317.27 MIN: 2882.15 MIN: 2449.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.45, N = 3 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 SE +/- 0.09, N = 3 86.82 72.88 75.55 64.58 89.36 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 273.73, N = 3 SE +/- 101.69, N = 3 SE +/- 146.12, N = 3 SE +/- 76.92, N = 3 SE +/- 59.04, N = 3 84209 44247 44561 95447 84611
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Inference - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 7.18, N = 3 SE +/- 1.91, N = 3 SE +/- 2.19, N = 3 SE +/- 6.38, N = 3 SE +/- 2.00, N = 3 1337.47 706.65 703.75 1548.71 1329.68 MIN: 1299.81 MIN: 679.48 MIN: 675.7 MIN: 1512.56 MIN: 1297.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 15 30 45 60 75 SE +/- 1.84, N = 15 SE +/- 0.90, N = 15 SE +/- 0.29, N = 3 SE +/- 1.37, N = 12 SE +/- 1.71, N = 12 67.48 65.69 66.63 61.49 68.30 MIN: 46.76 / MAX: 74.15 MIN: 53.12 / MAX: 69.76 MIN: 57.04 / MAX: 69.34 MIN: 40.06 / MAX: 72.22 MIN: 48 / MAX: 73.52
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: GoogLeNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 92.37 133.21 129.59 78.17 91.42
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 6.0 Mode: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9K 18K 27K 36K 45K SE +/- 11.00, N = 3 SE +/- 204.38, N = 3 SE +/- 38.48, N = 3 SE +/- 16.84, N = 3 SE +/- 57.13, N = 3 23078 43879 43567 20060 23056
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Thorough EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.32, N = 12 SE +/- 0.01, N = 3 10.21 19.90 19.68 12.23 10.32 1. (CXX) g++ options: -O3 -flto -pthread
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 4.04, N = 3 SE +/- 0.33, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 2526 1265 1275 2884 2539
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 92.49, N = 15 SE +/- 89.79, N = 6 SE +/- 68.61, N = 9 SE +/- 210.57, N = 15 SE +/- 81.68, N = 15 7898.41 9036.65 8784.83 9750.34 7857.53 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.73, N = 12 SE +/- 0.05, N = 3 SE +/- 0.19, N = 3 SE +/- 1.38, N = 12 SE +/- 0.82, N = 15 23.12 25.25 25.62 24.06 21.45 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 6.80, N = 3 SE +/- 14.04, N = 3 SE +/- 6.45, N = 3 SE +/- 6.39, N = 3 SE +/- 4.57, N = 3 1806.71 2387.89 2291.85 3033.56 1791.99 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 3.42, N = 3 SE +/- 1.46, N = 3 SE +/- 1.09, N = 3 SE +/- 4.91, N = 3 SE +/- 2.12, N = 3 1619.82 1977.00 1917.67 2260.82 1550.64 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 5 SE +/- 0.18, N = 3 SE +/- 0.16, N = 3 19.79 18.97 19.32 24.39 19.33 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12 24 36 48 60 SE +/- 0.43, N = 3 SE +/- 0.21, N = 3 SE +/- 0.52, N = 5 SE +/- 0.30, N = 3 SE +/- 0.43, N = 3 50.53 52.72 51.78 41.01 51.72 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 5 SE +/- 0.18, N = 3 SE +/- 0.16, N = 3 19.79 18.97 19.32 24.39 19.33 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12 24 36 48 60 SE +/- 0.48, N = 3 SE +/- 0.20, N = 15 SE +/- 0.37, N = 3 SE +/- 0.06, N = 3 SE +/- 0.38, N = 3 49.58 30.13 30.49 52.89 46.58 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 0.33, N = 3 SE +/- 2.08, N = 3 SE +/- 1.00, N = 3 SE +/- 2.33, N = 3 SE +/- 2.73, N = 3 2156 1080 1090 2474 2171
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 1.76, N = 3 SE +/- 1.00, N = 3 SE +/- 1.15, N = 3 SE +/- 0.88, N = 3 SE +/- 2.19, N = 3 2133 1069 1078 2441 2143
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 1.01, N = 3 SE +/- 1.00, N = 5 SE +/- 0.73, N = 3 SE +/- 0.11, N = 3 72.52 89.98 92.38 133.99 73.87 MIN: 28.47 / MAX: 87.6 MIN: 63.39 / MAX: 124.53 MIN: 31.8 / MAX: 128.67 MIN: 103.21 / MAX: 233.72 MIN: 36.14 / MAX: 99.53 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.98, N = 3 SE +/- 0.92, N = 5 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 55.11 88.84 86.56 29.83 54.11 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 27236.78, N = 3 SE +/- 18366.37, N = 3 SE +/- 29190.39, N = 3 SE +/- 59300.10, N = 15 SE +/- 44904.06, N = 10 5997921.20 2446936.66 2537425.46 4023464.53 5854131.56 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 152.10, N = 3 SE +/- 61.02, N = 3 SE +/- 126.32, N = 3 SE +/- 34.91, N = 3 SE +/- 128.55, N = 3 72250 38192 38454 82403 72906
Llama.cpp Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-13b.Q4_0.gguf EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 6.57 7.32 6.98 8.38 6.70 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 145.56, N = 3 SE +/- 168.15, N = 3 SE +/- 192.07, N = 3 SE +/- 135.20, N = 3 SE +/- 135.51, N = 3 71530 37923 37984 81236 72067
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00236, N = 3 SE +/- 0.00353, N = 3 SE +/- 0.00685, N = 3 SE +/- 0.00822, N = 3 SE +/- 0.00132, N = 3 4.66757 8.97813 8.92149 3.79539 4.65305
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800K 1600K 2400K 3200K 4000K SE +/- 2761.61, N = 3 SE +/- 5303.54, N = 3 SE +/- 5916.60, N = 3 SE +/- 7388.22, N = 3 SE +/- 1626.82, N = 3 2195688.49 3761626.61 3775389.09 3072373.87 2217988.24 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800K 1600K 2400K 3200K 4000K SE +/- 1471.26, N = 3 SE +/- 5665.65, N = 3 SE +/- 3096.61, N = 3 SE +/- 11523.53, N = 3 SE +/- 1526.11, N = 3 2104490.55 3844031.28 3850014.66 3048853.56 2124192.99 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/ao/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00682, N = 3 SE +/- 0.00259, N = 3 SE +/- 0.01707, N = 3 SE +/- 0.00479, N = 3 SE +/- 0.01340, N = 3 3.91070 7.46649 7.45782 2.73496 3.87827
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00260, N = 3 SE +/- 0.02548, N = 3 SE +/- 0.00203, N = 3 SE +/- 0.00622, N = 3 SE +/- 0.01059, N = 3 3.82138 7.36909 7.31921 2.63851 3.82321
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 4.42 7.72 7.70 3.58 4.42 MIN: 4.32 / MAX: 4.49 MIN: 6.87 / MAX: 8.15 MIN: 6.73 / MAX: 8.14 MIN: 3.44 / MAX: 4.42 MIN: 4.35 / MAX: 4.49
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.52, N = 3 SE +/- 1.73, N = 3 SE +/- 1.15, N = 3 SE +/- 8.08, N = 3 SE +/- 0.34, N = 3 560.14 590.80 592.59 1506.63 556.14 MIN: 538 / MAX: 581.72 MIN: 555.5 / MAX: 623.22 MIN: 531.26 / MAX: 631.27 MIN: 1180.01 / MAX: 1642.12 MIN: 444.24 / MAX: 578.41 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.14 13.52 13.46 2.65 7.19 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 7.19, N = 3 SE +/- 6.24, N = 3 SE +/- 3.89, N = 3 SE +/- 5.21, N = 3 SE +/- 7.92, N = 3 2352.5 2299.7 2278.0 2375.4 2324.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 0.95, N = 3 SE +/- 3.49, N = 3 SE +/- 3.67, N = 3 SE +/- 1.12, N = 3 SE +/- 0.67, N = 3 527.9 797.0 778.8 504.1 518.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.0463 2.0926 3.1389 4.1852 5.2315 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.41 4.65 4.62 1.69 2.41 MIN: 0.82 / MAX: 2.85 MIN: 2.03 / MAX: 5.27 MIN: 2 / MAX: 5.24 MIN: 0.51 / MAX: 2.04 MIN: 0.83 / MAX: 2.84
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80 160 240 320 400 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 SE +/- 0.59, N = 3 SE +/- 0.44, N = 3 285.97 301.79 301.83 384.09 283.71 MIN: 150.08 / MAX: 303.22 MIN: 278.87 / MAX: 327.8 MIN: 264.12 / MAX: 322.86 MIN: 285.45 / MAX: 543.49 MIN: 240.12 / MAX: 303.33 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 13.97 26.46 26.44 10.41 14.07 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.61 15.92 15.70 6.62 8.62
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.02822, N = 3 SE +/- 0.00809, N = 3 SE +/- 0.01596, N = 3 SE +/- 0.03060, N = 3 SE +/- 0.00562, N = 3 8.31107 8.15155 8.40671 8.10864 8.26788 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.41, N = 3 SE +/- 0.12, N = 3 SE +/- 0.22, N = 3 SE +/- 0.46, N = 3 SE +/- 0.08, N = 3 120.28 122.63 118.91 123.28 120.91 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: GPT-2 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.02822, N = 3 SE +/- 0.00809, N = 3 SE +/- 0.01596, N = 3 SE +/- 0.03060, N = 3 SE +/- 0.00562, N = 3 8.31107 8.15155 8.40671 8.10864 8.26788 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.18, N = 9 SE +/- 0.12, N = 15 SE +/- 0.21, N = 3 SE +/- 0.22, N = 3 SE +/- 0.17, N = 15 21.87 16.59 17.11 19.75 22.53 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.76, N = 3 SE +/- 0.22, N = 3 SE +/- 0.63, N = 3 SE +/- 0.78, N = 3 SE +/- 0.35, N = 3 79.70 64.28 63.11 86.69 81.57 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.16, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 12.55 15.56 15.85 11.54 12.26 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: bertsquad-12 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.76, N = 3 SE +/- 0.22, N = 3 SE +/- 0.63, N = 3 SE +/- 0.78, N = 3 SE +/- 0.35, N = 3 79.70 64.28 63.11 86.69 81.57 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.125 2.25 3.375 4.5 5.625 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.75 5.00 4.96 2.13 2.74 MIN: 2.66 / MAX: 2.86 MIN: 4.85 / MAX: 5.35 MIN: 4.84 / MAX: 5.29 MIN: 2.03 / MAX: 2.65 MIN: 2.66 / MAX: 2.85
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 13 26 39 52 65 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 30 58 57 36 31 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 48.26 58.43 59.77 118.45 48.55 MIN: 29.16 / MAX: 69.56 MIN: 31.85 / MAX: 94.55 MIN: 40 / MAX: 88.32 MIN: 68.57 / MAX: 244.71 MIN: 29.96 / MAX: 68.41 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.17, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.25, N = 3 82.80 136.73 133.65 33.74 82.32 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.06732, N = 3 SE +/- 0.02831, N = 3 SE +/- 0.01065, N = 3 SE +/- 0.03755, N = 3 SE +/- 0.09318, N = 3 7.06097 6.37806 6.57601 6.19876 7.00331 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 1.35, N = 3 SE +/- 0.69, N = 3 SE +/- 0.25, N = 3 SE +/- 0.97, N = 3 SE +/- 1.92, N = 3 141.63 156.77 152.04 161.31 142.82 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: T5 Encoder - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.06732, N = 3 SE +/- 0.02831, N = 3 SE +/- 0.01065, N = 3 SE +/- 0.03755, N = 3 SE +/- 0.09318, N = 3 7.06097 6.37806 6.57601 6.19876 7.00331 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 0.48, N = 3 SE +/- 0.63, N = 3 SE +/- 0.77, N = 3 SE +/- 0.12, N = 3 72.70 90.49 91.34 134.54 73.71 MIN: 40.38 / MAX: 84.07 MIN: 55.81 / MAX: 127.95 MIN: 68.51 / MAX: 134.9 MIN: 98.92 / MAX: 233.45 MIN: 31.23 / MAX: 89.7 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.47, N = 3 SE +/- 0.61, N = 3 SE +/- 0.17, N = 3 SE +/- 0.09, N = 3 54.98 88.32 87.50 29.71 54.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 7.95 9.90 10.18 15.21 8.03 MIN: 5 / MAX: 23.85 MIN: 6.53 / MAX: 37.69 MIN: 5.82 / MAX: 38.39 MIN: 10.08 / MAX: 64.69 MIN: 5.57 / MAX: 23.51 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.86, N = 3 SE +/- 1.23, N = 3 SE +/- 1.94, N = 3 SE +/- 1.17, N = 3 SE +/- 0.72, N = 3 991.32 1587.74 1544.55 523.02 980.84 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 14.48 16.73 17.22 36.35 14.59 MIN: 8.91 / MAX: 26.44 MIN: 9.66 / MAX: 30.96 MIN: 10.01 / MAX: 28.19 MIN: 24.38 / MAX: 96.4 MIN: 9.42 / MAX: 26.08 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 100 200 300 400 500 SE +/- 0.50, N = 3 SE +/- 1.94, N = 3 SE +/- 1.58, N = 3 SE +/- 0.28, N = 3 SE +/- 1.13, N = 3 275.72 476.63 462.96 219.86 273.50 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 50 94 93 49 50 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 76 119 116 70 76 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.78 5.12 5.13 9.94 4.78 MIN: 3.18 / MAX: 16.31 MIN: 3.61 / MAX: 15.43 MIN: 3.4 / MAX: 14.3 MIN: 6.2 / MAX: 41.65 MIN: 3.25 / MAX: 16.12 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.80, N = 3 SE +/- 2.52, N = 3 SE +/- 4.59, N = 3 SE +/- 0.87, N = 3 SE +/- 2.43, N = 3 833.73 1556.07 1551.75 401.42 834.18 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 17.79 25.15 26.38 39.48 18.02 MIN: 9.38 / MAX: 30.36 MIN: 11.63 / MAX: 50.5 MIN: 13.08 / MAX: 46.13 MIN: 22.02 / MAX: 90.7 MIN: 11.1 / MAX: 33.13 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 1.10, N = 3 SE +/- 0.46, N = 3 SE +/- 0.60, N = 3 SE +/- 0.22, N = 3 SE +/- 1.23, N = 3 224.43 317.25 302.45 101.21 221.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.11, N = 3 SE +/- 0.00, N = 3 3.87 4.11 4.11 18.22 3.85 MIN: 2.38 / MAX: 13.89 MIN: 2.45 / MAX: 41.27 MIN: 2.52 / MAX: 16.5 MIN: 10.36 / MAX: 177.58 MIN: 2.35 / MAX: 15.38 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400 800 1200 1600 2000 SE +/- 0.99, N = 3 SE +/- 0.22, N = 3 SE +/- 0.55, N = 3 SE +/- 2.58, N = 3 SE +/- 0.54, N = 3 1029.88 1938.32 1938.68 437.96 1036.33 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10K 20K 30K 40K 50K SE +/- 62.68, N = 3 SE +/- 29.87, N = 3 SE +/- 13.61, N = 3 SE +/- 115.84, N = 3 SE +/- 61.68, N = 3 43830 20283 20461 48929 44013
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 SE +/- 0.00, N = 3 19.13 20.09 20.11 45.70 18.90 MIN: 12.57 / MAX: 34 MIN: 13.66 / MAX: 38.02 MIN: 14.53 / MAX: 37.6 MIN: 32.92 / MAX: 117.92 MIN: 13.12 / MAX: 33.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 0.59, N = 3 SE +/- 1.22, N = 3 SE +/- 0.92, N = 3 SE +/- 0.45, N = 3 SE +/- 0.13, N = 3 417.64 795.05 794.33 174.92 422.62 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Fill EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 1877.58, N = 3 SE +/- 1278.43, N = 3 SE +/- 826.71, N = 3 SE +/- 5918.01, N = 3 SE +/- 738.85, N = 3 1005973 866759 869816 876130 1011788 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 4.53 4.82 4.83 15.95 4.52 MIN: 2.65 / MAX: 17.47 MIN: 2.78 / MAX: 17.83 MIN: 2.79 / MAX: 19.86 MIN: 9.85 / MAX: 70.28 MIN: 2.64 / MAX: 15.69 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400 800 1200 1600 2000 SE +/- 0.43, N = 3 SE +/- 1.19, N = 3 SE +/- 0.97, N = 3 SE +/- 1.52, N = 3 SE +/- 0.30, N = 3 879.50 1653.69 1648.32 500.86 882.75 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 20.37 21.89 21.99 53.07 20.34 MIN: 14.74 / MAX: 35.54 MIN: 14.64 / MAX: 40.37 MIN: 14.99 / MAX: 41.87 MIN: 35.4 / MAX: 130.32 MIN: 13.65 / MAX: 36.45 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 160 320 480 640 800 SE +/- 1.00, N = 3 SE +/- 3.18, N = 3 SE +/- 1.88, N = 3 SE +/- 0.33, N = 3 SE +/- 1.21, N = 3 392.23 729.84 726.61 150.64 392.93 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.1408 2.2816 3.4224 4.5632 5.704 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.16 3.33 3.32 5.07 3.13 MIN: 1.77 / MAX: 13.91 MIN: 1.88 / MAX: 15.67 MIN: 1.82 / MAX: 16.13 MIN: 2.36 / MAX: 27.99 MIN: 1.79 / MAX: 14.78 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1000 2000 3000 4000 5000 SE +/- 1.63, N = 3 SE +/- 1.99, N = 3 SE +/- 5.85, N = 3 SE +/- 3.53, N = 3 SE +/- 5.39, N = 3 2495.55 4734.27 4743.73 1559.06 2522.56 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Fill Sync EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50K 100K 150K 200K 250K SE +/- 468.76, N = 3 SE +/- 238.71, N = 3 SE +/- 490.37, N = 3 SE +/- 573.89, N = 3 SE +/- 257.69, N = 3 127955 171623 170365 225607 128855 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.09 0.18 0.27 0.36 0.45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.28 0.29 0.29 0.40 0.28 MIN: 0.15 / MAX: 7.71 MIN: 0.16 / MAX: 8.63 MIN: 0.16 / MAX: 9.68 MIN: 0.18 / MAX: 8.35 MIN: 0.15 / MAX: 7.24 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11K 22K 33K 44K 55K SE +/- 19.90, N = 3 SE +/- 47.80, N = 3 SE +/- 49.45, N = 3 SE +/- 38.88, N = 3 SE +/- 67.36, N = 3 26230.85 49485.35 49580.07 18507.31 26465.33 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.44 11.36 11.72 16.04 8.57 MIN: 4.55 / MAX: 19.23 MIN: 4.12 / MAX: 36.81 MIN: 4.13 / MAX: 31.49 MIN: 9.49 / MAX: 56.98 MIN: 3.77 / MAX: 23.35 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 150 300 450 600 750 SE +/- 1.08, N = 3 SE +/- 2.53, N = 3 SE +/- 3.49, N = 3 SE +/- 0.14, N = 3 SE +/- 0.22, N = 3 472.57 701.84 680.56 249.02 465.15 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7 14 21 28 35 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 11.05 11.72 11.75 30.05 10.97 MIN: 5.83 / MAX: 23.56 MIN: 6 / MAX: 26.83 MIN: 5.98 / MAX: 26.57 MIN: 17.38 / MAX: 87.99 MIN: 5.79 / MAX: 25.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.13, N = 3 SE +/- 1.37, N = 3 SE +/- 1.41, N = 3 SE +/- 0.54, N = 3 SE +/- 0.11, N = 3 723.13 1362.00 1359.29 265.83 728.18 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Update Random EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 1312.04, N = 3 SE +/- 1173.38, N = 3 SE +/- 726.71, N = 3 SE +/- 1460.51, N = 3 SE +/- 819.19, N = 3 553791 623366 624743 828184 556040 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.225 0.45 0.675 0.9 1.125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.40 0.41 0.41 1.00 0.40 MIN: 0.2 / MAX: 7.69 MIN: 0.21 / MAX: 10.02 MIN: 0.21 / MAX: 10.47 MIN: 0.4 / MAX: 10.73 MIN: 0.2 / MAX: 8.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7K 14K 21K 28K 35K SE +/- 8.32, N = 3 SE +/- 16.51, N = 3 SE +/- 30.10, N = 3 SE +/- 31.71, N = 3 SE +/- 3.66, N = 3 18615.12 34815.39 34850.47 7499.47 18754.59 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 5.63 5.96 5.96 7.78 5.59 MIN: 2.98 / MAX: 16.09 MIN: 3.08 / MAX: 18.52 MIN: 3.09 / MAX: 18.13 MIN: 3.64 / MAX: 44.07 MIN: 2.93 / MAX: 16.72 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 0.60, N = 3 SE +/- 4.11, N = 3 SE +/- 3.06, N = 3 SE +/- 2.31, N = 3 SE +/- 0.44, N = 3 1417.33 2672.51 2669.64 1022.10 1427.32 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.4035 0.807 1.2105 1.614 2.0175 SE +/- 0.01555, N = 3 SE +/- 0.01197, N = 3 SE +/- 0.00633, N = 3 SE +/- 0.01234, N = 3 SE +/- 0.00719, N = 3 1.47531 1.20504 1.19855 1.79347 1.46770 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 7.12, N = 3 SE +/- 8.14, N = 3 SE +/- 4.40, N = 3 SE +/- 3.86, N = 3 SE +/- 3.38, N = 3 677.58 829.19 833.52 557.41 680.95 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.4035 0.807 1.2105 1.614 2.0175 SE +/- 0.01555, N = 3 SE +/- 0.01197, N = 3 SE +/- 0.00633, N = 3 SE +/- 0.01234, N = 3 SE +/- 0.00719, N = 3 1.47531 1.20504 1.19855 1.79347 1.46770 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.3827 0.7654 1.1481 1.5308 1.9135 SE +/- 0.00807, N = 3 SE +/- 0.00549, N = 3 SE +/- 0.00557, N = 3 SE +/- 0.01304, N = 3 SE +/- 0.00355, N = 3 1.47738 1.11577 1.11712 1.70107 1.48191 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 3.70, N = 3 SE +/- 4.41, N = 3 SE +/- 4.40, N = 3 SE +/- 4.53, N = 3 SE +/- 1.59, N = 3 676.65 895.77 894.66 587.73 674.51 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.3827 0.7654 1.1481 1.5308 1.9135 SE +/- 0.00807, N = 3 SE +/- 0.00549, N = 3 SE +/- 0.00557, N = 3 SE +/- 0.01304, N = 3 SE +/- 0.00355, N = 3 1.47738 1.11577 1.11712 1.70107 1.48191 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 2.21 2.37 2.37 9.76 2.20 MIN: 1.23 / MAX: 13.12 MIN: 1.27 / MAX: 12.75 MIN: 1.26 / MAX: 11.94 MIN: 4.34 / MAX: 46.89 MIN: 1.23 / MAX: 12.5 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16 - Device: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 0.66, N = 3 SE +/- 6.25, N = 3 SE +/- 6.99, N = 3 SE +/- 1.96, N = 3 SE +/- 1.03, N = 3 1781.62 3269.12 3283.98 816.17 1791.06 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 900K 1800K 2700K 3600K 4500K SE +/- 7423.69, N = 3 SE +/- 11590.23, N = 3 SE +/- 19341.95, N = 3 SE +/- 2504.83, N = 3 SE +/- 10837.18, N = 3 2234333 4118000 4066667 919791 2230333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 139 250 246 130 138 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.33, N = 3 SE +/- 1.53, N = 3 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 168 167 161 317 169 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Read Random Write Random EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600K 1200K 1800K 2400K 3000K SE +/- 1406.43, N = 3 SE +/- 2327.92, N = 3 SE +/- 5448.80, N = 3 SE +/- 4914.43, N = 3 SE +/- 1481.36, N = 3 2138776 2770195 2770899 2120915 2144575 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 255 278 269 315 244 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Read While Writing EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 14207.26, N = 3 SE +/- 26941.48, N = 3 SE +/- 22040.69, N = 3 SE +/- 20209.24, N = 3 SE +/- 13925.00, N = 3 3194971 5119891 5146187 2548837 3212276 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.00, N = 3 SE +/- 1.00, N = 3 SE +/- 0.88, N = 3 SE +/- 1.86, N = 3 SE +/- 0.00, N = 3 255 338 332 244 251 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Fill EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400K 800K 1200K 1600K 2000K SE +/- 4512.41, N = 3 SE +/- 3781.08, N = 3 SE +/- 2055.67, N = 3 SE +/- 10692.91, N = 3 SE +/- 1270.91, N = 3 1159865 1225921 1229256 1647988 1154716 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.0486 2.0972 3.1458 4.1944 5.243 SE +/- 0.00856, N = 3 SE +/- 0.02511, N = 3 SE +/- 0.00459, N = 3 SE +/- 0.03176, N = 3 SE +/- 0.01507, N = 3 3.09905 2.62524 2.58704 4.66040 3.07441 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80 160 240 320 400 SE +/- 0.89, N = 3 SE +/- 3.68, N = 3 SE +/- 0.69, N = 3 SE +/- 1.45, N = 3 SE +/- 1.58, N = 3 322.61 380.82 386.41 214.55 325.21 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.0486 2.0972 3.1458 4.1944 5.243 SE +/- 0.00856, N = 3 SE +/- 0.02511, N = 3 SE +/- 0.00459, N = 3 SE +/- 0.03176, N = 3 SE +/- 0.01507, N = 3 3.09905 2.62524 2.58704 4.66040 3.07441 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Fill Sync EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 48.85, N = 3 SE +/- 270.54, N = 3 SE +/- 414.30, N = 3 SE +/- 710.34, N = 3 SE +/- 196.51, N = 3 99746 171199 170735 201594 98054 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Overwrite EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 400K 800K 1200K 1600K 2000K SE +/- 1465.82, N = 3 SE +/- 1555.52, N = 3 SE +/- 1384.95, N = 3 SE +/- 14969.35, N = 3 SE +/- 1933.24, N = 3 1152820 1225361 1229747 1653489 1155121 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Update Random EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 748.73, N = 3 SE +/- 1578.78, N = 3 SE +/- 818.39, N = 3 SE +/- 2978.74, N = 3 SE +/- 1914.75, N = 3 618187 700416 703294 908300 609510 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Read EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30M 60M 90M 120M 150M SE +/- 34435.41, N = 3 SE +/- 286737.26, N = 3 SE +/- 185772.57, N = 3 SE +/- 170759.02, N = 3 SE +/- 103627.09, N = 3 71648683 134971626 135603493 57824426 72087947 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read While Writing EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 900K 1800K 2700K 3600K 4500K SE +/- 22460.23, N = 3 SE +/- 7045.87, N = 3 SE +/- 13356.72, N = 3 SE +/- 3646.67, N = 3 SE +/- 15255.97, N = 3 2519594 4294631 4368397 1942760 2476441 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read Random Write Random EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600K 1200K 1800K 2400K 3000K SE +/- 5891.62, N = 3 SE +/- 24549.22, N = 3 SE +/- 24632.94, N = 3 SE +/- 17144.87, N = 3 SE +/- 2242.91, N = 3 2131122 2915758 2912747 1976533 2137892 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: super-resolution-10 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.02135, N = 3 SE +/- 0.06152, N = 3 SE +/- 0.02431, N = 3 SE +/- 0.09981, N = 3 SE +/- 0.14350, N = 3 10.91820 7.50080 7.55840 13.17920 10.73790 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Result
OpenBenchmarking.org Inferences Per Second, More Is Better ONNX Runtime 1.17 Model: super-resolution-10 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 1.10, N = 3 SE +/- 0.42, N = 3 SE +/- 0.57, N = 3 SE +/- 1.26, N = 3 91.59 133.32 132.29 75.88 93.16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
Inference Time Cost (ms)
OpenBenchmarking.org Inference Time Cost (ms), Fewer Is Better ONNX Runtime 1.17 Model: super-resolution-10 - Device: CPU - Executor: Parallel EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.02135, N = 3 SE +/- 0.06152, N = 3 SE +/- 0.02431, N = 3 SE +/- 0.09981, N = 3 SE +/- 0.14350, N = 3 10.91820 7.50080 7.55840 13.17920 10.73790 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40M 80M 120M 160M 200M SE +/- 411753.30, N = 3 SE +/- 580393.26, N = 3 SE +/- 877445.98, N = 3 SE +/- 518797.01, N = 3 SE +/- 719862.56, N = 3 116770333 181331000 178283333 40721000 117065667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Read EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30M 60M 90M 120M 150M SE +/- 144351.01, N = 3 SE +/- 780691.20, N = 3 SE +/- 342858.26, N = 3 SE +/- 236765.68, N = 3 SE +/- 112684.33, N = 3 69408254 130693063 131218639 47512260 69366631 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 8 16 24 32 40 SE +/- 0.15, N = 15 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 SE +/- 0.21, N = 4 17.61 34.65 34.13 19.48 17.73 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 15 30 45 60 75 SE +/- 0.35, N = 3 SE +/- 0.28, N = 3 SE +/- 0.10, N = 3 SE +/- 0.41, N = 3 SE +/- 0.66, N = 3 63.87 39.56 39.92 67.11 57.79 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4M 8M 12M 16M 20M SE +/- 250051.55, N = 3 SE +/- 410224.76, N = 15 SE +/- 154682.83, N = 3 SE +/- 13650.40, N = 3 SE +/- 17835.67, N = 3 19611667 18964400 19299333 20255000 20106667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Intel Open Image Denoise Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.1935 0.387 0.5805 0.774 0.9675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.48 0.86 0.85 0.42 0.48
OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.1935 0.387 0.5805 0.774 0.9675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.48 0.86 0.85 0.42 0.48
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9K 18K 27K 36K 45K SE +/- 127.68, N = 3 SE +/- 19.75, N = 3 SE +/- 24.89, N = 3 SE +/- 172.31, N = 3 SE +/- 82.83, N = 3 37900 17285 17491 42275 38010
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9K 18K 27K 36K 45K SE +/- 43.80, N = 3 SE +/- 33.74, N = 3 SE +/- 11.15, N = 3 SE +/- 112.38, N = 3 SE +/- 115.78, N = 3 37458 17116 17247 41514 37677
Apache Hadoop This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 100 - Files: 100000 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 670.20, N = 11 SE +/- 376.90, N = 3 SE +/- 292.68, N = 3 SE +/- 1027.34, N = 3 SE +/- 496.20, N = 3 90938 75381 74946 88992 90476
Node.js V8 Web Tooling Benchmark Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 22.14 20.81 20.06 23.30 21.89
Z3 Theorem Prover The Z3 Theorem Prover / SMT solver is developed by Microsoft Research under the MIT license. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Z3 Theorem Prover 4.12.1 SMT File: 2.smt2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.27, N = 3 SE +/- 0.53, N = 3 SE +/- 0.35, N = 3 SE +/- 0.51, N = 3 SE +/- 0.21, N = 3 48.68 48.88 49.00 63.08 48.12 1. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9M 18M 27M 36M 45M SE +/- 368485.05, N = 12 SE +/- 254700.04, N = 3 SE +/- 184763.21, N = 3 SE +/- 14666.67, N = 3 SE +/- 267690.74, N = 3 39094333 38712667 38605667 40821333 39693000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 6.11, N = 3 SE +/- 13.33, N = 3 SE +/- 195.61, N = 12 SE +/- 30.23, N = 3 SE +/- 0.86, N = 3 5972.09 11337.00 11228.13 4843.72 6023.38 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.13, N = 3 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.14, N = 3 51.94 36.33 36.34 61.84 51.38 1. (CXX) g++ options: -O3 -fPIC -lm
m-queens A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16 32 48 64 80 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.62, N = 3 SE +/- 0.05, N = 3 53.50 27.91 27.82 72.85 53.08 1. (CXX) g++ options: -fopenmp -O2 -march=native
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 15.69 27.39 26.88 14.55 15.57 MIN: 15.56 / MAX: 16.03 MIN: 27.11 / MAX: 28.25 MIN: 26.58 / MAX: 27.83 MIN: 14.14 / MAX: 16.68 MIN: 15.43 / MAX: 15.99
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 1076.82, N = 3 SE +/- 9073.63, N = 3 SE +/- 2950.54, N = 3 SE +/- 3256.58, N = 15 SE +/- 1288.71, N = 3 499103.50 951674.51 954988.73 430017.46 505633.42 1. (CC) gcc options: -O2 -lrt" -lrt
RawTherapee RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 46.66 42.55 42.79 42.66 46.73 1. RawTherapee, version 5.10, command line.
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 12.28 23.10 22.86 9.16 12.31 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.49 22.02 21.80 9.28 12.48 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Renaissance Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Savina Reactors.IO EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800 1600 2400 3200 4000 SE +/- 28.17, N = 3 SE +/- 15.03, N = 3 SE +/- 33.81, N = 3 SE +/- 16.31, N = 3 SE +/- 27.86, N = 3 3042.0 3265.2 3270.6 3695.4 3036.5 MIN: 3011.87 / MAX: 4321.7 MIN: 3236.75 / MAX: 4741.82 MIN: 3216.47 / MAX: 4638.43 MIN: 3664.8 / MAX: 5363.87 MIN: 2985.79 / MAX: 4135.64
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 13.14 17.44 17.32 11.77 13.08 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
JPEG-XL libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 90 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.37, N = 10 SE +/- 0.27, N = 3 SE +/- 0.09, N = 3 SE +/- 0.20, N = 3 37.18 48.59 46.77 32.51 36.32 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30M 60M 90M 120M 150M SE +/- 612490.36, N = 3 SE +/- 292137.71, N = 3 SE +/- 1579485.68, N = 4 SE +/- 1116779.09, N = 8 SE +/- 368435.13, N = 3 150576667 149903333 147935000 130002500 151906667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
FFmpeg This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Live EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 0.71, N = 3 SE +/- 0.58, N = 3 SE +/- 1.39, N = 3 SE +/- 0.16, N = 3 179.99 186.42 186.71 178.15 179.98 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Llama.cpp Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-7b.Q4_0.gguf EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 0.18, N = 4 SE +/- 0.12, N = 6 12.54 13.82 13.12 16.07 12.73 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
Java SciMark This test runs the Java version of SciMark 2, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This benchmark is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.2 Computational Test: Composite EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1200 2400 3600 4800 6000 SE +/- 42.37, N = 3 SE +/- 35.75, N = 3 SE +/- 46.58, N = 8 SE +/- 52.94, N = 3 SE +/- 56.78, N = 3 5476.91 5441.20 5321.50 4311.80 5519.55
Renaissance Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Finagle HTTP Requests EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 22.75, N = 3 SE +/- 15.93, N = 3 SE +/- 14.49, N = 3 SE +/- 25.68, N = 12 SE +/- 7.85, N = 3 1963.8 2300.7 2288.1 1668.7 1925.0 MIN: 1776.88 / MAX: 2143.78 MIN: 2117.57 / MAX: 2419.92 MIN: 2126.73 / MAX: 2373.63 MIN: 1155.42 / MAX: 1849.98 MIN: 1772.86 / MAX: 2146.59
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.46 31.37 30.79 13.39 16.33 MIN: 16.24 / MAX: 16.97 MIN: 30.96 / MAX: 32.3 MIN: 30.28 / MAX: 31.99 MIN: 13.27 / MAX: 16.49 MIN: 16.12 / MAX: 16.74
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 49.82, N = 3 SE +/- 169.22, N = 3 SE +/- 103.02, N = 3 SE +/- 294.52, N = 3 SE +/- 56.56, N = 3 43490.7 83365.1 83133.5 37214.4 44058.2 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.6274 1.2548 1.8822 2.5096 3.137 SE +/- 0.00212, N = 3 SE +/- 0.03311, N = 4 SE +/- 0.01282, N = 3 SE +/- 0.00164, N = 3 SE +/- 0.00243, N = 3 1.73764 2.78846 2.74999 0.65977 1.73454
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: GoogLeNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 93.71 135.30 131.79 78.79 92.82
miniFE MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 3.31, N = 3 SE +/- 8.13, N = 3 SE +/- 16.40, N = 3 SE +/- 8.52, N = 3 SE +/- 7.52, N = 3 7325.10 7673.66 7330.07 9869.74 7278.87 1. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 18.45 32.22 31.58 16.58 18.35 MIN: 18.3 / MAX: 18.87 MIN: 31.88 / MAX: 33.23 MIN: 31.27 / MAX: 32.45 MIN: 16.29 / MAX: 19.34 MIN: 18.21 / MAX: 18.8
Apache Hadoop This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 500 - Files: 100000 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 17K 34K 51K 68K 85K SE +/- 465.56, N = 3 SE +/- 682.82, N = 3 SE +/- 488.73, N = 3 SE +/- 717.72, N = 3 SE +/- 761.92, N = 3 77585 70436 71080 76956 76371
Timed Wasmer Compilation This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 2.3 Time To Compile EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.18, N = 3 36.90 29.44 29.89 38.13 36.55 1. (CC) gcc options: -m64 -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc -pie -nodefaultlibs
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4M 8M 12M 16M 20M SE +/- 7976.77, N = 3 SE +/- 18408.29, N = 3 SE +/- 20135.83, N = 3 SE +/- 185878.60, N = 5 SE +/- 6257.32, N = 3 12445240.76 16150369.04 16207347.12 17858353.80 12527831.36 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800 1600 2400 3200 4000 SE +/- 4.79, N = 3 SE +/- 20.59, N = 3 SE +/- 37.02, N = 5 SE +/- 3.73, N = 3 SE +/- 11.68, N = 3 1851.25 3500.13 3541.50 1437.47 1868.89 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Eclipse EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1500 3000 4500 6000 7500 SE +/- 37.34, N = 3 SE +/- 26.61, N = 3 SE +/- 28.45, N = 3 SE +/- 61.34, N = 8 SE +/- 12.50, N = 3 6410 6587 6627 6943 6483
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.10, N = 3 SE +/- 0.03, N = 4 SE +/- 0.17, N = 4 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 26.29 49.95 49.81 12.45 26.37 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 2.56, N = 3 SE +/- 0.84, N = 4 SE +/- 4.15, N = 4 SE +/- 1.56, N = 3 SE +/- 2.35, N = 3 657.35 1248.81 1245.30 311.31 659.13 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.56, N = 5 SE +/- 1.42, N = 15 SE +/- 1.37, N = 15 SE +/- 0.47, N = 15 SE +/- 0.50, N = 5 61.90 100.00 97.12 59.56 61.45 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: AlexNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.21, N = 3 SE +/- 0.34, N = 3 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 SE +/- 0.21, N = 3 206.40 322.53 316.11 173.78 205.32
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 140 280 420 560 700 SE +/- 0.30, N = 3 SE +/- 0.26, N = 3 SE +/- 7.55, N = 4 SE +/- 0.39, N = 3 SE +/- 0.10, N = 3 330.62 633.44 644.74 278.09 332.40 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
JPEG-XL libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 90 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.36, N = 3 SE +/- 0.28, N = 3 SE +/- 0.68, N = 3 SE +/- 0.19, N = 3 SE +/- 0.21, N = 3 36.02 48.53 47.23 31.81 36.04 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 18.77 23.19 23.00 15.46 18.90 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PUSCH Processor Benchmark, Throughput Thread EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 SE +/- 7.12, N = 15 SE +/- 0.00, N = 4 183.6 183.5 183.6 153.6 183.5 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 106 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 105.9 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 106 MIN: 105.1 / MAX: 182.1 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 105.9 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
Aircrack-ng Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.7 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 45.75, N = 3 SE +/- 153.97, N = 3 SE +/- 194.99, N = 3 SE +/- 17.95, N = 3 SE +/- 65.97, N = 3 48527.82 92259.95 92030.35 35441.71 48803.00 1. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpcre -lsqlite3 -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread
John The Ripper This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 221.78, N = 3 SE +/- 425.10, N = 3 SE +/- 573.94, N = 3 SE +/- 135.70, N = 3 SE +/- 245.32, N = 3 85532 160664 160805 29560 85712 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9K 18K 27K 36K 45K SE +/- 43.32, N = 3 SE +/- 104.20, N = 3 SE +/- 61.43, N = 3 SE +/- 127.68, N = 3 SE +/- 45.41, N = 3 23235 43695 43512 18787 23271 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9K 18K 27K 36K 45K SE +/- 44.47, N = 3 SE +/- 39.87, N = 3 SE +/- 344.23, N = 3 SE +/- 68.54, N = 3 SE +/- 121.86, N = 3 23243 43795 42924 18841 23174 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 60M 120M 180M 240M 300M SE +/- 75055.53, N = 3 SE +/- 520000.00, N = 3 SE +/- 2047567.66, N = 3 SE +/- 1245404.89, N = 3 SE +/- 176162.80, N = 3 211670000 292390000 286960000 159270000 212960000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 1.43, N = 3 SE +/- 4.61, N = 3 SE +/- 1.95, N = 3 SE +/- 4.51, N = 3 SE +/- 4.36, N = 3 1674.99 3179.32 3196.42 1356.31 1691.29 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 13K 26K 39K 52K 65K SE +/- 415.80, N = 3 SE +/- 261.75, N = 3 SE +/- 131.10, N = 3 SE +/- 378.33, N = 3 SE +/- 404.29, N = 3 30936.50 59037.14 59053.01 28846.06 30715.14 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 2.55, N = 3 SE +/- 0.58, N = 3 SE +/- 1.97, N = 3 SE +/- 0.40, N = 3 SE +/- 3.38, N = 3 2029.45 2011.17 1924.70 2778.89 1953.52 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 512 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20M 40M 60M 80M 100M SE +/- 342214.52, N = 3 SE +/- 190528.21, N = 3 SE +/- 302227.47, N = 3 SE +/- 493083.61, N = 3 SE +/- 444695.78, N = 3 77508667 76809000 76415667 79553667 77811000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 6.00, N = 3 SE +/- 15.20, N = 3 SE +/- 8.60, N = 3 SE +/- 7.41, N = 3 SE +/- 2.36, N = 3 4251.39 8199.65 8242.96 4740.28 4283.48 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4K 8K 12K 16K 20K SE +/- 2.63, N = 3 SE +/- 12.57, N = 3 SE +/- 12.64, N = 3 SE +/- 104.90, N = 3 SE +/- 3.63, N = 3 8556.72 16452.47 16537.83 9138.29 8606.94 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12M 24M 36M 48M 60M SE +/- 252666.10, N = 3 SE +/- 500794.58, N = 3 SE +/- 59842.82, N = 3 SE +/- 267691.10, N = 3 SE +/- 28572.99, N = 3 29927440.69 56410032.25 57429956.15 29762395.57 30389834.82 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30M 60M 90M 120M 150M SE +/- 13333.33, N = 3 SE +/- 282921.27, N = 3 SE +/- 14529.66, N = 3 SE +/- 724829.48, N = 3 SE +/- 15275.25, N = 3 107146667 103696667 103326667 127376667 107140000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 1.34, N = 3 SE +/- 1.89, N = 3 SE +/- 0.08, N = 3 SE +/- 0.23, N = 3 162.88 308.21 310.84 119.86 163.88 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200K 400K 600K 800K 1000K SE +/- 205.90, N = 3 SE +/- 1811.04, N = 3 SE +/- 1012.30, N = 3 SE +/- 373.64, N = 3 SE +/- 313.30, N = 3 413926.82 778238.33 781906.97 576480.15 416118.50 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50M 100M 150M 200M 250M SE +/- 325695.84, N = 3 SE +/- 674487.46, N = 3 SE +/- 452413.28, N = 3 SE +/- 17320.51, N = 3 SE +/- 1006495.57, N = 3 209743333 205890000 206306667 253450000 212860000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200M 400M 600M 800M 1000M SE +/- 812656.82, N = 3 SE +/- 67412.49, N = 3 SE +/- 1556216.92, N = 3 SE +/- 5153390.47, N = 3 SE +/- 199192.82, N = 3 735566667 795966667 787633333 711050000 740166667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90M 180M 270M 360M 450M SE +/- 388858.73, N = 3 SE +/- 2535286.53, N = 3 SE +/- 1946723.74, N = 3 SE +/- 2917967.33, N = 3 SE +/- 1727319.70, N = 3 409386667 404976667 404010000 439150000 411290000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14M 28M 42M 56M 70M SE +/- 2645.75, N = 3 SE +/- 318365.79, N = 3 SE +/- 469067.51, N = 3 SE +/- 637003.49, N = 3 SE +/- 881.92, N = 3 53659000 53018667 52493000 63236667 53661333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 3.02, N = 3 SE +/- 5.77, N = 3 SE +/- 3.98, N = 3 SE +/- 36.27, N = 3 SE +/- 3.81, N = 3 4377.74 8356.74 8421.15 3294.49 4405.85 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600K 1200K 1800K 2400K 3000K SE +/- 1225.91, N = 3 SE +/- 2539.18, N = 3 SE +/- 3334.21, N = 3 SE +/- 5501.16, N = 3 SE +/- 1569.35, N = 3 1371242.64 2571394.82 2572055.58 1125583.01 1376354.00 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2M 4M 6M 8M 10M SE +/- 788.85, N = 3 SE +/- 6736.08, N = 3 SE +/- 980.70, N = 3 SE +/- 7.65, N = 3 SE +/- 1009.93, N = 3 4374980.16 8469095.71 8497821.55 81538.34 4400145.01 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -lEGL -ljpeg -lmpfr -lpthread -lrt -lsctp -lz
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3K 6K 9K 12K 15K SE +/- 7.26, N = 3 SE +/- 5.77, N = 3 SE +/- 47.02, N = 3 SE +/- 15.18, N = 3 SE +/- 4.08, N = 3 8389.15 16120.00 16317.00 6037.26 8437.93 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7K 14K 21K 28K 35K SE +/- 39.30, N = 3 SE +/- 11.55, N = 3 SE +/- 17.64, N = 3 SE +/- 139.08, N = 3 SE +/- 5.77, N = 3 17983 34640 34753 13933 18060 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20K 40K 60K 80K 100K SE +/- 8.82, N = 3 SE +/- 146.40, N = 3 SE +/- 73.33, N = 3 SE +/- 3.33, N = 3 SE +/- 5.77, N = 3 51087 97340 97753 38847 51370 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 13K 26K 39K 52K 65K SE +/- 78.81, N = 3 SE +/- 331.71, N = 3 SE +/- 297.56, N = 3 SE +/- 3.33, N = 3 SE +/- 5.77, N = 3 31903 61360 61347 26527 32010 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30K 60K 90K 120K 150K SE +/- 29.06, N = 3 SE +/- 99.39, N = 3 SE +/- 125.03, N = 3 SE +/- 460.90, N = 3 SE +/- 74.24, N = 3 74513 142753 143480 56160 74883 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PUSCH Processor Benchmark, Throughput Total EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 0.10, N = 6 SE +/- 0.05, N = 4 SE +/- 0.04, N = 4 SE +/- 9.26, N = 15 SE +/- 17.52, N = 15 1470.5 1611.5 1611.5 1068.9 1517.4 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 849.1 / MAX: 1470.7 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1019.4 / MAX: 1611.6 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1019.3 / MAX: 1611.6 MIN: 637.2 / MAX: 1111.5 -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 849.7 / MAX: 1610.7 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.198 0.396 0.594 0.792 0.99 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.84 0.82 0.82 0.88 0.84 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.016, N = 3 SE +/- 0.009, N = 3 SE +/- 0.016, N = 3 SE +/- 0.024, N = 3 SE +/- 0.014, N = 3 5.742 7.838 7.831 4.348 5.711 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: llava-v1.5-7b-q4 - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.48 11.37 10.90 13.32 10.57
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4K 8K 12K 16K 20K SE +/- 12.84, N = 3 SE +/- 253.63, N = 3 SE +/- 211.58, N = 3 SE +/- 44.80, N = 3 SE +/- 48.58, N = 3 11152.31 18660.90 18360.82 12750.97 10661.66 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 2.01, N = 15 SE +/- 4.27, N = 15 SE +/- 4.13, N = 15 SE +/- 0.99, N = 15 SE +/- 1.96, N = 15 122.91 165.29 160.65 104.61 121.44 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Y-Cruncher Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.3 Pi Digits To Calculate: 1B EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.21, N = 3 SE +/- 0.05, N = 3 25.46 18.21 18.80 33.89 25.24
JPEG-XL libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 16.52 27.22 26.96 13.48 16.58 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 16.66 27.75 27.56 13.65 16.69 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Tradesoap EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 20.46, N = 5 SE +/- 17.19, N = 4 SE +/- 16.84, N = 13 SE +/- 16.67, N = 4 SE +/- 7.41, N = 4 1979 2474 2439 1914 1964
Timed Mesa Compilation This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 24.0 Time To Compile EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 8 16 24 32 40 SE +/- 0.11, N = 3 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 28.26 17.46 17.73 32.52 28.20
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 10 20 30 40 50 SE +/- 0.05, N = 3 SE +/- 0.08, N = 4 SE +/- 0.07, N = 4 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 25.25 42.60 42.20 18.97 25.32
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: H2 Database Engine EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 7.51, N = 3 SE +/- 13.96, N = 3 SE +/- 9.17, N = 3 SE +/- 23.84, N = 3 SE +/- 12.68, N = 3 1977 1826 1872 2128 2000
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Engine EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 18.64, N = 5 SE +/- 9.53, N = 5 SE +/- 9.20, N = 5 SE +/- 50.09, N = 15 SE +/- 17.06, N = 5 2409 2080 2083 3113 2404
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.04, N = 4 SE +/- 0.07, N = 6 SE +/- 0.08, N = 6 SE +/- 0.30, N = 10 SE +/- 0.05, N = 4 52.30 81.23 80.50 39.59 52.25 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 41.08, N = 15 SE +/- 13.01, N = 8 SE +/- 18.71, N = 8 SE +/- 55.41, N = 15 SE +/- 4.34, N = 6 1169.68 2123.37 2117.29 1470.81 1101.23 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.29, N = 3 SE +/- 0.54, N = 4 SE +/- 0.65, N = 4 SE +/- 0.04, N = 3 SE +/- 0.35, N = 3 203.10 392.24 388.63 205.93 204.84 1. (CXX) g++ options: -O3 -flto -pthread
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: AlexNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.23, N = 3 SE +/- 0.14, N = 4 SE +/- 0.14, N = 4 SE +/- 1.37, N = 3 SE +/- 0.05, N = 3 172.89 242.49 236.71 160.52 171.11
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.07, N = 4 SE +/- 0.05, N = 4 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 26.65 46.41 45.87 20.17 26.73
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2K 4K 6K 8K 10K SE +/- 55.23, N = 6 SE +/- 57.24, N = 12 SE +/- 15.60, N = 6 SE +/- 210.25, N = 15 SE +/- 39.33, N = 5 6744.86 8106.15 8006.35 8126.32 6454.67 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: jMonkeyEngine EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1500 3000 4500 6000 7500 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 4.67, N = 3 SE +/- 0.88, N = 3 6802 6806 6808 6827 6804
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_1d - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00421, N = 3 SE +/- 0.01295, N = 3 SE +/- 0.00488, N = 3 SE +/- 0.02381, N = 3 SE +/- 0.01491, N = 3 3.91089 3.00636 2.98654 6.85318 3.87665 MIN: 3.5 MIN: 2.4 MIN: 2.41 MIN: 3.93 MIN: 3.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 299.70, N = 3 SE +/- 369.09, N = 3 SE +/- 110.28, N = 3 SE +/- 900.25, N = 3 SE +/- 835.95, N = 3 88945 170373 168957 61527 87804 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40K 80K 120K 160K 200K SE +/- 195.02, N = 3 SE +/- 313.05, N = 3 SE +/- 227.16, N = 3 SE +/- 436.66, N = 3 SE +/- 118.34, N = 3 103571 170888 167186 94990 101849 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.09, N = 5 SE +/- 0.15, N = 6 SE +/- 0.05, N = 6 SE +/- 0.46, N = 15 SE +/- 0.09, N = 5 71.48 103.52 99.84 79.85 70.29 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.05, N = 4 SE +/- 0.07, N = 5 SE +/- 0.07, N = 5 SE +/- 0.31, N = 4 SE +/- 0.02, N = 4 38.82 62.56 62.23 28.64 38.85 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Spring Boot EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 300 600 900 1200 1500 SE +/- 6.93, N = 5 SE +/- 8.13, N = 5 SE +/- 10.62, N = 13 SE +/- 6.95, N = 5 SE +/- 7.11, N = 5 1162 1554 1554 1220 1159
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 1D - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.6557 1.3114 1.9671 2.6228 3.2785 SE +/- 0.00536, N = 4 SE +/- 0.00269, N = 4 SE +/- 0.00498, N = 4 SE +/- 0.01544, N = 4 SE +/- 0.00306, N = 4 1.99986 1.13000 1.13203 2.91408 1.98773 MIN: 1.85 MIN: 1.03 MIN: 1.03 MIN: 2.51 MIN: 1.84 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Z3 Theorem Prover The Z3 Theorem Prover / SMT solver is developed by Microsoft Research under the MIT license. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Z3 Theorem Prover 4.12.1 SMT File: 1.smt2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 20.43 20.33 20.78 18.66 20.50 1. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1100 2200 3300 4400 5500 SE +/- 17.50, N = 3 SE +/- 46.95, N = 3 SE +/- 56.37, N = 3 SE +/- 34.53, N = 4 SE +/- 43.08, N = 3 4457.8 4425.2 4425.0 5019.2 4518.4 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.08, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 31.86 54.54 54.07 24.69 32.09
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Apache Tomcat EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1400 2800 4200 5600 7000 SE +/- 9.21, N = 3 SE +/- 2.95, N = 4 SE +/- 4.40, N = 4 SE +/- 14.71, N = 3 SE +/- 2.73, N = 3 6482 3370 3367 6687 6467
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.09, N = 4 SE +/- 0.06, N = 4 SE +/- 0.19, N = 3 SE +/- 0.02, N = 3 31.04 48.11 47.67 23.05 31.05 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 100M 200M 300M 400M 500M SE +/- 445215.93, N = 4 SE +/- 143363.91, N = 3 SE +/- 201300.38, N = 3 SE +/- 269181.94, N = 5 SE +/- 333950.91, N = 4 352361775 378697000 362574433 453703880 350656750 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Tradebeans EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 700 1400 2100 2800 3500 SE +/- 22.67, N = 4 SE +/- 21.46, N = 4 SE +/- 38.01, N = 4 SE +/- 14.57, N = 4 SE +/- 31.95, N = 4 2815 3329 3359 3410 2775
Parallel BZIP2 Compression This test measures the time needed to compress a file (FreeBSD-13.0-RELEASE-amd64-memstick.img) using Parallel BZIP2 compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.13 FreeBSD-13.0-RELEASE-amd64-memstick.img Compression EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.045809, N = 15 SE +/- 0.025811, N = 10 SE +/- 0.023270, N = 9 SE +/- 0.044479, N = 6 SE +/- 0.045334, N = 15 6.154460 3.299879 3.325543 6.612486 6.176853 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: PMD Source Code Analyzer EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 9.55, N = 7 SE +/- 5.68, N = 7 SE +/- 10.09, N = 7 SE +/- 8.96, N = 15 SE +/- 9.22, N = 15 1113 1053 1074 1122 1142
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.04, N = 4 SE +/- 0.06, N = 5 SE +/- 0.11, N = 5 SE +/- 0.03, N = 3 SE +/- 0.01, N = 4 11.93 16.98 16.94 9.35 11.89 MIN: 11.1 / MAX: 12.12 MIN: 15.46 / MAX: 17.3 MIN: 15.4 / MAX: 17.31 MIN: 8.52 / MAX: 10.72 MIN: 11.05 / MAX: 12
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: BioJava Biological Data Framework EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 900 1800 2700 3600 4500 SE +/- 20.56, N = 4 SE +/- 33.62, N = 4 SE +/- 15.83, N = 4 SE +/- 25.22, N = 4 SE +/- 19.50, N = 4 4327 4425 4413 3708 4347
Y-Cruncher Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.3 Pi Digits To Calculate: 500M EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.036, N = 4 SE +/- 0.010, N = 5 SE +/- 0.013, N = 5 SE +/- 0.124, N = 4 SE +/- 0.040, N = 4 11.589 8.652 8.920 14.926 11.555
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5K 10K 15K 20K 25K SE +/- 213.35, N = 5 SE +/- 161.84, N = 5 SE +/- 94.12, N = 5 SE +/- 532.70, N = 12 SE +/- 158.41, N = 5 20265.52 20961.35 20223.60 24688.58 19566.67 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 11 22 33 44 55 SE +/- 0.04, N = 4 SE +/- 0.11, N = 4 SE +/- 0.09, N = 4 SE +/- 0.25, N = 4 SE +/- 0.05, N = 4 45.14 50.91 50.41 40.07 44.58 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Dolfyn Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.02, N = 5 SE +/- 0.11, N = 5 SE +/- 0.06, N = 5 SE +/- 0.03, N = 5 SE +/- 0.04, N = 5 10.73 10.96 10.93 10.42 10.63
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Jython EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 800 1600 2400 3200 4000 SE +/- 16.55, N = 4 SE +/- 34.16, N = 4 SE +/- 15.77, N = 4 SE +/- 14.73, N = 5 SE +/- 14.78, N = 4 3762 3871 3795 2433 3790
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Medium EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.03, N = 4 SE +/- 0.07, N = 5 SE +/- 0.05, N = 5 SE +/- 0.14, N = 4 SE +/- 0.07, N = 4 45.63 64.16 63.89 36.59 45.75
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.7 Preset: Medium EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.04, N = 4 SE +/- 0.18, N = 5 SE +/- 0.09, N = 5 SE +/- 0.00, N = 3 SE +/- 0.05, N = 4 79.60 154.72 153.70 89.14 80.53 1. (CXX) g++ options: -O3 -flto -pthread
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Index EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 18.27, N = 3 SE +/- 12.68, N = 3 SE +/- 11.06, N = 3 SE +/- 19.22, N = 3 SE +/- 17.23, N = 3 2390 2471 2517 2296 2407
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.4883 0.9766 1.4649 1.9532 2.4415 SE +/- 0.01, N = 5 SE +/- 0.00, N = 4 SE +/- 0.01, N = 4 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 2.17 2.09 2.08 2.17 2.16 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 0.53, N = 6 SE +/- 0.21, N = 7 SE +/- 0.13, N = 7 SE +/- 0.77, N = 15 SE +/- 0.38, N = 6 89.12 118.06 117.25 84.33 89.23 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Apache Kafka EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1100 2200 3300 4400 5500 SE +/- 29.67, N = 3 SE +/- 32.17, N = 3 SE +/- 27.06, N = 3 SE +/- 2.00, N = 3 SE +/- 29.48, N = 3 5027 5055 5030 4984 5031
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 16 32 48 64 80 SE +/- 0.10, N = 4 SE +/- 0.15, N = 5 SE +/- 0.25, N = 5 SE +/- 0.30, N = 3 SE +/- 0.02, N = 4 47.29 72.74 71.14 36.14 46.84 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.030, N = 6 SE +/- 0.010, N = 9 SE +/- 0.008, N = 8 SE +/- 0.089, N = 15 SE +/- 0.009, N = 6 5.990 3.037 3.164 6.278 6.166 1. (CXX) g++ options: -O3 -fopenmp
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 120 240 360 480 600 SE +/- 2.53, N = 4 SE +/- 6.41, N = 4 SE +/- 2.65, N = 4 SE +/- 2.82, N = 5 SE +/- 1.38, N = 4 527 533 536 415 532
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.12, N = 6 SE +/- 0.14, N = 8 SE +/- 0.15, N = 8 SE +/- 1.10, N = 15 SE +/- 0.10, N = 6 109.03 158.94 153.78 101.83 107.65 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.08, N = 5 SE +/- 0.13, N = 6 SE +/- 0.14, N = 6 SE +/- 0.64, N = 10 SE +/- 0.14, N = 5 76.06 111.01 107.37 83.19 74.93 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.04, N = 5 SE +/- 0.06, N = 6 SE +/- 0.08, N = 6 SE +/- 0.08, N = 4 SE +/- 0.07, N = 5 57.23 99.61 97.70 47.98 56.95 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1100 2200 3300 4400 5500 SE +/- 27.18, N = 8 SE +/- 49.85, N = 6 SE +/- 0.00, N = 6 SE +/- 41.03, N = 5 SE +/- 29.29, N = 6 3426.03 5172.44 5023.70 4282.18 3429.45 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1100 2200 3300 4400 5500 SE +/- 24.04, N = 8 SE +/- 41.40, N = 6 SE +/- 38.41, N = 6 SE +/- 25.17, N = 5 SE +/- 11.75, N = 6 2739.34 5073.23 4886.94 2556.22 2703.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.04, N = 5 17.47 14.93 14.78 13.71 16.77
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: GraphChi EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 21.22, N = 4 SE +/- 14.73, N = 4 SE +/- 10.30, N = 4 SE +/- 18.95, N = 4 SE +/- 18.46, N = 4 2561 2472 2521 2675 2535
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.1 Length: 1e12 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.007, N = 4 SE +/- 0.007, N = 6 SE +/- 0.008, N = 6 SE +/- 0.302, N = 3 SE +/- 0.018, N = 4 12.506 6.691 6.669 21.095 12.416 1. (CXX) g++ options: -O3
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.14, N = 7 SE +/- 0.33, N = 8 SE +/- 0.69, N = 8 SE +/- 1.00, N = 15 SE +/- 0.15, N = 7 125.28 170.21 164.85 105.28 123.64 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.14, N = 7 SE +/- 0.11, N = 9 SE +/- 0.14, N = 9 SE +/- 0.95, N = 15 SE +/- 0.05, N = 7 111.78 166.81 165.55 89.31 112.05
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 3D - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00833, N = 5 SE +/- 0.00611, N = 5 SE +/- 0.00590, N = 5 SE +/- 0.01930, N = 5 SE +/- 0.02020, N = 5 4.67840 3.88685 4.07056 7.59695 4.61211 MIN: 4.46 MIN: 3.57 MIN: 3.76 MIN: 7.29 MIN: 4.4 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Avrora AVR Simulation Framework EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 500 1000 1500 2000 2500 SE +/- 3.26, N = 6 SE +/- 20.96, N = 6 SE +/- 21.04, N = 8 SE +/- 2.49, N = 8 SE +/- 3.46, N = 6 2415 2359 2393 1451 2430
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.014, N = 7 SE +/- 0.010, N = 9 SE +/- 0.018, N = 9 SE +/- 0.088, N = 15 SE +/- 0.007, N = 7 5.338 3.421 3.447 6.260 5.301 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.021, N = 6 SE +/- 0.044, N = 7 SE +/- 0.018, N = 7 SE +/- 0.014, N = 5 SE +/- 0.012, N = 6 8.147 5.890 5.932 8.969 8.140 1. (CXX) g++ options: -O3 -fPIC -lm
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Super Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.08, N = 7 SE +/- 0.16, N = 9 SE +/- 0.19, N = 9 SE +/- 0.97, N = 15 SE +/- 0.07, N = 7 119.83 181.83 180.54 96.33 120.30
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.07, N = 5 SE +/- 0.13, N = 6 SE +/- 0.05, N = 6 SE +/- 0.15, N = 5 SE +/- 0.08, N = 5 19.89 24.17 23.94 15.92 19.96 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Medium EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 20 40 60 80 100 SE +/- 0.04, N = 5 SE +/- 0.08, N = 6 SE +/- 0.15, N = 6 SE +/- 0.37, N = 4 SE +/- 0.04, N = 5 64.68 88.30 87.51 50.47 64.80 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: AlexNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 5 10 15 20 25 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 6 SE +/- 0.01, N = 5 15.01 14.96 14.37 18.50 14.53
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 0.2 Input: 26 Minute Long Talking Sample EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.050, N = 6 SE +/- 0.033, N = 6 SE +/- 0.063, N = 6 SE +/- 0.059, N = 7 SE +/- 0.016, N = 6 7.184 7.414 7.291 6.400 7.028 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 8 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.13, N = 8 SE +/- 0.29, N = 9 SE +/- 0.22, N = 9 SE +/- 1.11, N = 15 SE +/- 0.12, N = 8 148.34 189.41 187.54 120.53 147.31 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Total EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3K 6K 9K 12K 15K SE +/- 38.56, N = 7 SE +/- 51.53, N = 6 SE +/- 66.74, N = 6 SE +/- 5.11, N = 5 SE +/- 8.66, N = 7 7530.0 12852.6 12647.5 2908.5 7792.6 -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Convolution Batch Shapes Auto - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.00717, N = 7 SE +/- 0.01415, N = 7 SE +/- 0.01136, N = 7 SE +/- 0.04485, N = 7 SE +/- 0.00879, N = 7 8.16800 6.40719 6.72536 6.38053 8.23220 MIN: 7.7 MIN: 5.93 MIN: 6.2 MIN: 5.92 MIN: 7.77 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Google Draco Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Church Facade EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1200 2400 3600 4800 6000 SE +/- 10.00, N = 7 SE +/- 29.27, N = 6 SE +/- 31.00, N = 6 SE +/- 17.70, N = 6 SE +/- 12.60, N = 6 5083 5224 5237 5607 5163 1. (CXX) g++ options: -O3
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.06, N = 8 SE +/- 0.20, N = 9 SE +/- 0.15, N = 9 SE +/- 1.21, N = 15 SE +/- 0.14, N = 8 142.97 203.39 201.92 115.40 143.28
GNU Octave Benchmark This test profile measures how long it takes to complete several reference GNU Octave files via octave-benchmark. GNU Octave is used for numerical computations and is an open-source alternative to MATLAB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GNU Octave Benchmark 8.4.0 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 2 4 6 8 10 SE +/- 0.017, N = 7 SE +/- 0.021, N = 7 SE +/- 0.043, N = 7 SE +/- 0.011, N = 6 SE +/- 0.016, N = 7 5.587 5.780 5.754 6.462 5.668
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: FOP Print Formatter EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 5.35, N = 15 SE +/- 5.40, N = 15 SE +/- 3.38, N = 15 SE +/- 6.07, N = 15 SE +/- 3.32, N = 15 414 399 403 426 423
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.1925 2.385 3.5775 4.77 5.9625 SE +/- 0.03, N = 8 SE +/- 0.01, N = 8 SE +/- 0.03, N = 8 SE +/- 0.02, N = 7 SE +/- 0.03, N = 8 5.28 5.10 5.15 4.73 5.30 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.9461 1.8922 2.8383 3.7844 4.7305 SE +/- 0.025, N = 8 SE +/- 0.038, N = 15 SE +/- 0.020, N = 8 SE +/- 0.011, N = 8 SE +/- 0.009, N = 8 4.205 3.841 3.804 4.162 4.202 1. (CXX) g++ options: -O3 -fPIC -lm
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Zxing 1D/2D Barcode Image Processing EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 2.37, N = 8 SE +/- 4.09, N = 9 SE +/- 4.29, N = 9 SE +/- 30.56, N = 15 SE +/- 3.37, N = 8 1019 589 597 1053 1016
Google Draco Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Lion EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 900 1800 2700 3600 4500 SE +/- 12.07, N = 7 SE +/- 17.41, N = 7 SE +/- 18.70, N = 7 SE +/- 9.84, N = 7 SE +/- 5.99, N = 7 3891 3878 4049 3830 3922 1. (CXX) g++ options: -O3
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Super Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 50 100 150 200 250 SE +/- 0.09, N = 9 SE +/- 0.26, N = 10 SE +/- 0.29, N = 10 SE +/- 2.29, N = 15 SE +/- 0.11, N = 9 168.08 248.39 245.56 139.72 168.47 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Very Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 40 80 120 160 200 SE +/- 0.10, N = 8 SE +/- 0.25, N = 9 SE +/- 0.24, N = 9 SE +/- 0.87, N = 7 SE +/- 0.08, N = 8 129.02 186.56 184.98 103.04 129.55 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Batik SVG Toolkit EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 5.55, N = 8 SE +/- 3.65, N = 8 SE +/- 5.97, N = 8 SE +/- 5.39, N = 8 SE +/- 6.28, N = 8 930 936 969 897 927
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.21, N = 8 SE +/- 0.26, N = 10 SE +/- 0.34, N = 10 SE +/- 2.05, N = 15 SE +/- 0.16, N = 8 178.20 311.95 307.89 168.39 178.68 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Helsing Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 0.918 1.836 2.754 3.672 4.59 SE +/- 0.004, N = 8 SE +/- 0.003, N = 11 SE +/- 0.002, N = 10 SE +/- 0.028, N = 15 SE +/- 0.004, N = 8 3.828 2.016 2.011 4.080 3.753 1. (CC) gcc options: -O2 -pthread
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: GoogLeNet EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 14 28 42 56 70 SE +/- 0.10, N = 9 SE +/- 0.18, N = 8 SE +/- 0.05, N = 8 SE +/- 0.05, N = 8 SE +/- 0.11, N = 9 62.99 57.34 57.21 51.33 62.02
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 80 160 240 320 400 SE +/- 0.76, N = 9 SE +/- 1.71, N = 15 SE +/- 1.84, N = 15 SE +/- 3.49, N = 15 SE +/- 0.90, N = 9 203.16 364.90 355.84 221.33 201.53 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_3d - Engine: CPU EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 1.1133 2.2266 3.3399 4.4532 5.5665 SE +/- 0.00661, N = 9 SE +/- 0.00405, N = 9 SE +/- 0.00342, N = 9 SE +/- 0.10597, N = 15 SE +/- 0.00353, N = 9 3.83412 2.46159 2.46164 4.94792 3.79890 MIN: 3.58 MIN: 2.35 MIN: 2.35 MIN: 4.45 MIN: 3.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 3 6 9 12 15 SE +/- 0.055, N = 9 SE +/- 0.059, N = 11 SE +/- 0.084, N = 15 SE +/- 0.250, N = 15 SE +/- 0.032, N = 9 6.826 12.481 12.449 7.236 6.848 1. (CXX) g++ options: -O3 -lm -ldl
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Thread EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 3.24, N = 10 SE +/- 5.92, N = 10 SE +/- 2.53, N = 9 SE +/- 5.58, N = 9 SE +/- 5.66, N = 10 1061.5 1049.1 1027.8 809.6 1082.7 -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq -mavx512f -mavx512cd -mavx512bw -mavx512dq 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 70 140 210 280 350 SE +/- 0.14, N = 10 SE +/- 0.21, N = 11 SE +/- 0.21, N = 11 SE +/- 4.30, N = 15 SE +/- 0.17, N = 10 226.13 301.57 299.12 197.51 226.69 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.20, N = 10 SE +/- 0.58, N = 11 SE +/- 0.22, N = 11 SE +/- 0.18, N = 10 SE +/- 0.20, N = 10 242.58 408.42 398.58 289.26 240.66 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 90 180 270 360 450 SE +/- 0.28, N = 10 SE +/- 0.42, N = 11 SE +/- 0.27, N = 11 SE +/- 0.53, N = 10 SE +/- 0.36, N = 10 251.44 419.26 408.36 295.69 249.86 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: Apache Xalan XSLT EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 110 220 330 440 550 SE +/- 2.71, N = 10 SE +/- 3.47, N = 10 SE +/- 2.09, N = 10 SE +/- 2.76, N = 11 SE +/- 2.21, N = 10 450 506 508 408 448
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 120 240 360 480 600 SE +/- 0.24, N = 11 SE +/- 1.04, N = 12 SE +/- 1.05, N = 12 SE +/- 4.53, N = 15 SE +/- 0.26, N = 11 361.05 560.81 545.31 383.64 358.54 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4 8 12 16 20 SE +/- 0.06, N = 12 SE +/- 0.10, N = 11 SE +/- 0.02, N = 11 SE +/- 0.05, N = 11 SE +/- 0.04, N = 12 16.54 16.11 15.84 15.37 16.59 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 160 320 480 640 800 SE +/- 15.50, N = 15 SE +/- 20.65, N = 15 SE +/- 20.45, N = 15 SE +/- 13.14, N = 15 SE +/- 14.78, N = 15 633.29 728.58 711.53 649.43 621.63 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 1080p EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 140 280 420 560 700 SE +/- 0.74, N = 12 SE +/- 1.66, N = 15 SE +/- 0.81, N = 12 SE +/- 0.76, N = 12 SE +/- 0.68, N = 12 546.71 660.34 649.20 548.04 539.87 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Default EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 6 12 18 24 30 SE +/- 0.07, N = 13 SE +/- 0.13, N = 13 SE +/- 0.08, N = 13 SE +/- 0.07, N = 13 SE +/- 0.10, N = 13 26.81 26.25 25.78 23.93 26.78 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 9 18 27 36 45 SE +/- 0.15, N = 15 SE +/- 0.58, N = 15 SE +/- 0.96, N = 15 SE +/- 0.39, N = 15 SE +/- 0.17, N = 15 23.72 37.24 35.07 26.48 22.47 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 30 60 90 120 150 SE +/- 1.52, N = 15 SE +/- 1.14, N = 15 SE +/- 1.50, N = 15 SE +/- 3.73, N = 15 SE +/- 0.33, N = 15 77.52 127.93 125.54 85.01 74.13 1. (CXX) g++ options: -O3
Java SciMark This test runs the Java version of SciMark 2, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This benchmark is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.2 Computational Test: Dense LU Matrix Factorization EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 4K 8K 12K 16K 20K SE +/- 146.88, N = 3 SE +/- 115.98, N = 3 SE +/- 171.55, N = 3 SE +/- 8.09, N = 3 SE +/- 217.22, N = 3 17934.59 17774.36 17230.85 11976.06 18030.92
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.2 Computational Test: Monte Carlo EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 19.82, N = 3 SE +/- 17.64, N = 3 SE +/- 26.32, N = 3 SE +/- 0.88, N = 3 SE +/- 32.12, N = 3 2541.20 2529.18 2456.47 1426.11 2570.60
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.2 Computational Test: Jacobi Successive Over-Relaxation EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 600 1200 1800 2400 3000 SE +/- 22.65, N = 3 SE +/- 10.01, N = 3 SE +/- 18.83, N = 3 SE +/- 2.05, N = 3 SE +/- 22.92, N = 3 2489.61 2518.58 2464.14 2667.90 2487.93
OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.2 Computational Test: Fast Fourier Transform EPYC 4364P @ DDR5-5200 EPYC 4564P @ DDR5-5200 EPYC 4564P @ DDR5-4800 Xeon E-2488 @ DDR5-4800 EPYC 4364P @ DDR5-4800 200 400 600 800 1000 SE +/- 0.41, N = 3 SE +/- 0.73, N = 3 SE +/- 3.44, N = 3 SE +/- 8.67, N = 3 SE +/- 0.60, N = 3 450.10 443.49 453.05 1023.23 470.91
EPYC 4364P @ DDR5-4800 Processor: AMD EPYC 4364P 8-Core @ 5.57GHz (8 Cores / 16 Threads), Motherboard: Supermicro AS-3015A-I H13SAE-MF v1.00 (1.1 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DRAM-4800MT/s Micron MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 960GB Micron_7450_MTFDKBA960TFR, Graphics: ASPEED, Audio: AMD Rembrandt Radeon HD Audio, Network: 2 x Intel I210
OS: Ubuntu 24.04, Kernel: 6.8.0-11-generic (x86_64), Desktop: GNOME Shell 45.3, Display Server: X Server 1.21.1.11, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw,stripe=64 / Block Size: 4096Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Notes: Python 3.12.2Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 March 2024 23:58 by user phoronix.
Xeon E-2488 @ DDR5-4800 Processor: Intel Xeon E E-2488 @ 3.20GHz (8 Cores / 16 Threads), Motherboard: Supermicro Super Server X13SCL-F v0123456789 (1.1 BIOS), Chipset: Intel Raptor Lake-S PCH, Memory: 2 x 32GB DRAM-4800MT/s Micron MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 960GB Micron_7450_MTFDKBA960TFR, Graphics: ASPEED, Network: 2 x Intel I210
OS: Ubuntu 24.04, Kernel: 6.8.0-11-generic (x86_64), Desktop: GNOME Shell 45.3, Display Server: X Server 1.21.1.11, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw,stripe=64 / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x122 - Thermald 2.5.6Java Notes: OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Notes: Python 3.12.2Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 1 April 2024 17:44 by user phoronix.
EPYC 4564P @ DDR5-4800 Processor: AMD EPYC 4564P 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: Supermicro AS-3015A-I H13SAE-MF v1.00 (1.1 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DRAM-4800MT/s Micron MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 960GB Micron_7450_MTFDKBA960TFR, Graphics: ASPEED, Audio: AMD Rembrandt Radeon HD Audio, Network: 2 x Intel I210
OS: Ubuntu 24.04, Kernel: 6.8.0-11-generic (x86_64), Desktop: GNOME Shell 45.3, Display Server: X Server 1.21.1.11, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Notes: Python 3.12.2Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 April 2024 01:56 by user phoronix.
EPYC 4564P @ DDR5-5200 Processor: AMD EPYC 4564P 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: Supermicro AS-3015A-I H13SAE-MF v1.00 (1.1 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DRAM-5200MT/s, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 960GB Micron_7450_MTFDKBA960TFR, Graphics: ASPEED, Audio: AMD Rembrandt Radeon HD Audio, Network: 2 x Intel I210
OS: Ubuntu 24.04, Kernel: 6.8.0-11-generic (x86_64), Desktop: GNOME Shell 45.3, Display Server: X Server 1.21.1.11, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Notes: Python 3.12.2Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 April 2024 00:18 by user phoronix.
EPYC 4364P @ DDR5-5200 Processor: AMD EPYC 4364P 8-Core @ 5.57GHz (8 Cores / 16 Threads), Motherboard: Supermicro AS-3015A-I H13SAE-MF v1.00 (1.1 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DRAM-5200MT/s, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 960GB Micron_7450_MTFDKBA960TFR, Graphics: ASPEED, Audio: AMD Rembrandt Radeon HD Audio, Network: 2 x Intel I210
OS: Ubuntu 24.04, Kernel: 6.8.0-11-generic (x86_64), Desktop: GNOME Shell 45.3, Display Server: X Server 1.21.1.11, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Notes: Python 3.12.2Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 April 2024 17:44 by user phoronix.