KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.
c3d-standard-60 AMD Genoa Processor: AMD EPYC 9B14 (30 Cores / 60 Threads), Motherboard: Google Compute Engine c3d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
t2d-standard-60 AMD Milan Processor: AMD EPYC 7B13 (60 Cores) , Motherboard: Google Compute Engine t2d-standard-60 , Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB PersistentDisk , Network: Red Hat Virtio device
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 130K 260K 390K 520K 650K 629363 510819 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 900M 1800M 2700M 3600M 4500M SE +/- 57202190.49, N = 12 SE +/- 84802173.02, N = 12 4289858333 3681935833 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 1000M 2000M 3000M 4000M 5000M SE +/- 201939132.13, N = 12 SE +/- 481352.26, N = 3 4723940000 2730620000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 1.56, N = 15 75.68 92.87
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 40 80 120 160 200 SE +/- 0.82, N = 3 SE +/- 2.19, N = 12 196.95 148.58 1. (CXX) g++ options: -O3
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 3.26, N = 9 123.61 147.31
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.22, N = 3 SE +/- 4.01, N = 12 72.06 89.65
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 80 160 240 320 400 SE +/- 0.62, N = 3 SE +/- 0.31, N = 3 364.64 259.55 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.78, N = 3 SE +/- 0.52, N = 3 109.68 88.63 1. (CXX) g++ options: -O3
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 120 240 360 480 600 SE +/- 0.25, N = 3 SE +/- 0.12, N = 3 554.2 489.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 50 100 150 200 250 SE +/- 1.77, N = 3 SE +/- 0.22, N = 3 222.30 209.00 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 13 26 39 52 65 SE +/- 0.66, N = 3 SE +/- 1.29, N = 15 60.03 57.31 1. (CXX) g++ options: -O3
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 60 120 180 240 300 SE +/- 3.60, N = 4 SE +/- 0.19, N = 3 289.2 255.4 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
Model: INIVOL and Fluid Structure Interaction Drop Container
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND
t2d-standard-60 AMD Milan: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 50.99 18.29
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 16 32 48 64 80 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 69.68 20.90
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 1.19 2.38 3.57 4.76 5.95 SE +/- 0.005, N = 3 SE +/- 0.011, N = 3 5.289 4.391 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.91, N = 3 SE +/- 1.78, N = 12 106.03 93.60 1. (CXX) g++ options: -O3
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 32 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 14 28 42 56 70 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 62.74 20.36
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 4K 8K 12K 16K 20K SE +/- 77.92, N = 3 SE +/- 1215.82, N = 15 19597.86 16649.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.17, N = 3 16.33 33.36 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 30K 60K 90K 120K 150K SE +/- 42.77, N = 3 SE +/- 122.23, N = 3 122720.61 96257.48 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 1100 2200 3300 4400 5500 SE +/- 51.21, N = 5 SE +/- 30.67, N = 3 4935.68 3783.60 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 12K 24K 36K 48K 60K SE +/- 137.77, N = 3 SE +/- 600.19, N = 15 54846.18 39647.47 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 500 1000 1500 2000 2500 SE +/- 36.45, N = 15 SE +/- 142.62, N = 12 2422.40 1752.62 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20K 40K 60K 80K 100K SE +/- 1463.52, N = 15 SE +/- 293.57, N = 3 94247.77 73563.13 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 10K 20K 30K 40K 50K SE +/- 145.06, N = 3 SE +/- 29.69, N = 3 47291.96 42701.83 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 9K 18K 27K 36K 45K SE +/- 555.92, N = 3 SE +/- 45.01, N = 3 43228.11 39919.71 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 14 28 42 56 70 SE +/- 0.13, N = 3 SE +/- 0.18, N = 3 50.97 64.86 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.86, N = 15 SE +/- 1.83, N = 12 84.17 88.54 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 42.01 45.50 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 3 6 9 12 15 SE +/- 0.034, N = 3 SE +/- 0.013, N = 3 7.368 10.025 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.104, N = 15 6.423 6.448 1. (CXX) g++ options: -O2 -lOpenCL
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 18.39 10.73 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 300 600 900 1200 1500 SE +/- 0.17, N = 3 SE +/- 1.32, N = 3 648.81 1393.56 MIN: 632.52 / MAX: 677.15 MIN: 1290.45 / MAX: 1469.82 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 3.12, N = 15 142.75 73.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 50 100 150 200 250 SE +/- 0.13, N = 3 SE +/- 9.17, N = 15 83.99 208.47 MIN: 64.11 / MAX: 117.81 MIN: 119.65 / MAX: 316.01 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.48, N = 3 SE +/- 2.74, N = 15 142.90 78.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 0.29, N = 3 SE +/- 7.79, N = 15 83.91 193.45 MIN: 61.61 / MAX: 119.94 MIN: 113.44 / MAX: 315.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 300 600 900 1200 1500 SE +/- 6.33, N = 3 SE +/- 1.55, N = 3 1389.69 368.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 8.62 40.67 MIN: 5.74 / MAX: 36.03 MIN: 12.07 / MAX: 59.44 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 35.19 26.28 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 120 240 360 480 600 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 340.29 568.77 MIN: 328.52 / MAX: 353.66 MIN: 481.12 / MAX: 606.31 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 900 1800 2700 3600 4500 SE +/- 8.74, N = 3 SE +/- 15.96, N = 3 4166.39 1285.47 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 2.87 11.65 MIN: 1.77 / MAX: 11.42 MIN: 3.76 / MAX: 29.1 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 120 240 360 480 600 SE +/- 1.04, N = 3 SE +/- 0.93, N = 3 576.94 225.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.27, N = 3 20.77 66.46 MIN: 11.01 / MAX: 36.69 MIN: 25.85 / MAX: 122.2 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 3.16, N = 3 SE +/- 1.12, N = 3 2043.05 1512.57 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.86 9.90 MIN: 3.22 / MAX: 20.19 MIN: 8.13 / MAX: 26.87 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 0.47, N = 3 SE +/- 0.56, N = 3 1875.28 1014.70 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 14.76 15.98 MIN: 12.71 / MAX: 35.86 MIN: 9.37 / MAX: 24.29 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 1400 2800 4200 5600 7000 SE +/- 6.52, N = 3 SE +/- 2.08, N = 3 6605.12 4239.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.52 4.53 MIN: 2.77 / MAX: 18.76 MIN: 2.72 / MAX: 12.38 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 140 280 420 560 700 SE +/- 0.79, N = 3 SE +/- 0.37, N = 3 645.74 565.34 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.56 26.50 MIN: 15.5 / MAX: 36.36 MIN: 19.6 / MAX: 56.62 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.37, N = 3 185.29 96.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.58, N = 3 64.70 155.14 MIN: 50.24 / MAX: 93.15 MIN: 114.75 / MAX: 224.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 800 1600 2400 3200 4000 SE +/- 3.28, N = 3 SE +/- 3.26, N = 3 3650.57 2646.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.20 11.32 MIN: 4.92 / MAX: 14.5 MIN: 10.13 / MAX: 30.81 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 4.38, N = 3 SE +/- 3.66, N = 3 1764.21 633.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 6.79 23.64 MIN: 4.45 / MAX: 24.38 MIN: 9.57 / MAX: 42.22 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 200 400 600 800 1000 SE +/- 0.60, N = 3 SE +/- 0.96, N = 3 964.46 370.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.21, N = 3 31.08 81.00 MIN: 19.98 / MAX: 50.81 MIN: 64.65 / MAX: 134.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 9K 18K 27K 36K 45K SE +/- 18.42, N = 3 SE +/- 14.46, N = 3 43607.04 29668.12 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.52 0.99 MIN: 0.31 / MAX: 8.86 MIN: 0.8 / MAX: 13.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 160 320 480 640 800 SE +/- 1.75, N = 3 SE +/- 1.37, N = 3 761.14 390.72 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 39.38 76.72 MIN: 34.32 / MAX: 53.98 MIN: 58.8 / MAX: 121.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 12K 24K 36K 48K 60K SE +/- 45.46, N = 3 SE +/- 332.93, N = 3 54971.26 44049.15 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.40 0.61 MIN: 0.23 / MAX: 7.49 MIN: 0.41 / MAX: 264.31 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 200M 400M 600M 800M 1000M SE +/- 2060519.90, N = 3 SE +/- 1088162.98, N = 3 962889833 920427767 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 1.3211 2.6422 3.9633 5.2844 6.6055 SE +/- 0.02210564, N = 3 SE +/- 0.04970425, N = 3 5.63057327 5.87157885 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 7 14 21 28 35 SE +/- 0.31, N = 12 SE +/- 0.25, N = 3 24.57 28.02 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20M 40M 60M 80M 100M SE +/- 1618403.29, N = 14 SE +/- 1450871.09, N = 3 112958788 105894457 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 50K 100K 150K 200K 250K SE +/- 347.74, N = 3 SE +/- 519.21, N = 3 247255 226211 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.37, N = 5 33.40
Build: defconfig
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig t2d-standard-60 AMD Milan 70 140 210 280 350 SE +/- 1.20, N = 3 333.35
Build: allmodconfig
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.06, N = 3 34.27
Blend File: BMW27 - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.07, N = 3 89.35
Blend File: Classroom - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only t2d-standard-60 AMD Milan 10 20 30 40 50 SE +/- 0.13, N = 3 45.22
Blend File: Fishy Cat - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only t2d-standard-60 AMD Milan 80 160 240 320 400 SE +/- 0.60, N = 3 351.58
Blend File: Barbershop - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.03, N = 3 112.64
Blend File: Pabellon Barcelona - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 41.54 41.99 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 0.7313 1.4626 2.1939 2.9252 3.6565 SE +/- 0.013, N = 3 SE +/- 0.007, N = 3 3.205 3.250 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 2 4 6 8 10 SE +/- 0.099, N = 3 SE +/- 0.031, N = 3 6.889 7.639 1. (CXX) g++ options: -O3 -fPIC -lm
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 40K 80K 120K 160K 200K SE +/- 126.15, N = 3 SE +/- 394.72, N = 3 187350.44 162957.75 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 40K 80K 120K 160K 200K SE +/- 688.79, N = 3 SE +/- 156.32, N = 3 180537.84 155609.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 11000M 22000M 33000M 44000M 55000M SE +/- 20491615.60, N = 3 SE +/- 9562123.40, N = 3 50884997103 46211821313 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 5000M 10000M 15000M 20000M 25000M SE +/- 108274834.92, N = 3 SE +/- 4399663.13, N = 3 22244804183 14702270573 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 4K 8K 12K 16K 20K SE +/- 14.62, N = 3 SE +/- 11.58, N = 3 20079.5 12973.0 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 200K 400K 600K 800K 1000K SE +/- 644.45, N = 3 SE +/- 50.91, N = 3 860844.6 493077.6 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 40000M 80000M 120000M 160000M 200000M SE +/- 47698640.87, N = 3 SE +/- 12326205.97, N = 3 180249145770 173980949893 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 70000M 140000M 210000M 280000M 350000M SE +/- 342949201.09, N = 3 SE +/- 376720190.05, N = 3 343095284440 234604082610 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 60000M 120000M 180000M 240000M 300000M SE +/- 71241287.97, N = 3 SE +/- 178290221.71, N = 3 293328048497 216025967640 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 30000M 60000M 90000M 120000M 150000M SE +/- 3664727.47, N = 3 SE +/- 198663058.81, N = 3 123909304773 119647720337 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 7M 14M 21M 28M 35M SE +/- 303573.79, N = 3 SE +/- 154587.14, N = 3 33466804 33268158
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 90 180 270 360 450 SE +/- 4.08, N = 3 SE +/- 2.14, N = 3 415.08 418.59 MAX: 28810.4 MAX: 31920.67
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 7M 14M 21M 28M 35M SE +/- 221111.93, N = 3 SE +/- 188621.29, N = 3 34925899 34762565
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 140 280 420 560 700 SE +/- 3.18, N = 3 SE +/- 12.58, N = 3 623.89 633.58 MAX: 41831.2 MAX: 54749.7
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 7M 14M 21M 28M 35M SE +/- 66565.66, N = 3 SE +/- 130588.20, N = 3 34332237 34123810
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 100 200 300 400 500 SE +/- 35.57, N = 3 SE +/- 27.08, N = 3 433.96 447.68 MAX: 103381.73 MAX: 95136.87
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 8M 16M 24M 32M 40M SE +/- 267152.12, N = 3 SE +/- 354923.23, N = 3 35359884 35068557
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 150 300 450 600 750 SE +/- 7.53, N = 3 SE +/- 25.87, N = 3 682.12 709.46 MAX: 98294.84 MAX: 113264.06
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only t2d-standard-60 AMD Milan 400K 800K 1200K 1600K 2000K SE +/- 20558.90, N = 3 2003784 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 800 - Mode: Read Only
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only t2d-standard-60 AMD Milan 400K 800K 1200K 1600K 2000K SE +/- 22497.42, N = 3 2008186 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 1000 - Mode: Read Only
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write t2d-standard-60 AMD Milan 1200 2400 3600 4800 6000 SE +/- 49.28, N = 12 5682 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 800 - Mode: Read Write
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write t2d-standard-60 AMD Milan 1200 2400 3600 4800 6000 SE +/- 49.93, N = 8 5793 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency t2d-standard-60 AMD Milan 0.0898 0.1796 0.2694 0.3592 0.449 SE +/- 0.004, N = 3 0.399 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency t2d-standard-60 AMD Milan 0.1121 0.2242 0.3363 0.4484 0.5605 SE +/- 0.006, N = 3 0.498 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 1.23, N = 12 140.92 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 1.44, N = 8 172.72 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
c3d-standard-60 AMD Genoa Processor: AMD EPYC 9B14 (30 Cores / 60 Threads), Motherboard: Google Compute Engine c3d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 00:31 by user michael_larabel.
t2d-standard-60 AMD Milan Processor: AMD EPYC 7B13 (60 Cores), Motherboard: Google Compute Engine t2d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB PersistentDisk, Network: Red Hat Virtio device
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 12:59 by user michael_larabel.