amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.
c3d-standard-60 AMD Genoa Processor: AMD EPYC 9B14 (30 Cores / 60 Threads), Motherboard: Google Compute Engine c3d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
t2d-standard-60 AMD Milan Processor: AMD EPYC 7B13 (60 Cores) , Motherboard: Google Compute Engine t2d-standard-60 , Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB PersistentDisk , Network: Red Hat Virtio device
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
c6g.16xlarge Processor: ARMv8 Neoverse-N1 (64 Cores) , Motherboard: Amazon EC2 c6g.16xlarge (1.0 BIOS) , Chipset: Amazon Device 0200 , Memory: 128GB , Disk: 215GB Amazon Elastic Block Store , Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
m7a.16xlarge Processor: AMD EPYC 9R14 (64 Cores) , Motherboard: Amazon EC2 m7a.16xlarge (1.0 BIOS) , Chipset: Intel 440FX 82441FX PMC , Memory: 256GB , Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
GCE c3d-standard-60 Processor Motherboard Chipset Memory Disk Network OS Kernel Vulkan Compiler File-System System Layer c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge m7a.16xlarge AMD EPYC 9B14 (30 Cores / 60 Threads) Google Compute Engine c3d-standard-60 Intel 440FX 82441FX PMC 240GB 215GB nvme_card-pd Google Compute Engine Virtual Ubuntu 22.04 6.2.0-1014-gcp (x86_64) 1.3.238 GCC 11.4.0 ext4 KVM AMD EPYC 7B13 (60 Cores) Google Compute Engine t2d-standard-60 215GB PersistentDisk Red Hat Virtio device ARMv8 Neoverse-N1 (64 Cores) Amazon EC2 c6g.16xlarge (1.0 BIOS) Amazon Device 0200 128GB 215GB Amazon Elastic Block Store Amazon Elastic 5.19.0-1025-aws (aarch64) amazon AMD EPYC 9R14 (64 Cores) Amazon EC2 m7a.16xlarge (1.0 BIOS) Intel 440FX 82441FX PMC 256GB 5.19.0-1025-aws (x86_64) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - c3d-standard-60 AMD Genoa: CPU Microcode: 0xffffffff - t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffff - m7a.16xlarge: CPU Microcode: 0xa10113e Java Details - OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04) Python Details - Python 3.10.12 Security Details - c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - m7a.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge m7a.16xlarge Logarithmic Result Overview Phoronix Test Suite NAS Parallel Benchmarks OpenSSL nekRS GROMACS libavif avifenc libxsmm Remhos Xcompact3d Incompact3d Algebraic Multi-Grid Benchmark OpenVINO Timed Node.js Compilation LAMMPS Molecular Dynamics Simulator Coremark Stockfish Rodinia Apache Cassandra Timed Gem5 Compilation nginx HeFFTe - Highly Efficient FFT for Exascale 7-Zip Compression Laghos
GCE c3d-standard-60 brl-cad: VGR Performance Metric nekrs: Kershaw nekrs: TurboPipe Periodic openradioss: Bumper Beam openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test heffte: r2c - FFTW - float - 128 openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation laghos: Sedov Blast Wave, ube_922_hex.mesh heffte: c2c - FFTW - float - 128 libxsmm: 64 laghos: Triple Point Problem heffte: c2c - FFTW - double - 128 libxsmm: 32 tensorflow: CPU - 16 - ResNet-50 tensorflow: CPU - 64 - ResNet-50 gromacs: MPI CPU - water_GMX50_bare lammps: 20k Atoms heffte: r2c - FFTW - double - 128 tensorflow: CPU - 32 - ResNet-50 lammps: Rhodopsin Protein npb: CG.C remhos: Sample Remap Example npb: BT.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.C rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU amg: incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction coremark: CoreMark Size 666 - Iterations Per Second stockfish: Total Time compress-7zip: Compression Rating compress-7zip: Decompression Rating build-linux-kernel: defconfig build-linux-kernel: allmodconfig blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 6, Lossless build-gem5: Time To Compile build-nodejs: Time To Compile nginx: 500 nginx: 1000 openssl: SHA256 openssl: SHA512 openssl: RSA4096 openssl: RSA4096 openssl: ChaCha20 openssl: AES-128-GCM openssl: AES-256-GCM openssl: ChaCha20-Poly1305 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 800 - 400 apache-iotdb: 500 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 500 - 400 apache-iotdb: 800 - 100 - 500 - 400 apache-iotdb: 800 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 800 - 400 cassandra: Writes pgbench: 100 - 800 - Read Only pgbench: 100 - 1000 - Read Only pgbench: 100 - 800 - Read Write pgbench: 100 - 1000 - Read Write pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 800 - Read Write - Average Latency pgbench: 100 - 1000 - Read Write - Average Latency c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge m7a.16xlarge 510819 4289858333 4723940000 92.87 337.70 38.82 148.575 147.31 89.65 259.55 88.6301 489.7 209.00 57.3116 255.4 50.99 69.68 4.391 19.776 93.6005 62.74 17.423 19597.86 33.362 96257.48 3783.60 39647.47 2422.40 73563.13 42701.83 39919.71 64.862 84.166 45.498 10.025 6.448 18.39 648.81 142.75 83.99 142.90 83.91 1389.69 8.62 35.19 340.29 4166.39 2.87 576.94 20.77 2043.05 5.86 1875.28 15.98 6605.12 4.53 645.74 18.56 185.29 64.70 3650.57 8.20 1764.21 6.79 964.46 31.08 43607.04 0.52 761.14 39.38 54971.26 0.4 962889833 5.87157885 28.0196877 1445843.521552 105894457 271795 226211 78.068 41.538 3.250 6.889 176.767 198.394 187350.44 180537.84 46211821313 14702270573 20079.5 493077.6 173980949893 343095284440 293328048497 123909304773 33268158 418.59 34762565 623.89 34332237 447.68 35359884 682.12 228640 629363 3681935833 2730620000 75.68 327.88 30.12 196.948 123.61 72.06 364.64 109.676 554.2 222.30 60.0343 289.2 18.29 20.90 5.289 26.734 106.029 20.36 27.828 16649.37 16.326 122720.61 4935.68 54846.18 1752.62 94247.77 47291.96 43228.11 50.974 88.535 42.010 7.368 6.423 10.73 1393.56 73.74 208.47 78.96 193.45 368.39 40.67 26.28 568.77 1285.47 11.65 225.48 66.46 1512.57 9.90 1014.70 14.76 4239.52 3.52 565.34 26.50 96.58 155.14 2646.58 11.32 633.76 23.64 370.03 81.00 29668.12 0.99 390.72 76.72 44049.15 0.61 920427767 5.63057327 24.5721181 1730658.449440 112958788 278973 247255 33.399 333.351 34.27 89.35 45.22 351.58 112.64 78.350 41.989 3.205 7.639 170.930 191.706 162957.75 155609.04 50884997103 22244804183 12973.0 860844.6 180249145770 234604082610 216025967640 119647720337 33466804 415.08 34925899 633.58 34123810 433.96 35068557 709.46 187169 2003784 2008186 5682 5793 0.399 0.498 140.916 172.717 1758860000 2221710000 202.445 321.29 129.172 589.5 179.52 32.3575 312.7 2.766 25.059 79.0156 26.041 13343.35 20.816 24229.14 2213.76 21386.37 915.80 18807.75 25661.04 9716.99 62.301 5.983 14.212 0.1 9996.56 1.06 947.59 1.06 947.86 6.53 153.12 0.04 22391.86 20.79 48.08 2.61 382.47 0.14 6990.10 8.39 119.17 0.46 2186.81 0.15 6773.31 1.36 735.58 5.50 181.94 7.36 135.87 2.53 394.94 178.82 5.58 2.36 423.95 136.16 7.33 1032893667 5.61811686 25.8748328 1259870.716902 81807706 239735 234046 102.216 409.097 270.068 167.946 4.467 8.879 224.414 286.201 162553.85 158700.36 42288513973 14384917863 2640.0 215683.2 67324778360 158788510970 129198197600 46715126487 217355 1043267 975031 4784 4776 0.767 1.026 168.191 210.608 788704 7667846667 4774796667 66.27 190.79 26.14 190.602 115.96 59.18 409.73 121.505 1201.8 218.86 71.1095 643.4 69.55 100.15 7.655 31.471 124.363 87.19 32.785 42007.57 13.867 193219.12 7501.76 103413.02 4085.20 210544.87 121293.80 102392.40 43.286 74.778 34.661 6.480 5.930 31.72 503.38 284.42 56.21 283.40 56.41 2417.34 6.60 61.19 261.11 7222.10 2.21 1049.66 15.22 3666.89 4.35 3132.48 10.19 10382.22 3.07 1222.99 13.07 315.14 50.72 6177.94 5.16 3146.04 5.07 1419.11 22.53 81996.81 0.38 1158.43 27.60 92100.80 0.27 1843444333 2.89602661 11.5913086 2158639.274883 135419169 330633 282593 27.709 267.965 27.74 71.51 37.12 276.23 91.88 65.447 35.805 2.649 5.678 153.800 154.441 233014.72 224859.09 62253861197 26481506820 31583.8 996017.5 308308045083 592545362740 522113080527 216773475533 40899210 340.11 44502333 521.98 42643903 355.13 44699315 598.49 278585 2923009 2880940 5312 5300 0.274 0.347 150.601 188.676 OpenBenchmarking.org
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 200K 400K 600K 800K 1000K 788704 629363 510819 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
VGR Performance Metric
c6g.16xlarge: The test quit with a non-zero exit status. E: ERROR: Could not find the BRL-CAD raytracer
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1600M 3200M 4800M 6400M 8000M SE +/- 49077561.86, N = 3 SE +/- 57202190.49, N = 12 SE +/- 84802173.02, N = 12 SE +/- 2970005.61, N = 3 7667846667 4289858333 3681935833 1758860000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1000M 2000M 3000M 4000M 5000M SE +/- 6657808.28, N = 3 SE +/- 201939132.13, N = 12 SE +/- 481352.26, N = 3 SE +/- 1790009.31, N = 3 4774796667 4723940000 2730620000 2221710000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 1.56, N = 15 66.27 75.68 92.87
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 70 140 210 280 350 SE +/- 0.61, N = 3 SE +/- 1.48, N = 3 SE +/- 2.03, N = 3 190.79 327.88 337.70
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.39, N = 15 SE +/- 1.27, N = 15 26.14 30.12 38.82
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 c6g.16xlarge t2d-standard-60 AMD Milan m7a.16xlarge c3d-standard-60 AMD Genoa 40 80 120 160 200 SE +/- 0.57, N = 3 SE +/- 0.82, N = 3 SE +/- 2.16, N = 15 SE +/- 2.19, N = 12 202.45 196.95 190.60 148.58 1. (CXX) g++ options: -O3
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.13, N = 3 SE +/- 3.26, N = 9 115.96 123.61 147.31
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 20 40 60 80 100 SE +/- 0.73, N = 3 SE +/- 0.22, N = 3 SE +/- 4.01, N = 12 59.18 72.06 89.65
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge c3d-standard-60 AMD Genoa 90 180 270 360 450 SE +/- 1.29, N = 3 SE +/- 0.62, N = 3 SE +/- 0.79, N = 3 SE +/- 0.31, N = 3 409.73 364.64 321.29 259.55 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 c6g.16xlarge m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 1.44, N = 15 SE +/- 0.78, N = 3 SE +/- 0.52, N = 3 129.17 121.51 109.68 88.63 1. (CXX) g++ options: -O3
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 m7a.16xlarge c6g.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 300 600 900 1200 1500 SE +/- 0.52, N = 3 SE +/- 0.96, N = 3 SE +/- 0.25, N = 3 SE +/- 0.12, N = 3 1201.8 589.5 554.2 489.7 -lquadmath -msse4.2 -march=armv8.1-a -lquadmath -msse4.2 -lquadmath -msse4.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem t2d-standard-60 AMD Milan m7a.16xlarge c3d-standard-60 AMD Genoa c6g.16xlarge 50 100 150 200 250 SE +/- 1.77, N = 3 SE +/- 1.67, N = 3 SE +/- 0.22, N = 3 SE +/- 0.50, N = 3 222.30 218.86 209.00 179.52 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 16 32 48 64 80 SE +/- 0.63, N = 15 SE +/- 0.66, N = 3 SE +/- 1.29, N = 15 SE +/- 0.10, N = 3 71.11 60.03 57.31 32.36 1. (CXX) g++ options: -O3
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 m7a.16xlarge c6g.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 140 280 420 560 700 SE +/- 0.40, N = 3 SE +/- 0.47, N = 3 SE +/- 3.60, N = 4 SE +/- 0.19, N = 3 643.4 312.7 289.2 255.4 -lquadmath -msse4.2 -march=armv8.1-a -lquadmath -msse4.2 -lquadmath -msse4.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
Model: INIVOL and Fluid Structure Interaction Drop Container
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND
t2d-standard-60 AMD Milan: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND
m7a.16xlarge: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container_0001.rad NOT FOUND
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 15 30 45 60 75 SE +/- 0.20, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 69.55 50.99 18.29
Device: CPU - Batch Size: 16 - Model: ResNet-50
c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 100.15 69.68 20.90
Device: CPU - Batch Size: 64 - Model: ResNet-50
c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.005, N = 3 SE +/- 0.011, N = 3 SE +/- 0.001, N = 3 7.655 5.289 4.391 2.766 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 30 60 90 120 150 SE +/- 1.18, N = 15 SE +/- 0.91, N = 3 SE +/- 1.78, N = 12 SE +/- 0.71, N = 3 124.36 106.03 93.60 79.02 1. (CXX) g++ options: -O3
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 32 - Model: ResNet-50 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 87.19 62.74 20.36
Device: CPU - Batch Size: 32 - Model: ResNet-50
c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 9K 18K 27K 36K 45K SE +/- 178.46, N = 3 SE +/- 77.92, N = 3 SE +/- 1215.82, N = 15 SE +/- 23.52, N = 3 42007.57 19597.86 16649.37 13343.35 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge c3d-standard-60 AMD Genoa 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 13.87 16.33 20.82 33.36 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 40K 80K 120K 160K 200K SE +/- 560.75, N = 3 SE +/- 42.77, N = 3 SE +/- 122.23, N = 3 SE +/- 7.69, N = 3 193219.12 122720.61 96257.48 24229.14 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 1600 3200 4800 6400 8000 SE +/- 8.54, N = 3 SE +/- 51.21, N = 5 SE +/- 30.67, N = 3 SE +/- 7.28, N = 3 7501.76 4935.68 3783.60 2213.76 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 20K 40K 60K 80K 100K SE +/- 446.85, N = 3 SE +/- 137.77, N = 3 SE +/- 600.19, N = 15 SE +/- 2.85, N = 3 103413.02 54846.18 39647.47 21386.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 900 1800 2700 3600 4500 SE +/- 3.14, N = 3 SE +/- 36.45, N = 15 SE +/- 142.62, N = 12 SE +/- 0.58, N = 3 4085.20 2422.40 1752.62 915.80 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 50K 100K 150K 200K 250K SE +/- 661.61, N = 3 SE +/- 1463.52, N = 15 SE +/- 293.57, N = 3 SE +/- 7.52, N = 3 210544.87 94247.77 73563.13 18807.75 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 30K 60K 90K 120K 150K SE +/- 526.14, N = 3 SE +/- 145.06, N = 3 SE +/- 29.69, N = 3 SE +/- 10.99, N = 3 121293.80 47291.96 42701.83 25661.04 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 20K 40K 60K 80K 100K SE +/- 91.17, N = 3 SE +/- 555.92, N = 3 SE +/- 45.01, N = 3 SE +/- 0.76, N = 3 102392.40 43228.11 39919.71 9716.99 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge c3d-standard-60 AMD Genoa 14 28 42 56 70 SE +/- 0.24, N = 3 SE +/- 0.13, N = 3 SE +/- 0.03, N = 3 SE +/- 0.18, N = 3 43.29 50.97 62.30 64.86 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.74, N = 15 SE +/- 0.86, N = 15 SE +/- 1.83, N = 12 74.78 84.17 88.54 1. (CXX) g++ options: -O2 -lOpenCL
Test: OpenMP HotSpot3D
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 34.66 42.01 45.50 1. (CXX) g++ options: -O2 -lOpenCL
Test: OpenMP Leukocyte
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver c6g.16xlarge m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 3 6 9 12 15 SE +/- 0.001, N = 3 SE +/- 0.007, N = 3 SE +/- 0.034, N = 3 SE +/- 0.013, N = 3 5.983 6.480 7.368 10.025 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 4 8 12 16 20 SE +/- 0.049, N = 3 SE +/- 0.009, N = 3 SE +/- 0.104, N = 15 SE +/- 0.017, N = 3 5.930 6.423 6.448 14.212 1. (CXX) g++ options: -O2 -lOpenCL
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 31.72 18.39 10.73 0.10 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 2K 4K 6K 8K 10K SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 SE +/- 1.32, N = 3 SE +/- 1.02, N = 3 503.38 648.81 1393.56 9996.56 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 60 120 180 240 300 SE +/- 0.53, N = 3 SE +/- 0.23, N = 3 SE +/- 3.12, N = 15 SE +/- 0.00, N = 3 284.42 142.75 73.74 1.06 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 200 400 600 800 1000 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 9.17, N = 15 SE +/- 0.37, N = 3 56.21 83.99 208.47 947.59 -pie - MIN: 119.65 / MAX: 316.01 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.68 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 60 120 180 240 300 SE +/- 0.29, N = 3 SE +/- 0.48, N = 3 SE +/- 2.74, N = 15 SE +/- 0.00, N = 3 283.40 142.90 78.96 1.06 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 200 400 600 800 1000 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 7.79, N = 15 SE +/- 0.55, N = 3 56.41 83.91 193.45 947.86 -pie - MIN: 113.44 / MAX: 315.54 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 500 1000 1500 2000 2500 SE +/- 1.65, N = 3 SE +/- 6.33, N = 3 SE +/- 1.55, N = 3 SE +/- 0.01, N = 3 2417.34 1389.69 368.39 6.53 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 SE +/- 0.09, N = 3 6.60 8.62 40.67 153.12 -pie - MIN: 12.07 / MAX: 59.44 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 14 28 42 56 70 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 61.19 35.19 26.28 0.04 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 5K 10K 15K 20K 25K SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 SE +/- 15.60, N = 3 261.11 340.29 568.77 22391.86 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.42 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1500 3000 4500 6000 7500 SE +/- 4.07, N = 3 SE +/- 8.74, N = 3 SE +/- 15.96, N = 3 SE +/- 0.01, N = 3 7222.10 4166.39 1285.47 20.79 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 11 22 33 44 55 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 2.21 2.87 11.65 48.08 -pie - MIN: 3.76 / MAX: 29.1 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 200 400 600 800 1000 SE +/- 0.11, N = 3 SE +/- 1.04, N = 3 SE +/- 0.93, N = 3 SE +/- 0.00, N = 3 1049.66 576.94 225.48 2.61 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 80 160 240 320 400 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 15.22 20.77 66.46 382.47 -pie - MIN: 25.85 / MAX: 122.2 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 800 1600 2400 3200 4000 SE +/- 1.69, N = 3 SE +/- 3.16, N = 3 SE +/- 1.12, N = 3 SE +/- 0.00, N = 15 3666.89 2043.05 1512.57 0.14 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1500 3000 4500 6000 7500 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 27.09, N = 15 4.35 5.86 9.90 6990.10 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.53 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 700 1400 2100 2800 3500 SE +/- 0.31, N = 3 SE +/- 0.47, N = 3 SE +/- 0.56, N = 3 SE +/- 0.01, N = 3 3132.48 1875.28 1014.70 8.39 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 10.19 14.76 15.98 119.17 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.21 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 2K 4K 6K 8K 10K SE +/- 3.60, N = 3 SE +/- 6.52, N = 3 SE +/- 2.08, N = 3 SE +/- 0.01, N = 3 10382.22 6605.12 4239.52 0.46 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 500 1000 1500 2000 2500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 27.29, N = 3 3.07 3.52 4.53 2186.81 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.34 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 300 600 900 1200 1500 SE +/- 0.51, N = 3 SE +/- 0.79, N = 3 SE +/- 0.37, N = 3 SE +/- 0.00, N = 3 1222.99 645.74 565.34 0.15 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1500 3000 4500 6000 7500 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 76.95, N = 3 13.07 18.56 26.50 6773.31 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.99 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 70 140 210 280 350 SE +/- 0.11, N = 3 SE +/- 0.25, N = 3 SE +/- 0.37, N = 3 SE +/- 0.00, N = 3 315.14 185.29 96.58 1.36 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 160 320 480 640 800 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 0.58, N = 3 SE +/- 0.26, N = 3 50.72 64.70 155.14 735.58 -pie - MIN: 114.75 / MAX: 224.64 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.23 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1300 2600 3900 5200 6500 SE +/- 4.43, N = 3 SE +/- 3.28, N = 3 SE +/- 3.26, N = 3 SE +/- 0.01, N = 3 6177.94 3650.57 2646.58 5.50 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.24, N = 3 5.16 8.20 11.32 181.94 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.11 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 700 1400 2100 2800 3500 SE +/- 1.10, N = 3 SE +/- 4.38, N = 3 SE +/- 3.66, N = 3 SE +/- 0.01, N = 3 3146.04 1764.21 633.76 7.36 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 5.07 6.79 23.64 135.87 -pie - MIN: 9.57 / MAX: 42.22 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 300 600 900 1200 1500 SE +/- 0.70, N = 3 SE +/- 0.60, N = 3 SE +/- 0.96, N = 3 SE +/- 0.02, N = 3 1419.11 964.46 370.03 2.53 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 90 180 270 360 450 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.21, N = 3 SE +/- 2.18, N = 3 22.53 31.08 81.00 394.94 -pie - MIN: 64.65 / MAX: 134.64 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.15 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 20K 40K 60K 80K 100K SE +/- 31.79, N = 3 SE +/- 18.42, N = 3 SE +/- 14.46, N = 3 SE +/- 0.39, N = 3 81996.81 43607.04 29668.12 178.82 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 1.2555 2.511 3.7665 5.022 6.2775 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.38 0.52 0.99 5.58 -pie - MIN: 0.8 / MAX: 13.76 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 200 400 600 800 1000 SE +/- 0.18, N = 3 SE +/- 1.75, N = 3 SE +/- 1.37, N = 3 SE +/- 0.00, N = 3 1158.43 761.14 390.72 2.36 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 90 180 270 360 450 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 SE +/- 0.26, N = 3 27.60 39.38 76.72 423.95 -pie - MIN: 58.8 / MAX: 121.69 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.57 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 20K 40K 60K 80K 100K SE +/- 453.62, N = 3 SE +/- 45.46, N = 3 SE +/- 332.93, N = 3 SE +/- 0.48, N = 3 92100.80 54971.26 44049.15 136.16 -pie -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 0.27 0.40 0.61 7.33 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.95 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 m7a.16xlarge c6g.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 400M 800M 1200M 1600M 2000M SE +/- 1428129.35, N = 3 SE +/- 176147.98, N = 3 SE +/- 2060519.90, N = 3 SE +/- 1088162.98, N = 3 1843444333 1032893667 962889833 920427767 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction m7a.16xlarge c6g.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 1.3211 2.6422 3.9633 5.2844 6.6055 SE +/- 0.03993251, N = 3 SE +/- 0.01888616, N = 3 SE +/- 0.02210564, N = 3 SE +/- 0.04970425, N = 3 2.89602661 5.61811686 5.63057327 5.87157885 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge c3d-standard-60 AMD Genoa 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.31, N = 12 SE +/- 0.01, N = 3 SE +/- 0.25, N = 3 11.59 24.57 25.87 28.02 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 500K 1000K 1500K 2000K 2500K SE +/- 1437.63, N = 3 SE +/- 9191.61, N = 3 SE +/- 1295.68, N = 3 SE +/- 635.29, N = 3 2158639.27 1730658.45 1445843.52 1259870.72 1. (CC) gcc options: -O2 -lrt" -lrt
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 30M 60M 90M 120M 150M SE +/- 1001447.21, N = 3 SE +/- 1618403.29, N = 14 SE +/- 1450871.09, N = 3 SE +/- 1645401.81, N = 15 135419169 112958788 105894457 81807706 -m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 70K 140K 210K 280K 350K SE +/- 400.99, N = 3 SE +/- 388.74, N = 3 SE +/- 346.51, N = 3 SE +/- 359.95, N = 3 330633 278973 271795 239735 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge c3d-standard-60 AMD Genoa 60K 120K 180K 240K 300K SE +/- 342.37, N = 3 SE +/- 347.74, N = 3 SE +/- 57.33, N = 3 SE +/- 519.21, N = 3 282593 247255 234046 226211 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 20 40 60 80 100 SE +/- 0.29, N = 5 SE +/- 0.37, N = 5 SE +/- 0.82, N = 3 27.71 33.40 102.22
Build: defconfig
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 90 180 270 360 450 SE +/- 0.37, N = 3 SE +/- 1.20, N = 3 SE +/- 2.41, N = 3 267.97 333.35 409.10
Build: allmodconfig
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only m7a.16xlarge t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 27.74 34.27
Blend File: BMW27 - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only m7a.16xlarge t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 71.51 89.35
Blend File: Classroom - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only m7a.16xlarge t2d-standard-60 AMD Milan 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 37.12 45.22
Blend File: Fishy Cat - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only m7a.16xlarge t2d-standard-60 AMD Milan 80 160 240 320 400 SE +/- 0.49, N = 3 SE +/- 0.60, N = 3 276.23 351.58
Blend File: Barbershop - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only m7a.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 91.88 112.64
Blend File: Pabellon Barcelona - Compute: CPU-Only
c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.
libavif avifenc This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 60 120 180 240 300 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.34, N = 3 65.45 78.07 78.35 270.07 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 40 80 120 160 200 SE +/- 0.22, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 35.81 41.54 41.99 167.95 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 1.0051 2.0102 3.0153 4.0204 5.0255 SE +/- 0.015, N = 3 SE +/- 0.013, N = 3 SE +/- 0.007, N = 3 SE +/- 0.014, N = 3 2.649 3.205 3.250 4.467 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 2 4 6 8 10 SE +/- 0.008, N = 3 SE +/- 0.099, N = 3 SE +/- 0.031, N = 3 SE +/- 0.032, N = 3 5.678 6.889 7.639 8.879 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 153.80 170.93 176.77 224.41
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 60 120 180 240 300 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 154.44 191.71 198.39 286.20
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 50K 100K 150K 200K 250K SE +/- 451.60, N = 3 SE +/- 126.15, N = 3 SE +/- 394.72, N = 3 SE +/- 249.05, N = 3 233014.72 187350.44 162957.75 162553.85 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 m7a.16xlarge c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 50K 100K 150K 200K 250K SE +/- 260.28, N = 3 SE +/- 688.79, N = 3 SE +/- 132.24, N = 3 SE +/- 156.32, N = 3 224859.09 180537.84 158700.36 155609.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 13000M 26000M 39000M 52000M 65000M SE +/- 161718701.08, N = 3 SE +/- 20491615.60, N = 3 SE +/- 9562123.40, N = 3 SE +/- 192235444.29, N = 3 62253861197 50884997103 46211821313 42288513973 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 6000M 12000M 18000M 24000M 30000M SE +/- 29763937.73, N = 3 SE +/- 108274834.92, N = 3 SE +/- 4399663.13, N = 3 SE +/- 6214593.12, N = 3 26481506820 22244804183 14702270573 14384917863 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 7K 14K 21K 28K 35K SE +/- 24.57, N = 3 SE +/- 14.62, N = 3 SE +/- 11.58, N = 3 SE +/- 0.09, N = 3 31583.8 20079.5 12973.0 2640.0 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 200K 400K 600K 800K 1000K SE +/- 367.82, N = 3 SE +/- 644.45, N = 3 SE +/- 50.91, N = 3 SE +/- 6.55, N = 3 996017.5 860844.6 493077.6 215683.2 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa c6g.16xlarge 70000M 140000M 210000M 280000M 350000M SE +/- 280681661.27, N = 3 SE +/- 47698640.87, N = 3 SE +/- 12326205.97, N = 3 SE +/- 372419.81, N = 3 308308045083 180249145770 173980949893 67324778360 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 130000M 260000M 390000M 520000M 650000M SE +/- 1818538727.73, N = 3 SE +/- 342949201.09, N = 3 SE +/- 376720190.05, N = 3 SE +/- 5537993.15, N = 3 592545362740 343095284440 234604082610 158788510970 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 110000M 220000M 330000M 440000M 550000M SE +/- 1691817104.57, N = 3 SE +/- 71241287.97, N = 3 SE +/- 178290221.71, N = 3 SE +/- 2100313.05, N = 3 522113080527 293328048497 216025967640 129198197600 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 50000M 100000M 150000M 200000M 250000M SE +/- 85099636.22, N = 3 SE +/- 3664727.47, N = 3 SE +/- 198663058.81, N = 3 SE +/- 2259404.37, N = 3 216773475533 123909304773 119647720337 46715126487 -m64 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 9M 18M 27M 36M 45M SE +/- 457186.43, N = 3 SE +/- 303573.79, N = 3 SE +/- 154587.14, N = 3 40899210 33466804 33268158
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 90 180 270 360 450 SE +/- 4.92, N = 3 SE +/- 4.08, N = 3 SE +/- 2.14, N = 3 340.11 415.08 418.59 MAX: 37899.66 MAX: 28810.4 MAX: 31920.67
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 10M 20M 30M 40M 50M SE +/- 224779.59, N = 3 SE +/- 221111.93, N = 3 SE +/- 188621.29, N = 3 44502333 34925899 34762565
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 140 280 420 560 700 SE +/- 0.65, N = 3 SE +/- 3.18, N = 3 SE +/- 12.58, N = 3 521.98 623.89 633.58 MAX: 34235.44 MAX: 41831.2 MAX: 54749.7
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 9M 18M 27M 36M 45M SE +/- 111073.27, N = 3 SE +/- 66565.66, N = 3 SE +/- 130588.20, N = 3 42643903 34332237 34123810
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 m7a.16xlarge t2d-standard-60 AMD Milan c3d-standard-60 AMD Genoa 100 200 300 400 500 SE +/- 7.15, N = 3 SE +/- 35.57, N = 3 SE +/- 27.08, N = 3 355.13 433.96 447.68 MAX: 67149.77 MAX: 103381.73 MAX: 95136.87
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 10M 20M 30M 40M 50M SE +/- 99649.31, N = 3 SE +/- 267152.12, N = 3 SE +/- 354923.23, N = 3 44699315 35359884 35068557
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 m7a.16xlarge c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 150 300 450 600 750 SE +/- 4.86, N = 3 SE +/- 7.53, N = 3 SE +/- 25.87, N = 3 598.49 682.12 709.46 MAX: 62432.53 MAX: 98294.84 MAX: 113264.06
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
c6g.16xlarge: The test quit with a non-zero exit status.
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 600K 1200K 1800K 2400K 3000K SE +/- 13309.30, N = 3 SE +/- 20558.90, N = 3 SE +/- 12058.27, N = 3 2923009 2003784 1043267 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 800 - Mode: Read Only
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 600K 1200K 1800K 2400K 3000K SE +/- 2874.04, N = 3 SE +/- 22497.42, N = 3 SE +/- 12606.16, N = 3 2880940 2008186 975031 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 1000 - Mode: Read Only
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write t2d-standard-60 AMD Milan m7a.16xlarge c6g.16xlarge 1200 2400 3600 4800 6000 SE +/- 49.28, N = 12 SE +/- 13.63, N = 3 SE +/- 109.40, N = 12 5682 5312 4784 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 800 - Mode: Read Write
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write t2d-standard-60 AMD Milan m7a.16xlarge c6g.16xlarge 1200 2400 3600 4800 6000 SE +/- 49.93, N = 8 SE +/- 16.57, N = 3 SE +/- 124.50, N = 9 5793 5300 4776 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write
c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 0.1726 0.3452 0.5178 0.6904 0.863 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.009, N = 3 0.274 0.399 0.767 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency m7a.16xlarge t2d-standard-60 AMD Milan c6g.16xlarge 0.2309 0.4618 0.6927 0.9236 1.1545 SE +/- 0.000, N = 3 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 0.347 0.498 1.026 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency t2d-standard-60 AMD Milan m7a.16xlarge c6g.16xlarge 40 80 120 160 200 SE +/- 1.23, N = 12 SE +/- 0.39, N = 3 SE +/- 3.85, N = 12 140.92 150.60 168.19 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency t2d-standard-60 AMD Milan m7a.16xlarge c6g.16xlarge 50 100 150 200 250 SE +/- 1.44, N = 8 SE +/- 0.59, N = 3 SE +/- 6.00, N = 9 172.72 188.68 210.61 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
c3d-standard-60 AMD Genoa Processor: AMD EPYC 9B14 (30 Cores / 60 Threads), Motherboard: Google Compute Engine c3d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 00:31 by user michael_larabel.
t2d-standard-60 AMD Milan Processor: AMD EPYC 7B13 (60 Cores), Motherboard: Google Compute Engine t2d-standard-60, Chipset: Intel 440FX 82441FX PMC, Memory: 240GB, Disk: 215GB PersistentDisk, Network: Red Hat Virtio device
OS: Ubuntu 22.04, Kernel: 6.2.0-1014-gcp (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 12:59 by user michael_larabel.
c6g.16xlarge Processor: ARMv8 Neoverse-N1 (64 Cores), Motherboard: Amazon EC2 c6g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 5 October 2023 17:34 by user ubuntu.
m7a.16xlarge Processor: AMD EPYC 9R14 (64 Cores), Motherboard: Amazon EC2 m7a.16xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 6 October 2023 16:34 by user ubuntu.