GCE c3d-standard-60

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310065-NE-2310055NE35&sro&grr.

GCE c3d-standard-60ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargeAMD EPYC 9B14 (30 Cores / 60 Threads)Google Compute Engine c3d-standard-60Intel 440FX 82441FX PMC240GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.046.2.0-1014-gcp (x86_64)1.3.238GCC 11.4.0ext4KVMAMD EPYC 7B13 (60 Cores)Google Compute Engine t2d-standard-60215GB PersistentDiskRed Hat Virtio deviceARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)Amazon Device 0200128GB215GB Amazon Elastic Block StoreAmazon Elastic5.19.0-1025-aws (aarch64)amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 m7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC256GB5.19.0-1025-aws (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- c3d-standard-60 AMD Genoa: CPU Microcode: 0xffffffff- t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffff- m7a.16xlarge: CPU Microcode: 0xa10113eJava Details- OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.12Security Details- c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - m7a.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCE c3d-standard-60pgbench: 100 - 800 - Read Write - Average Latencyapache-iotdb: 800 - 100 - 800 - 400apache-iotdb: 800 - 100 - 800 - 400lammps: 20k Atomspgbench: 100 - 800 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencyopenradioss: Chrysler Neon 1Mblender: Barbershop - CPU-Onlynekrs: Kershawrodinia: OpenMP HotSpot3Dnekrs: TurboPipe Periodicbuild-linux-kernel: allmodconfigapache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 500 - 400pgbench: 100 - 1000 - Read Writeapache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 500 - 100 - 800 - 400openradioss: Bird Strike on Windshieldcassandra: Writesbrl-cad: VGR Performance Metricbuild-nodejs: Time To Compileopenradioss: Bumper Beamstockfish: Total Timebuild-gem5: Time To Compileopenssl: AES-256-GCMopenssl: ChaCha20openssl: AES-128-GCMopenssl: ChaCha20-Poly1305openssl: SHA512openssl: SHA256apache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 500 - 400openradioss: Rubber O-Ring Seal Installationopenradioss: Cell Phone Drop Testpgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Only - Average Latencyopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUtensorflow: CPU - 64 - ResNet-50openvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUavifenc: 0pgbench: 100 - 800 - Read Onlypgbench: 100 - 1000 - Read Onlyblender: Pabellon Barcelona - CPU-Onlylaghos: Sedov Blast Wave, ube_922_hex.meshnginx: 1000nginx: 500blender: Classroom - CPU-Onlyopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUavifenc: 2tensorflow: CPU - 32 - ResNet-50openvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUnpb: IS.Dnpb: LU.Copenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenssl: RSA4096openssl: RSA4096npb: SP.Crodinia: OpenMP LavaMDlaghos: Triple Point Problembuild-linux-kernel: defconfiggromacs: MPI CPU - water_GMX50_barenpb: BT.Cincompact3d: input.i3d 193 Cells Per Directiontensorflow: CPU - 16 - ResNet-50blender: Fishy Cat - CPU-Onlynpb: EP.Dblender: BMW27 - CPU-Onlyrodinia: OpenMP Leukocyteamg: compress-7zip: Decompression Ratingcompress-7zip: Compression Ratingnpb: FT.Cremhos: Sample Remap Examplecoremark: CoreMark Size 666 - Iterations Per Secondlibxsmm: 32npb: CG.Clibxsmm: 64rodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solveravifenc: 6, Losslessincompact3d: input.i3d 129 Cells Per Directionnpb: MG.Cavifenc: 6heffte: c2c - FFTW - double - 128lammps: Rhodopsin Proteinheffte: r2c - FFTW - double - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 128openradioss: INIVOL and Fluid Structure Interaction Drop Containerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlarge682.123535988419.776337.70428985833384.1664723940000447.683433223734762565623.89147.31228640510819198.39492.87105894457176.7672933280484971739809498933430952844401239093047731470227057346211821313418.593326815889.6538.825.862043.0569.6883.99142.7583.91142.9078.068259.55180537.84187350.44340.2935.1941.53862.74648.8118.392422.4073563.1318.56645.744.536605.1264.70185.296.791764.2120.77576.9439.38761.1431.08964.468.621389.698.203650.570.454971.2615.981875.282.874166.390.5243607.04493077.620079.539919.7164.862209.004.39196257.4828.019687750.993783.6045.49896288983322621127179539647.4733.3621445843.521552255.419597.86489.76.44810.0256.8895.8715788542701.833.25057.311617.42393.6005148.57588.6301140.916709.463506855726.7345682172.717327.88351.58368193583388.5352730620000333.351433.9634123810579334925899633.58123.61187169629363191.70675.68112958788170.9302160259676401802491457702346040826101196477203372224480418350884997103415.083346680472.0630.120.3990.4989.901512.5720.90208.4773.74193.4578.9678.35020037842008186112.64364.64155609.04162957.7589.35568.7726.2841.98920.361393.5610.731752.6294247.7726.50565.343.524239.52155.1496.5823.64633.7666.46225.4876.72390.7281.00370.0340.67368.3911.322646.580.6144049.1514.761014.7011.651285.470.9929668.12860844.612973.043228.1150.974222.3033.3995.289122720.6124.572118118.2945.224935.6834.2742.01092042776724725527897354846.1816.3261730658.449440289.216649.37554.26.4237.3687.6395.6305732747291.963.20560.034327.828106.029196.948109.676168.19125.0594784210.60817588600002221710000409.0974776217355286.20181807706224.414129198197600673247783601587885109704671512648714384917863422885139730.7671.0266990.100.14947.591.06947.861.06270.0681043267975031321.29158700.36162553.8522391.860.04167.9469996.560.1915.8018807.756773.310.152186.810.46735.581.36135.877.36382.472.61423.952.36394.942.53153.126.53181.945.507.33136.16119.178.3948.0820.795.58178.82215683.22640.09716.9962.301179.52102.2162.76624229.1425.87483282213.76103289366723404623973521386.3720.8161259870.716902312.713343.35589.514.2125.9838.8795.6181168625661.044.46732.357526.04179.0156202.445129.172150.601598.494469931531.4715312188.676190.79276.23766784666774.7784774796667267.965355.1342643903530044502333521.98115.96278585788704154.44166.27135419169153.8005221130805273083080450835925453627402167734755332648150682062253861197340.114089921059.1826.140.2740.3474.353666.89100.1556.21284.4256.41283.4065.4472923009288094091.88409.73224859.09233014.7271.51261.1161.1935.80587.19503.3831.724085.20210544.8713.071222.993.0710382.2250.72315.145.073146.0415.221049.6627.601158.4322.531419.116.602417.345.166177.940.2792100.8010.193132.482.217222.100.3881996.81996017.531583.8102392.4043.286218.8627.7097.655193219.1211.591308669.5537.127501.7627.7434.6611843444333282593330633103413.0213.8672158639.274883643.442007.571201.85.9306.4805.6782.89602661121293.802.64971.109532.785124.363190.602121.505OpenBenchmarking.org

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan4080120160200SE +/- 3.85, N = 12SE +/- 0.39, N = 3SE +/- 1.23, N = 12168.19150.60140.921. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan150300450600750SE +/- 7.53, N = 3SE +/- 4.86, N = 3SE +/- 25.87, N = 3682.12598.49709.46MAX: 98294.84MAX: 62432.53MAX: 113264.06

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan10M20M30M40M50MSE +/- 267152.12, N = 3SE +/- 99649.31, N = 3SE +/- 354923.23, N = 3353598844469931535068557

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan714212835SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 319.7825.0631.4726.73-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Writec6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan12002400360048006000SE +/- 109.40, N = 12SE +/- 13.63, N = 3SE +/- 49.28, N = 124784531256821. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latencyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50100150200250SE +/- 6.00, N = 9SE +/- 0.59, N = 3SE +/- 1.44, N = 8210.61188.68172.721. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1Mc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan70140210280350SE +/- 2.03, N = 3SE +/- 0.61, N = 3SE +/- 1.48, N = 3337.70190.79327.88

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milan80160240320400SE +/- 0.49, N = 3SE +/- 0.60, N = 3276.23351.58

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1600M3200M4800M6400M8000MSE +/- 57202190.49, N = 12SE +/- 2970005.61, N = 3SE +/- 49077561.86, N = 3SE +/- 84802173.02, N = 1242898583331758860000766784666736819358331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 0.86, N = 15SE +/- 0.74, N = 15SE +/- 1.83, N = 1284.1774.7888.541. (CXX) g++ options: -O2 -lOpenCL

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1000M2000M3000M4000M5000MSE +/- 201939132.13, N = 12SE +/- 1790009.31, N = 3SE +/- 6657808.28, N = 3SE +/- 481352.26, N = 347239400002221710000477479666727306200001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan90180270360450SE +/- 2.41, N = 3SE +/- 0.37, N = 3SE +/- 1.20, N = 3409.10267.97333.35

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan100200300400500SE +/- 27.08, N = 3SE +/- 7.15, N = 3SE +/- 35.57, N = 3447.68355.13433.96MAX: 95136.87MAX: 67149.77MAX: 103381.73

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan9M18M27M36M45MSE +/- 66565.66, N = 3SE +/- 111073.27, N = 3SE +/- 130588.20, N = 3343322374264390334123810

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Writec6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan12002400360048006000SE +/- 124.50, N = 9SE +/- 16.57, N = 3SE +/- 49.93, N = 84776530057931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan10M20M30M40M50MSE +/- 188621.29, N = 3SE +/- 224779.59, N = 3SE +/- 221111.93, N = 3347625654450233334925899

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan140280420560700SE +/- 3.18, N = 3SE +/- 0.65, N = 3SE +/- 12.58, N = 3623.89521.98633.58MAX: 41831.2MAX: 34235.44MAX: 54749.7

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshieldc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 3.26, N = 9SE +/- 0.18, N = 3SE +/- 0.13, N = 3147.31115.96123.61

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60K120K180K240K300KSE +/- 1129.40, N = 3SE +/- 3249.94, N = 12SE +/- 352.65, N = 3SE +/- 681.76, N = 3228640217355278585187169

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan200K400K600K800K1000K5108197887046293631. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilec3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60120180240300SE +/- 0.29, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3198.39286.20154.44191.71

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beamc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 1.56, N = 15SE +/- 0.05, N = 3SE +/- 0.10, N = 392.8766.2775.68

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timec3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan30M60M90M120M150MSE +/- 1450871.09, N = 3SE +/- 1645401.81, N = 15SE +/- 1001447.21, N = 3SE +/- 1618403.29, N = 1410589445781807706135419169112958788-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50100150200250SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3176.77224.41153.80170.93

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan110000M220000M330000M440000M550000MSE +/- 71241287.97, N = 3SE +/- 2100313.05, N = 3SE +/- 1691817104.57, N = 3SE +/- 178290221.71, N = 3293328048497129198197600522113080527216025967640-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan70000M140000M210000M280000M350000MSE +/- 12326205.97, N = 3SE +/- 372419.81, N = 3SE +/- 280681661.27, N = 3SE +/- 47698640.87, N = 317398094989367324778360308308045083180249145770-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan130000M260000M390000M520000M650000MSE +/- 342949201.09, N = 3SE +/- 5537993.15, N = 3SE +/- 1818538727.73, N = 3SE +/- 376720190.05, N = 3343095284440158788510970592545362740234604082610-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50000M100000M150000M200000M250000MSE +/- 3664727.47, N = 3SE +/- 2259404.37, N = 3SE +/- 85099636.22, N = 3SE +/- 198663058.81, N = 312390930477346715126487216773475533119647720337-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan6000M12000M18000M24000M30000MSE +/- 4399663.13, N = 3SE +/- 6214593.12, N = 3SE +/- 29763937.73, N = 3SE +/- 108274834.92, N = 314702270573143849178632648150682022244804183-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan13000M26000M39000M52000M65000MSE +/- 9562123.40, N = 3SE +/- 192235444.29, N = 3SE +/- 161718701.08, N = 3SE +/- 20491615.60, N = 346211821313422885139736225386119750884997103-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan90180270360450SE +/- 2.14, N = 3SE +/- 4.92, N = 3SE +/- 4.08, N = 3418.59340.11415.08MAX: 31920.67MAX: 37899.66MAX: 28810.4

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan9M18M27M36M45MSE +/- 154587.14, N = 3SE +/- 457186.43, N = 3SE +/- 303573.79, N = 3332681584089921033466804

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installationc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 4.01, N = 12SE +/- 0.73, N = 3SE +/- 0.22, N = 389.6559.1872.06

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Testc3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan918273645SE +/- 1.27, N = 15SE +/- 0.08, N = 3SE +/- 0.39, N = 1538.8226.1430.12

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan0.17260.34520.51780.69040.863SE +/- 0.009, N = 3SE +/- 0.001, N = 3SE +/- 0.004, N = 30.7670.2740.3991. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan0.23090.46180.69270.92361.1545SE +/- 0.013, N = 3SE +/- 0.000, N = 3SE +/- 0.006, N = 31.0260.3470.4981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan15003000450060007500SE +/- 0.01, N = 3SE +/- 27.09, N = 15SE +/- 0.00, N = 3SE +/- 0.01, N = 35.866990.104.359.90-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan8001600240032004000SE +/- 3.16, N = 3SE +/- 0.00, N = 15SE +/- 1.69, N = 3SE +/- 1.12, N = 32043.050.143666.891512.57-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 369.68100.1520.90

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2004006008001000SE +/- 0.13, N = 3SE +/- 0.37, N = 3SE +/- 0.10, N = 3SE +/- 9.17, N = 1583.99947.5956.21208.47-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.68-pie - MIN: 119.65 / MAX: 316.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60120180240300SE +/- 0.23, N = 3SE +/- 0.00, N = 3SE +/- 0.53, N = 3SE +/- 3.12, N = 15142.751.06284.4273.74-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2004006008001000SE +/- 0.29, N = 3SE +/- 0.55, N = 3SE +/- 0.06, N = 3SE +/- 7.79, N = 1583.91947.8656.41193.45-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.26-pie - MIN: 113.44 / MAX: 315.541. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60120180240300SE +/- 0.48, N = 3SE +/- 0.00, N = 3SE +/- 0.29, N = 3SE +/- 2.74, N = 15142.901.06283.4078.96-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60120180240300SE +/- 0.07, N = 3SE +/- 0.34, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 378.07270.0765.4578.351. (CXX) g++ options: -O3 -fPIC -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan600K1200K1800K2400K3000KSE +/- 12058.27, N = 3SE +/- 13309.30, N = 3SE +/- 20558.90, N = 31043267292300920037841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlyc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan600K1200K1800K2400K3000KSE +/- 12606.16, N = 3SE +/- 2874.04, N = 3SE +/- 22497.42, N = 3975031288094020081861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 0.08, N = 3SE +/- 0.03, N = 391.88112.64

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan90180270360450SE +/- 0.31, N = 3SE +/- 0.79, N = 3SE +/- 1.29, N = 3SE +/- 0.62, N = 3259.55321.29409.73364.641. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50K100K150K200K250KSE +/- 688.79, N = 3SE +/- 132.24, N = 3SE +/- 260.28, N = 3SE +/- 156.32, N = 3180537.84158700.36224859.09155609.041. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50K100K150K200K250KSE +/- 126.15, N = 3SE +/- 249.05, N = 3SE +/- 451.60, N = 3SE +/- 394.72, N = 3187350.44162553.85233014.72162957.751. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 0.01, N = 3SE +/- 0.07, N = 371.5189.35

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan5K10K15K20K25KSE +/- 0.21, N = 3SE +/- 15.60, N = 3SE +/- 0.03, N = 3SE +/- 0.35, N = 3340.2922391.86261.11568.77-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1428425670SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 335.190.0461.1926.28-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan4080120160200SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 341.54167.9535.8141.991. (CXX) g++ options: -O3 -fPIC -lm

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 362.7487.1920.36

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2K4K6K8K10KSE +/- 0.17, N = 3SE +/- 1.02, N = 3SE +/- 0.12, N = 3SE +/- 1.32, N = 3648.819996.56503.381393.56-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan714212835SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 318.390.1031.7210.73-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan9001800270036004500SE +/- 36.45, N = 15SE +/- 0.58, N = 3SE +/- 3.14, N = 3SE +/- 142.62, N = 122422.40915.804085.201752.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50K100K150K200K250KSE +/- 293.57, N = 3SE +/- 7.52, N = 3SE +/- 661.61, N = 3SE +/- 1463.52, N = 1573563.1318807.75210544.8794247.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan15003000450060007500SE +/- 0.02, N = 3SE +/- 76.95, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 318.566773.3113.0726.50-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.991. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan30060090012001500SE +/- 0.79, N = 3SE +/- 0.00, N = 3SE +/- 0.51, N = 3SE +/- 0.37, N = 3645.740.151222.99565.34-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan5001000150020002500SE +/- 0.01, N = 3SE +/- 27.29, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.532186.813.073.52-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2K4K6K8K10KSE +/- 6.52, N = 3SE +/- 0.01, N = 3SE +/- 3.60, N = 3SE +/- 2.08, N = 36605.120.4610382.224239.52-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan160320480640800SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.02, N = 3SE +/- 0.58, N = 364.70735.5850.72155.14-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.23-pie - MIN: 114.75 / MAX: 224.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan70140210280350SE +/- 0.25, N = 3SE +/- 0.00, N = 3SE +/- 0.11, N = 3SE +/- 0.37, N = 3185.291.36315.1496.58-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 36.79135.875.0723.64-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.64-pie - MIN: 9.57 / MAX: 42.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan7001400210028003500SE +/- 4.38, N = 3SE +/- 0.01, N = 3SE +/- 1.10, N = 3SE +/- 3.66, N = 31764.217.363146.04633.76-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan80160240320400SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.00, N = 3SE +/- 0.27, N = 320.77382.4715.2266.46-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.43-pie - MIN: 25.85 / MAX: 122.21. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2004006008001000SE +/- 1.04, N = 3SE +/- 0.00, N = 3SE +/- 0.11, N = 3SE +/- 0.93, N = 3576.942.611049.66225.48-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan90180270360450SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.01, N = 3SE +/- 0.27, N = 339.38423.9527.6076.72-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.57-pie - MIN: 58.8 / MAX: 121.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan2004006008001000SE +/- 1.75, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 3SE +/- 1.37, N = 3761.142.361158.43390.72-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan90180270360450SE +/- 0.02, N = 3SE +/- 2.18, N = 3SE +/- 0.01, N = 3SE +/- 0.21, N = 331.08394.9422.5381.00-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.15-pie - MIN: 64.65 / MAX: 134.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan30060090012001500SE +/- 0.60, N = 3SE +/- 0.02, N = 3SE +/- 0.70, N = 3SE +/- 0.96, N = 3964.462.531419.11370.03-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 38.62153.126.6040.67-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.75-pie - MIN: 12.07 / MAX: 59.441. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan5001000150020002500SE +/- 6.33, N = 3SE +/- 0.01, N = 3SE +/- 1.65, N = 3SE +/- 1.55, N = 31389.696.532417.34368.39-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan4080120160200SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.20181.945.1611.32-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.111. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan13002600390052006500SE +/- 3.28, N = 3SE +/- 0.01, N = 3SE +/- 4.43, N = 3SE +/- 3.26, N = 33650.575.506177.942646.58-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan246810SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.407.330.270.61-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.951. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan20K40K60K80K100KSE +/- 45.46, N = 3SE +/- 0.48, N = 3SE +/- 453.62, N = 3SE +/- 332.93, N = 354971.26136.1692100.8044049.15-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 315.98119.1710.1914.76-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.211. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan7001400210028003500SE +/- 0.47, N = 3SE +/- 0.01, N = 3SE +/- 0.31, N = 3SE +/- 0.56, N = 31875.288.393132.481014.70-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1122334455SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 32.8748.082.2111.65-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.49-pie - MIN: 3.76 / MAX: 29.11. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan15003000450060007500SE +/- 8.74, N = 3SE +/- 0.01, N = 3SE +/- 4.07, N = 3SE +/- 15.96, N = 34166.3920.797222.101285.47-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1.25552.5113.76655.0226.2775SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.525.580.380.99-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.48-pie - MIN: 0.8 / MAX: 13.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan20K40K60K80K100KSE +/- 18.42, N = 3SE +/- 0.39, N = 3SE +/- 31.79, N = 3SE +/- 14.46, N = 343607.04178.8281996.8129668.12-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan200K400K600K800K1000KSE +/- 50.91, N = 3SE +/- 6.55, N = 3SE +/- 367.82, N = 3SE +/- 644.45, N = 3493077.6215683.2996017.5860844.6-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan7K14K21K28K35KSE +/- 14.62, N = 3SE +/- 0.09, N = 3SE +/- 24.57, N = 3SE +/- 11.58, N = 320079.52640.031583.812973.0-m64-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan20K40K60K80K100KSE +/- 45.01, N = 3SE +/- 0.76, N = 3SE +/- 91.17, N = 3SE +/- 555.92, N = 339919.719716.99102392.4043228.111. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1428425670SE +/- 0.18, N = 3SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.13, N = 364.8662.3043.2950.971. (CXX) g++ options: -O2 -lOpenCL

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan50100150200250SE +/- 0.22, N = 3SE +/- 0.50, N = 3SE +/- 1.67, N = 3SE +/- 1.77, N = 3209.00179.52218.86222.301. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigc6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan20406080100SE +/- 0.82, N = 3SE +/- 0.29, N = 5SE +/- 0.37, N = 5102.2227.7133.40

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barec3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan246810SE +/- 0.011, N = 3SE +/- 0.001, N = 3SE +/- 0.035, N = 3SE +/- 0.005, N = 34.3912.7667.6555.2891. (CXX) g++ options: -O3

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan40K80K120K160K200KSE +/- 122.23, N = 3SE +/- 7.69, N = 3SE +/- 560.75, N = 3SE +/- 42.77, N = 396257.4824229.14193219.12122720.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan714212835SE +/- 0.25, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.31, N = 1228.0225.8711.5924.571. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan1530456075SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 350.9969.5518.29

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milan1020304050SE +/- 0.10, N = 3SE +/- 0.13, N = 337.1245.22

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan16003200480064008000SE +/- 30.67, N = 3SE +/- 7.28, N = 3SE +/- 8.54, N = 3SE +/- 51.21, N = 53783.602213.767501.764935.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milan816243240SE +/- 0.04, N = 3SE +/- 0.06, N = 327.7434.27

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytec3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milan1020304050SE +/- 0.17, N = 3SE +/- 0.32, N = 3SE +/- 0.02, N = 345.5034.6642.011. (CXX) g++ options: -O2 -lOpenCL

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan400M800M1200M1600M2000MSE +/- 2060519.90, N = 3SE +/- 176147.98, N = 3SE +/- 1428129.35, N = 3SE +/- 1088162.98, N = 3962889833103289366718434443339204277671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan60K120K180K240K300KSE +/- 519.21, N = 3SE +/- 57.33, N = 3SE +/- 342.37, N = 3SE +/- 347.74, N = 32262112340462825932472551. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan70K140K210K280K350KSE +/- 346.51, N = 3SE +/- 359.95, N = 3SE +/- 400.99, N = 3SE +/- 388.74, N = 32717952397353306332789731. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan20K40K60K80K100KSE +/- 600.19, N = 15SE +/- 2.85, N = 3SE +/- 446.85, N = 3SE +/- 137.77, N = 339647.4721386.37103413.0254846.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplec3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan816243240SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 333.3620.8213.8716.331. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan500K1000K1500K2000K2500KSE +/- 1295.68, N = 3SE +/- 635.29, N = 3SE +/- 1437.63, N = 3SE +/- 9191.61, N = 31445843.521259870.722158639.271730658.451. (CC) gcc options: -O2 -lrt" -lrt

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan140280420560700SE +/- 0.19, N = 3SE +/- 0.47, N = 3SE +/- 0.40, N = 3SE +/- 3.60, N = 4255.4312.7643.4289.2-lquadmath -msse4.2-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan9K18K27K36K45KSE +/- 77.92, N = 3SE +/- 23.52, N = 3SE +/- 178.46, N = 3SE +/- 1215.82, N = 1519597.8613343.3542007.5716649.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan30060090012001500SE +/- 0.12, N = 3SE +/- 0.96, N = 3SE +/- 0.52, N = 3SE +/- 0.25, N = 3489.7589.51201.8554.2-lquadmath -msse4.2-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan48121620SE +/- 0.104, N = 15SE +/- 0.017, N = 3SE +/- 0.049, N = 3SE +/- 0.009, N = 36.44814.2125.9306.4231. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan3691215SE +/- 0.013, N = 3SE +/- 0.001, N = 3SE +/- 0.007, N = 3SE +/- 0.034, N = 310.0255.9836.4807.3681. (CXX) g++ options: -O2 -lOpenCL

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Losslessc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan246810SE +/- 0.099, N = 3SE +/- 0.032, N = 3SE +/- 0.008, N = 3SE +/- 0.031, N = 36.8898.8795.6787.6391. (CXX) g++ options: -O3 -fPIC -lm

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1.32112.64223.96335.28446.6055SE +/- 0.04970425, N = 3SE +/- 0.01888616, N = 3SE +/- 0.03993251, N = 3SE +/- 0.02210564, N = 35.871578855.618116862.896026615.630573271. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan30K60K90K120K150KSE +/- 29.69, N = 3SE +/- 10.99, N = 3SE +/- 526.14, N = 3SE +/- 145.06, N = 342701.8325661.04121293.8047291.961. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1.00512.01023.01534.02045.0255SE +/- 0.007, N = 3SE +/- 0.014, N = 3SE +/- 0.015, N = 3SE +/- 0.013, N = 33.2504.4672.6493.2051. (CXX) g++ options: -O3 -fPIC -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan1632486480SE +/- 1.29, N = 15SE +/- 0.10, N = 3SE +/- 0.63, N = 15SE +/- 0.66, N = 357.3132.3671.1160.031. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinc3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan816243240SE +/- 0.54, N = 12SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 317.4226.0432.7927.83-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 1.78, N = 12SE +/- 0.71, N = 3SE +/- 1.18, N = 15SE +/- 0.91, N = 393.6079.02124.36106.031. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan4080120160200SE +/- 2.19, N = 12SE +/- 0.57, N = 3SE +/- 2.16, N = 15SE +/- 0.82, N = 3148.58202.45190.60196.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128c3d-standard-60 AMD Genoac6g.16xlargem7a.16xlarget2d-standard-60 AMD Milan306090120150SE +/- 0.52, N = 3SE +/- 0.11, N = 3SE +/- 1.44, N = 15SE +/- 0.78, N = 388.63129.17121.51109.681. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5