Graviton4 r8g.16xlarge vs. AMD EPYC 4th Gen

Initial benchmarks by Michael Larabel

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2407108-NE-2407106NE99
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Graviton4 r8g.16xlarge
July 10
  10 Hours, 5 Minutes
EPYC 9R14 r7a.16xlarge
July 10
  5 Hours, 16 Minutes
Invert Behavior (Only Show Selected Data)
  7 Hours, 40 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Graviton4 r8g.16xlarge vs. AMD EPYC 4th GenProcessorMotherboardChipsetMemoryDiskNetworkGraphicsOSKernelCompilerFile-SystemSystem LayerScreen ResolutionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlargeARMv8 Neoverse-V2 (64 Cores)Amazon EC2 r8g.16xlarge (1.0 BIOS)Amazon Device 0200496GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 r7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC1 x 512GB DDR5-4800MT/ssimpledrmdrmfb6.8.0-1009-aws (x86_64)800x600OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- Graviton4 r8g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - EPYC 9R14 r7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Python Details- Python 3.12.3Security Details- Graviton4 r8g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9R14 r7a.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Processor Details- EPYC 9R14 r7a.16xlarge: CPU Microcode: 0xa101148

Graviton4 r8g.16xlarge vs. EPYC 9R14 r7a.16xlarge ComparisonPhoronix Test SuiteBaseline+136.8%+136.8%+273.6%+273.6%+410.4%+410.4%547.1%447.4%310.6%202.9%189.3%142.6%139.3%137.2%96.2%84.5%80.1%76.3%75.5%67.7%60.3%59.9%45.8%31.1%16%15.8%8.8%7.5%7.5%6%4.6%3.9%WPA PSKMD564 - 256 - 512ChaCha20ChaCha20-Poly1305Fishy Cat - CPU-OnlyPabellon Barcelona - CPU-OnlyHMAC-SHA512P.P.B.T.TClassroom - CPU-OnlyP.P.B.T.T5K - 1676.9%MPI CPU - water_GMX50_bareBarbershop - CPU-Only4K - 1675.4%Update Rand71.6%BMW27 - CPU-OnlybcryptBlowfishSmall58.5%R.R.W.R48.6%Time To Compile64 - 256 - 3244.1%i.i.1.C.P.D38.6%Read While Writing35.2%d.S.M.S - Mesh Time34.8%d.M.M.S - Mesh Time34.8%64 - 256 - 57Compression Rating19.5%AES-128-GCM17.1%Time To CompileGhostRider - 1MD.R14.9%Time To Compile14.2%AES-256-GCM12.5%Ninja9%d.M.M.S - Execution Time8.9%1.R.H.D.F.R.C.C100 - 1000 - Read Only - Average Latency100 - 1000 - Read OnlyRand Read6.7%1.R.H.D.S.RChess Benchmarkd.S.M.S - Execution Time4.2%1.R.H.D.T.RJohn The RipperJohn The RipperLiquid-DSPOpenSSLOpenSSLBlenderBlenderJohn The RippersrsRAN ProjectBlendersrsRAN ProjectC-RayGROMACSBlenderC-RayRocksDBBlenderJohn The RipperJohn The RipperminiFERocksDBTimed Node.js CompilationLiquid-DSPXcompact3d Incompact3dRocksDBOpenFOAMOpenFOAMLiquid-DSP7-Zip CompressionOpenSSLTimed Godot Game Engine CompilationXmrig7-Zip CompressionTimed Gem5 CompilationOpenSSLTimed LLVM CompilationOpenFOAMClickHousePostgreSQLPostgreSQLRocksDBClickHouseStockfishOpenFOAMClickHouseGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge

Graviton4 r8g.16xlarge vs. AMD EPYC 4th Genminife: Smallincompact3d: input.i3d 193 Cells Per Directionopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timexmrig: KawPow - 1Mxmrig: Monero - 1Mxmrig: Wownero - 1Mxmrig: GhostRider - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: CryptoNight-Femto UPX2 - 1Msrsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Totaljohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5compress-7zip: Compression Ratingcompress-7zip: Decompression Ratingstockfish: Chess Benchmarkbuild-gem5: Time To Compilebuild-godot: Time To Compilebuild-llvm: Ninjabuild-nodejs: Time To Compilec-ray: 4K - 16c-ray: 5K - 16liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 64 - 256 - 512openssl: SHA256openssl: SHA512openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Rungromacs: MPI CPU - water_GMX50_barepgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencyrocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge65410.47.8750206618.38476841.41307591.974527344.167621906.021872.128304.05958.721867.221851.914402.51332.5570385744457032108066333157966738346033150581440801186.768147.870182.063365.02428.32350.24732587333331929666667185000000569711745373541280936310189872032329979710213726612515052775190548707449.44479.94495.034.83119475250.5144420226.24434653258612746168522597558401450.64105.3995.01499.63202.2641274.210.915766124.78823643.169709123.99717374.696646899.828256.82400.29140937174791221256384000864700032093328840885180777213.288127.448198.500250.42949.69188.89122615333332529533333759700000308611390393256101795407236482948213217491096737488.88508.87514.298.51920927100.4784458224.3233247036697428986304192375658530.1957.1139.16284.6384.52OpenBenchmarking.org

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge14K28K42K56K70KSE +/- 28.12, N = 3SE +/- 136.71, N = 365410.441274.21. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge3691215SE +/- 0.01884341, N = 3SE +/- 0.01638008, N = 37.8750206610.915766101. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge61218243018.3824.79-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge102030405041.4143.17-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge30609012015091.97124.00-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400344.17374.70-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 59.25, N = 321906.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Variant: KawPow - Hash Count: 1M

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 40.15, N = 321872.11. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Variant: Monero - Hash Count: 1M

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MGraviton4 r8g.16xlarge6K12K18K24K30KSE +/- 16.31, N = 328304.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Variant: Wownero - Hash Count: 1M

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge15003000450060007500SE +/- 47.30, N = 12SE +/- 2.15, N = 35958.76899.8-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 16.81, N = 321867.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Variant: CryptoNight-Heavy - Hash Count: 1M

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 9.16, N = 321851.91. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Variant: CryptoNight-Femto UPX2 - Hash Count: 1M

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

srsRAN Project

srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge6K12K18K24K30KSE +/- 34.15, N = 3SE +/- 148.22, N = 314402.528256.8-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge5001000150020002500SE +/- 0.03, N = 3SE +/- 0.07, N = 31332.52400.2MIN: 784 / MAX: 1332.6-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1696.6 / MAX: 2400.31. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 2.33, N = 3SE +/- 75.52, N = 35703891409-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 10.97, N = 3SE +/- 164.01, N = 357444371747-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 9.29, N = 3SE +/- 117.95, N = 35703291221-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50M100M150M200M250MSE +/- 640028.21, N = 3SE +/- 730794.32, N = 3108066333256384000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 1201.85, N = 3SE +/- 7571.88, N = 315796678647000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 397.84, N = 3SE +/- 1508.30, N = 33834603209331. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70K140K210K280K350KSE +/- 107.45, N = 3SE +/- 148.35, N = 33315052884081. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20M40M60M80M100MSE +/- 2235406.92, N = 15SE +/- 927253.56, N = 128144080185180777-m64 -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 1.43, N = 12SE +/- 2.25, N = 3186.77213.29

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge306090120150SE +/- 0.35, N = 3SE +/- 0.19, N = 3147.87127.45

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.20, N = 3SE +/- 0.91, N = 3182.06198.50

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400SE +/- 0.19, N = 3SE +/- 0.19, N = 3365.02250.43

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 328.3249.691. (CC) gcc options: -lpthread -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.07, N = 3SE +/- 0.10, N = 350.2588.891. (CC) gcc options: -lpthread -lm

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge700M1400M2100M2800M3500MSE +/- 533333.33, N = 3SE +/- 3199131.83, N = 3325873333322615333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge500M1000M1500M2000M2500MSE +/- 33333.33, N = 3SE +/- 6691619.97, N = 3192966666725295333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge160M320M480M640M800MSE +/- 0.00, N = 3SE +/- 1140935.29, N = 31850000007597000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenSSL

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton4 r8g.16xlarge12000M24000M36000M48000M60000MSE +/- 10002237.60, N = 3569711745371. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Algorithm: SHA256

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton4 r8g.16xlarge8000M16000M24000M32000M40000MSE +/- 8695334.33, N = 3354128093631. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Algorithm: SHA512

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

Algorithm: RSA4096

Graviton4 r8g.16xlarge: The test run did not produce a result.

EPYC 9R14 r7a.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70000M140000M210000M280000M350000MSE +/- 66408.09, N = 3SE +/- 350599170.76, N = 31018987203233086113903931. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 7802917.99, N = 3SE +/- 358257182.84, N = 32997971021372561017954071. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 6417811.59, N = 3SE +/- 338102973.44, N = 32661251505272364829482131. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50000M100000M150000M200000M250000MSE +/- 558939.35, N = 3SE +/- 197460025.45, N = 3751905487072174910967371. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 4.96, N = 9SE +/- 6.79, N = 3449.44488.88MIN: 42.7 / MAX: 6666.67MIN: 40.21 / MAX: 5000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 5.05, N = 9SE +/- 2.24, N = 3479.94508.87MIN: 43.1 / MAX: 6000MIN: 40.57 / MAX: 6000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 6.71, N = 9SE +/- 9.86, N = 3495.03514.29MIN: 43.07 / MAX: 6666.67MIN: 40.51 / MAX: 6000

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge246810SE +/- 0.004, N = 3SE +/- 0.041, N = 34.8318.5191. (CXX) g++ options: -O3 -lm

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge400K800K1200K1600K2000KSE +/- 6233.51, N = 3SE +/- 7113.73, N = 3194752520927101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge0.11570.23140.34710.46280.5785SE +/- 0.002, N = 3SE +/- 0.002, N = 30.5140.4781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge10002000300040005000SE +/- 14.53, N = 3SE +/- 34.82, N = 3442044581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 0.74, N = 3SE +/- 1.74, N = 3226.24224.321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70M140M210M280M350MSE +/- 1371022.17, N = 3SE +/- 827628.35, N = 33465325863247036691. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge300K600K900K1200K1500KSE +/- 11457.12, N = 3SE +/- 4861.01, N = 312746167428981. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 26974.32, N = 3SE +/- 43017.34, N = 13852259763041921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1.2M2.4M3.6M4.8M6MSE +/- 11170.68, N = 3SE +/- 17358.30, N = 3558401437565851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.10, N = 3SE +/- 0.08, N = 350.6430.19

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.11, N = 3SE +/- 0.14, N = 3105.3957.11

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 395.0139.16

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Barbershop - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 0.76, N = 3SE +/- 0.03, N = 3499.63284.63

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Pabellon Barcelona - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.34, N = 3SE +/- 0.11, N = 3202.2684.52