AWS Graviton Benchmarks

Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2407235-NE-2407230NE75&rdt&grs.

AWS Graviton BenchmarksProcessorMotherboardMemoryDiskNetworkOSKernelCompilerFile-SystemGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xlARMv8 Cortex-A72 (16 Cores)Amazon EC2 a1.metal (1.0 BIOS)32GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4ARMv8 Neoverse-V2 (96 Cores)Amazon EC2 r8g.metal-24xl (1.0 BIOS)12 x 64GB DDR5-5600MT/sOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Java Details- OpenJDK Runtime Environment (build 11.0.23+9-post-Ubuntu-1ubuntu1)Python Details- Python 3.12.3Security Details- Graviton1 16 Cores a1.metal: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening BHB + srbds: Not affected + tsx_async_abort: Not affected - Graviton4 96 Cores r8g.metal-24xl: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AWS Graviton Benchmarksspeedb: Read Rand Write Randrocksdb: Read While Writingcassandra: Writesstress-ng: Matrix Mathrocksdb: Update Randgraphics-magick: Noise-Gaussianavifenc: 6stress-ng: CPU Cachemt-dgemm: Sustained Floating-Point Ratepyperformance: pathlibspeedb: Update Randavifenc: 6, Losslessbuild-php: Time To Compileavifenc: 0avifenc: 2numpy: speedb: Read While Writinggraphics-magick: Resizingpyperformance: regex_compilestress-ng: Vector Floating Pointcompress-lz4: 2 - Compression Speedpyperformance: xml_etreegraphics-magick: HWB Color Spacesrsran: PUSCH Processor Benchmark, Throughput Threadphpbench: PHP Benchmark Suitepyperformance: json_loadspyperformance: crypto_pyaespyperformance: python_startupwebp: Quality 100, Losslesscompress-lz4: 12 - Decompression Speedcompress-lz4: 9 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedc-ray: 4K - 16c-ray: 5K - 16c-ray: 1080p - 16pgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencyspeedb: Rand Readstress-ng: Matrix 3D Mathrocksdb: Rand Readgromacs: MPI CPU - water_GMX50_bareopenssl: SHA512srsran: PDSCH Processor Benchmark, Throughput Threadcompress-lz4: 1 - Compression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 2 - Decompression Speedcompress-lz4: 1 - Decompression Speedwebp: Defaultcompress-lz4: 12 - Compression Speedwebp: Quality 100webp: Quality 100, Highest Compressionblender: Classroom - CPU-Onlyopenssl: ChaCha20blender: BMW27 - CPU-Onlyminife: Smallopenssl: ChaCha20-Poly1305srsran: PDSCH Processor Benchmark, Throughput Totalhpcg: 104 104 104 - 60stress-ng: Memory Copyingbuild-nodejs: Time To Compileopenssl: AES-256-GCMstress-ng: Vector Mathjohn-the-ripper: HMAC-SHA512stress-ng: Trigonometric Mathstress-ng: Logarithmic Mathblender: Fishy Cat - CPU-Onlygraphics-magick: Enhancedcompress-7zip: Compression Ratingopenssl: AES-128-GCMstress-ng: Floating Pointstress-ng: Power Mathcoremark: CoreMark Size 666 - Iterations Per Secondopenssl: SHA256stress-ng: Fused Multiply-Addbuild-godot: Time To Compilegraphics-magick: Sharpenjohn-the-ripper: MD5rocksdb: Read Rand Write Randcompress-7zip: Decompression Ratingsrsran: PUSCH Processor Benchmark, Throughput Totaljohn-the-ripper: WPA PSKjohn-the-ripper: bcryptjohn-the-ripper: Blowfishpgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writestockfish: Chess BenchmarkGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl5526211268650263487343.991646162621.288631797.130.87925011012816728.229479.642668.499381.112104.592433641614553466.2582.512308118.723421172.122725.90.401090.31102.037.301137.3566.9461008.693140.8221018709.81623286630838.94232528210.313250589187794.0237.6813.721153.11514.74.634.833.251.431289.618285450333599.394159.1265541467701307.33.768671997.894663.2762577349645728508.49103216672998.753302.68952.43203269730455741337852.81609.16192273.03371561517499334928995.121674.4692619042657445841244169.3789478428003215.89046322493057534291111560819239658631796.6513274232082.7774735225.1760.63583917.67293985.23789.999130.20380.243495.7811490911278101119727.57329.4158.230169.482922120.665.07.551.333595.53573.1115.143489.518.92733.6784.89128932640.34664093847122886.625274585476.92653203418283278.7697.8939.803326.13927.711.6111.957.823.2269.4014879168876033.7270677.811069945000322062.761.619532525.12289.267400130558450438910.4015823833345810.3750252.6664.0729447985144487114879712455.0223418.402732642.2688918560484677065585333.04126.346325236733370471014946601961.7862818564785667232.4334303118337918OpenBenchmarking.org

Speedb

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1.1M2.2M3.3M4.4M5.5MSE +/- 214.78, N = 3SE +/- 4241.50, N = 355262153429111. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl2M4M6M8M10MSE +/- 16876.08, N = 3SE +/- 18723.10, N = 31268650115608191. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: WritesGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl50K100K150K200K250KSE +/- 148.76, N = 3SE +/- 835.72, N = 326348239658

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Matrix MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl140K280K420K560K700KSE +/- 74.42, N = 3SE +/- 9.55, N = 37343.99631796.651. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl300K600K900K1200K1500KSE +/- 443.99, N = 3SE +/- 5313.65, N = 316461613274231. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl50100150200250SE +/- 0.00, N = 3SE +/- 0.67, N = 3262081. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl510152025SE +/- 0.032, N = 3SE +/- 0.014, N = 321.2882.7771. (CXX) g++ options: -O3 -fPIC -lm

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: CPU CacheGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1000K2000K3000K4000K5000KSE +/- 3895.03, N = 3SE +/- 57877.80, N = 15631797.134735225.171. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1428425670SE +/- 0.003641, N = 3SE +/- 0.164173, N = 30.87925060.6358391. (CC) gcc options: -O3 -march=native -fopenmp

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: pathlibGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3110.017.6

Speedb

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl160K320K480K640K800KSE +/- 302.33, N = 3SE +/- 3844.86, N = 31281677293981. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl714212835SE +/- 0.098, N = 3SE +/- 0.046, N = 328.2295.2371. (CXX) g++ options: -O3 -fPIC -lm

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.3.4Time To CompileGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl100200300400500SE +/- 0.12, N = 3SE +/- 0.10, N = 3479.6490.00

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl140280420560700SE +/- 3.02, N = 3SE +/- 0.01, N = 3668.50130.201. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl80160240320400SE +/- 0.85, N = 3SE +/- 0.02, N = 3381.1180.241. (CXX) g++ options: -O3 -fPIC -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl110220330440550SE +/- 0.23, N = 3SE +/- 0.76, N = 3104.59495.78

Speedb

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl2M4M6M8M10MSE +/- 20302.77, N = 8SE +/- 87574.70, N = 152433641114909111. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl60120180240300SE +/- 0.00, N = 3SE +/- 0.58, N = 3612781. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: regex_compileGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl100200300400500SE +/- 0.33, N = 3SE +/- 0.00, N = 3455101

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Vector Floating PointGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl30K60K90K120K150KSE +/- 0.38, N = 3SE +/- 17.26, N = 33466.25119727.571. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

LZ4 Compression

Compression Level: 2 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 2 - Compression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl70140210280350SE +/- 0.02, N = 3SE +/- 0.13, N = 382.51329.411. (CC) gcc options: -O3 -pthread

PyPerformance

Benchmark: xml_etree

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: xml_etreeGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl50100150200250SE +/- 0.58, N = 3SE +/- 0.00, N = 3230.058.2

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl70140210280350SE +/- 0.00, N = 3SE +/- 0.33, N = 3813011. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput ThreadGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1530456075SE +/- 0.23, N = 4SE +/- 0.00, N = 318.769.4MIN: 11.3 / MAX: 18.9MIN: 45.41. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl200K400K600K800K1000KSE +/- 481.21, N = 3SE +/- 273.39, N = 3234211829221

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: json_loadsGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1632486480SE +/- 0.10, N = 3SE +/- 0.00, N = 372.120.6

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: crypto_pyaesGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl50100150200250SE +/- 0.33, N = 3SE +/- 0.03, N = 3227.065.0

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: python_startupGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 325.907.55

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl0.29930.59860.89791.19721.4965SE +/- 0.00, N = 3SE +/- 0.00, N = 30.401.331. (CC) gcc options: -fvisibility=hidden -O2 -lm

LZ4 Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 12 - Decompression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl8001600240032004000SE +/- 2.61, N = 3SE +/- 1.89, N = 31090.33595.51. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 9 - Decompression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl8001600240032004000SE +/- 0.06, N = 3SE +/- 0.40, N = 31102.03573.11. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 3 - Compression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl306090120150SE +/- 0.01, N = 3SE +/- 0.08, N = 337.30115.141. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 3 - Decompression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl7001400210028003500SE +/- 0.06, N = 3SE +/- 0.15, N = 31137.33489.51. (CC) gcc options: -O3 -pthread

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl120240360480600SE +/- 1.65, N = 3SE +/- 0.02, N = 3566.9518.931. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl2004006008001000SE +/- 2.98, N = 3SE +/- 0.03, N = 31008.6933.681. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 1080p - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 1080p - Rays Per Pixel: 16Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl306090120150SE +/- 0.248, N = 3SE +/- 0.007, N = 3140.8224.8911. (CC) gcc options: -lpthread -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl600K1200K1800K2400K3000KSE +/- 107.42, N = 3SE +/- 15759.90, N = 310187028932641. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl3691215SE +/- 0.010, N = 3SE +/- 0.002, N = 39.8160.3461. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Speedb

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl140M280M420M560M700MSE +/- 174965.25, N = 15SE +/- 229391.69, N = 3232866306409384711. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Matrix 3D MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl5K10K15K20K25KSE +/- 3.67, N = 3SE +/- 50.98, N = 3838.9422886.621. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl110M220M330M440M550MSE +/- 12274.71, N = 3SE +/- 53499.62, N = 3232528215274585471. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl246810SE +/- 0.000, N = 3SE +/- 0.001, N = 30.3136.9261. (CXX) g++ options: -O3 -lm

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl11000M22000M33000M44000M55000MSE +/- 4619831.10, N = 3SE +/- 4634061.90, N = 32505891877532034182831. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput ThreadGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl60120180240300SE +/- 0.21, N = 3SE +/- 3.00, N = 394.0278.71. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 1 - Compression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl150300450600750SE +/- 3.13, N = 3SE +/- 0.04, N = 3237.68697.891. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 9 - Compression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 313.7239.801. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 2 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 2 - Decompression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl7001400210028003500SE +/- 0.27, N = 3SE +/- 0.13, N = 31153.13326.11. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 1 - Decompression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl8001600240032004000SE +/- 0.30, N = 3SE +/- 0.19, N = 31514.73927.71. (CC) gcc options: -O3 -pthread

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 34.6311.611. (CC) gcc options: -fvisibility=hidden -O2 -lm

LZ4 Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 12 - Compression SpeedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 34.8311.951. (CC) gcc options: -O3 -pthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl246810SE +/- 0.01, N = 3SE +/- 0.01, N = 33.257.821. (CC) gcc options: -fvisibility=hidden -O2 -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl0.72451.4492.17352.8983.6225SE +/- 0.01, N = 3SE +/- 0.00, N = 31.433.221. (CC) gcc options: -fvisibility=hidden -O2 -lm

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl30060090012001500SE +/- 0.28, N = 3SE +/- 0.03, N = 31289.6169.40

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl30000M60000M90000M120000M150000MSE +/- 4780384.74, N = 3SE +/- 430328.22, N = 382854503331487916887601. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl130260390520650SE +/- 0.63, N = 3SE +/- 0.03, N = 3599.3933.72

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl15K30K45K60K75KSE +/- 1.93, N = 3SE +/- 233.77, N = 34159.1270677.801. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20000M40000M60000M80000M100000MSE +/- 4948671.03, N = 3SE +/- 512363.59, N = 365541467701106994500031. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl5K10K15K20K25KSE +/- 10.32, N = 3SE +/- 15.31, N = 31307.322062.71. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1428425670SE +/- 0.00496, N = 3SE +/- 0.01489, N = 33.7686761.619501. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory CopyingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl7K14K21K28K35KSE +/- 0.09, N = 3SE +/- 13.60, N = 31997.8932525.121. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl10002000300040005000SE +/- 1.44, N = 3SE +/- 0.69, N = 34663.28289.27

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl90000M180000M270000M360000M450000MSE +/- 17186337.18, N = 3SE +/- 15611507.33, N = 3257734964574001305584501. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Vector MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl90K180K270K360K450KSE +/- 0.54, N = 3SE +/- 85.60, N = 328508.49438910.401. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl30M60M90M120M150MSE +/- 9769.57, N = 3SE +/- 162829.50, N = 3103216671582383331. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Stress-NG

Test: Trigonometric Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Trigonometric MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl10K20K30K40K50KSE +/- 0.04, N = 3SE +/- 11.48, N = 32998.7545810.371. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Logarithmic Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Logarithmic MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl11K22K33K44K55KSE +/- 0.28, N = 3SE +/- 1.22, N = 33302.6850252.661. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl2004006008001000SE +/- 0.38, N = 3SE +/- 0.26, N = 3952.4364.07

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl60120180240300SE +/- 0.00, N = 3SE +/- 0.67, N = 3202941. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl100K200K300K400K500KSE +/- 28.98, N = 3SE +/- 2385.17, N = 3326974798511. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl100000M200000M300000M400000M500000MSE +/- 35254075.37, N = 3SE +/- 2173107.96, N = 3304557413374448711487971. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Floating PointGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl3K6K9K12K15KSE +/- 1.60, N = 3SE +/- 2.60, N = 3852.8012455.021. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Power Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Power MathGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl5K10K15K20K25KSE +/- 0.08, N = 3SE +/- 0.98, N = 31609.1623418.401. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl600K1200K1800K2400K3000KSE +/- 24.05, N = 3SE +/- 64.82, N = 3192273.032732642.271. (CC) gcc options: -O2 -lrt" -lrt

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20000M40000M60000M80000M100000MSE +/- 62991235.00, N = 3SE +/- 7443314.27, N = 36151749933856048467701. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Fused Multiply-AddGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl14M28M42M56M70MSE +/- 2472.21, N = 3SE +/- 1063.93, N = 34928995.1265585333.041. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl400800120016002000SE +/- 0.31, N = 3SE +/- 1.14, N = 31674.47126.35

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl70140210280350SE +/- 0.00, N = 3SE +/- 0.33, N = 3263251. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl500K1000K1500K2000K2500KSE +/- 287.69, N = 3SE +/- 1201.85, N = 319042623673331. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl1.5M3M4.5M6M7.5MSE +/- 4162.42, N = 3SE +/- 15765.85, N = 357445870471011. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl110K220K330K440K550KSE +/- 50.81, N = 3SE +/- 158.70, N = 3412444946601. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl400800120016002000SE +/- 0.03, N = 3SE +/- 0.07, N = 3169.31961.7MIN: 1132.6 / MAX: 1961.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20K40K60K80K100KSE +/- 1.33, N = 3SE +/- 0.00, N = 37894862811. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20K40K60K80K100KSE +/- 94.53, N = 15SE +/- 22.45, N = 37842856471. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl20K40K60K80K100KSE +/- 106.90, N = 15SE +/- 3.00, N = 38003856671. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl50100150200250SE +/- 1.47, N = 3SE +/- 2.16, N = 3215.89232.431. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl10002000300040005000SE +/- 31.31, N = 3SE +/- 40.13, N = 3463243031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton1 16 Cores a1.metalGraviton4 96 Cores r8g.metal-24xl30M60M90M120M150MSE +/- 39945.87, N = 9SE +/- 2918876.82, N = 1524930571183379181. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver


Phoronix Test Suite v10.8.5