AmpereOne CPU Benchmarks

AmpereOne benchmarks by Michael Larabel for a future article review on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/2408294-NE-AMPEREONE67&grw&sor.

AmpereOne CPU BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 32 CoresAmpereOne @ 3.20GHz (128 Cores)Supermicro ARS-211M-NR R13SPD v1.02 (T20240726102529 BIOS)Ampere Computing LLC Device e2088 x 64GB DDR5-5200MT/s3841GB SAMSUNG MZQL23T8HCLS-00A07 + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDVGA HDMI2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb + 2 x Mellanox MT2892Ubuntu 24.046.8.0-39-generic-64k (aarch64)GCC 13.2.0ext41920x1080AmpereOne @ 3.20GHz (64 Cores)AmpereOne @ 3.20GHz (96 Cores)AmpereOne @ 3.20GHz (160 Cores)AmpereOne @ 3.20GHz (32 Cores)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Python Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AmpereOne CPU Benchmarksastcenc: Thoroughastcenc: Very Thoroughastcenc: Exhaustivexmrig: GhostRider - 1Mquantlib: Multi-Threadedwrf: conus 2.5kmcloverleaf: clover_bm64_shortcloverleaf: clover_bm16pytorch: CPU - 512 - ResNet-50gromacs: MPI CPU - water_GMX50_barelammps: Rhodopsin Proteinlammps: 20k Atomshpcg: 144 144 144 - 60npb: EP.Dnpb: SP.Cnpb: IS.Donednn: Deconvolution Batch shapes_3d - CPUaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingpennant: leblancbigpennant: sedovbigamg: lulesh: nwchem: C240 Buckyballopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeqmcpack: Li2_STO_aeminife: Smallincompact3d: input.i3d 193 Cells Per Directionincompact3d: X3D-benchmarking input.i3dgpaw: Carbon Nanotubecoremark: CoreMark Size 666 - Iterations Per Secondprimesieve: 1e13stockfish: Chess Benchmarkcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: Blowfishjohn-the-ripper: bcryptbuild-llvm: Ninjacompress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compressionm-queens: Time To Solvegraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Sharpengraphics-magick: Swirlbuild-gem5: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compilehelsing: 14 digitliquid-dsp: 128 - 256 - 32srsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PDSCH Processor Benchmark, Throughput Totalspeedb: Rand Readrocksdb: Rand Readrocksdb: Read While Writingpgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average LatencyAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 32 Cores33.78714.92953.071312738.0200797.89608.64137.85329.1524.065.20645.74948.08731.96715087.1425255.322420.962.4171618018.818386.94.7802156.060620179133300041666.5932239.127.25956737.727827143.92786376.84134109.8839605.810.20729636307.12253844.9812717686.04904819.49089911572604723601553121100120886192.2002.0548557.913203253308662210.56717.495233.61543.40129821333331883.418948.2507348157498075884846009527135880.36917.07172.46981.53786065.1100367.912930.95240.18354.2825.572.85130.78831.04520.72482542.4813715.041500.884.1637211286.013258.79.33420613.16158135410566733097.409237429.47200657.366708160.16405563.9277127.0029130.113.7540960367.40520270.9101520096.92899637.996437519053446263056446088260787239.7653.12852915.562142131176348235.76919.045312.61887.0111495033333994.311144.9249704806248932877665573025750900.38825.47843.70032.30449542.9150449.510563.09138.49330.7526.224.07939.03239.88327.78093791.7917166.691492.182.9416614696.016514.55.8855118.135246169275000033783.3792214.228.29350142.96047147.84581442.26537114.9636739.611.1763146318.37908953.1032243103.32055525.559663790264818274586139107191068209.9302.53972010.437178194249507222.55817.907254.37858.07822380666671470.315292.2380234305372309429806806927422420.36541.96866.15473.835315534.2250463.69224.58538.23339.3422.176.25650.85151.56133.63026325.0428844.112420.502.7322420081.718121.73.7946674.984562183205366741723.9222239.727.80580432.815077146.85839335.45136106.3240530.19.41105728293.59717841.8823069687.78342415.972119077506698343748883150613149891182.7391.8378186.371226302387807202.67317.127217.67435.17529839333332354.021927.6633736504621920651915633427787840.3618.57661.23610.76963072.950295.121231.05554.69499.1622.321.46817.78118.52211.25871286.865519.451102.128.043027012.878920.1119.5464625.9085077429850018959.6644230.133.484845104.62499199.90819995.61852164.2316927.622.7074776549.043030117.878771783.50179375.355217210771892721547543047230487366.0105.30323830.963856691179311.65023.212494.123174.210754193333551.85962.0126847842124567637343604911913400.839OpenBenchmarking.org

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ThoroughAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores1020304050SE +/- 0.0029, N = 4SE +/- 0.0055, N = 3SE +/- 0.0007, N = 3SE +/- 0.0020, N = 3SE +/- 0.0000, N = 341.968633.787125.478417.07178.57661. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Very ThoroughAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores246810SE +/- 0.0006, N = 3SE +/- 0.0008, N = 3SE +/- 0.0003, N = 3SE +/- 0.0002, N = 3SE +/- 0.0002, N = 36.15474.92953.70032.46981.23611. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ExhaustiveAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores0.86291.72582.58873.45164.3145SE +/- 0.0003, N = 3SE +/- 0.0004, N = 3SE +/- 0.0002, N = 3SE +/- 0.0003, N = 3SE +/- 0.0002, N = 33.83533.07132.30441.53780.76961. (CXX) g++ options: -O3 -flto -pthread

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores3K6K9K12K15KSE +/- 93.26, N = 3SE +/- 96.21, N = 15SE +/- 63.96, N = 3SE +/- 83.23, N = 3SE +/- 9.24, N = 315534.212738.09542.96065.13072.91. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores50K100K150K200K250KSE +/- 30.07, N = 3SE +/- 57.74, N = 3SE +/- 43.40, N = 3SE +/- 1.35, N = 3SE +/- 17.09, N = 3250463.6200797.8150449.5100367.950295.11. (CXX) g++ options: -O3 -march=native -fPIE -pie

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5kmAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores5K10K15K20K25K9224.599608.6410563.0912930.9521231.061. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores1224364860SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 337.8538.2338.4940.1854.691. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores110220330440550SE +/- 0.28, N = 3SE +/- 0.22, N = 3SE +/- 0.31, N = 3SE +/- 1.09, N = 3SE +/- 0.21, N = 3329.15330.75339.34354.28499.161. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: ResNet-50AmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 32 CoresAmpereOne A192-32X @ 160 Cores612182430SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 326.2225.5724.0622.3222.17MIN: 24.56 / MAX: 26.93MIN: 19.76 / MAX: 26.07MIN: 20.95 / MAX: 24.81MIN: 21.91 / MAX: 22.77MIN: 20.09 / MAX: 22.55

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores246810SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.016, N = 56.2565.2064.0792.8511.4681. (CXX) g++ options: -O3 -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores1122334455SE +/- 0.05, N = 9SE +/- 1.42, N = 15SE +/- 1.11, N = 15SE +/- 0.02, N = 11SE +/- 0.01, N = 1250.8545.7539.0330.7917.781. (CXX) g++ options: -O3 -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores1224364860SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 351.5648.0939.8831.0518.521. (CXX) g++ options: -O3 -lm -ldl

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60AmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores816243240SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 333.6331.9727.7820.7211.261. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores14002800420056007000SE +/- 2.29, N = 3SE +/- 24.16, N = 3SE +/- 4.87, N = 3SE +/- 12.37, N = 3SE +/- 3.57, N = 36325.045087.143791.792542.481286.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores6K12K18K24K30KSE +/- 18.61, N = 3SE +/- 14.45, N = 3SE +/- 12.08, N = 3SE +/- 8.73, N = 3SE +/- 7.19, N = 328844.1125255.3217166.6913715.045519.451. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 32 Cores5001000150020002500SE +/- 4.94, N = 3SE +/- 13.09, N = 3SE +/- 18.24, N = 4SE +/- 18.69, N = 3SE +/- 8.53, N = 32420.962420.501500.881492.181102.121. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Deconvolution Batch shapes_3d - Engine: CPUAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores246810SE +/- 0.00575, N = 9SE +/- 0.01136, N = 9SE +/- 0.00486, N = 9SE +/- 0.01038, N = 9SE +/- 0.00301, N = 92.417162.732242.941664.163728.04302MIN: 2.3MIN: 2.59MIN: 2.85MIN: 4.05MIN: 7.971. (CXX) g++ options: -O3 -march=native -fopenmp -mcpu=generic -fPIC -pie -ldl -lpthread

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores4K8K12K16K20KSE +/- 67.66, N = 3SE +/- 44.66, N = 3SE +/- 192.84, N = 3SE +/- 35.02, N = 3SE +/- 15.65, N = 320081.7018018.8014696.0011286.007012.871. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores4K8K12K16K20KSE +/- 26.80, N = 3SE +/- 203.80, N = 3SE +/- 28.93, N = 3SE +/- 73.63, N = 3SE +/- 25.34, N = 318386.9018121.7016514.5013258.708920.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores510152025SE +/- 0.062223, N = 15SE +/- 0.149446, N = 12SE +/- 0.038839, N = 6SE +/- 0.018943, N = 5SE +/- 0.002751, N = 33.7946674.7802155.8855119.33420619.5464601. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores612182430SE +/- 0.095285, N = 15SE +/- 0.034211, N = 6SE +/- 0.029025, N = 5SE +/- 0.059346, N = 4SE +/- 0.031676, N = 34.9845626.0606208.13524613.16158025.9085001. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2AmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores400M800M1200M1600M2000MSE +/- 626107.91, N = 3SE +/- 541472.38, N = 3SE +/- 739129.89, N = 3SE +/- 1045977.11, N = 3SE +/- 568382.82, N = 318320536671791333000169275000013541056677742985001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3AmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores9K18K27K36K45KSE +/- 83.25, N = 3SE +/- 158.62, N = 3SE +/- 252.99, N = 4SE +/- 164.49, N = 4SE +/- 202.33, N = 541723.9241666.5933783.3833097.4118959.661. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores90018002700360045002214.22239.12239.72374.04230.11. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores81624324027.2627.8128.2929.4733.481. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores2040608010032.8237.7342.9657.37104.621. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores4080120160200143.93146.86147.85160.16199.911. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores2004006008001000335.45376.84442.27563.93995.621. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores4080120160200SE +/- 0.37, N = 3SE +/- 0.64, N = 3SE +/- 0.17, N = 3SE +/- 0.23, N = 3SE +/- 0.15, N = 3106.32109.88114.96127.00164.231. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores9K18K27K36K45KSE +/- 162.64, N = 4SE +/- 60.85, N = 4SE +/- 106.36, N = 4SE +/- 89.27, N = 4SE +/- 6.97, N = 340530.139605.836739.629130.116927.61. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores510152025SE +/- 0.25501410, N = 15SE +/- 0.22363750, N = 12SE +/- 0.00898947, N = 4SE +/- 0.00881027, N = 4SE +/- 0.22492051, N = 39.4110572810.2072963611.1763146013.7540960022.707477601. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores120240360480600SE +/- 0.46, N = 3SE +/- 1.73, N = 3SE +/- 0.37, N = 3SE +/- 0.81, N = 3SE +/- 0.12, N = 3293.60307.12318.38367.41549.041. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores306090120150SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 341.8844.9853.1070.91117.881. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores700K1400K2100K2800K3500KSE +/- 29681.14, N = 3SE +/- 12021.71, N = 3SE +/- 24157.99, N = 3SE +/- 17648.10, N = 4SE +/- 135.81, N = 33069687.782717686.052243103.321520096.93771783.501. (CC) gcc options: -O2 -lrt" -lrt

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13AmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores20406080100SE +/- 0.05, N = 4SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 315.9719.4925.5638.0075.361. (CXX) g++ options: -O3

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores30M60M90M120M150MSE +/- 2347181.46, N = 10SE +/- 2128472.89, N = 12SE +/- 1643187.92, N = 12SE +/- 193507.24, N = 3SE +/- 111315.22, N = 3119077506899115726637902643751905217210771. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores150K300K450K600K750KSE +/- 3030.31, N = 3SE +/- 2815.31, N = 3SE +/- 4489.83, N = 3SE +/- 3457.21, N = 3SE +/- 521.69, N = 36983436047234818273446261892721. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores160K320K480K640K800KSE +/- 3252.04, N = 3SE +/- 5325.21, N = 3SE +/- 191.34, N = 3SE +/- 1384.56, N = 3SE +/- 699.19, N = 37488836015534586133056441547541. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores30K60K90K120K150KSE +/- 347.77, N = 3SE +/- 257.46, N = 3SE +/- 92.18, N = 3SE +/- 15.84, N = 3SE +/- 14.11, N = 31506131211009107160882304721. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores30K60K90K120K150KSE +/- 778.55, N = 3SE +/- 376.38, N = 3SE +/- 87.11, N = 3SE +/- 128.00, N = 3SE +/- 2.00, N = 31498911208869106860787304871. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores80160240320400SE +/- 0.28, N = 3SE +/- 0.79, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.30, N = 3182.74192.20209.93239.77366.01

Parallel BZIP2 Compression

FreeBSD-13.0-RELEASE-amd64-memstick.img Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.13FreeBSD-13.0-RELEASE-amd64-memstick.img CompressionAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores1.19322.38643.57964.77285.966SE +/- 0.005048, N = 11SE +/- 0.008019, N = 11SE +/- 0.014149, N = 10SE +/- 0.008052, N = 9SE +/- 0.003938, N = 71.8378182.0548552.5397203.1285295.3032381. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores714212835SE +/- 0.018, N = 6SE +/- 0.019, N = 6SE +/- 0.009, N = 5SE +/- 0.009, N = 4SE +/- 0.013, N = 36.3717.91310.43715.56230.9631. (CXX) g++ options: -fopenmp -O2 -march=native

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores50100150200250SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3226203178142851. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores70140210280350SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3302253194131661. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores80160240320400SE +/- 2.60, N = 3SE +/- 2.60, N = 3SE +/- 3.18, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3387308249176911. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores2004006008001000SE +/- 2.40, N = 3SE +/- 1.76, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38076625073481791. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores70140210280350SE +/- 0.45, N = 3SE +/- 0.87, N = 3SE +/- 0.49, N = 3SE +/- 0.34, N = 3SE +/- 0.29, N = 3202.67210.57222.56235.77311.65

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 24.0Time To CompileAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores612182430SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 317.1317.5017.9119.0523.21

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores110220330440550SE +/- 0.26, N = 3SE +/- 0.12, N = 3SE +/- 0.31, N = 3SE +/- 0.38, N = 3SE +/- 0.19, N = 3217.67233.62254.38312.62494.12

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores4080120160200SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 335.1843.4058.0887.01174.211. (CC) gcc options: -O2 -pthread

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32AmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores600M1200M1800M2400M3000MSE +/- 176383.42, N = 3SE +/- 688799.28, N = 3SE +/- 1156623.44, N = 3SE +/- 317979.73, N = 3SE +/- 218581.28, N = 329839333332982133333223806666714950333337541933331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores5001000150020002500SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 32354.01883.41470.3994.3551.8MIN: 1414.7 / MAX: 2354.1MIN: 1131.8 / MAX: 1883.5MIN: 848.9MIN: 582.2 / MAX: 994.4MIN: 328.71. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores5K10K15K20K25KSE +/- 93.65, N = 3SE +/- 45.75, N = 3SE +/- 27.16, N = 3SE +/- 47.88, N = 3SE +/- 28.69, N = 421927.618948.215292.211144.95962.01. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

Speedb

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores140M280M420M560M700MSE +/- 430545.53, N = 3SE +/- 114538.19, N = 3SE +/- 119433.65, N = 3SE +/- 1848285.25, N = 3SE +/- 17250.81, N = 36337365045073481573802343052497048061268478421. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores130M260M390M520M650MSE +/- 69890.71, N = 3SE +/- 156369.77, N = 3SE +/- 915048.87, N = 3SE +/- 84410.44, N = 3SE +/- 13970.84, N = 36219206514980758843723094292489328771245676371. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores2M4M6M8M10MSE +/- 75896.27, N = 3SE +/- 119192.66, N = 3SE +/- 38617.89, N = 3SE +/- 91773.39, N = 3SE +/- 19036.70, N = 3915633484600958068069665573034360491. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores600K1200K1800K2400K3000KSE +/- 51808.62, N = 12SE +/- 23815.94, N = 3SE +/- 38685.91, N = 3SE +/- 22685.45, N = 3SE +/- 3213.57, N = 3277878427422422713588257509011913401. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyAmpereOne A192-32X @ 160 CoresAmpereOne A192-32X @ 96 CoresAmpereOne A192-32X @ 128 CoresAmpereOne A192-32X @ 64 CoresAmpereOne A192-32X @ 32 Cores0.18880.37760.56640.75520.944SE +/- 0.007, N = 12SE +/- 0.003, N = 3SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.3610.3650.3690.3880.8391. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm


Phoronix Test Suite v10.8.5