Linux Distros Emerald Rapids

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2403039-NE-2403025NE90
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 3 Tests
C/C++ Compiler Tests 5 Tests
CPU Massive 10 Tests
Creator Workloads 9 Tests
Database Test Suite 5 Tests
Encoding 2 Tests
Fortran Tests 5 Tests
Game Development 2 Tests
HPC - High Performance Computing 14 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 4 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 5 Tests
Multi-Core 15 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 6 Tests
OpenMPI Tests 8 Tests
Python Tests 4 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 8 Tests
Server 6 Tests
Server CPU Tests 4 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Ubuntu Linux 23.10
February 25
  20 Hours, 3 Minutes
CentOS Stream 9
February 26
  13 Hours, 36 Minutes
Fedora Server 39
March 02
  17 Hours, 42 Minutes
Arch Linux
March 03
  17 Hours, 43 Minutes
Invert Hiding All Results Option
  17 Hours, 16 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Linux Distros Emerald RapidsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionDesktopDisplay ServerUbuntu Linux 23.10CentOS Stream 9Fedora Server 39Arch Linux2 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS (3B05.TEL4P1 BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-17-generic (x86_64)GCC 13.2.0ext41024x7683201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash DriveCentOS Stream 95.14.0-419.el9.x86_64 (x86_64)GNOME Shell 40.10X ServerGCC 11.4.1 20231218xfs1920x1200Fedora Linux 396.7.6-200.fc39.x86_64 (x86_64)GCC 13.2.1 202312051024x768Arch Linux6.7.6-arch1-2 (x86_64)GCC 13.2.1 20230801btrfs1920x1200OpenBenchmarking.orgKernel Details- Ubuntu Linux 23.10: Transparent Huge Pages: madvise- CentOS Stream 9: Transparent Huge Pages: always- Fedora Server 39: Transparent Huge Pages: madvise- Arch Linux: Transparent Huge Pages: alwaysCompiler Details- Ubuntu Linux 23.10: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - CentOS Stream 9: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl - Fedora Server 39: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Arch Linux: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x21000161Python Details- Ubuntu Linux 23.10: Python 3.11.6- CentOS Stream 9: Python 3.9.18- Fedora Server 39: Python 3.12.2- Arch Linux: Python 3.11.7Security Details- Ubuntu Linux 23.10: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - CentOS Stream 9: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - Fedora Server 39: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - Arch Linux: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

Ubuntu Linux 23.10CentOS Stream 9Fedora Server 39Arch LinuxResult OverviewPhoronix Test Suite100%139%177%216%255%NAMDMemcachedGROMACSACES DGEMMeasyWaveEmbree7-Zip CompressionCloverLeafY-CruncherLAMMPS Molecular Dynamics SimulatorClickHouseIntel Open Image DenoiseXcompact3d Incompact3dRedisOpenVINOGraph500SVT-AV1OSPRayQuicksilverVVenCOpenVKLHigh Performance Conjugate Gradient

Linux Distros Emerald Rapidsgraph500: 26graph500: 26quicksilver: CTS2quicksilver: CORAL2 P1quicksilver: CORAL2 P2openvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasterhpcg: 104 104 104 - 60hpcg: 144 144 144 - 60mt-dgemm: Sustained Floating-Point Rateoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyoidn: RTLightmap.hdr.4096x4096 - CPU-Onlytensorflow: CPU - 512 - ResNet-50openvkl: vklBenchmarkCPU ISPCospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timeospray: particle_volume/pathtracer/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timedeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratinglczero: BLASlczero: Eigengromacs: MPI CPU - water_GMX50_barenamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomslammps: 20k Atomslammps: Rhodopsin Proteinrocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randspeedb: Rand Readmemcached: 1:10memcached: 1:100clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runredis: GET - 500redis: SET - 500graph500: 26graph500: 26pgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Write - Average Latencyopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 16 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUospray-studio: 2 - 4K - 16 - Path Tracer - CPUospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 16 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUospray-studio: 1 - 1080p - 1 - Path Tracer - CPUospray-studio: 2 - 1080p - 1 - Path Tracer - CPUospray-studio: 3 - 1080p - 1 - Path Tracer - CPUospray-studio: 1 - 1080p - 16 - Path Tracer - CPUospray-studio: 1 - 1080p - 32 - Path Tracer - CPUospray-studio: 2 - 1080p - 16 - Path Tracer - CPUospray-studio: 2 - 1080p - 32 - Path Tracer - CPUospray-studio: 3 - 1080p - 16 - Path Tracer - CPUospray-studio: 3 - 1080p - 32 - Path Tracer - CPUdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamcloverleaf: clover_bm16cloverleaf: clover_bm64_shortnwchem: C240 Buckyballincompact3d: X3D-benchmarking input.i3deasywave: e2Asean Grid + BengkuluSept2007 Source - 2400y-cruncher: 1By-cruncher: 5Brawtherapee: Total Benchmark Timegpaw: Carbon NanotubeUbuntu Linux 23.10CentOS Stream 9Fedora Server 39Arch Linux12895800001226550000768022259315006744778375.75519.838901.6124570.102377.901143.2543516.9810033.953217.8461020.16100.3399144.91952.30721.31766.85766.4302.7855.61171.137169.336237.9710184.534.542.1873.12282120.654620.772582.970028.608828.916232.3212132.67894547.53951837.111311099.6440829.5063155.81781826.0447853.20791223.1639176.56911866.2259133.2063464813455056153809.6304.707111.4900456.68337.51859982204612556097759899554616007033843370014.383477137.28205.20214.31215.621541713.802134872.89664991000483534000679317395421.48925.61685.06245.9814.375.2053.7927.932.8512.7439.760.809149111079120863227912243331741450837285234233276371864483694632443537321478.763114.057334.76035.751577.0180409.008934.988474.883852.2774361.287734.2545477.1362279.2132.101750.3169.178579177.7547.20729.821129.07938.63714996000001464120000739833358008336381000328.37528.798908.7424295.712412.51617.4540187.3410113.473196.4760429.54105.6194177.79602.35821.56069.86869.7182.8235.80871.133269.172863.6358405.044.902.5176.68281520.502120.442783.565229.762629.510433.2938132.91014462.52551863.342911022.3020827.9914152.56781866.5153853.33741223.6470173.95151810.9508133.28187510704342005.9393.203280.5025157.83639.7953337339.102615816.73183.62200.18201.051821521.391978937.1864079400046639800097.34241.6714.365.2553.0252.093.0212.6540.020.83478.031514.318234.29065.783377.1671416.681634.226274.803852.2360366.383135.2846476.5785314.4925.01184.192332205.0057.91332.85012532500001195510000736750058358186759600340.97529.148862.0724296.542383.22978.7042419.8910082.933255.1656624.98142.1675179.54032.34621.16171.51870.8742.8345.80472.457570.380537.7200764.684.702.25282120.444220.541982.138028.663728.43031.69994961384286017.1805.407061.9297757.67739.009652854125123092856700611848376262618265054518.768983240.52188.71202.14206.831838671.432125843.9766341200049303100093.74241.7014.435.2553.6732.672.8612.6939.300.839028971058121872803512262275241442234698235233276368363143655630343777424255.3931.63168.671819156.2378.27033.734118.228194.50813234800001275690000772366758655007013889392.84531.538956.3724355.332409.761092.7541472.2510197.223189.7064056.00118.7400148.60222.39522.04673.71772.9062.9005.94271.129270.121043.6692344.754.752.3677.06295120.667320.778982.511334.048731.579333.3694134.20654534.89461868.770511213.1629834.1920154.32161839.9833857.84941227.3743181.33301878.4478134.933375927948207613.6385.599561.7478462.99042.75160834589412745193856529320346165733855462773.679633352.44209.64221.80221.711777534.652231227.61692893000509964000698734185361.44054.08581.35240.6514.285.2453.0829.232.9412.5440.110.778748671026115752388411658236871363933976226223265359163803544639941737043474.241914.095934.21345.695376.6571413.383834.742874.525652.1246351.746034.0360471.8157356.2330.30187.266596145.7137.56133.652107.048OpenBenchmarking.org

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10300M600M900M1200M1500M13234800001499600000125325000012895800001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10300M600M900M1200M1500M12756900001464120000119551000012265500001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CTS2Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.7M3.4M5.1M6.8M8.5MSE +/- 67222.68, N = 9SE +/- 121572.66, N = 6SE +/- 90618.71, N = 4SE +/- 80526.35, N = 977236677398333736750076802221. (CXX) g++ options: -fopenmp -O3 -march=native

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P1Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.3M2.6M3.9M5.2M6.5MSE +/- 55910.49, N = 6SE +/- 105910.76, N = 12SE +/- 42699.12, N = 11SE +/- 65941.38, N = 1258655005800833583581859315001. (CXX) g++ options: -fopenmp -O3 -march=native

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P2Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.5M3M4.5M6M7.5MSE +/- 67285.55, N = 9SE +/- 51868.42, N = 3SE +/- 74346.22, N = 5SE +/- 81512.06, N = 970138896381000675960067447781. (CXX) g++ options: -fopenmp -O3 -march=native

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1090180270360450SE +/- 0.49, N = 3SE +/- 2.69, N = 3SE +/- 0.63, N = 3SE +/- 0.48, N = 3392.84328.37340.97375.75-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10110220330440550SE +/- 6.17, N = 4SE +/- 1.20, N = 3SE +/- 6.06, N = 4SE +/- 5.90, N = 4531.53528.79529.14519.83-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.102K4K6K8K10KSE +/- 34.95, N = 3SE +/- 7.04, N = 3SE +/- 9.74, N = 3SE +/- 20.48, N = 38956.378908.748862.078901.61-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.105K10K15K20K25KSE +/- 269.84, N = 3SE +/- 36.89, N = 3SE +/- 130.19, N = 3SE +/- 208.25, N = 324355.3324295.7124296.5424570.10-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.105001000150020002500SE +/- 6.39, N = 3SE +/- 10.50, N = 3SE +/- 4.46, N = 3SE +/- 15.67, N = 32409.762412.512383.222377.90-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.102004006008001000SE +/- 11.59, N = 3SE +/- 13.25, N = 15SE +/- 8.04, N = 9SE +/- 4.60, N = 31092.75617.45978.701143.25-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.109K18K27K36K45KSE +/- 312.18, N = 3SE +/- 347.22, N = 3SE +/- 194.22, N = 3SE +/- 43.54, N = 341472.2540187.3442419.8943516.98-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.102K4K6K8K10KSE +/- 125.31, N = 3SE +/- 104.96, N = 5SE +/- 124.93, N = 4SE +/- 140.94, N = 310197.2210113.4710082.9310033.95-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.107001400210028003500SE +/- 3.83, N = 3SE +/- 1.03, N = 3SE +/- 4.78, N = 3SE +/- 1.65, N = 33189.703196.473255.163217.84-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1014K28K42K56K70KSE +/- 941.42, N = 15SE +/- 478.67, N = 3SE +/- 271.02, N = 3SE +/- 827.51, N = 1564056.0060429.5456624.9861020.16-pie-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10306090120150SE +/- 0.13, N = 3SE +/- 1.24, N = 15SE +/- 1.32, N = 15SE +/- 0.25, N = 3118.74105.62142.17100.34MIN: 113.06 / MAX: 135.85MIN: 95.01 / MAX: 132.75MIN: 115.81 / MAX: 164.58MIN: 95.92 / MAX: 109.46

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.104080120160200SE +/- 0.19, N = 3SE +/- 3.90, N = 15SE +/- 0.73, N = 3SE +/- 0.50, N = 3148.60177.80179.54144.92MIN: 140.37 / MAX: 169.21MIN: 142.88 / MAX: 210.57MIN: 145.33 / MAX: 207.47MIN: 137.85 / MAX: 155.04

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 4KArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.53891.07781.61672.15562.6945SE +/- 0.009, N = 3SE +/- 0.009, N = 3SE +/- 0.008, N = 3SE +/- 0.007, N = 32.3952.3582.3462.3071. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 4KArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10510152025SE +/- 0.08, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 322.0521.5621.1621.321. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 4KArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101632486480SE +/- 0.15, N = 3SE +/- 0.39, N = 3SE +/- 0.99, N = 3SE +/- 0.48, N = 1573.7269.8771.5266.861. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 4KArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101632486480SE +/- 0.36, N = 3SE +/- 0.99, N = 3SE +/- 0.53, N = 3SE +/- 0.74, N = 572.9169.7270.8766.431. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: FastArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.65251.3051.95752.613.2625SE +/- 0.007, N = 3SE +/- 0.026, N = 3SE +/- 0.017, N = 3SE +/- 0.035, N = 32.9002.8232.8342.7851. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: FasterArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.3372.6744.0115.3486.685SE +/- 0.050, N = 3SE +/- 0.013, N = 3SE +/- 0.035, N = 3SE +/- 0.061, N = 45.9425.8085.8045.6111. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101632486480SE +/- 0.32, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 371.1371.1372.4671.14-lmpi_cxx-lmpi_cxx-lmpi_cxx1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101632486480SE +/- 0.59, N = 3SE +/- 0.83, N = 4SE +/- 0.27, N = 3SE +/- 0.52, N = 370.1269.1770.3869.34-lmpi_cxx-lmpi_cxx-lmpi_cxx1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101428425670SE +/- 0.91, N = 13SE +/- 0.51, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 343.6763.6437.7237.971. (CC) gcc options: -O3 -march=native -fopenmp

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-OnlyArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.1342.2683.4024.5365.67SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 34.755.044.684.53

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlyArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.10252.2053.30754.415.5125SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 15SE +/- 0.06, N = 34.754.904.704.54

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RTLightmap.hdr.4096x4096 - Device: CPU-OnlyArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.56481.12961.69442.25922.824SE +/- 0.03, N = 3SE +/- 0.03, N = 4SE +/- 0.01, N = 15SE +/- 0.01, N = 32.362.512.252.18

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50Arch LinuxCentOS Stream 9Ubuntu Linux 23.1020406080100SE +/- 0.61, N = 3SE +/- 0.83, N = 4SE +/- 0.84, N = 377.0676.6873.12

Device: CPU - Batch Size: 512 - Model: ResNet-50

Fedora Server 39: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.106001200180024003000SE +/- 20.04, N = 3SE +/- 33.05, N = 3SE +/- 22.66, N = 3SE +/- 20.99, N = 32951281528212821MIN: 193 / MAX: 35719MIN: 186 / MAX: 35204MIN: 178 / MAX: 36193MIN: 175 / MAX: 34505

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/ao/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 320.6720.5020.4420.65

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/scivis/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 320.7820.4420.5420.77

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/pathtracer/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1020406080100SE +/- 0.87, N = 4SE +/- 0.23, N = 3SE +/- 0.58, N = 3SE +/- 0.35, N = 382.5183.5782.1482.97

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/ao/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10816243240SE +/- 0.29, N = 3SE +/- 0.26, N = 15SE +/- 0.10, N = 3SE +/- 0.05, N = 334.0529.7628.6628.61

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10714212835SE +/- 0.48, N = 15SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 331.5829.5128.4328.92

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10816243240SE +/- 0.10, N = 3SE +/- 0.48, N = 15SE +/- 0.12, N = 3SE +/- 0.05, N = 333.3733.2931.7032.32

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10306090120150SE +/- 0.47, N = 3SE +/- 0.30, N = 3SE +/- 0.29, N = 3134.21132.91132.68

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1010002000300040005000SE +/- 55.62, N = 3SE +/- 40.12, N = 7SE +/- 40.60, N = 74534.894462.534547.54

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10400800120016002000SE +/- 9.98, N = 3SE +/- 4.86, N = 3SE +/- 7.36, N = 31868.771863.341837.11

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.102K4K6K8K10KSE +/- 102.84, N = 7SE +/- 43.63, N = 3SE +/- 102.73, N = 711213.1611022.3011099.64

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.102004006008001000SE +/- 2.18, N = 3SE +/- 4.43, N = 3SE +/- 5.31, N = 3834.19827.99829.51

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10306090120150SE +/- 1.36, N = 3SE +/- 0.32, N = 3SE +/- 0.87, N = 3154.32152.57155.82

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10400800120016002000SE +/- 7.12, N = 3SE +/- 3.56, N = 3SE +/- 16.26, N = 31839.981866.521826.04

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.102004006008001000SE +/- 8.02, N = 3SE +/- 8.06, N = 3SE +/- 6.05, N = 3857.85853.34853.21

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1030060090012001500SE +/- 12.68, N = 3SE +/- 15.64, N = 3SE +/- 14.85, N = 31227.371223.651223.16

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.104080120160200SE +/- 2.15, N = 4SE +/- 0.21, N = 3SE +/- 1.16, N = 3181.33173.95176.57

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10400800120016002000SE +/- 22.85, N = 3SE +/- 16.06, N = 3SE +/- 20.52, N = 31878.451810.951866.23

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10306090120150SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.24, N = 3134.93133.28133.21

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10160K320K480K640K800KSE +/- 9120.21, N = 15SE +/- 5583.39, N = 15SE +/- 4508.27, N = 7SE +/- 4291.24, N = 37592797510704961384648131. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10100K200K300K400K500KSE +/- 19128.23, N = 15SE +/- 2615.74, N = 15SE +/- 14994.49, N = 7SE +/- 39319.74, N = 34820764342004286014550561. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.30Backend: BLASUbuntu Linux 23.1048121620SE +/- 1.18, N = 9151. (CXX) g++ options: -flto -pthread

Backend: BLAS

CentOS Stream 9: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

Fedora Server 39: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

Arch Linux: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.30Backend: EigenUbuntu Linux 23.1080160240320400SE +/- 4.52, N = 43801. (CXX) g++ options: -flto -pthread

Backend: Eigen

CentOS Stream 9: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

Fedora Server 39: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

Arch Linux: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1048121620SE +/- 1.257, N = 15SE +/- 0.323, N = 12SE +/- 0.850, N = 12SE +/- 1.403, N = 713.6385.9397.1809.6301. (CXX) g++ options: -O3 -lm

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 AtomsArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.25992.51983.77975.03966.2995SE +/- 0.21129, N = 12SE +/- 0.30866, N = 12SE +/- 0.33952, N = 15SE +/- 0.33618, N = 155.599563.203285.407064.70711

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 AtomsArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.43420.86841.30261.73682.171SE +/- 0.07631, N = 15SE +/- 0.04889, N = 12SE +/- 0.05361, N = 15SE +/- 0.14829, N = 151.747840.502511.929771.49004

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101428425670SE +/- 0.44, N = 3SE +/- 0.46, N = 3SE +/- 0.40, N = 3SE +/- 0.42, N = 362.9957.8457.6856.681. (CXX) g++ options: -O3 -lm -ldl

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101020304050SE +/- 0.85, N = 12SE +/- 0.76, N = 15SE +/- 0.95, N = 15SE +/- 0.93, N = 1542.7539.8039.0137.521. (CXX) g++ options: -O3 -lm -ldl

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random ReadArch LinuxFedora Server 39Ubuntu Linux 23.10140M280M420M560M700MSE +/- 7708164.70, N = 3SE +/- 3865049.53, N = 3SE +/- 7527805.83, N = 15608345894652854125599822046-lpthread1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Test: Random Read

CentOS Stream 9: The test quit with a non-zero exit status. E: rocksdb: line 4: ./db_bench: No such file or directory

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update RandomArch LinuxFedora Server 39Ubuntu Linux 23.1030K60K90K120K150KSE +/- 1143.37, N = 3SE +/- 1187.74, N = 3SE +/- 440.29, N = 3127451123092125560-lpthread1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Test: Update Random

CentOS Stream 9: The test quit with a non-zero exit status. E: rocksdb: line 4: ./db_bench: No such file or directory

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read While WritingArch LinuxFedora Server 39Ubuntu Linux 23.102M4M6M8M10MSE +/- 87416.61, N = 3SE +/- 133209.05, N = 14SE +/- 81611.31, N = 8938565285670069775989-lpthread1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Test: Read While Writing

CentOS Stream 9: The test quit with a non-zero exit status. E: rocksdb: line 4: ./db_bench: No such file or directory

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write RandomArch LinuxFedora Server 39Ubuntu Linux 23.10300K600K900K1200K1500KSE +/- 9325.25, N = 15SE +/- 13512.70, N = 15SE +/- 5328.60, N = 39320341184837955461-lpthread1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Test: Read Random Write Random

CentOS Stream 9: The test quit with a non-zero exit status. E: rocksdb: line 4: ./db_bench: No such file or directory

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadArch LinuxFedora Server 39Ubuntu Linux 23.10130M260M390M520M650MSE +/- 6923132.62, N = 4SE +/- 8524314.28, N = 3SE +/- 5282775.06, N = 8616573385626261826600703384-lpthread1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Test: Random Read

CentOS Stream 9: The test quit with a non-zero exit status. E: speedb: line 4: ./db_bench: No such file or directory

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.2M2.4M3.6M4.8M6MSE +/- 45099.83, N = 8SE +/- 43085.73, N = 13SE +/- 65254.44, N = 3SE +/- 36897.17, N = 55462773.673337339.105054518.763370014.381. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.102M4M6M8M10MSE +/- 121500.81, N = 15SE +/- 20448.57, N = 10SE +/- 380886.14, N = 12SE +/- 20632.12, N = 39633352.442615816.738983240.523477137.281. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 2.35, N = 3SE +/- 0.84, N = 3SE +/- 2.54, N = 3SE +/- 2.70, N = 3209.64183.62188.71205.20MIN: 32.84 / MAX: 1764.71MIN: 34.56 / MAX: 1714.29MIN: 30.08 / MAX: 1428.57MIN: 35.4 / MAX: 1818.18

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 1.76, N = 3SE +/- 3.30, N = 3SE +/- 1.90, N = 3SE +/- 1.67, N = 3221.80200.18202.14214.31MIN: 36.97 / MAX: 1875MIN: 37.41 / MAX: 2000MIN: 36.41 / MAX: 1463.41MIN: 36.39 / MAX: 1764.71

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 0.45, N = 3SE +/- 2.16, N = 3SE +/- 3.01, N = 3SE +/- 3.81, N = 3221.71201.05206.83215.62MIN: 35.63 / MAX: 2068.97MIN: 36.08 / MAX: 1500MIN: 35.91 / MAX: 1935.48MIN: 37.43 / MAX: 1714.29

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10400K800K1200K1600K2000KSE +/- 19370.21, N = 15SE +/- 19161.91, N = 15SE +/- 30474.05, N = 15SE +/- 15103.72, N = 151777534.651821521.391838671.431541713.801. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10500K1000K1500K2000K2500KSE +/- 46614.18, N = 15SE +/- 47416.25, N = 15SE +/- 73166.31, N = 15SE +/- 48178.71, N = 152231227.611978937.182125843.972134872.891. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10150M300M450M600M750M6928930006407940006634120006649910001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10110M220M330M440M550M5099640004663980004930310004835340001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyArch LinuxUbuntu Linux 23.10150K300K450K600K750KSE +/- 16477.93, N = 12SE +/- 22056.84, N = 126987346793171. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

CentOS Stream 9: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

Fedora Server 39: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteArch LinuxUbuntu Linux 23.108K16K24K32K40KSE +/- 280.04, N = 12SE +/- 1662.84, N = 918536395421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

CentOS Stream 9: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

Fedora Server 39: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyArch LinuxUbuntu Linux 23.100.3350.671.0051.341.675SE +/- 0.035, N = 12SE +/- 0.047, N = 121.4401.4891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

CentOS Stream 9: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

Fedora Server 39: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyArch LinuxUbuntu Linux 23.101224364860SE +/- 0.82, N = 12SE +/- 0.97, N = 954.0925.621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

CentOS Stream 9: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

Fedora Server 39: The test run did not produce a result. E: pgbench: line 21: pg_/bin/pgbench: No such file or directory

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1020406080100SE +/- 0.10, N = 3SE +/- 0.81, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 381.3597.3493.7485.06-pie - MIN: 38.9 / MAX: 148.69-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 33.82 / MAX: 325.84-pie - MIN: 34.75 / MAX: 283.24-pie - MIN: 39.38 / MAX: 260.961. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 2.90, N = 4SE +/- 0.55, N = 3SE +/- 2.84, N = 4SE +/- 2.84, N = 4240.65241.67241.70245.98-pie - MIN: 175.12 / MAX: 455.8-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 160.04 / MAX: 556.9-pie - MIN: 163.92 / MAX: 559.83-pie - MIN: 176.44 / MAX: 390.611. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1048121620SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.2814.3614.4314.37-pie - MIN: 12.05 / MAX: 46.92-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 12.55 / MAX: 76.83-pie - MIN: 12.18 / MAX: 71.66-pie - MIN: 12.05 / MAX: 72.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101.18132.36263.54394.72525.9065SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 35.245.255.255.20-pie - MIN: 4.62 / MAX: 25.2-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 4.69 / MAX: 58.72-pie - MIN: 4.62 / MAX: 31.26-pie - MIN: 4.59 / MAX: 31.781. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101224364860SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.10, N = 3SE +/- 0.36, N = 353.0853.0253.6753.79-pie - MIN: 44.51 / MAX: 131.9-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 42.98 / MAX: 178.63-pie - MIN: 43.29 / MAX: 161.25-pie - MIN: 44.62 / MAX: 198.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.101224364860SE +/- 0.30, N = 3SE +/- 1.14, N = 15SE +/- 0.27, N = 9SE +/- 0.11, N = 329.2352.0932.6727.93-pie - MIN: 21.95 / MAX: 117.93-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 17.25 / MAX: 578.83-pie - MIN: 19.33 / MAX: 484.58-pie - MIN: 19.99 / MAX: 261.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.67951.3592.03852.7183.3975SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.943.022.862.85-pie - MIN: 2.09 / MAX: 30.51-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 2.07 / MAX: 29.73-pie - MIN: 1.94 / MAX: 41.9-pie - MIN: 2.08 / MAX: 32.041. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.103691215SE +/- 0.16, N = 3SE +/- 0.14, N = 5SE +/- 0.16, N = 4SE +/- 0.18, N = 312.5412.6512.6912.74-pie - MIN: 10.74 / MAX: 50.52-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 10.9 / MAX: 74.36-pie - MIN: 10.9 / MAX: 75.69-pie - MIN: 10.88 / MAX: 73.621. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10918273645SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 340.1140.0239.3039.76-pie - MIN: 38.25 / MAX: 97.25-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 33.6 / MAX: 102.02-pie - MIN: 33.26 / MAX: 101.52-pie - MIN: 37.84 / MAX: 104.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.100.18680.37360.56040.74720.934SE +/- 0.01, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 150.770.830.830.80-pie - MIN: 0.19 / MAX: 28.17-isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 0.21 / MAX: 20.36-pie - MIN: 0.2 / MAX: 31.65-pie - MIN: 0.2 / MAX: 28.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.102004006008001000SE +/- 3.93, N = 3SE +/- 9.82, N = 4SE +/- 3.48, N = 3874902914

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.102004006008001000SE +/- 3.18, N = 3SE +/- 2.03, N = 3SE +/- 3.53, N = 3867897911

Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.102004006008001000SE +/- 2.33, N = 3SE +/- 2.96, N = 3SE +/- 2.40, N = 3102610581079

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.103K6K9K12K15KSE +/- 32.00, N = 3SE +/- 22.15, N = 3SE +/- 3.51, N = 3115751218712086

Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.107K14K21K28K35KSE +/- 740.65, N = 15SE +/- 1257.60, N = 15SE +/- 481.26, N = 15238842803532279

Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.103K6K9K12K15KSE +/- 13.00, N = 3SE +/- 36.03, N = 3SE +/- 25.12, N = 3116581226212243

Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.107K14K21K28K35KSE +/- 467.39, N = 15SE +/- 1255.46, N = 15SE +/- 600.94, N = 15236872752433174

Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.103K6K9K12K15KSE +/- 37.30, N = 3SE +/- 37.95, N = 3SE +/- 177.62, N = 4136391442214508

Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.108K16K24K32K40KSE +/- 138.45, N = 3SE +/- 119.27, N = 3SE +/- 564.37, N = 15339763469837285

Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3226235234

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1050100150200250SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3223233233

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1060120180240300SE +/- 0.67, N = 3SE +/- 1.45, N = 3SE +/- 0.00, N = 3265276276

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.108001600240032004000SE +/- 21.20, N = 3SE +/- 16.22, N = 3SE +/- 3.21, N = 3359136833718

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1014002800420056007000SE +/- 40.01, N = 3SE +/- 13.35, N = 3SE +/- 77.79, N = 4638063146448

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.108001600240032004000SE +/- 5.13, N = 3SE +/- 43.59, N = 3SE +/- 7.45, N = 3354436553694

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1014002800420056007000SE +/- 44.43, N = 3SE +/- 28.01, N = 3SE +/- 60.98, N = 6639963036324

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.109001800270036004500SE +/- 13.25, N = 3SE +/- 21.55, N = 3SE +/- 47.59, N = 4417343774353

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUArch LinuxFedora Server 39Ubuntu Linux 23.1016003200480064008000SE +/- 29.16, N = 3SE +/- 91.12, N = 4SE +/- 31.90, N = 3704374247321

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

CentOS Stream 9: The test quit with a non-zero exit status. E: ospStudio: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by ospStudio)

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10100200300400500SE +/- 1.54, N = 3SE +/- 0.62, N = 3SE +/- 0.91, N = 3474.24478.03478.76

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1048121620SE +/- 0.18, N = 3SE +/- 0.13, N = 7SE +/- 0.13, N = 714.1014.3214.06

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10816243240SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 334.2134.2934.76

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.101.30122.60243.90365.20486.506SE +/- 0.0549, N = 7SE +/- 0.0226, N = 3SE +/- 0.0557, N = 75.69535.78335.7515

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1020406080100SE +/- 0.21, N = 3SE +/- 0.41, N = 3SE +/- 0.46, N = 376.6677.1777.02

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1090180270360450SE +/- 3.56, N = 3SE +/- 0.86, N = 3SE +/- 2.51, N = 3413.38416.68409.01

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10816243240SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.31, N = 334.7434.2334.99

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1020406080100SE +/- 0.70, N = 3SE +/- 0.70, N = 3SE +/- 0.50, N = 374.5374.8074.88

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.101224364860SE +/- 0.55, N = 3SE +/- 0.70, N = 3SE +/- 0.63, N = 352.1252.2452.28

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.1080160240320400SE +/- 4.23, N = 4SE +/- 0.14, N = 3SE +/- 2.37, N = 3351.75366.38361.29

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10816243240SE +/- 0.42, N = 3SE +/- 0.32, N = 3SE +/- 0.38, N = 334.0435.2834.25

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamArch LinuxCentOS Stream 9Ubuntu Linux 23.10100200300400500SE +/- 1.04, N = 3SE +/- 0.34, N = 3SE +/- 0.78, N = 3471.82476.58477.14

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

Fedora Server 39: The test quit with a non-zero exit status. E: deepsparse: line 2: /.local/bin/deepsparse.benchmark: No such file or directory

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.1080160240320400SE +/- 4.10, N = 3SE +/- 1.51, N = 3SE +/- 1.01, N = 3SE +/- 3.60, N = 3356.23314.49255.39279.211. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10714212835SE +/- 0.27, N = 3SE +/- 0.12, N = 3SE +/- 0.40, N = 3SE +/- 0.23, N = 1130.3025.0131.6332.101. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

NWChem

NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballUbuntu Linux 23.104008001200160020001750.31. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Input: C240 Buckyball

CentOS Stream 9: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Fedora Server 39: The test run did not produce a result. E: /nwchem-7.0.2/bin/LINUX64/nwchem: error while loading shared libraries: libmpi_usempif08.so.40: cannot open shared object file: No such file or directory

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.104080120160200SE +/- 0.25, N = 3SE +/- 0.56, N = 3SE +/- 0.67, N = 3SE +/- 0.44, N = 3187.27184.19168.67169.18-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

easyWave

The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400Arch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.104080120160200SE +/- 7.11, N = 12SE +/- 10.40, N = 12SE +/- 15.88, N = 12SE +/- 8.98, N = 12145.71205.01156.24177.751. (CXX) g++ options: -O3 -fopenmp

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.3Pi Digits To Calculate: 1BArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10246810SE +/- 0.109, N = 3SE +/- 0.045, N = 3SE +/- 0.088, N = 5SE +/- 0.037, N = 37.5617.9138.2707.207

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.3Pi Digits To Calculate: 5BArch LinuxCentOS Stream 9Fedora Server 39Ubuntu Linux 23.10816243240SE +/- 0.37, N = 5SE +/- 0.17, N = 3SE +/- 0.43, N = 3SE +/- 0.07, N = 333.6532.8533.7329.82

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark TimeArch LinuxFedora Server 39Ubuntu Linux 23.10306090120150SE +/- 0.34, N = 3SE +/- 0.45, N = 3SE +/- 0.65, N = 3107.05118.23129.081. Arch Linux: RawTherapee, version 5.10, command line.2. Fedora Server 39: RawTherapee, version 5.10, command line.3. Ubuntu Linux 23.10: RawTherapee, version 5.9, command line.

Total Benchmark Time

CentOS Stream 9: The test run did not produce a result.

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeFedora Server 39Ubuntu Linux 23.104080120160200SE +/- 0.94, N = 3SE +/- 0.35, N = 15194.5138.64-fno-strict-overflow -fcf-protection -fexceptions -fPIC -UNDEBUG -std=c99-fwrapv -O21. (CC) gcc options: -shared -lxc -lblas -lmpi

Input: Carbon Nanotube

CentOS Stream 9: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'gpaw'

Arch Linux: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'gpaw'

125 Results Shown

Graph500:
  26:
    bfs max_TEPS
    bfs median_TEPS
Quicksilver:
  CTS2
  CORAL2 P1
  CORAL2 P2
OpenVINO:
  Person Detection FP16 - CPU
  Face Detection FP16-INT8 - CPU
  Vehicle Detection FP16-INT8 - CPU
  Face Detection Retail FP16-INT8 - CPU
  Road Segmentation ADAS FP16-INT8 - CPU
  Machine Translation EN To DE FP16 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Handwritten English Recognition FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
SVT-AV1:
  Preset 4 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
VVenC:
  Bosphorus 4K - Fast
  Bosphorus 4K - Faster
High Performance Conjugate Gradient:
  104 104 104 - 60
  144 144 144 - 60
ACES DGEMM
Intel Open Image Denoise:
  RT.hdr_alb_nrm.3840x2160 - CPU-Only
  RT.ldr_alb_nrm.3840x2160 - CPU-Only
  RTLightmap.hdr.4096x4096 - CPU-Only
TensorFlow
OpenVKL
OSPRay:
  particle_volume/ao/real_time
  particle_volume/scivis/real_time
  particle_volume/pathtracer/real_time
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/pathtracer/real_time
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream
  ResNet-50, Baseline - Asynchronous Multi-Stream
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
7-Zip Compression:
  Compression Rating
  Decompression Rating
LeelaChessZero:
  BLAS
  Eigen
GROMACS
NAMD:
  ATPase with 327,506 Atoms
  STMV with 1,066,628 Atoms
LAMMPS Molecular Dynamics Simulator:
  20k Atoms
  Rhodopsin Protein
RocksDB:
  Rand Read
  Update Rand
  Read While Writing
  Read Rand Write Rand
Speedb
Memcached:
  1:10
  1:100
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
Redis:
  GET - 500
  SET - 500
Graph500:
    sssp max_TEPS
    sssp median_TEPS
PostgreSQL:
  100 - 1000 - Read Only
  100 - 1000 - Read Write
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Write - Average Latency
OpenVINO:
  Person Detection FP16 - CPU
  Face Detection FP16-INT8 - CPU
  Vehicle Detection FP16-INT8 - CPU
  Face Detection Retail FP16-INT8 - CPU
  Road Segmentation ADAS FP16-INT8 - CPU
  Machine Translation EN To DE FP16 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Handwritten English Recognition FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
OSPRay Studio:
  1 - 4K - 1 - Path Tracer - CPU
  2 - 4K - 1 - Path Tracer - CPU
  3 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 16 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
  2 - 4K - 16 - Path Tracer - CPU
  2 - 4K - 32 - Path Tracer - CPU
  3 - 4K - 16 - Path Tracer - CPU
  3 - 4K - 32 - Path Tracer - CPU
  1 - 1080p - 1 - Path Tracer - CPU
  2 - 1080p - 1 - Path Tracer - CPU
  3 - 1080p - 1 - Path Tracer - CPU
  1 - 1080p - 16 - Path Tracer - CPU
  1 - 1080p - 32 - Path Tracer - CPU
  2 - 1080p - 16 - Path Tracer - CPU
  2 - 1080p - 32 - Path Tracer - CPU
  3 - 1080p - 16 - Path Tracer - CPU
  3 - 1080p - 32 - Path Tracer - CPU
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream
  ResNet-50, Baseline - Asynchronous Multi-Stream
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
CloverLeaf:
  clover_bm16
  clover_bm64_short
NWChem
Xcompact3d Incompact3d
easyWave
Y-Cruncher:
  1B
  5B
RawTherapee
GPAW