AMD Ryzen Zen 4 ECC memory benchmarks by Michael Larabel for a future article.
ECC Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ECC Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-4800MT/s MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED 512MB, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 100 - Inserts: 3000 - Rounds: 30 ECC ECC Disabled 50 100 150 200 250 SE +/- 1.11, N = 3 SE +/- 0.80, N = 3 229.28 230.48 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 1000 - Rounds: 30 ECC ECC Disabled 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.34, N = 3 109.62 109.15 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 3000 - Rounds: 30 ECC ECC Disabled 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 2.72, N = 3 358.40 359.99 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 1000 - Rounds: 30 ECC ECC Disabled 40 80 120 160 200 SE +/- 1.92, N = 12 SE +/- 2.32, N = 12 184.75 184.11 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 3000 - Rounds: 30 ECC ECC Disabled 140 280 420 560 700 SE +/- 7.62, N = 9 SE +/- 5.67, N = 8 638.92 646.61 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 50 - Files: 100000 ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 15897.44, N = 15 SE +/- 10350.72, N = 12 830359 839379
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 100 - Files: 100000 ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 13410.36, N = 15 SE +/- 13606.70, N = 15 799264 805165
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 20 - Files: 1000000 ECC ECC Disabled 400K 800K 1200K 1600K 2000K SE +/- 4344.35, N = 3 SE +/- 1042.35, N = 3 1807250 1767827
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 50 - Files: 1000000 ECC ECC Disabled 400K 800K 1200K 1600K 2000K SE +/- 23302.59, N = 3 SE +/- 19536.69, N = 3 1988614 1961170
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 20 - Files: 100000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 660.95, N = 3 SE +/- 673.12, N = 3 129429 130612
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 50 - Files: 100000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 954.30, N = 15 SE +/- 1147.56, N = 7 125519 127197
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 100 - Files: 1000000 ECC ECC Disabled 400K 800K 1200K 1600K 2000K SE +/- 24661.11, N = 15 SE +/- 27286.10, N = 3 1989761 2065407
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 20 - Files: 100000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 1330.48, N = 3 SE +/- 1104.00, N = 15 98948 101358
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 50 - Files: 100000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 470.34, N = 3 SE +/- 323.23, N = 3 99905 100202
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 20 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 267.90, N = 3 SE +/- 362.23, N = 3 90059 91144
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 50 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 648.45, N = 3 SE +/- 267.21, N = 3 90748 89838
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 100 - Files: 100000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 1255.07, N = 15 SE +/- 1147.47, N = 15 127903 127189
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 20 - Files: 1000000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 382.11, N = 3 SE +/- 736.46, N = 3 143625 143370
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 50 - Files: 1000000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 283.49, N = 3 SE +/- 1096.64, N = 3 140418 141653
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 100 - Files: 100000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 915.43, N = 15 SE +/- 959.88, N = 15 97496 100433
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 20 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 646.22, N = 3 SE +/- 378.60, N = 3 108155 108282
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 50 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 354.41, N = 3 SE +/- 1059.57, N = 3 105277 106306
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 100 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 455.68, N = 3 SE +/- 358.07, N = 3 90122 89839
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 100 - Files: 1000000 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 1323.41, N = 3 SE +/- 1332.20, N = 3 140586 141149
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 100 - Files: 1000000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 1138.94, N = 4 SE +/- 223.84, N = 3 105242 103357
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC ECC Disabled 14M 28M 42M 56M 70M SE +/- 351004.04, N = 3 SE +/- 427325.61, N = 3 65805942 66014976
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC ECC Disabled 15 30 45 60 75 SE +/- 0.44, N = 3 SE +/- 0.56, N = 3 69.44 68.57 MAX: 11348.56 MAX: 13857.17
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC ECC Disabled 14M 28M 42M 56M 70M SE +/- 218304.75, N = 3 SE +/- 274619.09, N = 3 65758975 65712193
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC ECC Disabled 50 100 150 200 250 SE +/- 1.77, N = 3 SE +/- 3.71, N = 3 241.75 242.49 MAX: 27025.88 MAX: 26726.56
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC ECC Disabled 16M 32M 48M 64M 80M SE +/- 321408.08, N = 3 SE +/- 375866.34, N = 3 72019505 72601202
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC ECC Disabled 20 40 60 80 100 SE +/- 0.76, N = 3 SE +/- 0.96, N = 3 101.64 100.88 MAX: 11366.39 MAX: 12636.24
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC ECC Disabled 16M 32M 48M 64M 80M SE +/- 157085.43, N = 3 SE +/- 402756.47, N = 3 73119059 73026782
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC ECC Disabled 80 160 240 320 400 SE +/- 0.99, N = 3 SE +/- 2.67, N = 3 345.18 343.57 MAX: 26517.23 MAX: 27057.97
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC ECC Disabled 15M 30M 45M 60M 75M SE +/- 327817.64, N = 3 SE +/- 458439.80, N = 3 70732193 70558904
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC ECC Disabled 15 30 45 60 75 SE +/- 0.25, N = 3 SE +/- 0.58, N = 3 66.73 66.65 MAX: 23906.66 MAX: 23921.42
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC ECC Disabled 16M 32M 48M 64M 80M SE +/- 328990.74, N = 3 SE +/- 483767.39, N = 3 73102972 71997380
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC ECC Disabled 60 120 180 240 300 SE +/- 2.06, N = 3 SE +/- 3.62, N = 3 246.82 252.53 MAX: 26952.07 MAX: 27096.99
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC ECC Disabled 14M 28M 42M 56M 70M SE +/- 302463.46, N = 3 SE +/- 93381.16, N = 3 66848344 66754764
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC ECC Disabled 30 60 90 120 150 SE +/- 0.32, N = 3 SE +/- 0.16, N = 3 114.38 114.67 MAX: 23891.9 MAX: 23908.06
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC ECC Disabled 15M 30M 45M 60M 75M SE +/- 261100.74, N = 3 SE +/- 147482.62, N = 3 67564513 68475092
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC ECC Disabled 90 180 270 360 450 SE +/- 6.10, N = 3 SE +/- 1.46, N = 3 431.60 431.96 MAX: 29656.61 MAX: 27343.45
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only ECC ECC Disabled 15 30 45 60 75 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 67.06 67.05
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only ECC ECC Disabled 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.19, N = 3 171.67 172.09
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only ECC ECC Disabled 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 84.30 84.19
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only ECC ECC Disabled 140 280 420 560 700 SE +/- 0.13, N = 3 SE +/- 0.93, N = 3 626.02 626.30
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only ECC ECC Disabled 50 100 150 200 250 SE +/- 0.40, N = 3 SE +/- 0.20, N = 3 207.94 207.14
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: MoVR - Concurrency: 128 ECC ECC Disabled 400 800 1200 1600 2000 SE +/- 2.12, N = 3 SE +/- 0.46, N = 3 1709.8 1716.5
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 10% Reads - Concurrency: 128 ECC ECC Disabled 14K 28K 42K 56K 70K SE +/- 757.41, N = 4 SE +/- 203.95, N = 3 65696.6 65769.8
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 10% Reads - Concurrency: 256 ECC ECC Disabled 14K 28K 42K 56K 70K SE +/- 60.26, N = 3 SE +/- 21.82, N = 3 63955.2 64158.1
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 206.20, N = 3 SE +/- 990.71, N = 3 77301.1 78240.7
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 256 ECC ECC Disabled 16K 32K 48K 64K 80K SE +/- 18.97, N = 3 SE +/- 241.90, N = 3 75288.6 76074.8
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 60% Reads - Concurrency: 128 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 395.32, N = 3 SE +/- 727.86, N = 3 81837.5 83011.6
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 60% Reads - Concurrency: 256 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 113.54, N = 3 SE +/- 156.76, N = 3 79189.8 79793.7
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 71.75, N = 3 SE +/- 76.68, N = 3 99577.6 100834.4
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 256 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 1100.25, N = 3 SE +/- 571.60, N = 3 97540.6 98267.7
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown ECC ECC Disabled 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 24.56 24.38 MIN: 24.27 / MAX: 25.25 MIN: 24.07 / MAX: 25.34
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon ECC ECC Disabled 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 26.20 26.14 MIN: 25.89 / MAX: 26.82 MIN: 25.95 / MAX: 26.71
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj ECC ECC Disabled 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 22.33 22.34 MIN: 22.12 / MAX: 22.85 MIN: 22.15 / MAX: 22.84
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube ECC ECC Disabled 40 80 120 160 200 SE +/- 0.24, N = 3 SE +/- 0.11, N = 3 171.77 172.01 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC ECC Disabled 70M 140M 210M 280M 350M 282503000 305288000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC ECC Disabled 70M 140M 210M 280M 350M 283689000 306781000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC ECC Disabled 20M 40M 60M 80M 100M 91728900 93647900 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC ECC Disabled 30M 60M 90M 120M 150M 120930000 124887000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare ECC ECC Disabled 0.4766 0.9532 1.4298 1.9064 2.383 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 2.106 2.118 1. (CXX) g++ options: -O3
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 308.39, N = 3 SE +/- 15.84, N = 3 131313 131245 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish ECC ECC Disabled 7K 14K 21K 28K 35K SE +/- 20.78, N = 3 SE +/- 11.85, N = 3 34416 34438 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 ECC ECC Disabled 30M 60M 90M 120M 150M SE +/- 182924.70, N = 3 SE +/- 1072155.00, N = 3 162879333 161177000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 ECC ECC Disabled 800K 1600K 2400K 3200K 4000K SE +/- 6691.62, N = 3 SE +/- 8685.88, N = 3 3666333 3685667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow ECC ECC Disabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.55 16.51 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium ECC ECC Disabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.95 16.94 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast ECC ECC Disabled 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 40.36 40.34 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast ECC ECC Disabled 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 51.26 51.21 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast ECC ECC Disabled 15 30 45 60 75 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 65.88 66.03 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem ECC ECC Disabled 40 80 120 160 200 SE +/- 0.85, N = 3 SE +/- 0.48, N = 3 163.36 163.07 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh ECC ECC Disabled 40 80 120 160 200 SE +/- 0.26, N = 3 SE +/- 1.21, N = 3 176.27 177.52 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 ECC ECC Disabled 0.8861 1.7722 2.6583 3.5444 4.4305 SE +/- 0.029, N = 3 SE +/- 0.018, N = 3 3.837 3.938 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless ECC ECC Disabled 0.8588 1.7176 2.5764 3.4352 4.294 SE +/- 0.007, N = 3 SE +/- 0.041, N = 5 3.723 3.817 1. (CXX) g++ options: -O3 -fPIC -lm
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 ECC ECC Disabled 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 107.4 107.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 ECC ECC Disabled 50 100 150 200 250 SE +/- 0.23, N = 3 SE +/- 0.15, N = 3 214.9 215.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 ECC ECC Disabled 50M 100M 150M 200M 250M SE +/- 381575.68, N = 3 SE +/- 138604.15, N = 3 244130000 241173333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 24 - Buffer Length: 256 - Filter Length: 512 ECC ECC Disabled 70M 140M 210M 280M 350M SE +/- 453370.10, N = 3 SE +/- 105039.68, N = 3 306083333 304490000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU ECC ECC Disabled 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 3.97 3.98 MIN: 3.87 / MAX: 4.22 MIN: 3.86 / MAX: 4.21
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU ECC ECC Disabled 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 3.22 3.22 MIN: 1.2 / MAX: 3.72 MIN: 1.23 / MAX: 3.72
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU ECC ECC Disabled 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.28 6.25 MIN: 5.72 / MAX: 6.55 MIN: 5.72 / MAX: 6.47
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU ECC ECC Disabled 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.66 3.66 MIN: 1.52 / MAX: 4.17 MIN: 1.51 / MAX: 4.18
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU ECC ECC Disabled 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.13, N = 7 14.97 14.87 MIN: 13.47 / MAX: 15.18 MIN: 13.14 / MAX: 15.38
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB ECC ECC Disabled 4K 8K 12K 16K 20K SE +/- 158.15, N = 7 SE +/- 243.33, N = 3 17667.19 18187.18 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB ECC ECC Disabled 3K 6K 9K 12K 15K SE +/- 85.40, N = 3 SE +/- 161.44, N = 15 15125.32 15938.54 1. (CC) gcc options: -O3 -march=native
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms ECC ECC Disabled 0.2332 0.4664 0.6996 0.9328 1.166 SE +/- 0.00211, N = 3 SE +/- 0.00078, N = 3 1.03599 1.03624
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C ECC ECC Disabled 8K 16K 24K 32K 40K SE +/- 14.62, N = 3 SE +/- 104.76, N = 3 39530.58 39142.85 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C ECC ECC Disabled 2K 4K 6K 8K 10K SE +/- 33.18, N = 3 SE +/- 28.33, N = 3 9818.08 9766.48 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C ECC ECC Disabled 500 1000 1500 2000 2500 SE +/- 28.31, N = 4 SE +/- 25.44, N = 4 2353.15 2342.69 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C ECC ECC Disabled 5K 10K 15K 20K 25K SE +/- 28.63, N = 3 SE +/- 51.21, N = 3 22585.66 22407.05 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D ECC ECC Disabled 300 600 900 1200 1500 SE +/- 0.84, N = 3 SE +/- 8.10, N = 3 1276.30 1300.77 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C ECC ECC Disabled 9K 18K 27K 36K 45K SE +/- 132.77, N = 3 SE +/- 184.92, N = 3 42129.34 41776.27 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C ECC ECC Disabled 5K 10K 15K 20K 25K SE +/- 39.09, N = 3 SE +/- 21.67, N = 3 22482.36 22628.24 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B ECC ECC Disabled 4K 8K 12K 16K 20K SE +/- 16.53, N = 3 SE +/- 24.91, N = 3 18505.14 18369.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 155.77, N = 3 SE +/- 118.52, N = 3 118799.08 118602.17 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 ECC ECC Disabled 20K 40K 60K 80K 100K SE +/- 266.94, N = 3 SE +/- 24.09, N = 3 108646.65 111520.62 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball ECC ECC Disabled 1100 2200 3300 4400 5500 5115.7 5290.5 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
Input: drivaerFastback, Small Mesh Size
ECC Disabled: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
ECC: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
Input: drivaerFastback, Medium Mesh Size
ECC Disabled: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
ECC: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam ECC ECC Disabled 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.53, N = 3 97.51 98.55
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 ECC ECC Disabled 6000M 12000M 18000M 24000M 30000M SE +/- 20175809.49, N = 3 SE +/- 283646194.48, N = 3 27345804760 26965794363 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 ECC ECC Disabled 2000M 4000M 6000M 8000M 10000M SE +/- 2949369.22, N = 3 SE +/- 829361.75, N = 3 8675080823 8644944773 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 ECC ECC Disabled 2K 4K 6K 8K 10K SE +/- 35.75, N = 3 SE +/- 10.04, N = 3 11651.1 11581.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 ECC ECC Disabled 60K 120K 180K 240K 300K SE +/- 76.05, N = 3 SE +/- 52.39, N = 3 293872.1 292853.7 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 ECC ECC Disabled 20000M 40000M 60000M 80000M 100000M SE +/- 8950166.36, N = 3 SE +/- 19425517.36, N = 3 103120195583 102741841967 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM ECC ECC Disabled 40000M 80000M 120000M 160000M 200000M SE +/- 59771765.00, N = 3 SE +/- 62857109.96, N = 3 201399715157 200652491073 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM ECC ECC Disabled 40000M 80000M 120000M 160000M 200000M SE +/- 52395564.84, N = 3 SE +/- 81118637.19, N = 3 172523134883 171468566437 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 ECC ECC Disabled 16000M 32000M 48000M 64000M 80000M SE +/- 5719713.01, N = 3 SE +/- 10402041.03, N = 3 73295799273 73055647790 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU ECC ECC Disabled 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 76.73 76.87 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU ECC ECC Disabled 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 78.12 77.99 MIN: 56.83 / MAX: 103.58 MIN: 54.1 / MAX: 104.16 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU ECC ECC Disabled 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 20.73 20.76 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU ECC ECC Disabled 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 0.11, N = 3 288.95 288.56 MIN: 267.2 / MAX: 297.65 MIN: 267.28 / MAX: 296.44 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU ECC ECC Disabled 300 600 900 1200 1500 SE +/- 1.03, N = 3 SE +/- 0.79, N = 3 1307.43 1306.54 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU ECC ECC Disabled 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.56 4.57 MIN: 2.54 / MAX: 13.32 MIN: 2.52 / MAX: 13.14 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU ECC ECC Disabled 800 1600 2400 3200 4000 SE +/- 6.12, N = 3 SE +/- 9.13, N = 3 3630.27 3623.19 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU ECC ECC Disabled 0.7178 1.4356 2.1534 2.8712 3.589 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.19 3.19 MIN: 1.74 / MAX: 11.21 MIN: 1.75 / MAX: 9.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU ECC ECC Disabled 90 180 270 360 450 SE +/- 1.67, N = 3 SE +/- 0.37, N = 3 402.05 401.89 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU ECC ECC Disabled 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 14.90 14.91 MIN: 11.1 / MAX: 21.3 MIN: 11.06 / MAX: 24.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU ECC ECC Disabled 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 105.39 105.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU ECC ECC Disabled 13 26 39 52 65 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 56.88 56.59 MIN: 44.67 / MAX: 72.07 MIN: 45.24 / MAX: 67.38 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU ECC ECC Disabled 500 1000 1500 2000 2500 SE +/- 0.76, N = 3 SE +/- 3.30, N = 3 2128.62 2130.59 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU ECC ECC Disabled 1.26 2.52 3.78 5.04 6.3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.60 5.59 MIN: 2.88 / MAX: 13.28 MIN: 2.76 / MAX: 13.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU ECC ECC Disabled 300 600 900 1200 1500 SE +/- 4.37, N = 3 SE +/- 5.33, N = 3 1229.15 1223.55 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU ECC ECC Disabled 1.1003 2.2006 3.3009 4.4012 5.5015 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.87 4.89 MIN: 3.3 / MAX: 8.95 MIN: 3.29 / MAX: 11.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU ECC ECC Disabled 100 200 300 400 500 SE +/- 1.10, N = 3 SE +/- 0.60, N = 3 468.28 468.29 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU ECC ECC Disabled 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 25.58 25.57 MIN: 16.44 / MAX: 40.48 MIN: 16.54 / MAX: 37.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU ECC ECC Disabled 8K 16K 24K 32K 40K SE +/- 41.68, N = 3 SE +/- 91.39, N = 3 39510.35 39479.81 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU ECC ECC Disabled 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.27 0.27 MIN: 0.15 / MAX: 7.01 MIN: 0.15 / MAX: 7.04 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time ECC ECC Disabled 1.3415 2.683 4.0245 5.366 6.7075 SE +/- 0.00168, N = 3 SE +/- 0.00390, N = 3 5.96233 5.93739
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time ECC ECC Disabled 2 4 6 8 10 SE +/- 0.00349, N = 3 SE +/- 0.00404, N = 3 6.02105 5.93370
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time ECC ECC Disabled 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.46, N = 3 193.50 194.14
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time ECC ECC Disabled 1.3417 2.6834 4.0251 5.3668 6.7085 SE +/- 0.01403, N = 3 SE +/- 0.02068, N = 3 5.81175 5.96291
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time ECC ECC Disabled 1.3036 2.6072 3.9108 5.2144 6.518 SE +/- 0.00078, N = 3 SE +/- 0.01901, N = 3 5.63794 5.79397
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time ECC ECC Disabled 2 4 6 8 10 SE +/- 0.00444, N = 3 SE +/- 0.00292, N = 3 6.95107 6.94833
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Only ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 1968.16, N = 3 SE +/- 6825.25, N = 3 1050229 1057581 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Only - Average Latency ECC ECC Disabled 0.1715 0.343 0.5145 0.686 0.8575 SE +/- 0.001, N = 3 SE +/- 0.005, N = 3 0.762 0.756 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Only ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 3860.24, N = 3 SE +/- 5787.70, N = 3 995905 1006054 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Only - Average Latency ECC ECC Disabled 0.2259 0.4518 0.6777 0.9036 1.1295 SE +/- 0.004, N = 3 SE +/- 0.006, N = 3 1.004 0.994 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Write ECC ECC Disabled 300 600 900 1200 1500 SE +/- 42.18, N = 9 SE +/- 40.18, N = 12 1531 1517 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Write - Average Latency ECC ECC Disabled 110 220 330 440 550 SE +/- 14.93, N = 9 SE +/- 14.11, N = 12 525.98 531.29 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Write ECC ECC Disabled 300 600 900 1200 1500 SE +/- 47.44, N = 12 SE +/- 40.02, N = 12 1388 1445 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Write - Average Latency ECC ECC Disabled 160 320 480 640 800 SE +/- 24.99, N = 12 SE +/- 20.48, N = 12 729.62 698.43 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 4872.30, N = 3 SE +/- 9395.63, N = 3 915305 923799 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency ECC ECC Disabled 0.1967 0.3934 0.5901 0.7868 0.9835 SE +/- 0.005, N = 3 SE +/- 0.009, N = 3 0.874 0.866 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 6917.40, N = 3 SE +/- 2229.74, N = 3 856806 881612 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency ECC ECC Disabled 0.2626 0.5252 0.7878 1.0504 1.313 SE +/- 0.010, N = 3 SE +/- 0.003, N = 3 1.167 1.134 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write ECC ECC Disabled 8K 16K 24K 32K 40K SE +/- 374.53, N = 3 SE +/- 246.74, N = 3 35767 36881 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency ECC ECC Disabled 5 10 15 20 25 SE +/- 0.24, N = 3 SE +/- 0.15, N = 3 22.37 21.69 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write ECC ECC Disabled 7K 14K 21K 28K 35K SE +/- 418.19, N = 3 SE +/- 312.09, N = 3 32176 32685 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency ECC ECC Disabled 7 14 21 28 35 SE +/- 0.40, N = 3 SE +/- 0.29, N = 3 31.09 30.60 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer ECC ECC Disabled 14K 28K 42K 56K 70K SE +/- 247.45, N = 3 SE +/- 333.42, N = 3 62763.29 63428.87 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer ECC ECC Disabled 13K 26K 39K 52K 65K SE +/- 259.25, N = 3 SE +/- 309.22, N = 3 62587.40 63014.91 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer ECC ECC Disabled 12K 24K 36K 48K 60K SE +/- 523.00, N = 3 SE +/- 634.29, N = 3 55609.57 56261.71 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer ECC ECC Disabled 13K 26K 39K 52K 65K SE +/- 168.42, N = 3 SE +/- 410.79, N = 3 58968.86 59817.80 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point ECC ECC Disabled 12K 24K 36K 48K 60K SE +/- 412.39, N = 3 SE +/- 563.17, N = 3 56004.72 55931.83 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point ECC ECC Disabled 14K 28K 42K 56K 70K SE +/- 316.77, N = 3 SE +/- 349.92, N = 3 62547.20 63329.35 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point ECC ECC Disabled 14K 28K 42K 56K 70K SE +/- 76.35, N = 3 SE +/- 215.92, N = 3 63205.02 63394.10 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point ECC ECC Disabled 12K 24K 36K 48K 60K SE +/- 220.23, N = 3 SE +/- 321.29, N = 3 54750.05 55640.51 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point ECC ECC Disabled 13K 26K 39K 52K 65K SE +/- 286.79, N = 3 SE +/- 148.97, N = 3 59313.91 59227.55 1. (CC) gcc options: -O3 -march=native
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example ECC ECC Disabled 7 14 21 28 35 SE +/- 0.16, N = 3 SE +/- 0.29, N = 3 31.71 32.13 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill ECC ECC Disabled 300K 600K 900K 1200K 1500K SE +/- 1532.71, N = 3 SE +/- 1908.55, N = 3 1211432 1215131 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read ECC ECC Disabled 20M 40M 60M 80M 100M SE +/- 55540.17, N = 3 SE +/- 218187.96, N = 3 106312822 106335519 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random ECC ECC Disabled 200K 400K 600K 800K 1000K SE +/- 1160.52, N = 3 SE +/- 582.27, N = 3 776559 780282 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Sequential Fill ECC ECC Disabled 300K 600K 900K 1200K 1500K SE +/- 3425.42, N = 3 SE +/- 3332.23, N = 3 1326951 1324715 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill Sync ECC ECC Disabled 30K 60K 90K 120K 150K SE +/- 2406.06, N = 15 SE +/- 394.20, N = 3 131422 127674 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing ECC ECC Disabled 700K 1400K 2100K 2800K 3500K SE +/- 26280.53, N = 3 SE +/- 2700.07, N = 3 3293184 3277737 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random ECC ECC Disabled 600K 1200K 1800K 2400K 3000K SE +/- 9449.73, N = 3 SE +/- 17250.82, N = 3 2715841 2727147 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark ECC ECC Disabled 200 400 600 800 1000 SE +/- 3.71, N = 3 SE +/- 6.06, N = 3 1064.7 1062.9 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total ECC ECC Disabled 700 1400 2100 2800 3500 SE +/- 7.41, N = 3 SE +/- 1.53, N = 3 3457.7 3444.6 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread ECC ECC Disabled 50 100 150 200 250 SE +/- 0.35, N = 3 SE +/- 0.92, N = 3 212.0 214.4 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale ECC ECC Disabled 8K 16K 24K 32K 40K SE +/- 22.50, N = 5 SE +/- 13.93, N = 5 35849.8 35867.3 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad ECC ECC Disabled 9K 18K 27K 36K 45K SE +/- 23.27, N = 5 SE +/- 5.66, N = 5 39769.0 39833.5 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add ECC ECC Disabled 9K 18K 27K 36K 45K SE +/- 24.36, N = 5 SE +/- 25.08, N = 5 39666.0 39735.4 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K ECC ECC Disabled 1.1527 2.3054 3.4581 4.6108 5.7635 SE +/- 0.012, N = 3 SE +/- 0.006, N = 3 5.123 5.099 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K ECC ECC Disabled 16 32 48 64 80 SE +/- 0.40, N = 3 SE +/- 0.52, N = 3 72.36 72.09 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K ECC ECC Disabled 40 80 120 160 200 SE +/- 0.32, N = 3 SE +/- 0.18, N = 3 161.59 160.06 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K ECC ECC Disabled 40 80 120 160 200 SE +/- 0.44, N = 3 SE +/- 0.32, N = 3 160.91 159.53 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K ECC ECC Disabled 20 40 60 80 100 SE +/- 1.61, N = 12 SE +/- 1.68, N = 12 92.71 92.55 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K ECC ECC Disabled 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.20, N = 3 99.57 100.16 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K ECC ECC Disabled 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.17, N = 3 87.32 86.92 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 ECC ECC Disabled 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 35.68 35.75
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 32 - Model: ResNet-50 ECC ECC Disabled 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 36.61 36.66
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 ECC ECC Disabled 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 36.74 36.78
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 256 - Model: ResNet-50 ECC ECC Disabled 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 36.83 36.91
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium ECC ECC Disabled 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.09 12.07
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast ECC ECC Disabled 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 35.02 34.95
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast ECC ECC Disabled 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 37.09 36.97
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast ECC ECC Disabled 10 20 30 40 50 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 43.40 43.29
WRF WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.
Input: conus 2.5km
ECC Disabled: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
ECC: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction ECC ECC Disabled 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 16.65 16.63 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction ECC ECC Disabled 20 40 60 80 100 SE +/- 0.60, N = 3 SE +/- 0.70, N = 3 77.11 78.22 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
ECC Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 1 October 2023 19:04 by user root.
ECC Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-4800MT/s MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED 512MB, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 05:22 by user root.