2 x Intel Xeon Max 9480 benchmarks for a future article.
Linux 6.8 Processor: 2 x Intel Xeon Max 9480 @ 3.50GHz (112 Cores / 224 Threads), Motherboard: Supermicro SYS-221H-TNR X13DEM v1.10 (1.3 BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Network: 2 x Broadcom BCM57508 NetXtreme-E 10Gb/25Gb/40Gb/50Gb/100Gb/200Gb
OS: Ubuntu 23.10, Kernel: 6.8.0-060800-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c000290Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Linux 6.9-rc2 OS: Ubuntu 23.10, Kernel: 6.9.0-060900rc2-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c000290Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 Linux 6.8 Linux 6.9-rc2 1.5M 3M 4.5M 6M 7.5M SE +/- 67589.78, N = 6 SE +/- 46491.34, N = 3 6814667 6916667 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 Linux 6.8 Linux 6.9-rc2 1.4M 2.8M 4.2M 5.6M 7M SE +/- 74194.56, N = 9 SE +/- 91964.82, N = 9 6677333 6689667 1. (CXX) g++ options: -fopenmp -O3 -march=native
FFmpeg This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx264 - Scenario: Live Linux 6.8 Linux 6.9-rc2 40 80 120 160 200 SE +/- 1.29, N = 3 SE +/- 0.70, N = 3 181.51 182.12 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Live Linux 6.8 Linux 6.9-rc2 30 60 90 120 150 SE +/- 1.36, N = 15 SE +/- 1.80, N = 15 112.43 113.99 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx264 - Scenario: Upload Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 11.50 11.51 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Upload Linux 6.8 Linux 6.9-rc2 5 10 15 20 25 SE +/- 0.31, N = 3 SE +/- 0.26, N = 4 22.31 22.24 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx264 - Scenario: Platform Linux 6.8 Linux 6.9-rc2 10 20 30 40 50 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 42.76 42.77 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Platform Linux 6.8 Linux 6.9-rc2 10 20 30 40 50 SE +/- 0.71, N = 12 SE +/- 0.46, N = 12 42.74 40.20 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx264 - Scenario: Video On Demand Linux 6.8 Linux 6.9-rc2 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 42.94 42.88 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org FPS, More Is Better FFmpeg 6.1 Encoder: libx265 - Scenario: Video On Demand Linux 6.8 Linux 6.9-rc2 10 20 30 40 50 SE +/- 0.50, N = 12 SE +/- 0.76, N = 9 44.80 44.24 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: KawPow - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 68.36, N = 3 SE +/- 137.96, N = 3 32860.8 33072.5 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Monero - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 320.50, N = 13 SE +/- 181.84, N = 3 32361.2 33015.5 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Wownero - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 8K 16K 24K 32K 40K SE +/- 245.88, N = 3 SE +/- 490.33, N = 3 36096.6 36180.7 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 31.05, N = 3 SE +/- 42.79, N = 3 9592.3 9602.7 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: CryptoNight-Heavy - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 49.86, N = 3 SE +/- 160.32, N = 3 32856.3 32826.1 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: CryptoNight-Femto UPX2 - Hash Count: 1M Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 110.64, N = 3 SE +/- 170.37, N = 3 33002.2 32993.8 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read Linux 6.8 Linux 6.9-rc2 3K 6K 9K 12K 15K SE +/- 0.52, N = 3 SE +/- 0.88, N = 3 13344.70 13341.47 MIN: 13338.06 / MAX: 13346.23 MIN: 13335.31 / MAX: 13343.85 1. (CC) gcc options: -O3 -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write Linux 6.8 Linux 6.9-rc2 20K 40K 60K 80K 100K SE +/- 36.95, N = 3 SE +/- 65.10, N = 3 93036.30 93862.76 MIN: 54037.87 / MAX: 103869.49 MIN: 54012.69 / MAX: 104121.12 1. (CC) gcc options: -O3 -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write Linux 6.8 Linux 6.9-rc2 20K 40K 60K 80K 100K SE +/- 2.71, N = 3 SE +/- 71.07, N = 3 99112.70 98711.08 MIN: 88982.6 / MAX: 105154.55 MIN: 85707.72 / MAX: 105115.44 1. (CC) gcc options: -O3 -lrt
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed Linux 6.8 Linux 6.9-rc2 800 1600 2400 3200 4000 SE +/- 44.31, N = 3 SE +/- 18.57, N = 3 3601.0 3628.0 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed Linux 6.8 Linux 6.9-rc2 700 1400 2100 2800 3500 SE +/- 33.92, N = 3 SE +/- 34.17, N = 3 3364.8 3349.6 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed Linux 6.8 Linux 6.9-rc2 800 1600 2400 3200 4000 SE +/- 16.01, N = 3 SE +/- 57.81, N = 3 3537.0 3492.0 1. (CC) gcc options: -O3
JPEG-XL libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 80 Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.38, N = 3 SE +/- 0.36, N = 15 32.48 33.05 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 90 Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.28, N = 3 SE +/- 0.18, N = 3 30.97 30.13 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 80 Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.51, N = 12 SE +/- 0.53, N = 15 34.60 34.19 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 90 Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.37, N = 3 SE +/- 0.38, N = 15 32.56 33.89 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 100 Linux 6.8 Linux 6.9-rc2 6 12 18 24 30 SE +/- 0.15, N = 3 SE +/- 0.21, N = 3 25.67 25.61 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: JPEG - Quality: 100 Linux 6.8 Linux 6.9-rc2 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 26.00 25.96 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
JPEG-XL Decoding libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG-XL Decoding libjxl 0.10.1 CPU Threads: All Linux 6.8 Linux 6.9-rc2 80 160 240 320 400 SE +/- 4.23, N = 3 SE +/- 2.26, N = 3 386.23 381.66
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP2 Image Encode 20220823 Encode Settings: Default Linux 6.8 Linux 6.9-rc2 1.3275 2.655 3.9825 5.31 6.6375 SE +/- 0.11, N = 15 SE +/- 0.09, N = 15 5.90 5.65 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl
OpenBenchmarking.org MP/s, More Is Better WebP2 Image Encode 20220823 Encode Settings: Quality 75, Compression Effort 7 Linux 6.8 Linux 6.9-rc2 0.144 0.288 0.432 0.576 0.72 SE +/- 0.01, N = 3 SE +/- 0.01, N = 4 0.64 0.61 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl
OpenBenchmarking.org MP/s, More Is Better WebP2 Image Encode 20220823 Encode Settings: Quality 95, Compression Effort 7 Linux 6.8 Linux 6.9-rc2 0.0765 0.153 0.2295 0.306 0.3825 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.34 0.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl
OpenBenchmarking.org MP/s, More Is Better WebP2 Image Encode 20220823 Encode Settings: Quality 100, Compression Effort 5 Linux 6.8 Linux 6.9-rc2 2 4 6 8 10 SE +/- 0.11, N = 15 SE +/- 0.07, N = 3 8.35 8.45 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl
OpenBenchmarking.org MP/s, More Is Better WebP2 Image Encode 20220823 Encode Settings: Quality 100, Lossless Compression Linux 6.8 Linux 6.9-rc2 0.0158 0.0316 0.0474 0.0632 0.079 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.07 0.07 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl
srsRAN Project OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240219 Test: PDSCH Processor Benchmark, Throughput Total Linux 6.8 Linux 6.9-rc2 4K 8K 12K 16K 20K SE +/- 263.34, N = 15 SE +/- 377.19, N = 12 17006.8 16877.6 1. (CXX) g++ options: -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -O3 -fno-trapping-math -fno-math-errno -ldl
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Crown Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.59, N = 15 SE +/- 1.07, N = 15 36.60 35.08 MIN: 25.71 / MAX: 54.49 MIN: 22.8 / MAX: 59.06
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown Linux 6.8 Linux 6.9-rc2 9 18 27 36 45 SE +/- 0.47, N = 4 SE +/- 1.44, N = 12 38.84 35.73 MIN: 30.82 / MAX: 52.25 MIN: 20.32 / MAX: 57.51
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.86, N = 15 SE +/- 0.54, N = 15 35.40 35.65 MIN: 23.5 / MAX: 52.61 MIN: 24.54 / MAX: 51.88
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon Obj Linux 6.8 Linux 6.9-rc2 9 18 27 36 45 SE +/- 0.92, N = 15 SE +/- 0.60, N = 15 39.82 36.50 MIN: 26.64 / MAX: 50.91 MIN: 22.66 / MAX: 46.51
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.77, N = 13 SE +/- 0.82, N = 15 31.60 32.83 MIN: 20.8 / MAX: 52.83 MIN: 22.56 / MAX: 48.16
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Linux 6.8 Linux 6.9-rc2 9 18 27 36 45 SE +/- 0.94, N = 15 SE +/- 0.88, N = 15 36.44 37.43 MIN: 22.56 / MAX: 49.05 MIN: 27.23 / MAX: 49.01
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K Linux 6.8 Linux 6.9-rc2 1.3196 2.6392 3.9588 5.2784 6.598 SE +/- 0.035, N = 3 SE +/- 0.051, N = 15 5.865 5.760 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K Linux 6.8 Linux 6.9-rc2 11 22 33 44 55 SE +/- 1.10, N = 12 SE +/- 1.18, N = 15 48.94 48.57 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K Linux 6.8 Linux 6.9-rc2 30 60 90 120 150 SE +/- 5.13, N = 15 SE +/- 5.21, N = 15 127.20 126.19 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 4K Linux 6.8 Linux 6.9-rc2 30 60 90 120 150 SE +/- 3.82, N = 12 SE +/- 5.42, N = 15 123.25 123.10 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 1080p Linux 6.8 Linux 6.9-rc2 5 10 15 20 25 SE +/- 0.13, N = 15 SE +/- 0.21, N = 3 18.66 18.93 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast Linux 6.8 Linux 6.9-rc2 1.1106 2.2212 3.3318 4.4424 5.553 SE +/- 0.075, N = 10 SE +/- 0.077, N = 12 4.902 4.936 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.200, N = 12 SE +/- 0.172, N = 15 9.847 9.524 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast Linux 6.8 Linux 6.9-rc2 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 15.91 16.15 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.29, N = 15 SE +/- 0.36, N = 15 33.99 33.09 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/ao/real_time Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 29.73 28.88
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/scivis/real_time Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.26, N = 3 SE +/- 0.33, N = 3 29.28 29.07
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/pathtracer/real_time Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 1.85, N = 12 SE +/- 1.98, N = 9 102.95 98.87
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/ao/real_time Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.14, N = 15 SE +/- 0.05, N = 3 10.70 10.51
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.08, N = 15 SE +/- 0.07, N = 3 10.50 10.82
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time Linux 6.8 Linux 6.9-rc2 6 12 18 24 30 SE +/- 0.41, N = 12 SE +/- 0.20, N = 3 24.28 25.17
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark Linux 6.8 Linux 6.9-rc2 14M 28M 42M 56M 70M SE +/- 2993041.97, N = 9 SE +/- 2705883.43, N = 9 61999717 64443099 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 1D - Engine: CPU Linux 6.8 Linux 6.9-rc2 1.0151 2.0302 3.0453 4.0604 5.0755 SE +/- 0.04580, N = 3 SE +/- 0.05180, N = 12 4.50055 4.51166 MIN: 3.52 MIN: 3.37 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 3D - Engine: CPU Linux 6.8 Linux 6.9-rc2 1.0896 2.1792 3.2688 4.3584 5.448 SE +/- 0.02352, N = 3 SE +/- 0.02357, N = 3 4.79853 4.84245 MIN: 3.98 MIN: 4.01 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Convolution Batch Shapes Auto - Engine: CPU Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.03328, N = 3 SE +/- 0.09863, N = 15 9.00621 8.81043 MIN: 8.12 MIN: 6.25 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_1d - Engine: CPU Linux 6.8 Linux 6.9-rc2 5 10 15 20 25 SE +/- 0.23, N = 15 SE +/- 0.14, N = 3 18.34 18.03 MIN: 10.09 MIN: 13.29 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_3d - Engine: CPU Linux 6.8 Linux 6.9-rc2 2 4 6 8 10 SE +/- 0.10591, N = 15 SE +/- 0.04472, N = 3 6.16576 5.78202 MIN: 3.75 MIN: 3.81 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Training - Engine: CPU Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 22.69, N = 3 SE +/- 498.89, N = 13 4821.32 10195.75 MIN: 4659.99 MIN: 4568.3 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Inference - Engine: CPU Linux 6.8 Linux 6.9-rc2 160 320 480 640 800 SE +/- 7.46, N = 3 SE +/- 6.16, N = 9 748.50 753.36 MIN: 677.68 MIN: 652.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 400 800 1200 1600 2000 SE +/- 40.26, N = 15 SE +/- 22.91, N = 15 1801 1774
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 400 800 1200 1600 2000 SE +/- 13.93, N = 11 SE +/- 26.21, N = 3 1861 1839
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 500 1000 1500 2000 2500 SE +/- 21.99, N = 4 SE +/- 26.16, N = 13 2108 2164
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 294.16, N = 3 SE +/- 454.99, N = 3 34412 34953
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 13K 26K 39K 52K 65K SE +/- 443.96, N = 15 SE +/- 567.54, N = 7 61363 61162
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 8K 16K 24K 32K 40K SE +/- 302.37, N = 15 SE +/- 483.46, N = 3 35317 35422
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 14K 28K 42K 56K 70K SE +/- 591.53, N = 7 SE +/- 885.61, N = 3 63514 63529
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 9K 18K 27K 36K 45K SE +/- 626.71, N = 12 SE +/- 252.89, N = 3 40657 39794
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 16K 32K 48K 64K 80K SE +/- 506.90, N = 3 SE +/- 855.02, N = 15 72172 73367
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 100 200 300 400 500 SE +/- 3.71, N = 3 SE +/- 2.33, N = 3 466 478
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 110 220 330 440 550 SE +/- 5.26, N = 5 SE +/- 2.33, N = 3 501 489
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 120 240 360 480 600 SE +/- 4.36, N = 15 SE +/- 4.85, N = 15 569 570
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 1500 3000 4500 6000 7500 SE +/- 83.68, N = 3 SE +/- 15.17, N = 3 6909 6804
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 3K 6K 9K 12K 15K SE +/- 112.81, N = 9 SE +/- 180.26, N = 3 13867 13807
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 1600 3200 4800 6400 8000 SE +/- 105.66, N = 3 SE +/- 118.69, N = 15 7402 7506
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 3K 6K 9K 12K 15K SE +/- 268.32, N = 12 SE +/- 235.77, N = 15 14836 15415
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 70.21, N = 3 SE +/- 26.21, N = 3 8732 8552
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 4K 8K 12K 16K 20K SE +/- 101.18, N = 3 SE +/- 156.23, N = 6 17204 16687
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Process Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 13.11 13.36 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 32 - Type: Process Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 24.75 27.99 1. (CC) gcc options: -lpthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. The system/openssl test profiles relies on benchmarking the system/OS-supplied openssl binary rather than the pts/openssl test profile that uses the locally-built OpenSSL for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: SHA256 Linux 6.8 Linux 6.9-rc2 20000M 40000M 60000M 80000M 100000M SE +/- 767626108.54, N = 3 SE +/- 717052715.91, N = 12 84678420667 81678284176 1. OpenSSL 3.0.10 1 Aug 2023 (Library: OpenSSL 3.0.10 1 Aug 2023)
OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: SHA512 Linux 6.8 Linux 6.9-rc2 6000M 12000M 18000M 24000M 30000M SE +/- 92272355.83, N = 3 SE +/- 252323355.66, N = 3 29940735740 29665499053 1. OpenSSL 3.0.10 1 Aug 2023 (Library: OpenSSL 3.0.10 1 Aug 2023)
OpenBenchmarking.org sign/s, More Is Better OpenSSL Algorithm: RSA4096 Linux 6.8 Linux 6.9-rc2 8K 16K 24K 32K 40K SE +/- 46.27, N = 3 SE +/- 90.23, N = 3 38204.6 38147.4 1. OpenSSL 3.0.10 1 Aug 2023 (Library: OpenSSL 3.0.10 1 Aug 2023)
OpenBenchmarking.org verify/s, More Is Better OpenSSL Algorithm: RSA4096 Linux 6.8 Linux 6.9-rc2 300K 600K 900K 1200K 1500K SE +/- 19064.62, N = 3 SE +/- 833.27, N = 3 1358616.4 1396904.0 1. OpenSSL 3.0.10 1 Aug 2023 (Library: OpenSSL 3.0.10 1 Aug 2023)
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 12.09 11.76 1. (CXX) g++ options: -O3 -lm
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB mariadb-slap 11.5 Clients: 1024 Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 1.23, N = 9 SE +/- 1.41, N = 6 34 36 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -shared -lrt -lpthread -lz -ldl -lm -lstdc++
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: AlexNet Linux 6.8 Linux 6.9-rc2 4 8 12 16 20 SE +/- 0.25, N = 12 SE +/- 0.25, N = 3 17.23 18.28
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 16 - Model: AlexNet Linux 6.8 Linux 6.9-rc2 50 100 150 200 250 SE +/- 1.58, N = 15 SE +/- 1.89, N = 15 213.35 210.71
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: AlexNet Linux 6.8 Linux 6.9-rc2 80 160 240 320 400 SE +/- 2.86, N = 15 SE +/- 5.16, N = 15 346.10 341.97
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: AlexNet Linux 6.8 Linux 6.9-rc2 110 220 330 440 550 SE +/- 6.05, N = 3 SE +/- 5.17, N = 15 495.72 471.88
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 Linux 6.8 Linux 6.9-rc2 0.4275 0.855 1.2825 1.71 2.1375 SE +/- 0.03, N = 15 SE +/- 0.03, N = 15 1.88 1.90
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: AlexNet Linux 6.8 Linux 6.9-rc2 160 320 480 640 800 SE +/- 9.01, N = 3 SE +/- 6.62, N = 7 723.16 711.74
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 16 - Model: GoogLeNet Linux 6.8 Linux 6.9-rc2 13 26 39 52 65 SE +/- 1.31, N = 12 SE +/- 1.24, N = 12 58.50 58.83
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 16 - Model: ResNet-50 Linux 6.8 Linux 6.9-rc2 6 12 18 24 30 SE +/- 0.30, N = 3 SE +/- 0.28, N = 15 24.12 24.80
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: GoogLeNet Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 1.27, N = 12 SE +/- 1.10, N = 15 96.31 97.53
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 32 - Model: ResNet-50 Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.48, N = 15 SE +/- 0.42, N = 15 35.30 34.86
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: GoogLeNet Linux 6.8 Linux 6.9-rc2 30 60 90 120 150 SE +/- 0.94, N = 3 SE +/- 1.70, N = 15 135.18 128.71
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 Linux 6.8 Linux 6.9-rc2 11 22 33 44 55 SE +/- 0.54, N = 12 SE +/- 0.42, N = 3 47.03 47.96
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: GoogLeNet Linux 6.8 Linux 6.9-rc2 40 80 120 160 200 SE +/- 2.20, N = 12 SE +/- 1.95, N = 3 202.37 203.63
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 Linux 6.8 Linux 6.9-rc2 16 32 48 64 80 SE +/- 0.38, N = 3 SE +/- 0.47, N = 3 70.10 70.53
Google Draco Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Lion Linux 6.8 Linux 6.9-rc2 1300 2600 3900 5200 6500 SE +/- 40.84, N = 3 SE +/- 29.28, N = 3 6129 6054 1. (CXX) g++ options: -O3
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP Linux 6.8 Linux 6.9-rc2 1400 2800 4200 5600 7000 SE +/- 70.88, N = 14 SE +/- 66.09, N = 15 6169.36 6427.33 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: NUMA Linux 6.8 Linux 6.9-rc2 110 220 330 440 550 SE +/- 4.27, N = 3 SE +/- 5.89, N = 4 482.62 487.60 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe Linux 6.8 Linux 6.9-rc2 7M 14M 21M 28M 35M SE +/- 749025.19, N = 15 SE +/- 2106194.07, N = 15 30377557.97 29814480.67 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll Linux 6.8 Linux 6.9-rc2 2M 4M 6M 8M 10M SE +/- 37074.52, N = 3 SE +/- 30731.99, N = 3 10389186.02 10232390.41 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 25.13, N = 3 SE +/- 124.93, N = 3 9670.37 9446.07 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD Linux 6.8 Linux 6.9-rc2 150 300 450 600 750 SE +/- 14.68, N = 15 SE +/- 5.94, N = 8 688.26 688.76 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex Linux 6.8 Linux 6.9-rc2 6M 12M 18M 24M 30M SE +/- 281178.86, N = 3 SE +/- 294297.65, N = 3 28490133.28 28934422.16 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto Linux 6.8 Linux 6.9-rc2 30K 60K 90K 120K 150K SE +/- 1720.03, N = 15 SE +/- 2039.96, N = 15 143967.24 147978.33 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc Linux 6.8 Linux 6.9-rc2 40M 80M 120M 160M 200M SE +/- 201545.00, N = 3 SE +/- 3675496.40, N = 12 163551396.40 161652700.09 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pthread Linux 6.8 Linux 6.9-rc2 8K 16K 24K 32K 40K SE +/- 49.70, N = 3 SE +/- 450.35, N = 15 39599.26 35919.64 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree Linux 6.8 Linux 6.9-rc2 200 400 600 800 1000 SE +/- 6.53, N = 3 SE +/- 2.13, N = 3 810.19 808.99 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: IO_uring Linux 6.8 Linux 6.9-rc2 700K 1400K 2100K 2800K 3500K SE +/- 31595.27, N = 3 SE +/- 43339.28, N = 15 3468957.59 3346567.80 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE Linux 6.8 Linux 6.9-rc2 400K 800K 1200K 1600K 2000K SE +/- 609.53, N = 3 SE +/- 5873.31, N = 3 1798203.12 1900682.08 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress Linux 6.8 Linux 6.9-rc2 40K 80K 120K 160K 200K SE +/- 569.84, N = 3 SE +/- 335.84, N = 3 202073.08 200671.70 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores Linux 6.8 Linux 6.9-rc2 30M 60M 90M 120M 150M SE +/- 1498916.25, N = 3 SE +/- 1330349.25, N = 3 154589461.19 150005776.47 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math Linux 6.8 Linux 6.9-rc2 80K 160K 240K 320K 400K SE +/- 4925.11, N = 15 SE +/- 9278.38, N = 12 384748.54 392750.09 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point Linux 6.8 Linux 6.9-rc2 6K 12K 18K 24K 30K SE +/- 632.70, N = 12 SE +/- 214.66, N = 11 29600.02 29139.60 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math Linux 6.8 Linux 6.9-rc2 7K 14K 21K 28K 35K SE +/- 669.33, N = 15 SE +/- 437.41, N = 15 31442.69 31460.42 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying Linux 6.8 Linux 6.9-rc2 5K 10K 15K 20K 25K SE +/- 178.25, N = 3 SE +/- 155.03, N = 10 21085.10 20605.39 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle Linux 6.8 Linux 6.9-rc2 120K 240K 360K 480K 600K SE +/- 1416.85, N = 3 SE +/- 1897.90, N = 3 576905.92 569593.00 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler Linux 6.8 Linux 6.9-rc2 15K 30K 45K 60K 75K SE +/- 153.71, N = 3 SE +/- 281.29, N = 3 70407.96 70036.20 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity Linux 6.8 Linux 6.9-rc2 10K 20K 30K 40K 50K SE +/- 79.93, N = 3 SE +/- 21.40, N = 3 47927.75 46123.32 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math Linux 6.8 Linux 6.9-rc2 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 72950.92, N = 4 SE +/- 55746.31, N = 15 5874353.63 5908316.03 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching Linux 6.8 Linux 6.9-rc2 4M 8M 12M 16M 20M SE +/- 178479.09, N = 8 SE +/- 265846.30, N = 3 20343491.52 19390362.90 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add Linux 6.8 Linux 6.9-rc2 40M 80M 120M 160M 200M SE +/- 6187723.80, N = 12 SE +/- 3824188.11, N = 15 204365411.59 200468629.32 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point Linux 6.8 Linux 6.9-rc2 50K 100K 150K 200K 250K SE +/- 5616.14, N = 12 SE +/- 8579.53, N = 12 234669.74 230503.34 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions Linux 6.8 Linux 6.9-rc2 20M 40M 60M 80M 100M SE +/- 2138024.53, N = 13 SE +/- 1504028.14, N = 15 80643523.24 85035507.55 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting Linux 6.8 Linux 6.9-rc2 400 800 1200 1600 2000 SE +/- 4.40, N = 3 SE +/- 4.16, N = 3 1866.58 1858.70 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing Linux 6.8 Linux 6.9-rc2 4M 8M 12M 16M 20M SE +/- 189741.33, N = 15 SE +/- 35075.39, N = 3 17859304.38 18740350.56 1. (CXX) g++ options: -lm -latomic -lc -lcrypt -ldl -lEGL -lGLESv2 -ljpeg -lpthread -lrt -lsctp -lz
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: BMW27 - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 5 10 15 20 25 SE +/- 0.33, N = 14 SE +/- 0.08, N = 3 21.82 22.02
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Junkshop - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 11 22 33 44 55 SE +/- 0.47, N = 15 SE +/- 0.40, N = 15 45.60 46.56
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Classroom - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 12 24 36 48 60 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 53.98 53.82
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Fishy Cat - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 9 18 27 36 45 SE +/- 0.38, N = 15 SE +/- 0.44, N = 4 38.09 39.37
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Barbershop - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 60 120 180 240 300 SE +/- 6.20, N = 8 SE +/- 6.67, N = 9 271.99 279.74
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Pabellon Barcelona - Compute: CPU-Only Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 0.83, N = 5 SE +/- 0.61, N = 15 76.20 75.56
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.55, N = 3 119.92 121.93 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 70 140 210 280 350 SE +/- 0.84, N = 3 SE +/- 1.30, N = 3 307.74 302.77 MIN: 183.58 / MAX: 728.14 MIN: 181.55 / MAX: 894.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 100 200 300 400 500 SE +/- 2.33, N = 3 SE +/- 1.20, N = 3 451.13 466.45 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.20, N = 3 81.90 79.23 MIN: 49.43 / MAX: 345.35 MIN: 50.1 / MAX: 290.06 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU Linux 6.8 Linux 6.9-rc2 100 200 300 400 500 SE +/- 3.09, N = 3 SE +/- 3.72, N = 3 445.90 467.79 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 82.87 79.00 MIN: 51.69 / MAX: 529.56 MIN: 49.24 / MAX: 319.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 600 1200 1800 2400 3000 SE +/- 16.72, N = 3 SE +/- 25.96, N = 3 2530.04 2656.42 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 14.58 13.89 MIN: 9.5 / MAX: 109.83 MIN: 9.12 / MAX: 90.57 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 70 140 210 280 350 SE +/- 0.75, N = 3 SE +/- 0.82, N = 3 331.23 330.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 70 140 210 280 350 SE +/- 0.74, N = 3 SE +/- 0.81, N = 3 337.21 337.78 MIN: 261.24 / MAX: 615.09 MIN: 253.21 / MAX: 781.06 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 23.83, N = 3 SE +/- 26.72, N = 3 11260.98 11295.92 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.92 9.89 MIN: 7.45 / MAX: 33.82 MIN: 7.63 / MAX: 36.57 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 200 400 600 800 1000 SE +/- 9.82, N = 3 SE +/- 6.69, N = 3 1018.47 1051.30 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 8 16 24 32 40 SE +/- 0.35, N = 3 SE +/- 0.23, N = 3 36.28 35.14 MIN: 24.78 / MAX: 205.05 MIN: 25.25 / MAX: 437.27 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 1100 2200 3300 4400 5500 SE +/- 9.46, N = 3 SE +/- 18.34, N = 3 5067.40 5052.71 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 22.06 22.12 MIN: 15.51 / MAX: 86.25 MIN: 15.73 / MAX: 71.5 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 4K 8K 12K 16K 20K SE +/- 194.69, N = 3 SE +/- 107.94, N = 3 16147.57 16349.05 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 6.85 6.76 MIN: 4.47 / MAX: 51.04 MIN: 4.36 / MAX: 70.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 3K 6K 9K 12K 15K SE +/- 132.85, N = 3 SE +/- 25.09, N = 3 15532.54 15589.59 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 2 4 6 8 10 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 7.17 7.14 MIN: 5.78 / MAX: 54.3 MIN: 5.79 / MAX: 44.66 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 300 600 900 1200 1500 SE +/- 1.59, N = 3 SE +/- 5.16, N = 3 1446.42 1442.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.28, N = 3 77.34 77.57 MIN: 56.48 / MAX: 158.67 MIN: 57.16 / MAX: 148.67 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 150 300 450 600 750 SE +/- 5.75, N = 3 SE +/- 3.16, N = 3 671.66 673.46 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 12 24 36 48 60 SE +/- 0.47, N = 3 SE +/- 0.25, N = 3 54.98 54.83 MIN: 33.9 / MAX: 423.36 MIN: 34.71 / MAX: 375.04 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 6K 12K 18K 24K 30K SE +/- 285.82, N = 4 SE +/- 155.42, N = 3 26732.88 27790.24 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Weld Porosity Detection FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.05, N = 4 SE +/- 0.01, N = 3 4.05 3.89 MIN: 2.67 / MAX: 52.43 MIN: 2.64 / MAX: 47.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 1400 2800 4200 5600 7000 SE +/- 18.88, N = 3 SE +/- 23.77, N = 3 6336.78 6306.42 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 17.52 17.58 MIN: 11.97 / MAX: 108.98 MIN: 12.03 / MAX: 66.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 1000 2000 3000 4000 5000 SE +/- 54.03, N = 3 SE +/- 29.05, N = 3 4621.32 4600.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.20, N = 3 23.68 23.97 MIN: 11.64 / MAX: 175.69 MIN: 11.24 / MAX: 358.1 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 800 1600 2400 3200 4000 SE +/- 3.14, N = 3 SE +/- 10.89, N = 3 3626.41 3608.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 30.85 31.01 MIN: 24.82 / MAX: 82.03 MIN: 24.42 / MAX: 90.86 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 2K 4K 6K 8K 10K SE +/- 20.55, N = 3 SE +/- 28.95, N = 3 11638.51 11564.78 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 9.59 9.65 MIN: 7.62 / MAX: 27.98 MIN: 7.62 / MAX: 34.18 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 16K 32K 48K 64K 80K SE +/- 711.41, N = 15 SE +/- 1055.18, N = 3 67789.25 73754.74 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Linux 6.8 Linux 6.9-rc2 0.1755 0.351 0.5265 0.702 0.8775 SE +/- 0.01, N = 15 SE +/- 0.03, N = 3 0.78 0.71 MIN: 0.3 / MAX: 138.08 MIN: 0.32 / MAX: 74.71 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 800 1600 2400 3200 4000 SE +/- 10.67, N = 3 SE +/- 8.36, N = 3 3777.56 3704.38 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 29.61 30.20 MIN: 25.51 / MAX: 92.13 MIN: 22.08 / MAX: 107.87 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 30K 60K 90K 120K 150K SE +/- 1455.69, N = 15 SE +/- 1149.56, N = 15 121702.12 119735.57 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Linux 6.8 Linux 6.9-rc2 0.0855 0.171 0.2565 0.342 0.4275 SE +/- 0.01, N = 15 SE +/- 0.00, N = 15 0.38 0.38 MIN: 0.23 / MAX: 46.5 MIN: 0.24 / MAX: 50.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Overwrite Linux 6.8 Linux 6.9-rc2 80K 160K 240K 320K 400K SE +/- 881.98, N = 3 SE +/- 3053.81, N = 3 388010 391968 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Read Linux 6.8 Linux 6.9-rc2 80M 160M 240M 320M 400M SE +/- 1573786.50, N = 3 SE +/- 1447290.34, N = 3 360498518 361586083 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Update Random Linux 6.8 Linux 6.9-rc2 80K 160K 240K 320K 400K SE +/- 1470.46, N = 3 SE +/- 1522.72, N = 3 359825 361502 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read Random Write Random Linux 6.8 Linux 6.9-rc2 500K 1000K 1500K 2000K 2500K SE +/- 17412.72, N = 3 SE +/- 22543.45, N = 5 2120844 2068588 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Fill Linux 6.8 Linux 6.9-rc2 70K 140K 210K 280K 350K SE +/- 2148.02, N = 3 SE +/- 3492.28, N = 3 326499 322816 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Read Linux 6.8 Linux 6.9-rc2 80M 160M 240M 320M 400M SE +/- 1002006.26, N = 3 SE +/- 4146957.10, N = 3 380419389 375837479 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Update Random Linux 6.8 Linux 6.9-rc2 60K 120K 180K 240K 300K SE +/- 3488.80, N = 3 SE +/- 3235.16, N = 3 292180 293414 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Sequential Fill Linux 6.8 Linux 6.9-rc2 70K 140K 210K 280K 350K SE +/- 1183.38, N = 3 SE +/- 1955.03, N = 3 327637 330167 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Fill Sync Linux 6.8 Linux 6.9-rc2 60K 120K 180K 240K 300K SE +/- 1490.88, N = 3 SE +/- 3603.61, N = 3 257446 259603 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Read Random Write Random Linux 6.8 Linux 6.9-rc2 300K 600K 900K 1200K 1500K SE +/- 8674.90, N = 11 SE +/- 13609.90, N = 3 1181758 1175562 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.
Llama.cpp Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-13b.Q4_0.gguf Linux 6.8 Linux 6.9-rc2 0.3443 0.6886 1.0329 1.3772 1.7215 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 1.48 1.53 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-70b-chat.Q5_0.gguf Linux 6.8 Linux 6.9-rc2 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.36 0.37 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 1.0305 2.061 3.0915 4.122 5.1525 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 4.50 4.58
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU Linux 6.8 Linux 6.9-rc2 0.1778 0.3556 0.5334 0.7112 0.889 SE +/- 0.01, N = 4 SE +/- 0.00, N = 3 0.79 0.77
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric Linux 6.8 Linux 6.9-rc2 900K 1800K 2700K 3600K 4500K 3906839 3991802 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 6.0 Mode: CPU Linux 6.8 Linux 6.9-rc2 13K 26K 39K 52K 65K SE +/- 530.58, N = 15 SE +/- 557.85, N = 3 59852 58995
Linux 6.8 Processor: 2 x Intel Xeon Max 9480 @ 3.50GHz (112 Cores / 224 Threads), Motherboard: Supermicro SYS-221H-TNR X13DEM v1.10 (1.3 BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Network: 2 x Broadcom BCM57508 NetXtreme-E 10Gb/25Gb/40Gb/50Gb/100Gb/200Gb
OS: Ubuntu 23.10, Kernel: 6.8.0-060800-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Compiler: