Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
2 3 4 5 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (2002 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16GB, Disk: Samsung SSD 970 EVO 500GB, Graphics: Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz), Audio: Realtek ALC1220, Monitor: G237HL, Network: Intel I219-V
OS: Ubuntu 20.10, Kernel: 5.8.0-36-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 20.2.6 (LLVM 11.0.0), Vulkan: 1.2.131, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 4 2 1 5 3 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 115 114 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 4 3 2 1 5 110 220 330 440 550 SE +/- 3.67, N = 3 SE +/- 2.67, N = 3 SE +/- 2.00, N = 3 SE +/- 2.19, N = 3 SE +/- 3.67, N = 3 496 496 495 495 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 4 5 2 1 400 800 1200 1600 2000 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 SE +/- 3.06, N = 3 SE +/- 6.00, N = 3 1772 1771 1770 1767 1767 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 5 3 4 2 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 111 110 110 109 109 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 5 4 1 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.53, N = 3 SE +/- 1.86, N = 3 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 484 484 483 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 5 2 4 1 100 200 300 400 500 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 472 470 470 469 469 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
IOR IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 5 1 4 3 2 110 220 330 440 550 SE +/- 6.10, N = 3 SE +/- 3.92, N = 3 SE +/- 4.68, N = 3 SE +/- 4.79, N = 3 SE +/- 2.46, N = 3 493.41 492.85 490.66 489.25 480.71 MIN: 246.91 / MAX: 1088.06 MIN: 402.07 / MAX: 1345.65 MIN: 348.43 / MAX: 1036.5 MIN: 374.86 / MAX: 1033.08 MIN: 301.17 / MAX: 1079.47 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 5 1 4 2 3 110 220 330 440 550 SE +/- 4.67, N = 3 SE +/- 3.89, N = 3 SE +/- 4.09, N = 12 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 516.94 493.82 484.50 474.62 470.37 MIN: 413.96 / MAX: 1183.64 MIN: 245.23 / MAX: 1245.29 MIN: 196.66 / MAX: 1538.21 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 1. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 5 4 2 1 3 110 220 330 440 550 SE +/- 7.12, N = 3 SE +/- 1.65, N = 3 SE +/- 5.80, N = 15 SE +/- 6.27, N = 3 SE +/- 6.54, N = 3 506.02 500.39 484.06 481.36 442.47 MIN: 308.33 / MAX: 1247.46 MIN: 315.3 / MAX: 1210.23 MIN: 224.25 / MAX: 1505.62 MIN: 316.02 / MAX: 1504.02 MIN: 217.24 / MAX: 1379.75 1. (CC) gcc options: -O2 -lm -pthread -lmpi
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 2 4 1 5 3 110 220 330 440 550 SE +/- 5.42, N = 15 SE +/- 4.21, N = 14 SE +/- 25.39, N = 15 SE +/- 6.12, N = 5 SE +/- 4.96, N = 15 530.83 527.45 525.44 510.05 354.01 MIN: 248.03 / MAX: 1386.96 MIN: 251.24 / MAX: 1378.27 MIN: 290.23 / MAX: 1447.59 MIN: 222.93 / MAX: 1266.18 MIN: 189.98 / MAX: 1385.03 1. (CC) gcc options: -O2 -lm -pthread -lmpi
lzbench lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 4 2 5 3 1 400 800 1200 1600 2000 SE +/- 3.38, N = 3 SE +/- 3.33, N = 3 SE +/- 3.71, N = 3 SE +/- 1.86, N = 3 1803 1801 1800 1795 1792 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 4 3 2 5 130 260 390 520 650 SE +/- 1.00, N = 3 SE +/- 2.73, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 5 3 2 1 4 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 189 189 189 189 188 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 5 3 4 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 709 708 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 5 4 1 50 100 150 200 250 SE +/- 0.58, N = 3 238 238 237 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Ngspice Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 2 3 5 4 1 30 60 90 120 150 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 1.61, N = 3 SE +/- 2.23, N = 3 SE +/- 2.33, N = 3 151.98 153.06 153.10 154.21 155.60 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 2 1 5 4 3 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.78, N = 3 SE +/- 2.04, N = 3 SE +/- 1.21, N = 3 SE +/- 1.23, N = 3 137.58 138.66 139.02 139.03 139.57 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 2 3 5 4 1 300 600 900 1200 1500 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 2.52, N = 3 SE +/- 3.03, N = 3 SE +/- 13.09, N = 3 1387.30 1385.84 1377.89 1376.34 1365.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 3 4 2 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.85, N = 3 SE +/- 0.49, N = 3 315.62 314.66 314.27 313.70 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 3 5 2 4 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 184.00 183.42 183.12 182.98 182.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 5 1 2 3 4 70 140 210 280 350 SE +/- 0.09, N = 3 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 303.89 303.57 303.51 303.11 290.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 5 3 4 2 1 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 53.69 53.44 53.11 53.03 52.50 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 3 2 4 5 1 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.93 7.93 7.92 7.91 7.86 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 5 4 3 2 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 5 2 3 4 1 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 49.13 49.08 48.89 48.85 48.64 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 3 2 5 4 1 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 49.41 49.22 49.18 48.85 48.50 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 3 5 4 2 1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 23.43 23.37 23.37 23.25 23.19 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 2 5 3 4 1 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 32.39 32.30 32.26 32.17 31.81
OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 4 5 2 3 1 40 80 120 160 200 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 178.61 178.47 178.23 177.88 175.22
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 3 4 5 2 1 0.758 1.516 2.274 3.032 3.79 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 SE +/- 0.012, N = 3 SE +/- 0.034, N = 3 3.325 3.347 3.350 3.363 3.369 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 4 1 2 5 3 30 60 90 120 150 SE +/- 0.66, N = 3 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.73, N = 3 SE +/- 0.29, N = 3 154.88 155.74 156.55 156.75 157.53 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 4 2 1 3 5 60 120 180 240 300 SE +/- 1.22, N = 3 SE +/- 0.54, N = 3 SE +/- 1.07, N = 3 SE +/- 2.02, N = 3 SE +/- 2.02, N = 3 279.57 280.64 282.05 282.58 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 3 2 1 5 4 3 6 9 12 15 SE +/- 0.024, N = 3 SE +/- 0.020, N = 3 SE +/- 0.105, N = 3 SE +/- 0.149, N = 3 SE +/- 0.141, N = 3 9.075 9.110 9.195 9.216 9.309 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 5 2 1 4 3 130 260 390 520 650 SE +/- 0.58, N = 3 SE +/- 0.45, N = 3 SE +/- 1.66, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 590.21 590.29 593.07 593.53 596.41 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 2 1 3 5 4 120 240 360 480 600 SE +/- 1.36, N = 3 SE +/- 2.84, N = 3 SE +/- 3.76, N = 3 SE +/- 1.67, N = 3 SE +/- 2.23, N = 3 555.30 553.64 553.55 552.79 548.56 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
IOR IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 4 5 3 2 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 SE +/- 45.69, N = 12 SE +/- 5.42, N = 15 887.94 493.98 478.01 467.76 466.30 MIN: 629.82 / MAX: 1334.23 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 MIN: 188.74 / MAX: 1435.44 MIN: 212.95 / MAX: 1329.58 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Gcrypt Library Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 3 4 2 1 5 50 100 150 200 250 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.34, N = 3 SE +/- 0.63, N = 3 SE +/- 0.29, N = 3 211.79 211.90 212.06 212.80 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 1 5 4 3 500 1000 1500 2000 2500 SE +/- 19.83, N = 12 SE +/- 17.98, N = 13 SE +/- 28.27, N = 12 SE +/- 21.07, N = 12 SE +/- 27.77, N = 12 2222.6 2222.4 2216.0 2215.6 2207.1 1. (CXX) g++ options: -O3 -march=native -rdynamic
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 4 2 5 1 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 84.21 84.27 84.88 85.20 85.54 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 5 4 3 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.007, N = 3 SE +/- 0.070, N = 3 SE +/- 0.110, N = 3 7.108 7.232 7.272 7.281 7.355 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 7 / MAX: 7.67 MIN: 6.97 / MAX: 7.74 MIN: 6.8 / MAX: 7.87 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 3 5 2 4 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 37.18 37.28 37.34 37.35 37.61 MIN: 35.67 / MAX: 38.06 MIN: 36.85 / MAX: 38.11 MIN: 36.95 / MAX: 37.85 MIN: 36.97 / MAX: 37.99 MIN: 37.16 / MAX: 38.09 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 3 4 5 2 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 SE +/- 0.086, N = 3 4.201 4.348 4.513 4.546 4.634 MIN: 3.68 / MAX: 4.8 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 MIN: 4.2 / MAX: 4.99 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 2 5 4 1 3 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.019, N = 3 SE +/- 0.011, N = 3 SE +/- 0.010, N = 3 SE +/- 0.029, N = 5 SE +/- 0.045, N = 3 2.618 2.645 2.662 2.676 2.686 MIN: 2.5 / MAX: 2.92 MIN: 2.44 / MAX: 3.04 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.19 MIN: 2.42 / MAX: 3.11 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 3 2 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 2.44, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 45.97 48.33 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 40.58 / MAX: 48.74 MIN: 47.9 / MAX: 50.26 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 5 4 2 1 120 240 360 480 600 SE +/- 2.25, N = 3 SE +/- 0.88, N = 3 SE +/- 1.64, N = 3 SE +/- 0.29, N = 3 SE +/- 1.09, N = 3 557 555 555 555 553 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 4 3 5 2 1 140 280 420 560 700 SE +/- 13.34, N = 12 SE +/- 13.29, N = 12 SE +/- 11.09, N = 12 SE +/- 11.30, N = 12 SE +/- 5.46, N = 3 659 658 652 649 630 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 5 2 4 3 1 30 60 90 120 150 SE +/- 0.17, N = 3 SE +/- 0.33, N = 3 SE +/- 0.44, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 145 145 144 144 143 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 4 5 2 1 2K 4K 6K 8K 10K SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 SE +/- 30.21, N = 3 SE +/- 35.99, N = 3 9541 9533 9519 9471 9436 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 4 2 3 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 109.17, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 212.04, N = 12 7531 7457 7249 7080 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 2 1 4 3 5 80 160 240 320 400 SE +/- 0.27, N = 3 SE +/- 0.37, N = 3 SE +/- 0.38, N = 3 SE +/- 0.39, N = 3 SE +/- 0.31, N = 3 360.92 361.24 361.43 361.45 361.49 MIN: 356.72 / MAX: 372.66 MIN: 357.28 / MAX: 391.58 MIN: 356.24 / MAX: 380.38 MIN: 356.32 / MAX: 386.75 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 3 4 2 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.57, N = 3 SE +/- 0.43, N = 3 316.36 317.14 318.21 319.05 319.31 MIN: 313.3 / MAX: 320.26 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 313 / MAX: 340.64 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 5 4 1 3 2 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 1.703 1.700 1.699 1.698 1.698 1. (CXX) g++ options: -O3 -pthread
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 3 2 5 4 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.69, N = 12 SE +/- 51.51, N = 15 SE +/- 46.79, N = 15 SE +/- 32.10, N = 15 2081.02 2033.28 2001.11 1999.52 1997.31 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 5 3 1 4 500 1000 1500 2000 2500 SE +/- 3.15, N = 3 SE +/- 28.36, N = 4 SE +/- 34.24, N = 3 SE +/- 10.76, N = 3 SE +/- 35.18, N = 3 2241.41 2214.08 2202.89 2140.50 2112.54 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 4 5 1 10K 20K 30K 40K 50K SE +/- 139.20, N = 3 SE +/- 7.96, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 SE +/- 154.44, N = 3 47778.76 47734.40 47482.70 47360.36 47323.88 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 4 5 2 1 3 400 800 1200 1600 2000 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 SE +/- 0.35, N = 3 SE +/- 2.23, N = 3 SE +/- 2.62, N = 3 1821.82 1821.36 1820.90 1815.85 1813.56 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 4 3 2 1 5 600 1200 1800 2400 3000 SE +/- 4.51, N = 3 SE +/- 7.85, N = 3 SE +/- 4.46, N = 3 SE +/- 10.95, N = 3 SE +/- 11.98, N = 3 2633.32 2633.11 2622.75 2618.53 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 5 3 2 4 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 12.94, N = 3 SE +/- 32.33, N = 3 SE +/- 22.21, N = 3 SE +/- 12.71, N = 3 2458.07 2442.85 2428.72 2420.91 2420.64 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 5 4 3 2 1 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 5 4 3 2 1 80 160 240 320 400 SE +/- 1.66, N = 3 SE +/- 0.00, N = 3 SE +/- 0.48, N = 3 SE +/- 0.82, N = 3 SE +/- 0.47, N = 3 378.80 378.79 377.84 377.36 376.89 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 2 3 4 5 1 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 SE +/- 0.12, N = 3 52.67 52.71 52.77 52.85 53.14 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 4 3 2 5 1 8 16 24 32 40 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.29, N = 3 SE +/- 0.03, N = 3 33.40 33.56 33.69 33.94 34.30 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 3 5 2 4 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.45, N = 3 SE +/- 0.68, N = 3 38.34 38.49 38.62 39.05 39.80 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 2 5 3 1 4 4K 8K 12K 16K 20K SE +/- 70.27, N = 3 SE +/- 84.18, N = 3 SE +/- 6.44, N = 3 SE +/- 47.06, N = 3 SE +/- 54.08, N = 3 17872 17849 17842 17828 17742
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 5 4 2 3 1 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.977 0.976 0.976 0.975 0.975
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 5 2 3 4 1 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.274 1.272 1.270 1.269 1.269
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 4 1 5 2 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 SE +/- 0.007, N = 3 SE +/- 0.012, N = 3 2.694 2.679 2.677 2.676 2.675
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 5 4 2 3 1 9K 18K 27K 36K 45K SE +/- 20.70, N = 3 SE +/- 69.89, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 210.55, N = 3 39275.55 39350.22 39461.82 39681.57 39726.16 1. (CXX) g++ options: -O3 -march=native -fopenmp
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 3 5 2 4 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 19169.69, N = 3 SE +/- 18070.29, N = 3 SE +/- 2566.99, N = 3 SE +/- 15108.16, N = 3 2698309.25 1797918.46 1770711.21 1769937.88 1761464.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 5 2 3 4 1 500K 1000K 1500K 2000K 2500K SE +/- 11360.66, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 18441.13, N = 3 2179924.50 2168410.90 2159575.21 2158064.17 2153944.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 4 3 5 2 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 10458.55, N = 3 SE +/- 9034.55, N = 3 SE +/- 10827.27, N = 3 SE +/- 13174.46, N = 3 1712504.00 1699000.00 1695317.29 1682254.45 1642807.13 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 5 4 3 2 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 23178.69, N = 3 SE +/- 18752.77, N = 3 SE +/- 17482.31, N = 3 SE +/- 31640.05, N = 3 2539377.42 2443276.75 2442355.92 2431811.58 2415143.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 2 5 1 4 3 400K 800K 1200K 1600K 2000K SE +/- 23318.24, N = 3 SE +/- 21792.45, N = 3 SE +/- 15512.42, N = 3 SE +/- 30239.19, N = 3 SE +/- 24702.46, N = 4 1952727.25 1923099.42 1912809.71 1906086.42 1889099.97 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 2 5 3 1 4 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 4.81 4.80 4.80 4.80 4.67
OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 2 1 3 5 4 100 200 300 400 500 SE +/- 0.39, N = 3 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.34, N = 3 SE +/- 12.92, N = 11 481.83 481.69 481.42 481.35 468.46
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 4 5 1 2 3 16 32 48 64 80 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 72.32 72.31 72.31 72.26 72.23
OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 4 1 5 2 3 200 400 600 800 1000 SE +/- 0.60, N = 3 SE +/- 0.52, N = 3 SE +/- 0.98, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 1157.11 1156.97 1156.87 1156.22 1155.67
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 4 2 3 5 1 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 83.01 83.00 82.98 82.97 82.97
OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 4 2 3 1 5 200 400 600 800 1000 SE +/- 0.39, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.51, N = 3 SE +/- 0.27, N = 3 865.03 864.94 864.80 864.68 864.65
IOR IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 5 3 4 2 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 7.25, N = 15 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 4.65, N = 3 592.90 437.60 429.36 419.08 396.56 MIN: 431.01 / MAX: 1028.65 MIN: 162.6 / MAX: 1036.24 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 214.64 / MAX: 1028.95 1. (CC) gcc options: -O2 -lm -pthread -lmpi
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 4 3 1 5 12K 24K 36K 48K 60K SE +/- 37.92, N = 3 SE +/- 86.41, N = 3 SE +/- 83.09, N = 3 SE +/- 86.69, N = 3 SE +/- 53.09, N = 3 55622.97 55656.12 55666.72 55767.16 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 14 February 2021 12:11 by user phoronix.
2 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 14 February 2021 18:03 by user phoronix.
3 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 15 February 2021 05:28 by user phoronix.
4 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 15 February 2021 12:16 by user phoronix.
5 Processor: Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads), Motherboard: ASUS PRIME X299-A (2002 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 16GB, Disk: Samsung SSD 970 EVO 500GB, Graphics: Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz), Audio: Realtek ALC1220, Monitor: G237HL, Network: Intel I219-V
OS: Ubuntu 20.10, Kernel: 5.8.0-36-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 20.2.6 (LLVM 11.0.0), Vulkan: 1.2.131, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Testing initiated at 15 February 2021 20:01 by user phoronix.