7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&rdt .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 ior: 2MB - Default Test Directory ior: 4MB - Default Test Directory ior: 8MB - Default Test Directory ior: 16MB - Default Test Directory ior: 32MB - Default Test Directory ior: 64MB - Default Test Directory paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 quantlib: etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering npb: EP.C npb: EP.D npb: LU.C cloverleaf: Lagrangian-Eulerian Hydrodynamics lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression pennant: sedovbig pennant: leblancbig qmcpack: simple-H2O lammps: 20k Atoms lammps: Rhodopsin Protein lulesh: jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 build-godot: Time To Compile gcrypt: ngspice: C2670 ngspice: C7552 webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 financebench: Repo OpenMP financebench: Bonds OpenMP askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP gromacs: water_GMX50_bare redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU gnupg: 2.7GB Sample File Encryption v-ray: CPU 1 2 3 4 5 592.90 887.94 525.44 481.36 493.82 492.85 4.80 481.692 72.31 1156.973 82.97 864.682 2222.4 1365.386 315.616 183.995 303.570 2081.02 2140.50 47323.88 85.54 42 115 495 1767 85 1792 111 483 469 613 189 711 237 53.14447 34.29865 38.340 11.176 9.483 6695.0413 52.50 7.86 0.71 48.64 48.50 23.19 31.81 175.22 0.358 0.975 1.269 2.677 98.889 212.796 155.596 138.661 3.369 155.740 282.047 9.195 593.066 553.640 39726.164063 55767.164062 1815.85 2618.53 2458.07 3973.97 376.885 1.699 2698309.25 2153944.42 1712504.00 2539377.42 1912809.71 7.108 37.184 4.201 2.676 44.157 361.235 316.358 553 630 143 9436 7531 67.390 17828 396.56 466.30 530.83 484.06 474.62 480.71 4.81 481.834 72.26 1156.216 83.00 864.937 2222.6 1387.302 313.703 182.977 303.514 2001.11 2241.41 47734.40 84.88 42 115 495 1767 86 1801 109 484 470 612 189 711 238 52.66769 33.68917 39.048 11.210 9.084 6707.8596 53.03 7.93 0.70 49.08 49.22 23.25 32.39 178.23 0.359 0.976 1.272 2.675 98.932 212.057 151.978 137.582 3.363 156.550 280.636 9.110 590.293 555.295 39461.821614 55622.968750 1820.90 2622.75 2420.91 3973.97 377.362 1.698 1769937.88 2168410.90 1642807.13 2415143.75 1952727.25 7.232 37.345 4.634 2.618 48.330 360.923 319.049 555 649 145 9471 7249 67.175 17872 429.36 467.76 354.01 442.47 470.37 489.25 4.80 481.420 72.23 1155.674 82.98 864.804 2207.1 1385.842 314.662 183.424 303.105 2033.28 2202.89 47778.76 84.21 42 114 496 1772 85 1795 110 484 472 612 189 709 238 52.70973 33.55896 38.489 11.195 9.408 6678.4880 53.44 7.93 0.7 48.89 49.41 23.43 32.26 177.88 0.359 0.975 1.270 2.694 98.872 211.787 153.061 139.573 3.325 157.528 282.575 9.075 596.408 553.553 39681.566406 55666.716146 1813.56 2633.11 2428.72 3973.97 377.835 1.698 1797918.46 2159575.21 1695317.29 2431811.58 1889099.97 7.355 37.283 4.348 2.686 45.970 361.454 317.136 557 658 144 9541 7080 67.124 17842 419.08 493.98 527.45 500.39 484.50 490.66 4.67 468.456 72.32 1157.110 83.01 865.027 2215.6 1376.344 314.271 182.722 290.726 1997.31 2112.54 47482.70 84.27 42 115 496 1771 86 1803 109 483 469 612 188 708 237 52.76675 33.40175 39.795 11.205 9.099 6697.3184 53.11 7.92 0.70 48.85 48.85 23.37 32.17 178.61 0.360 0.976 1.269 2.679 98.964 211.895 154.214 139.032 3.347 154.880 279.573 9.309 593.530 548.556 39350.223958 55656.115885 1821.82 2633.32 2420.64 3973.97 378.788 1.700 1761464.25 2158064.17 1699000.00 2442355.92 1906086.42 7.281 37.607 4.513 2.662 48.375 361.425 318.208 555 659 144 9533 7457 67.151 17742 437.60 478.01 510.05 506.02 516.94 493.41 4.80 481.347 72.31 1156.869 82.97 864.654 2216.0 1377.894 313.693 183.118 303.890 1999.52 2214.08 47360.36 85.20 42 114 494 1770 85 1800 110 483 470 611 189 709 237 52.84683 33.93533 38.621 11.216 8.901 6708.2322 53.69 7.91 0.7 49.13 49.18 23.37 32.30 178.47 0.359 0.977 1.274 2.676 98.873 212.963 153.099 139.017 3.350 156.753 283.331 9.216 590.211 552.792 39275.548177 55876.180990 1821.36 2609.75 2442.85 3973.97 378.802 1.703 1770711.21 2179924.5 1682254.45 2443276.75 1923099.42 7.272 37.338 4.546 2.645 48.640 361.488 319.314 555 652 145 9519 6939 67.102 17849 OpenBenchmarking.org
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 4.65, N = 3 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 7.25, N = 15 592.90 396.56 429.36 419.08 437.60 MIN: 431.01 / MAX: 1028.65 MIN: 214.64 / MAX: 1028.95 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 162.6 / MAX: 1036.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 5.42, N = 15 SE +/- 45.69, N = 12 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 887.94 466.30 467.76 493.98 478.01 MIN: 629.82 / MAX: 1334.23 MIN: 212.95 / MAX: 1329.58 MIN: 188.74 / MAX: 1435.44 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 25.39, N = 15 SE +/- 5.42, N = 15 SE +/- 4.96, N = 15 SE +/- 4.21, N = 14 SE +/- 6.12, N = 5 525.44 530.83 354.01 527.45 510.05 MIN: 290.23 / MAX: 1447.59 MIN: 248.03 / MAX: 1386.96 MIN: 189.98 / MAX: 1385.03 MIN: 251.24 / MAX: 1378.27 MIN: 222.93 / MAX: 1266.18 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 5.80, N = 15 SE +/- 6.54, N = 3 SE +/- 1.65, N = 3 SE +/- 7.12, N = 3 481.36 484.06 442.47 500.39 506.02 MIN: 316.02 / MAX: 1504.02 MIN: 224.25 / MAX: 1505.62 MIN: 217.24 / MAX: 1379.75 MIN: 315.3 / MAX: 1210.23 MIN: 308.33 / MAX: 1247.46 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.89, N = 3 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 SE +/- 4.09, N = 12 SE +/- 4.67, N = 3 493.82 474.62 470.37 484.50 516.94 MIN: 245.23 / MAX: 1245.29 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 MIN: 196.66 / MAX: 1538.21 MIN: 413.96 / MAX: 1183.64 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.92, N = 3 SE +/- 2.46, N = 3 SE +/- 4.79, N = 3 SE +/- 4.68, N = 3 SE +/- 6.10, N = 3 492.85 480.71 489.25 490.66 493.41 MIN: 402.07 / MAX: 1345.65 MIN: 301.17 / MAX: 1079.47 MIN: 374.86 / MAX: 1033.08 MIN: 348.43 / MAX: 1036.5 MIN: 246.91 / MAX: 1088.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 SE +/- 0.00, N = 3 4.80 4.81 4.80 4.67 4.80
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 12.92, N = 11 SE +/- 0.34, N = 3 481.69 481.83 481.42 468.46 481.35
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 72.31 72.26 72.23 72.32 72.31
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 SE +/- 0.60, N = 3 SE +/- 0.98, N = 3 1156.97 1156.22 1155.67 1157.11 1156.87
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 82.97 83.00 82.98 83.01 82.97
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.51, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 864.68 864.94 864.80 865.03 864.65
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 17.98, N = 13 SE +/- 19.83, N = 12 SE +/- 27.77, N = 12 SE +/- 21.07, N = 12 SE +/- 28.27, N = 12 2222.4 2222.6 2207.1 2215.6 2216.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 13.09, N = 3 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 3.03, N = 3 SE +/- 2.52, N = 3 1365.39 1387.30 1385.84 1376.34 1377.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 0.85, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.49, N = 3 315.62 313.70 314.66 314.27 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 184.00 182.98 183.42 182.72 183.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 SE +/- 0.09, N = 3 303.57 303.51 303.11 290.73 303.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.51, N = 15 SE +/- 51.69, N = 12 SE +/- 32.10, N = 15 SE +/- 46.79, N = 15 2081.02 2001.11 2033.28 1997.31 1999.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 10.76, N = 3 SE +/- 3.15, N = 3 SE +/- 34.24, N = 3 SE +/- 35.18, N = 3 SE +/- 28.36, N = 4 2140.50 2241.41 2202.89 2112.54 2214.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5 10K 20K 30K 40K 50K SE +/- 154.44, N = 3 SE +/- 7.96, N = 3 SE +/- 139.20, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 47323.88 47734.40 47778.76 47482.70 47360.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 85.54 84.88 84.21 84.27 85.20 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 5 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 5 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 114 115 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 5 110 220 330 440 550 SE +/- 2.19, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 3.67, N = 3 SE +/- 3.67, N = 3 495 495 496 496 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 6.00, N = 3 SE +/- 3.06, N = 3 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 1767 1767 1772 1771 1770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 5 20 40 60 80 100 85 86 85 86 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 3.33, N = 3 SE +/- 1.86, N = 3 SE +/- 3.38, N = 3 SE +/- 3.71, N = 3 1792 1801 1795 1803 1800 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 5 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 111 109 110 109 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 1.86, N = 3 483 484 484 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 469 470 472 469 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 189 189 189 188 189 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 5 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 708 709 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 5 50 100 150 200 250 SE +/- 0.58, N = 3 237 238 238 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 4 5 12 24 36 48 60 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 53.14 52.67 52.71 52.77 52.85 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 34.30 33.69 33.56 33.40 33.94 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.68, N = 3 SE +/- 0.24, N = 3 38.34 39.05 38.49 39.80 38.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.18 11.21 11.20 11.21 11.22 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.433, N = 15 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.306, N = 15 9.483 9.084 9.408 9.099 8.901 1. (CXX) g++ options: -O3 -pthread -lm
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 4 5 1400 2800 4200 5600 7000 SE +/- 21.98, N = 3 SE +/- 16.05, N = 3 SE +/- 43.78, N = 3 SE +/- 39.16, N = 3 SE +/- 30.71, N = 3 6695.04 6707.86 6678.49 6697.32 6708.23 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 1 2 3 4 5 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 52.50 53.03 53.44 53.11 53.69 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 1 2 3 4 5 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.86 7.93 7.93 7.92 7.91 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 2 3 4 5 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 1 2 3 4 5 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 48.64 49.08 48.89 48.85 49.13 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 1 2 3 4 5 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 48.50 49.22 49.41 48.85 49.18 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 1 2 3 4 5 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 23.19 23.25 23.43 23.37 23.37 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 31.81 32.39 32.26 32.17 32.30
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 1 2 3 4 5 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 175.22 178.23 177.88 178.61 178.47
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.358 0.359 0.359 0.360 0.359
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.975 0.976 0.975 0.976 0.977
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.269 1.272 1.270 1.269 1.274
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.677 2.675 2.694 2.679 2.676
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 4 5 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 98.89 98.93 98.87 98.96 98.87
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 5 50 100 150 200 250 SE +/- 0.63, N = 3 SE +/- 0.34, N = 3 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 212.80 212.06 211.79 211.90 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 4 5 30 60 90 120 150 SE +/- 2.33, N = 3 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 2.23, N = 3 SE +/- 1.61, N = 3 155.60 151.98 153.06 154.21 153.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 4 5 30 60 90 120 150 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 SE +/- 1.23, N = 3 SE +/- 1.21, N = 3 SE +/- 2.04, N = 3 138.66 137.58 139.57 139.03 139.02 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 5 0.758 1.516 2.274 3.032 3.79 SE +/- 0.034, N = 3 SE +/- 0.012, N = 3 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 3.369 3.363 3.325 3.347 3.350 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 4 5 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.66, N = 3 SE +/- 0.73, N = 3 155.74 156.55 157.53 154.88 156.75 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 4 5 60 120 180 240 300 SE +/- 1.07, N = 3 SE +/- 0.54, N = 3 SE +/- 2.02, N = 3 SE +/- 1.22, N = 3 SE +/- 2.02, N = 3 282.05 280.64 282.58 279.57 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 5 3 6 9 12 15 SE +/- 0.105, N = 3 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.141, N = 3 SE +/- 0.149, N = 3 9.195 9.110 9.075 9.309 9.216 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 593.07 590.29 596.41 593.53 590.21 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 5 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.36, N = 3 SE +/- 3.76, N = 3 SE +/- 2.23, N = 3 SE +/- 1.67, N = 3 553.64 555.30 553.55 548.56 552.79 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 210.55, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 69.89, N = 3 SE +/- 20.70, N = 3 39726.16 39461.82 39681.57 39350.22 39275.55 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 5 12K 24K 36K 48K 60K SE +/- 86.69, N = 3 SE +/- 37.92, N = 3 SE +/- 83.09, N = 3 SE +/- 86.41, N = 3 SE +/- 53.09, N = 3 55767.16 55622.97 55666.72 55656.12 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 2.23, N = 3 SE +/- 0.35, N = 3 SE +/- 2.62, N = 3 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 1815.85 1820.90 1813.56 1821.82 1821.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 10.95, N = 3 SE +/- 4.46, N = 3 SE +/- 7.85, N = 3 SE +/- 4.51, N = 3 SE +/- 11.98, N = 3 2618.53 2622.75 2633.11 2633.32 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 22.21, N = 3 SE +/- 32.33, N = 3 SE +/- 12.71, N = 3 SE +/- 12.94, N = 3 2458.07 2420.91 2428.72 2420.64 2442.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 5 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 5 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.82, N = 3 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 1.66, N = 3 376.89 377.36 377.84 378.79 378.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 1 2 3 4 5 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.699 1.698 1.698 1.700 1.703 1. (CXX) g++ options: -O3 -pthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 5 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 2566.99, N = 3 SE +/- 19169.69, N = 3 SE +/- 15108.16, N = 3 SE +/- 18070.29, N = 3 2698309.25 1769937.88 1797918.46 1761464.25 1770711.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 18441.13, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 11360.66, N = 3 2153944.42 2168410.90 2159575.21 2158064.17 2179924.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 13174.46, N = 3 SE +/- 9034.55, N = 3 SE +/- 10458.55, N = 3 SE +/- 10827.27, N = 3 1712504.00 1642807.13 1695317.29 1699000.00 1682254.45 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 31640.05, N = 3 SE +/- 17482.31, N = 3 SE +/- 18752.77, N = 3 SE +/- 23178.69, N = 3 2539377.42 2415143.75 2431811.58 2442355.92 2443276.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 15512.42, N = 3 SE +/- 23318.24, N = 3 SE +/- 24702.46, N = 4 SE +/- 30239.19, N = 3 SE +/- 21792.45, N = 3 1912809.71 1952727.25 1889099.97 1906086.42 1923099.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 5 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.110, N = 3 SE +/- 0.070, N = 3 SE +/- 0.007, N = 3 7.108 7.232 7.355 7.281 7.272 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 6.8 / MAX: 7.87 MIN: 6.97 / MAX: 7.74 MIN: 7 / MAX: 7.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 4 5 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 37.18 37.35 37.28 37.61 37.34 MIN: 35.67 / MAX: 38.06 MIN: 36.97 / MAX: 37.99 MIN: 36.85 / MAX: 38.11 MIN: 37.16 / MAX: 38.09 MIN: 36.95 / MAX: 37.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 4 5 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.086, N = 3 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 4.201 4.634 4.348 4.513 4.546 MIN: 3.68 / MAX: 4.8 MIN: 4.2 / MAX: 4.99 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 5 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.029, N = 5 SE +/- 0.019, N = 3 SE +/- 0.045, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 2.676 2.618 2.686 2.662 2.645 MIN: 2.44 / MAX: 3.19 MIN: 2.5 / MAX: 2.92 MIN: 2.42 / MAX: 3.11 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 0.10, N = 3 SE +/- 2.44, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 48.33 45.97 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 47.9 / MAX: 50.26 MIN: 40.58 / MAX: 48.74 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 4 5 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.31, N = 3 361.24 360.92 361.45 361.43 361.49 MIN: 357.28 / MAX: 391.58 MIN: 356.72 / MAX: 372.66 MIN: 356.32 / MAX: 386.75 MIN: 356.24 / MAX: 380.38 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 316.36 319.05 317.14 318.21 319.31 MIN: 313.3 / MAX: 320.26 MIN: 313 / MAX: 340.64 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 5 120 240 360 480 600 SE +/- 1.09, N = 3 SE +/- 0.29, N = 3 SE +/- 2.25, N = 3 SE +/- 1.64, N = 3 SE +/- 0.88, N = 3 553 555 557 555 555 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 5 140 280 420 560 700 SE +/- 5.46, N = 3 SE +/- 11.30, N = 12 SE +/- 13.29, N = 12 SE +/- 13.34, N = 12 SE +/- 11.09, N = 12 630 649 658 659 652 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 5 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.33, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 143 145 144 144 145 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 35.99, N = 3 SE +/- 30.21, N = 3 SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 9436 9471 9541 9533 9519 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 4 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 109.17, N = 3 SE +/- 212.04, N = 12 7531 7249 7080 7457 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 5 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 67.39 67.18 67.12 67.15 67.10 1. (CC) gcc options: -O2
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 4 5 4K 8K 12K 16K 20K SE +/- 47.06, N = 3 SE +/- 70.27, N = 3 SE +/- 6.44, N = 3 SE +/- 54.08, N = 3 SE +/- 84.18, N = 3 17828 17872 17842 17742 17849
Phoronix Test Suite v10.8.5