nvidia-linux-gpu-performance-20212 AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102075-PTS-NVIDIALI44&rdt&grr .
nvidia-linux-gpu-performance-20212 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution RTX 3080 RTX 3060 Ti NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) AMD Starship/Matisse 32GB 2000GB Corsair Force MP600 + 2000GB NVIDIA GeForce RTX 3080 10GB (1710/9501MHz) NVIDIA Device 1aef ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.8.0-41-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 NVIDIA 460.39 4.6.0 OpenCL 1.2 CUDA 11.2.136 1.2.155 GCC 10.2.0 + Clang 11.0.1-1~oibaf~g ext4 3840x2160 NVIDIA GeForce RTX 3060 Ti 8GB (345/405MHz) NVIDIA Device 228b NVIDIA GeForce RTX 3060 Ti 8GB (360/405MHz) NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz) NVIDIA TU104 HD Audio NVIDIA GeForce RTX 2060 SUPER 8GB (390/405MHz) NVIDIA TU106 HD Audio NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz) NVIDIA TU104 HD Audio NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz) NVIDIA TU102 HD Audio OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - RTX 3080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - RTX 3060 Ti: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa201009 - NVIDIA GeForce RTX 3060 Ti: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa201009 - RTX 2070 SUPER: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - RTX 2060 SUPER: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - RTX 2080 SUPER: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - RTX 2080 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details - RTX 3080: GPU Compute Cores: 8704 - RTX 3060 Ti: GPU Compute Cores: 4864 - RTX 2070 SUPER: GPU Compute Cores: 2560 - RTX 2060 SUPER: GPU Compute Cores: 2176 - RTX 2080 SUPER: GPU Compute Cores: 3072 - RTX 2080 Ti: GPU Compute Cores: 4352 Python Details - RTX 3080, RTX 3060 Ti, RTX 2070 SUPER, RTX 2060 SUPER, RTX 2080 SUPER, RTX 2080 Ti: Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
nvidia-linux-gpu-performance-20212 lczero: OpenCL redshift: vkfft: ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet octanebench: Total Score fahbench: warsow: 2560 x 1440 warsow: 1920 x 1200 warsow: 1920 x 1080 warsow: 3840 x 2160 v-ray: NVIDIA CUDA GPU v-ray: NVIDIA RTX GPU indigobench: OpenCL GPU - Bedroom indigobench: OpenCL GPU - Supercar realsr-ncnn: 4x - Yes vkresample: 2x - Double rtiv: 3840 x 2160 - Lucy In One Weekend rtiv: 3840 x 2160 - Cornell Box + Lucy rtiv: 2560 x 1440 - Lucy In One Weekend rtiv: 1920 x 1200 - Lucy In One Weekend rtiv: 1920 x 1080 - Lucy In One Weekend rtiv: 2560 x 1440 - Cornell Box + Lucy rtiv: 1920 x 1200 - Cornell Box + Lucy rtiv: 1920 x 1080 - Cornell Box + Lucy rtiv: 3840 x 2160 - Planets In One Weekend rtiv: 3840 x 2160 - Ray Tracing In One Weekend rtiv: 2560 x 1440 - Planets In One Weekend rtiv: 3840 x 2160 - Cornell Box rtiv: 1920 x 1080 - Planets In One Weekend rtiv: 1920 x 1200 - Planets In One Weekend rtiv: 2560 x 1440 - Ray Tracing In One Weekend rtiv: 1920 x 1080 - Ray Tracing In One Weekend rtiv: 1920 x 1200 - Ray Tracing In One Weekend rtiv: 2560 x 1440 - Cornell Box rtiv: 1920 x 1200 - Cornell Box rtiv: 1920 x 1080 - Cornell Box paraview: Many Spheres - 3840 x 2160 paraview: Many Spheres - 3840 x 2160 paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Contour - 3840 x 2160 paraview: Wavelet Contour - 3840 x 2160 realsr-ncnn: 4x - No vkresample: 2x - Single namd-cuda: ATPase Simulation - 327,506 Atoms paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Volume - 3840 x 2160 paraview: Wavelet Volume - 3840 x 2160 hashcat: TrueCrypt RIPEMD160 + XTS financebench: Monte-Carlo OpenCL paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 betsy: ETC2 RGB - Highest hashcat: SHA-512 hashcat: SHA1 hashcat: MD5 betsy: ETC1 - Highest waifu2x-ncnn: 2x - 3 - Yes darktable: Boat - OpenCL darktable: Masskrug - OpenCL hashcat: 7-Zip darktable: Server Room - OpenCL financebench: Black-Scholes OpenCL darktable: Server Rack - OpenCL RTX 3080 RTX 3060 Ti NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 31897 164 56583 17.85 14.48 21.38 24.97 11.12 14.43 58.83 12.82 1.85 5.41 3.99 4.44 4.21 4.48 12.20 562.818641 321.9986 984.8 985.7 963.6 830.2 1666 2183 17.650 46.405 34.114 148.005 16.4312 16.4413 33.0545 48.6939 57.8223 33.9422 48.4290 59.0920 17.5512 17.1359 36.5362 31.9668 63.7969 57.1482 35.6185 62.7312 55.5417 67.4759 97.1997 117.919 7680.539 76.61 7841.299 78.21 3478.835 333.82 6.250 11.232 0.13291 5373.698 515.65 5430.654 339.42 721867 386.046997 10606.699 662.92 4.416 2420400000 19098866667 56039666667 3.328 3.439 1.282 2.090 990567 0.706 7.307 0.095 241 35228 18.00 14.74 23.38 26.98 11.72 14.78 58.90 13.59 1.88 5.42 3.98 4.44 4.14 4.50 13.27 382.736651 237.0623 983.3 974.5 980.4 499.6 54.096 266.210 9.60781 9.39314 19.5287 29.0999 33.7196 19.5451 27.8946 34.0683 9.99652 9.76038 20.9757 18.4943 36.7365 32.5992 20.4810 35.8823 31.8033 38.5985 55.7661 68.3205 4451.593 44.40 4475.501 44.64 2251.061 216.01 8.761 17.658 0.13614 3169.404 304.13 3342.870 208.93 426700 461.676341 7578.143 473.63 6.794 1414633333 11068233333 32940366667 5.009 4.372 1.713 2.121 585967 0.777 13.138 0.101 1185 1550 11.452 33.992 348 32063 17.96 14.27 22.39 25.63 12.09 14.70 58.49 13.44 1.86 5.50 4.07 4.46 4.21 4.58 12.87 261.261302 232.7775 953.3 986.0 979.2 437.6 785 977 7.776 24.706 63.020 262.249 7.93153 6.77717 15.6550 22.7902 26.2753 14.1030 20.0635 24.5964 10.8262 10.6371 22.7453 16.7912 39.9825 35.4654 22.2612 39.1433 34.6587 35.2750 50.7312 61.7059 4407.226 43.96 4499.127 44.88 2299.463 220.65 9.901 19.469 0.14823 3691.347 354.21 3288.286 205.52 426200 816.009990 7715.261 482.20 7.330 1424200000 11121466667 35398166667 5.456 4.689 1.709 2.156 596367 0.776 12.804 0.097 379 31000 17.98 14.47 21.94 25.39 11.15 14.44 58.99 12.85 1.87 5.45 4.01 4.46 4.21 4.50 12.53 241.445325 208.7075 696.3 985.2 977.5 353.3 492 733 7.392 23.311 75.755 309.185 6.87206 5.97602 13.7568 20.1725 23.3070 12.4373 17.8450 21.8369 9.17886 8.99633 19.3343 14.1848 33.8886 30.1809 18.9526 33.2083 29.4861 29.8870 42.9372 52.1534 3748.650 37.39 3824.557 38.15 1859.637 178.45 11.501 22.234 0.15857 2862.737 274.70 2847.497 177.97 351200 752.413981 6827.820 426.74 8.394 1187933333 9341766667 29280266667 6.032 5.262 1.716 2.118 495567 0.723 16.486333 0.097 328 34719 18.65 14.67 23.07 26.63 12.26 15.15 61.11 13.57 1.87 5.57 4.09 4.57 4.25 4.61 13.31 262.843677 257.9896 983.2 985.7 979.7 494.2 742 905 7.845 25.326 54.895 217.311 8.82616 7.14784 17.1498 24.5890 28.4471 14.8708 21.1197 25.7529 13.1467 12.8719 27.5123 20.3361 48.3452 43.0463 26.9538 47.5230 42.0649 42.8800 61.6809 75.2446 5323.265 53.10 5446.212 54.32 2616.200 251.05 9.010 17.779 0.14894 4225.680 405.49 3853.392 240.83 520673 843.452657 8551.751 534.48 6.831 1708433333 13420400000 42393033333 5.296 4.347 1.591 2.169 692500 0.756 10.610 0.097 246 42152 18.00 14.41 22.08 24.98 11.20 14.51 59.02 13.29 1.84 5.45 4.03 4.47 4.19 4.50 12.97 358.549904 307.2028 985.7 985.9 979.3 553.5 926 1246 11.012 33.236 43.816 155.743 12.4673 10.5369 24.3719 35.0178 40.4178 21.8005 31.0242 37.7862 17.6370 17.3073 36.7571 27.5750 64.4970 57.4290 36.0794 63.3046 56.2395 57.9535 84.0012 102.060 7296.592 72.78 7469.292 74.50 3059.954 293.63 7.506 14.780 0.14084 4690.748 450.11 4935.743 308.48 657067 649.820658 9606.532 600.41 5.008 2239266667 17718433333 55298100000 3.822 3.776 1.485 2.084 894867 0.710 9.226 0.096 OpenBenchmarking.org
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL RTX 3080 7K 14K 21K 28K 35K SE +/- 256.06, N = 3 31897 1. (CXX) g++ options: -flto -pthread
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 80 160 240 320 400 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 164 241 348 379 328 246
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.1.1 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 12K 24K 36K 48K 60K SE +/- 23.59, N = 3 SE +/- 17.02, N = 3 SE +/- 134.01, N = 3 SE +/- 4.98, N = 3 SE +/- 9.33, N = 3 SE +/- 109.70, N = 3 56583 35228 32063 31000 34719 42152 1. (CXX) g++ options: -O3 -pthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 5 10 15 20 25 SE +/- 0.02, N = 15 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 15 SE +/- 0.15, N = 3 SE +/- 0.07, N = 13 17.85 18.00 17.96 17.98 18.65 18.00 MIN: 17.56 / MAX: 27.39 MIN: 17.41 / MAX: 27.83 MIN: 17.61 / MAX: 19.15 MIN: 17.54 / MAX: 27.13 MIN: 17.68 / MAX: 41.61 MIN: 17.4 / MAX: 35.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.07, N = 14 SE +/- 0.21, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 13 14.48 14.74 14.27 14.47 14.67 14.41 MIN: 13.79 / MAX: 24.15 MIN: 14.07 / MAX: 16.1 MIN: 13.95 / MAX: 14.61 MIN: 13.92 / MAX: 16.43 MIN: 14.1 / MAX: 19.06 MIN: 13.6 / MAX: 27.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 6 12 18 24 30 SE +/- 0.22, N = 15 SE +/- 0.09, N = 3 SE +/- 0.59, N = 3 SE +/- 0.27, N = 15 SE +/- 0.78, N = 3 SE +/- 0.31, N = 13 21.38 23.38 22.39 21.94 23.07 22.08 MIN: 20.7 / MAX: 38.06 MIN: 22.85 / MAX: 25.03 MIN: 20.98 / MAX: 23.83 MIN: 20.72 / MAX: 25.98 MIN: 20.78 / MAX: 30.08 MIN: 20.68 / MAX: 34.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet50 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 6 12 18 24 30 SE +/- 0.22, N = 15 SE +/- 0.14, N = 3 SE +/- 0.36, N = 3 SE +/- 0.27, N = 15 SE +/- 0.60, N = 3 SE +/- 0.25, N = 13 24.97 26.98 25.63 25.39 26.63 24.98 MIN: 24.2 / MAX: 30.79 MIN: 24.63 / MAX: 46.48 MIN: 24.31 / MAX: 35.26 MIN: 24.28 / MAX: 43.07 MIN: 24.4 / MAX: 46.26 MIN: 23.79 / MAX: 34.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: alexnet RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 3 6 9 12 15 SE +/- 0.10, N = 15 SE +/- 0.57, N = 3 SE +/- 0.61, N = 3 SE +/- 0.14, N = 15 SE +/- 0.55, N = 3 SE +/- 0.19, N = 13 11.12 11.72 12.09 11.15 12.26 11.20 MIN: 10.81 / MAX: 24.77 MIN: 10.8 / MAX: 44.26 MIN: 10.8 / MAX: 13.2 MIN: 10.78 / MAX: 13.1 MIN: 10.76 / MAX: 16.88 MIN: 10.71 / MAX: 13.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet18 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.02, N = 15 SE +/- 0.45, N = 3 SE +/- 0.36, N = 3 SE +/- 0.06, N = 15 SE +/- 0.15, N = 3 SE +/- 0.15, N = 13 14.43 14.78 14.70 14.44 15.15 14.51 MIN: 14.25 / MAX: 27.77 MIN: 14.12 / MAX: 17.41 MIN: 14.18 / MAX: 23.66 MIN: 14.17 / MAX: 32.3 MIN: 14.24 / MAX: 28.92 MIN: 14.09 / MAX: 16.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: vgg16 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 14 28 42 56 70 SE +/- 0.06, N = 15 SE +/- 0.18, N = 3 SE +/- 0.40, N = 3 SE +/- 0.04, N = 15 SE +/- 0.50, N = 3 SE +/- 0.05, N = 13 58.83 58.90 58.49 58.99 61.11 59.02 MIN: 57.03 / MAX: 117.59 MIN: 57.26 / MAX: 90.67 MIN: 56.35 / MAX: 60.41 MIN: 57.66 / MAX: 83.72 MIN: 57.69 / MAX: 129.41 MIN: 57.58 / MAX: 72.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: googlenet RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 3 6 9 12 15 SE +/- 0.02, N = 15 SE +/- 0.77, N = 3 SE +/- 0.68, N = 3 SE +/- 0.04, N = 15 SE +/- 0.21, N = 3 SE +/- 0.25, N = 13 12.82 13.59 13.44 12.85 13.57 13.29 MIN: 12.43 / MAX: 22.33 MIN: 12.49 / MAX: 16.77 MIN: 12.44 / MAX: 24.56 MIN: 12.46 / MAX: 21.99 MIN: 12.61 / MAX: 15.4 MIN: 12.5 / MAX: 27.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: blazeface RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.423 0.846 1.269 1.692 2.115 SE +/- 0.01, N = 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 13 1.85 1.88 1.86 1.87 1.87 1.84 MIN: 1.8 / MAX: 2.32 MIN: 1.8 / MAX: 7.07 MIN: 1.8 / MAX: 2.1 MIN: 1.79 / MAX: 2.29 MIN: 1.8 / MAX: 2.36 MIN: 1.8 / MAX: 2.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1.2533 2.5066 3.7599 5.0132 6.2665 SE +/- 0.01, N = 15 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 15 SE +/- 0.01, N = 3 SE +/- 0.04, N = 13 5.41 5.42 5.50 5.45 5.57 5.45 MIN: 5.3 / MAX: 14.74 MIN: 5.3 / MAX: 9.74 MIN: 5.31 / MAX: 9.38 MIN: 5.31 / MAX: 14.05 MIN: 5.34 / MAX: 7.4 MIN: 5.28 / MAX: 6.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mnasnet RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.9203 1.8406 2.7609 3.6812 4.6015 SE +/- 0.00, N = 15 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 SE +/- 0.04, N = 13 3.99 3.98 4.07 4.01 4.09 4.03 MIN: 3.85 / MAX: 4.5 MIN: 3.84 / MAX: 5.19 MIN: 3.84 / MAX: 5.04 MIN: 3.85 / MAX: 6.06 MIN: 3.88 / MAX: 5.64 MIN: 3.82 / MAX: 5.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.01, N = 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 12 4.44 4.44 4.46 4.46 4.57 4.47 MIN: 4.35 / MAX: 6.25 MIN: 4.34 / MAX: 5.43 MIN: 4.36 / MAX: 6.1 MIN: 4.36 / MAX: 19.04 MIN: 4.41 / MAX: 6 MIN: 4.32 / MAX: 5.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.9563 1.9126 2.8689 3.8252 4.7815 SE +/- 0.02, N = 15 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 13 4.21 4.14 4.21 4.21 4.25 4.19 MIN: 4.1 / MAX: 6.19 MIN: 4.06 / MAX: 5.35 MIN: 4.12 / MAX: 5.49 MIN: 4.09 / MAX: 5.44 MIN: 4.1 / MAX: 8.02 MIN: 4.08 / MAX: 5.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1.0373 2.0746 3.1119 4.1492 5.1865 SE +/- 0.00, N = 15 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 13 4.48 4.50 4.58 4.50 4.61 4.50 MIN: 4.3 / MAX: 5.1 MIN: 4.28 / MAX: 5.73 MIN: 4.3 / MAX: 28.29 MIN: 4.3 / MAX: 5.72 MIN: 4.31 / MAX: 6.2 MIN: 4.29 / MAX: 6.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mobilenet RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 3 6 9 12 15 SE +/- 0.09, N = 15 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 15 SE +/- 0.09, N = 3 SE +/- 0.09, N = 13 12.20 13.27 12.87 12.53 13.31 12.97 MIN: 11.81 / MAX: 13.49 MIN: 12.64 / MAX: 14.87 MIN: 12.38 / MAX: 13.65 MIN: 11.86 / MAX: 26 MIN: 12.41 / MAX: 30.9 MIN: 12.36 / MAX: 22.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 120 240 360 480 600 562.82 382.74 261.26 241.45 262.84 358.55
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 70 140 210 280 350 SE +/- 0.54, N = 3 SE +/- 0.24, N = 3 SE +/- 0.78, N = 3 SE +/- 0.46, N = 3 SE +/- 0.56, N = 3 SE +/- 0.96, N = 3 322.00 237.06 232.78 208.71 257.99 307.20
Warsow Resolution: 2560 x 1440 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 2560 x 1440 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200 400 600 800 1000 SE +/- 0.15, N = 3 SE +/- 0.68, N = 3 SE +/- 0.97, N = 3 SE +/- 2.47, N = 3 SE +/- 0.29, N = 3 SE +/- 0.33, N = 3 984.8 983.3 953.3 696.3 983.2 985.7
Warsow Resolution: 1920 x 1200 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 1920 x 1200 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200 400 600 800 1000 SE +/- 0.37, N = 3 SE +/- 5.47, N = 3 SE +/- 0.46, N = 3 SE +/- 0.52, N = 3 SE +/- 0.55, N = 3 SE +/- 0.20, N = 3 985.7 974.5 986.0 985.2 985.7 985.9
Warsow Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200 400 600 800 1000 SE +/- 13.18, N = 3 SE +/- 5.01, N = 3 SE +/- 6.42, N = 3 SE +/- 6.77, N = 3 SE +/- 6.93, N = 3 SE +/- 6.45, N = 3 963.6 980.4 979.2 977.5 979.7 979.3
Warsow Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200 400 600 800 1000 SE +/- 2.63, N = 3 SE +/- 1.22, N = 3 SE +/- 0.30, N = 3 SE +/- 0.73, N = 3 SE +/- 0.52, N = 3 SE +/- 0.61, N = 3 830.2 499.6 437.6 353.3 494.2 553.5
Chaos Group V-RAY Mode: NVIDIA CUDA GPU OpenBenchmarking.org vpaths, More Is Better Chaos Group V-RAY 5 Mode: NVIDIA CUDA GPU RTX 3080 NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 400 800 1200 1600 2000 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 1666 1185 785 492 742 926
Chaos Group V-RAY Mode: NVIDIA RTX GPU OpenBenchmarking.org vrays, More Is Better Chaos Group V-RAY 5 Mode: NVIDIA RTX GPU RTX 3080 NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 500 1000 1500 2000 2500 SE +/- 5.69, N = 3 SE +/- 1.45, N = 3 SE +/- 1.00, N = 3 2183 1550 977 733 905 1246
IndigoBench Acceleration: OpenCL GPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom RTX 3080 NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.045, N = 3 SE +/- 0.011, N = 3 SE +/- 0.004, N = 3 SE +/- 0.011, N = 3 SE +/- 0.002, N = 3 SE +/- 0.010, N = 3 17.650 11.452 7.776 7.392 7.845 11.012
IndigoBench Acceleration: OpenCL GPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar RTX 3080 NVIDIA GeForce RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 46.41 33.99 24.71 23.31 25.33 33.24
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.27, N = 3 SE +/- 0.30, N = 3 SE +/- 0.16, N = 3 SE +/- 0.23, N = 3 34.11 54.10 63.02 75.76 54.90 43.82
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 70 140 210 280 350 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.63, N = 3 SE +/- 0.24, N = 3 SE +/- 0.53, N = 3 SE +/- 0.08, N = 3 148.01 266.21 262.25 309.19 217.31 155.74 1. (CXX) g++ options: -O3 -pthread
Ray Tracing In Vulkan Resolution: 3840 x 2160 - Scene: Lucy In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 3840 x 2160 - Scene: Lucy In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.01030, N = 3 SE +/- 0.00111, N = 3 SE +/- 0.00227, N = 3 SE +/- 0.00320, N = 3 SE +/- 0.00442, N = 3 SE +/- 0.00880, N = 3 16.43120 9.60781 7.93153 6.87206 8.82616 12.46730 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 3840 x 2160 - Scene: Cornell Box + Lucy OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 3840 x 2160 - Scene: Cornell Box + Lucy RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.00971, N = 3 SE +/- 0.00090, N = 3 SE +/- 0.01081, N = 3 SE +/- 0.00374, N = 3 SE +/- 0.00445, N = 3 SE +/- 0.00654, N = 3 16.44130 9.39314 6.77717 5.97602 7.14784 10.53690 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 2560 x 1440 - Scene: Lucy In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 2560 x 1440 - Scene: Lucy In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 33.05 19.53 15.66 13.76 17.15 24.37 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1200 - Scene: Lucy In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1200 - Scene: Lucy In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 11 22 33 44 55 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 48.69 29.10 22.79 20.17 24.59 35.02 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1080 - Scene: Lucy In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1080 - Scene: Lucy In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 13 26 39 52 65 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 57.82 33.72 26.28 23.31 28.45 40.42 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 2560 x 1440 - Scene: Cornell Box + Lucy OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 2560 x 1440 - Scene: Cornell Box + Lucy RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 33.94 19.55 14.10 12.44 14.87 21.80 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1200 - Scene: Cornell Box + Lucy OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1200 - Scene: Cornell Box + Lucy RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 11 22 33 44 55 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 48.43 27.89 20.06 17.85 21.12 31.02 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1080 - Scene: Cornell Box + Lucy OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1080 - Scene: Cornell Box + Lucy RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 13 26 39 52 65 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 59.09 34.07 24.60 21.84 25.75 37.79 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 3840 x 2160 - Scene: Planets In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 3840 x 2160 - Scene: Planets In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.00529, N = 3 SE +/- 0.00065, N = 3 SE +/- 0.00074, N = 3 SE +/- 0.00967, N = 3 SE +/- 0.00055, N = 3 SE +/- 0.00677, N = 3 17.55120 9.99652 10.82620 9.17886 13.14670 17.63700 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 3840 x 2160 - Scene: Ray Tracing In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 3840 x 2160 - Scene: Ray Tracing In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.00227, N = 3 SE +/- 0.00160, N = 3 SE +/- 0.02386, N = 3 SE +/- 0.00026, N = 3 SE +/- 0.00003, N = 3 SE +/- 0.00581, N = 3 17.13590 9.76038 10.63710 8.99633 12.87190 17.30730 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 2560 x 1440 - Scene: Planets In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 2560 x 1440 - Scene: Planets In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 36.54 20.98 22.75 19.33 27.51 36.76 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 3840 x 2160 - Scene: Cornell Box OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 3840 x 2160 - Scene: Cornell Box RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 31.97 18.49 16.79 14.18 20.34 27.58 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1080 - Scene: Planets In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1080 - Scene: Planets In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 14 28 42 56 70 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.22, N = 3 63.80 36.74 39.98 33.89 48.35 64.50 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1200 - Scene: Planets In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1200 - Scene: Planets In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 13 26 39 52 65 SE +/- 0.15, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 57.15 32.60 35.47 30.18 43.05 57.43 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 2560 x 1440 - Scene: Ray Tracing In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 2560 x 1440 - Scene: Ray Tracing In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 8 16 24 32 40 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 35.62 20.48 22.26 18.95 26.95 36.08 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1080 - Scene: Ray Tracing In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1080 - Scene: Ray Tracing In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 14 28 42 56 70 SE +/- 0.23, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.15, N = 3 62.73 35.88 39.14 33.21 47.52 63.30 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1200 - Scene: Ray Tracing In One Weekend OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1200 - Scene: Ray Tracing In One Weekend RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 13 26 39 52 65 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 55.54 31.80 34.66 29.49 42.06 56.24 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 2560 x 1440 - Scene: Cornell Box OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 2560 x 1440 - Scene: Cornell Box RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 15 30 45 60 75 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 67.48 38.60 35.28 29.89 42.88 57.95 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1200 - Scene: Cornell Box OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1200 - Scene: Cornell Box RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 97.20 55.77 50.73 42.94 61.68 84.00 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
Ray Tracing In Vulkan Resolution: 1920 x 1080 - Scene: Cornell Box OpenBenchmarking.org FPS, More Is Better Ray Tracing In Vulkan r6 Resolution: 1920 x 1080 - Scene: Cornell Box RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 117.92 68.32 61.71 52.15 75.24 102.06 1. (CXX) g++ options: -O3 -lbacktrace -lstdc++fs -lm -ldl -lpthread
ParaView Test: Many Spheres - Resolution: 3840 x 2160 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1600 3200 4800 6400 8000 SE +/- 11.77, N = 3 SE +/- 5.00, N = 3 SE +/- 1.45, N = 3 SE +/- 2.20, N = 3 SE +/- 1.58, N = 3 SE +/- 7.19, N = 3 7680.54 4451.59 4407.23 3748.65 5323.27 7296.59
ParaView Test: Many Spheres - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 76.61 44.40 43.96 37.39 53.10 72.78
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 2K 4K 6K 8K 10K SE +/- 20.20, N = 3 SE +/- 3.55, N = 3 SE +/- 10.38, N = 3 SE +/- 9.03, N = 3 SE +/- 16.45, N = 3 SE +/- 17.40, N = 3 7841.30 4475.50 4499.13 3824.56 5446.21 7469.29
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 78.21 44.64 44.88 38.15 54.32 74.50
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 700 1400 2100 2800 3500 SE +/- 28.95, N = 15 SE +/- 25.69, N = 3 SE +/- 11.62, N = 3 SE +/- 12.72, N = 3 SE +/- 20.11, N = 10 SE +/- 12.62, N = 3 3478.84 2251.06 2299.46 1859.64 2616.20 3059.95
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 70 140 210 280 350 SE +/- 2.78, N = 15 SE +/- 2.47, N = 3 SE +/- 1.12, N = 3 SE +/- 1.22, N = 3 SE +/- 1.93, N = 10 SE +/- 1.21, N = 3 333.82 216.01 220.65 178.45 251.05 293.63
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 3 6 9 12 15 SE +/- 0.044, N = 3 SE +/- 0.053, N = 3 SE +/- 0.038, N = 3 SE +/- 0.118, N = 5 SE +/- 0.240, N = 14 SE +/- 0.058, N = 3 6.250 8.761 9.901 11.501 9.010 7.506
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 11.23 17.66 19.47 22.23 17.78 14.78 1. (CXX) g++ options: -O3 -pthread
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.0357 0.0714 0.1071 0.1428 0.1785 SE +/- 0.00128, N = 3 SE +/- 0.00015, N = 3 SE +/- 0.00031, N = 3 SE +/- 0.00029, N = 3 SE +/- 0.00049, N = 3 SE +/- 0.00010, N = 3 0.13291 0.13614 0.14823 0.15857 0.14894 0.14084
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1200 2400 3600 4800 6000 SE +/- 37.60, N = 3 SE +/- 16.12, N = 3 SE +/- 10.22, N = 3 SE +/- 14.35, N = 3 SE +/- 36.34, N = 3 SE +/- 7.41, N = 3 5373.70 3169.40 3691.35 2862.74 4225.68 4690.75
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 110 220 330 440 550 SE +/- 3.61, N = 3 SE +/- 1.55, N = 3 SE +/- 0.98, N = 3 SE +/- 1.38, N = 3 SE +/- 3.49, N = 3 SE +/- 0.71, N = 3 515.65 304.13 354.21 274.70 405.49 450.11
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1200 2400 3600 4800 6000 SE +/- 21.74, N = 3 SE +/- 20.25, N = 3 SE +/- 7.19, N = 3 SE +/- 3.77, N = 3 SE +/- 14.19, N = 3 SE +/- 27.20, N = 3 5430.65 3342.87 3288.29 2847.50 3853.39 4935.74
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 70 140 210 280 350 SE +/- 1.36, N = 3 SE +/- 1.26, N = 3 SE +/- 0.45, N = 3 SE +/- 0.24, N = 3 SE +/- 0.89, N = 3 SE +/- 1.70, N = 3 339.42 208.93 205.52 177.97 240.83 308.48
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 150K 300K 450K 600K 750K SE +/- 560.75, N = 3 SE +/- 1006.64, N = 3 SE +/- 360.56, N = 3 SE +/- 173.21, N = 3 SE +/- 4356.82, N = 15 SE +/- 592.55, N = 3 721867 426700 426200 351200 520673 657067
FinanceBench Benchmark: Monte-Carlo OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Monte-Carlo OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200 400 600 800 1000 SE +/- 1.21, N = 3 SE +/- 1.66, N = 3 SE +/- 0.43, N = 3 SE +/- 0.57, N = 3 SE +/- 0.99, N = 3 SE +/- 0.61, N = 3 386.05 461.68 816.01 752.41 843.45 649.82 1. (CXX) g++ options: -O3 -march=native -fopenmp
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 2K 4K 6K 8K 10K SE +/- 34.46, N = 3 SE +/- 81.03, N = 5 SE +/- 22.07, N = 3 SE +/- 26.87, N = 3 SE +/- 14.53, N = 3 SE +/- 71.88, N = 3 10606.70 7578.14 7715.26 6827.82 8551.75 9606.53
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 140 280 420 560 700 SE +/- 2.15, N = 3 SE +/- 5.06, N = 5 SE +/- 1.38, N = 3 SE +/- 1.68, N = 3 SE +/- 0.91, N = 3 SE +/- 4.49, N = 3 662.92 473.63 482.20 426.74 534.48 600.41
Betsy GPU Compressor Codec: ETC2 RGB - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 2 4 6 8 10 SE +/- 0.011, N = 3 SE +/- 0.009, N = 3 SE +/- 0.011, N = 3 SE +/- 0.011, N = 3 SE +/- 0.006, N = 3 SE +/- 0.006, N = 3 4.416 6.794 7.330 8.394 6.831 5.008 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 500M 1000M 1500M 2000M 2500M SE +/- 2514623.90, N = 3 SE +/- 536449.23, N = 3 SE +/- 1276714.53, N = 3 SE +/- 633333.33, N = 3 SE +/- 1980179.57, N = 3 SE +/- 982061.32, N = 3 2420400000 1414633333 1424200000 1187933333 1708433333 2239266667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4000M 8000M 12000M 16000M 20000M SE +/- 26464462.54, N = 3 SE +/- 7377744.31, N = 3 SE +/- 12109546.28, N = 3 SE +/- 5454763.46, N = 3 SE +/- 1527525.23, N = 3 SE +/- 29255901.59, N = 3 19098866667 11068233333 11121466667 9341766667 13420400000 17718433333
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 12000M 24000M 36000M 48000M 60000M SE +/- 64449006.54, N = 3 SE +/- 46443813.13, N = 3 SE +/- 25610045.77, N = 3 SE +/- 14634472.24, N = 3 SE +/- 44071015.92, N = 3 SE +/- 36241182.84, N = 3 56039666667 32940366667 35398166667 29280266667 42393033333 55298100000
Betsy GPU Compressor Codec: ETC1 - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 2 4 6 8 10 SE +/- 0.024, N = 3 SE +/- 0.041, N = 3 SE +/- 0.021, N = 3 SE +/- 0.016, N = 3 SE +/- 0.028, N = 3 SE +/- 0.036, N = 7 3.328 5.009 5.456 6.032 5.296 3.822 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 1.184 2.368 3.552 4.736 5.92 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 SE +/- 0.017, N = 3 SE +/- 0.066, N = 3 SE +/- 0.020, N = 3 SE +/- 0.012, N = 3 3.439 4.372 4.689 5.262 4.347 3.776
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.2.1 Test: Boat - Acceleration: OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.3861 0.7722 1.1583 1.5444 1.9305 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 SE +/- 0.008, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 SE +/- 0.010, N = 3 1.282 1.713 1.709 1.716 1.591 1.485
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.2.1 Test: Masskrug - Acceleration: OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.488 0.976 1.464 1.952 2.44 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 SE +/- 0.003, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 2.090 2.121 2.156 2.118 2.169 2.084
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 200K 400K 600K 800K 1000K SE +/- 1449.52, N = 3 SE +/- 1047.75, N = 3 SE +/- 1146.49, N = 3 SE +/- 1083.72, N = 3 SE +/- 6986.65, N = 3 SE +/- 2234.08, N = 3 990567 585967 596367 495567 692500 894867
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.2.1 Test: Server Room - Acceleration: OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.1748 0.3496 0.5244 0.6992 0.874 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 0.706 0.777 0.776 0.723 0.756 0.710
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 4 8 12 16 20 SE +/- 0.014468, N = 3 SE +/- 0.179234, N = 3 SE +/- 0.083319, N = 3 SE +/- 0.196835, N = 3 SE +/- 0.084311, N = 9 SE +/- 0.016231, N = 3 7.307000 13.138000 12.804000 16.486333 10.610000 9.226000 1. (CXX) g++ options: -O3 -march=native -fopenmp
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.2.1 Test: Server Rack - Acceleration: OpenCL RTX 3080 RTX 3060 Ti RTX 2070 SUPER RTX 2060 SUPER RTX 2080 SUPER RTX 2080 Ti 0.0227 0.0454 0.0681 0.0908 0.1135 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.095 0.101 0.097 0.097 0.097 0.096
Phoronix Test Suite v10.8.5