NVIDIA CUDA OpenCL Compute Tests Pre-Ampere

NVIDIA GeForce compute benchmarks of GTX 1000 and RTX 2000 series. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2008319-FI-NVIDIACOM78&grr&sor.

NVIDIA CUDA OpenCL Compute Tests Pre-AmpereProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTXAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)NVIDIA GP104 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-42-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)ASUS NVIDIA GeForce GTX 1650 SUPER 4GB (1530/6000MHz)NVIDIA TU116 HD AudioASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA TU106 HD AudioNVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA TU104 HD AudioZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioNVIDIA TITAN RTX 24GB (1350/7000MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013OpenCL Details- GTX 1070: GPU Compute Cores: 1920- GTX 1080: GPU Compute Cores: 2560- GTX 1650 SUPER: GPU Compute Cores: 1280- GTX 1660: GPU Compute Cores: 1408- GTX 1660 SUPER: GPU Compute Cores: 1408- RTX 2060: GPU Compute Cores: 1920- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2070: GPU Compute Cores: 2304- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080: GPU Compute Cores: 2944- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affectedPython Details- TITAN RTX: Python 3.8.2

NVIDIA CUDA OpenCL Compute Tests Pre-Ampereblender: Barbershop - NVIDIA OptiXblender: Barbershop - CUDAblender: Pabellon Barcelona - CUDAblender: Classroom - CUDAblender: Pabellon Barcelona - NVIDIA OptiXoctanebench: Total Scoreblender: Fishy Cat - CUDAblender: Classroom - NVIDIA OptiXluxcorerender-cl: Foodluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: DLSCblender: BMW27 - CUDAblender: Fishy Cat - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXdaphne: OpenCL - NDT Mappingclpeak: Double-Precision Doubledaphne: NVIDIA CUDA - NDT Mappingnamd-cuda: ATPase Simulation - 327,506 Atomsrodinia: OpenCL Particle Filterclpeak: Single-Precision Floatclpeak: Integer Compute INTarrayfire: Conjugate Gradient OpenCLclpeak: Global Memory BandwidthGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX991.22666.87327.21132.918217208.790.912.142.46104.22349.28223.960.230138.2786223.211670.534.075197.35765.18556.32256.83148.579893173.560.962.082.4085.82351.67297.590.203626.5838114.572409.963.386223.031201.48951.26415.2892.509207251.450.811.361.62135.98312.38158.07560.440.2456712.5804107.404222.584.445156.911083.24673.64319.05119.405787175.431.002.202.5690.98317.14167.30571.120.2291911.9924561.344422.504.540158.251039.36630.86303.12136.619341161.391.072.272.6484.82320.58173.47570.140.2204511.4884605.334748.952.643276.621758.681038.23565.38287.55211.07165.87791133.75145.011.192.813.3472.3267.5137.90330.31231.69578.190.193088.8355296.985279.952.653276.421736.001013.67530.35282.77195.62206.370933117.51139.841.303.564.2061.7560.3431.22337.40262.17591.340.188637.9317047.526903.902.083368.731785.391042.63541.45291.51199.91208.175753119.30143.481.303.524.1862.7760.7331.44343.97267.78594.850.187347.7877221.347047.122.098369.07990.71552.03346.63162.41137.37222.37487490.2788.571.893.714.4649.7646.8427.03341.15308.88612.400.183856.8618597.678493.382.071369.70963.51550.41345.00160.87137.52223.95792490.2685.661.873.594.3150.4244.5627.73348.74344.26611.080.182126.2188852.629030.212.080369.26905.99522.49336.59153.35131.93233.8025388.9181.161.943.634.3749.6542.1527.43352.15376.35604.180.181015.76910336.7610369.131.903406.14890.52516.75277.75143.60105.42311.11795368.6072.252.204.755.6737.5833.9820.36353.79522.05625.350.178484.42313408.5313256.971.676506.53898.32520.40273.74143.94104.08324.96206366.3672.322.204.935.8735.9633.1623.29349.64544.30625.620.179484.29514044.5913207.171.654530.38OpenBenchmarking.org

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: NVIDIA OptiXRTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2060RTX 2070400800120016002000SE +/- 1.03, N = 3SE +/- 1.39, N = 3SE +/- 0.61, N = 3SE +/- 0.24, N = 3SE +/- 0.53, N = 3SE +/- 1.45, N = 3SE +/- 0.59, N = 3SE +/- 1.45, N = 3890.52898.32905.99963.51990.711736.001758.681785.39

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CUDARTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERGTX 1080GTX 1070RTX 2060 SUPERRTX 2060GTX 1660 SUPERRTX 2070GTX 1660GTX 1650 SUPER30060090012001500SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.18, N = 3SE +/- 2.44, N = 3SE +/- 0.29, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3516.75520.40522.49550.41552.03765.18991.221013.671038.231039.361042.631083.241201.48

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CUDATITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070GTX 1080RTX 2060GTX 1660 SUPERGTX 1070GTX 1660GTX 1650 SUPER2004006008001000SE +/- 0.23, N = 3SE +/- 0.39, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 3SE +/- 0.73, N = 3273.74277.75336.59345.00346.63530.35541.45556.32565.38630.86666.87673.64951.26

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CUDARTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERGTX 1080RTX 2060 SUPERRTX 2060RTX 2070GTX 1660 SUPERGTX 1660GTX 1070GTX 1650 SUPER90180270360450SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.37, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.49, N = 3143.60143.94153.35160.87162.41256.83282.77287.55291.51303.12319.05327.21415.28

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2060 SUPERRTX 2070RTX 206050100150200250SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.41, N = 3SE +/- 0.40, N = 3SE +/- 0.20, N = 3104.08105.42131.93137.37137.52195.62199.91211.07

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060GTX 1080GTX 1660 SUPERGTX 1070GTX 1660GTX 1650 SUPER70140210280350324.96311.12233.80223.96222.37208.18206.37165.88148.58136.62132.92119.4192.51

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CUDATITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 2060GTX 1660 SUPERGTX 1080GTX 1660GTX 1070GTX 1650 SUPER50100150200250SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.39, N = 366.3668.6088.9190.2690.27117.51119.30133.75161.39173.56175.43208.79251.45

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: NVIDIA OptiXRTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 2060306090120150SE +/- 0.06, N = 3SE +/- 0.47, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.08, N = 3SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.15, N = 372.2572.3281.1685.6688.57139.84143.48145.01

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: FoodTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2070RTX 2060 SUPERRTX 2060GTX 1660 SUPERGTX 1660GTX 1080GTX 1070GTX 1650 SUPER0.4950.991.4851.982.475SE +/- 0.06, N = 12SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 32.202.201.941.891.871.301.301.191.071.000.960.910.81MIN: 0.13 / MAX: 2.7MIN: 0.29 / MAX: 2.63MIN: 0.25 / MAX: 2.32MIN: 0.29 / MAX: 2.24MIN: 0.25 / MAX: 2.24MIN: 0.29 / MAX: 1.5MIN: 0.26 / MAX: 1.53MIN: 0.25 / MAX: 1.39MIN: 0.25 / MAX: 1.26MIN: 0.24 / MAX: 1.16MIN: 0.13 / MAX: 1.15MIN: 0.23 / MAX: 1.06MIN: 0.24 / MAX: 0.93

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore BenchmarkTITAN RTXRTX 2080 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080RTX 2060 SUPERRTX 2070RTX 2060GTX 1660 SUPERGTX 1660GTX 1070GTX 1080GTX 1650 SUPER1.10932.21863.32794.43725.5465SE +/- 0.10, N = 12SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 12SE +/- 0.01, N = 34.934.753.713.633.593.563.522.812.272.202.142.081.36MIN: 0.17 / MAX: 5.74MIN: 0.32 / MAX: 5.38MIN: 0.27 / MAX: 4.24MIN: 0.27 / MAX: 4.15MIN: 0.27 / MAX: 4.08MIN: 0.27 / MAX: 4.03MIN: 0.38 / MAX: 4MIN: 0.32 / MAX: 3.18MIN: 0.27 / MAX: 2.55MIN: 0.32 / MAX: 2.48MIN: 0.32 / MAX: 2.39MIN: 0.14 / MAX: 2.4MIN: 0.27 / MAX: 1.51

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCTITAN RTXRTX 2080 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080RTX 2060 SUPERRTX 2070RTX 2060GTX 1660 SUPERGTX 1660GTX 1070GTX 1080GTX 1650 SUPER1.32082.64163.96245.28326.604SE +/- 0.13, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 12SE +/- 0.00, N = 35.875.674.464.374.314.204.183.342.642.562.462.401.62MIN: 1.56 / MAX: 6.11MIN: 3.79 / MAX: 5.74MIN: 4.11 / MAX: 4.56MIN: 4.11 / MAX: 4.5MIN: 4.11 / MAX: 4.39MIN: 4.11 / MAX: 4.37MIN: 4.09 / MAX: 4.28MIN: 3.15 / MAX: 3.41MIN: 2.53 / MAX: 2.71MIN: 2.31 / MAX: 2.63MIN: 2.3 / MAX: 2.51MIN: 0.66 / MAX: 2.49MIN: 1.43 / MAX: 1.65

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CUDATITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2060 SUPERRTX 2070RTX 2060GTX 1660 SUPERGTX 1080GTX 1660GTX 1070GTX 1650 SUPER306090120150SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.23, N = 3SE +/- 0.12, N = 335.9637.5849.6549.7650.4261.7562.7772.3284.8285.8290.98104.22135.98

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: NVIDIA OptiXTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 20601530456075SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 333.1633.9842.1544.5646.8460.3460.7367.51

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: NVIDIA OptiXRTX 2080 TiTITAN RTXRTX 2070 SUPERRTX 2080 SUPERRTX 2080RTX 2060 SUPERRTX 2070RTX 2060918273645SE +/- 0.01, N = 3SE +/- 3.53, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 320.3623.2927.0327.4327.7331.2231.4437.90

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenCL - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenCL - Kernel: NDT MappingRTX 2080 TiRTX 2080 SUPERGTX 1080TITAN RTXGTX 1070RTX 2080RTX 2070RTX 2070 SUPERRTX 2060 SUPERRTX 2060GTX 1660 SUPERGTX 1660GTX 1650 SUPER80160240320400SE +/- 0.42, N = 3SE +/- 0.12, N = 3SE +/- 0.47, N = 3SE +/- 1.50, N = 3SE +/- 2.39, N = 3SE +/- 1.02, N = 3SE +/- 1.78, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 2.04, N = 3SE +/- 0.28, N = 3SE +/- 0.11, N = 3353.79352.15351.67349.64349.28348.74343.97341.15337.40330.31320.58317.14312.381. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERGTX 1080RTX 2070RTX 2060 SUPERRTX 2060GTX 1070GTX 1660 SUPERGTX 1660GTX 1650 SUPER120240360480600SE +/- 1.45, N = 3SE +/- 1.15, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.67, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.94, N = 3SE +/- 0.36, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3544.30522.05376.35344.26308.88297.59267.78262.17231.69223.96173.47167.30158.071. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: NVIDIA CUDA - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: NVIDIA CUDA - Kernel: NDT MappingTITAN RTXRTX 2080 TiRTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2070RTX 2060 SUPERRTX 2060GTX 1660GTX 1660 SUPERGTX 1650 SUPER140280420560700SE +/- 2.67, N = 3SE +/- 2.17, N = 3SE +/- 1.48, N = 3SE +/- 0.67, N = 3SE +/- 1.57, N = 3SE +/- 1.38, N = 3SE +/- 1.15, N = 3SE +/- 1.50, N = 3SE +/- 0.87, N = 3SE +/- 1.18, N = 3SE +/- 0.75, N = 3625.62625.35612.40611.08604.18594.85591.34578.19571.12570.14560.441. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060GTX 1080GTX 1660 SUPERGTX 1660GTX 1070GTX 1650 SUPER0.05530.11060.16590.22120.2765SE +/- 0.00006, N = 3SE +/- 0.00081, N = 3SE +/- 0.00010, N = 3SE +/- 0.00006, N = 3SE +/- 0.00031, N = 3SE +/- 0.00023, N = 3SE +/- 0.00047, N = 3SE +/- 0.00034, N = 3SE +/- 0.00124, N = 3SE +/- 0.00036, N = 3SE +/- 0.00052, N = 3SE +/- 0.00394, N = 3SE +/- 0.00031, N = 30.178480.179480.181010.182120.183850.187340.188630.193080.203620.220450.229190.230130.24567

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080GTX 1080RTX 2070 SUPERRTX 2070RTX 2060 SUPERGTX 1070RTX 2060GTX 1660 SUPERGTX 1660GTX 1650 SUPER3691215SE +/- 0.064, N = 3SE +/- 0.006, N = 3SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.079, N = 3SE +/- 0.005, N = 3SE +/- 0.017, N = 3SE +/- 0.004, N = 3SE +/- 0.018, N = 3SE +/- 0.020, N = 3SE +/- 0.025, N = 3SE +/- 0.023, N = 3SE +/- 0.013, N = 34.2954.4235.7696.2186.5836.8617.7877.9318.2788.83511.48811.99212.5801. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERGTX 1080RTX 2070RTX 2060 SUPERGTX 1070RTX 2060GTX 1660 SUPERGTX 1660GTX 1650 SUPER3K6K9K12K15KSE +/- 194.88, N = 15SE +/- 163.05, N = 15SE +/- 101.58, N = 9SE +/- 10.28, N = 3SE +/- 77.32, N = 15SE +/- 112.13, N = 15SE +/- 96.05, N = 15SE +/- 91.57, N = 5SE +/- 0.24, N = 3SE +/- 83.16, N = 3SE +/- 17.25, N = 3SE +/- 46.17, N = 15SE +/- 55.18, N = 1514044.5913408.5310336.768852.628597.678114.577221.347047.526223.215296.984605.334561.344107.401. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060GTX 1660 SUPERGTX 1660GTX 1650 SUPERGTX 1080GTX 10703K6K9K12K15KSE +/- 130.38, N = 15SE +/- 112.69, N = 3SE +/- 146.75, N = 4SE +/- 32.74, N = 3SE +/- 75.60, N = 15SE +/- 70.45, N = 15SE +/- 70.16, N = 15SE +/- 49.62, N = 3SE +/- 26.64, N = 3SE +/- 53.01, N = 3SE +/- 52.82, N = 15SE +/- 15.07, N = 3SE +/- 21.96, N = 313256.9713207.1710369.139030.218493.387047.126903.905279.954748.954422.504222.582409.961670.531. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2060 SUPERRTX 2070GTX 1660 SUPERRTX 2060GTX 1080GTX 1070GTX 1650 SUPERGTX 16601.02152.0433.06454.0865.1075SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.6541.6761.9032.0712.0802.0832.0982.6432.6533.3864.0754.4454.5401. (CXX) g++ options: -rdynamic

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthTITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2070RTX 2060 SUPERGTX 1660 SUPERRTX 2060GTX 1080GTX 1070GTX 1660GTX 1650 SUPER110220330440550SE +/- 0.12, N = 3SE +/- 1.07, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.23, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3530.38506.53406.14369.70369.26369.07368.73276.62276.42223.03197.35158.25156.911. (CXX) g++ options: -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5