NVIDIA CUDA OpenCL Compute Tests Pre-Ampere

NVIDIA GeForce compute benchmarks of GTX 1000 and RTX 2000 series. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2008319-FI-NVIDIACOM78&grr&rdt.

NVIDIA CUDA OpenCL Compute Tests Pre-AmpereProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600NVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-42-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA TU106 HD AudioNVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA TU104 HD AudioNVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)NVIDIA TU106 HD AudioZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA TU104 HD AudioNVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GP104 HD AudioeVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)NVIDIA TU116 HD AudioASUS NVIDIA GeForce GTX 1650 SUPER 4GB (1530/6000MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)NVIDIA GP104 HD AudioASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)NVIDIA TU116 HD AudioOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013OpenCL Details- TITAN RTX: GPU Compute Cores: 4608- RTX 2060: GPU Compute Cores: 1920- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080 Ti: GPU Compute Cores: 4352- RTX 2070: GPU Compute Cores: 2304- RTX 2080: GPU Compute Cores: 2944- GTX 1080: GPU Compute Cores: 2560- GTX 1660 SUPER: GPU Compute Cores: 1408- GTX 1650 SUPER: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1660: GPU Compute Cores: 1408Python Details- TITAN RTX: Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA CUDA OpenCL Compute Tests Pre-Ampereblender: Barbershop - NVIDIA OptiXblender: Barbershop - CUDAblender: Pabellon Barcelona - CUDAblender: Classroom - CUDAblender: Pabellon Barcelona - NVIDIA OptiXoctanebench: Total Scoreblender: Fishy Cat - CUDAblender: Classroom - NVIDIA OptiXluxcorerender-cl: Foodluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: DLSCblender: BMW27 - CUDAblender: Fishy Cat - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXdaphne: OpenCL - NDT Mappingclpeak: Double-Precision Doubledaphne: NVIDIA CUDA - NDT Mappingnamd-cuda: ATPase Simulation - 327,506 Atomsrodinia: OpenCL Particle Filterclpeak: Single-Precision Floatclpeak: Integer Compute INTarrayfire: Conjugate Gradient OpenCLclpeak: Global Memory BandwidthTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660898.32520.40273.74143.94104.08324.96206366.3672.322.204.935.8735.9633.1623.29349.64544.30625.620.179484.29514044.5913207.171.654530.381758.681038.23565.38287.55211.07165.87791133.75145.011.192.813.3472.3267.5137.90330.31231.69578.190.193088.8355296.985279.952.653276.421736.001013.67530.35282.77195.62206.370933117.51139.841.303.564.2061.7560.3431.22337.40262.17591.340.188637.9317047.526903.902.083368.73905.99522.49336.59153.35131.93233.8025388.9181.161.943.634.3749.6542.1527.43352.15376.35604.180.181015.76910336.7610369.131.903406.14990.71552.03346.63162.41137.37222.37487490.2788.571.893.714.4649.7646.8427.03341.15308.88612.400.183856.8618597.678493.382.071369.70890.52516.75277.75143.60105.42311.11795368.6072.252.204.755.6737.5833.9820.36353.79522.05625.350.178484.42313408.5313256.971.676506.531785.391042.63541.45291.51199.91208.175753119.30143.481.303.524.1862.7760.7331.44343.97267.78594.850.187347.7877221.347047.122.098369.07963.51550.41345.00160.87137.52223.95792490.2685.661.873.594.3150.4244.5627.73348.74344.26611.080.182126.2188852.629030.212.080369.26765.18556.32256.83148.579893173.560.962.082.4085.82351.67297.590.203626.5838114.572409.963.386223.031039.36630.86303.12136.619341161.391.072.272.6484.82320.58173.47570.140.2204511.4884605.334748.952.643276.621201.48951.26415.2892.509207251.450.811.361.62135.98312.38158.07560.440.2456712.5804107.404222.584.445156.91991.22666.87327.21132.918217208.790.912.142.46104.22349.28223.960.230138.2786223.211670.534.075197.351083.24673.64319.05119.405787175.431.002.202.5690.98317.14167.30571.120.2291911.9924561.344422.504.540158.25OpenBenchmarking.org

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080400800120016002000SE +/- 1.39, N = 3SE +/- 0.59, N = 3SE +/- 1.45, N = 3SE +/- 0.61, N = 3SE +/- 0.53, N = 3SE +/- 1.03, N = 3SE +/- 1.45, N = 3SE +/- 0.24, N = 3898.321758.681736.00905.99990.71890.521785.39963.51

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166030060090012001500SE +/- 0.09, N = 3SE +/- 0.18, N = 3SE +/- 0.27, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.29, N = 3SE +/- 0.11, N = 3SE +/- 0.23, N = 3SE +/- 2.44, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3520.401038.231013.67522.49552.03516.751042.63550.41765.181039.361201.48991.221083.24

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16602004006008001000SE +/- 0.23, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.73, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 3273.74565.38530.35336.59346.63277.75541.45345.00556.32630.86951.26666.87673.64

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166090180270360450SE +/- 0.21, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.02, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.49, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3143.94287.55282.77153.35162.41143.60291.51160.87256.83303.12415.28327.21319.05

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 208050100150200250SE +/- 0.01, N = 3SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3104.08211.07195.62131.93137.37105.42199.91137.52

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166070140210280350324.96165.88206.37233.80222.37311.12208.18223.96148.58136.6292.51132.92119.41

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166050100150200250SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.25, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 366.36133.75117.5188.9190.2768.60119.3090.26173.56161.39251.45208.79175.43

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080306090120150SE +/- 0.47, N = 3SE +/- 0.15, N = 3SE +/- 0.26, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 3SE +/- 0.16, N = 372.32145.01139.8481.1688.5772.25143.4885.66

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: FoodTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16600.4950.991.4851.982.475SE +/- 0.06, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.201.191.301.941.892.201.301.870.961.070.810.911.00MIN: 0.13 / MAX: 2.7MIN: 0.25 / MAX: 1.39MIN: 0.26 / MAX: 1.53MIN: 0.25 / MAX: 2.32MIN: 0.29 / MAX: 2.24MIN: 0.29 / MAX: 2.63MIN: 0.29 / MAX: 1.5MIN: 0.25 / MAX: 2.24MIN: 0.13 / MAX: 1.15MIN: 0.25 / MAX: 1.26MIN: 0.24 / MAX: 0.93MIN: 0.23 / MAX: 1.06MIN: 0.24 / MAX: 1.16

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore BenchmarkTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.10932.21863.32794.43725.5465SE +/- 0.10, N = 12SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.932.813.563.633.714.753.523.592.082.271.362.142.20MIN: 0.17 / MAX: 5.74MIN: 0.32 / MAX: 3.18MIN: 0.27 / MAX: 4.03MIN: 0.27 / MAX: 4.15MIN: 0.27 / MAX: 4.24MIN: 0.32 / MAX: 5.38MIN: 0.38 / MAX: 4MIN: 0.27 / MAX: 4.08MIN: 0.14 / MAX: 2.4MIN: 0.27 / MAX: 2.55MIN: 0.27 / MAX: 1.51MIN: 0.32 / MAX: 2.39MIN: 0.32 / MAX: 2.48

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.32082.64163.96245.28326.604SE +/- 0.13, N = 12SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.873.344.204.374.465.674.184.312.402.641.622.462.56MIN: 1.56 / MAX: 6.11MIN: 3.15 / MAX: 3.41MIN: 4.11 / MAX: 4.37MIN: 4.11 / MAX: 4.5MIN: 4.11 / MAX: 4.56MIN: 3.79 / MAX: 5.74MIN: 4.09 / MAX: 4.28MIN: 4.11 / MAX: 4.39MIN: 0.66 / MAX: 2.49MIN: 2.53 / MAX: 2.71MIN: 1.43 / MAX: 1.65MIN: 2.3 / MAX: 2.51MIN: 2.31 / MAX: 2.63

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660306090120150SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.22, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 335.9672.3261.7549.6549.7637.5862.7750.4285.8284.82135.98104.2290.98

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 20801530456075SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 333.1667.5160.3442.1546.8433.9860.7344.56

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080918273645SE +/- 3.53, N = 15SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 323.2937.9031.2227.4327.0320.3631.4427.73

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenCL - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenCL - Kernel: NDT MappingTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166080160240320400SE +/- 1.50, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.42, N = 3SE +/- 1.78, N = 3SE +/- 1.02, N = 3SE +/- 0.47, N = 3SE +/- 2.04, N = 3SE +/- 0.11, N = 3SE +/- 2.39, N = 3SE +/- 0.28, N = 3349.64330.31337.40352.15341.15353.79343.97348.74351.67320.58312.38349.28317.141. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660120240360480600SE +/- 1.45, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 1.15, N = 3SE +/- 0.67, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.36, N = 3SE +/- 0.08, N = 3SE +/- 0.94, N = 3SE +/- 0.07, N = 3544.30231.69262.17376.35308.88522.05267.78344.26297.59173.47158.07223.96167.301. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: NVIDIA CUDA - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: NVIDIA CUDA - Kernel: NDT MappingTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1660 SUPERGTX 1650 SUPERGTX 1660140280420560700SE +/- 2.67, N = 3SE +/- 1.50, N = 3SE +/- 1.15, N = 3SE +/- 1.57, N = 3SE +/- 1.48, N = 3SE +/- 2.17, N = 3SE +/- 1.38, N = 3SE +/- 0.67, N = 3SE +/- 1.18, N = 3SE +/- 0.75, N = 3SE +/- 0.87, N = 3625.62578.19591.34604.18612.40625.35594.85611.08570.14560.44571.121. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16600.05530.11060.16590.22120.2765SE +/- 0.00081, N = 3SE +/- 0.00034, N = 3SE +/- 0.00047, N = 3SE +/- 0.00010, N = 3SE +/- 0.00031, N = 3SE +/- 0.00006, N = 3SE +/- 0.00023, N = 3SE +/- 0.00006, N = 3SE +/- 0.00124, N = 3SE +/- 0.00036, N = 3SE +/- 0.00031, N = 3SE +/- 0.00394, N = 3SE +/- 0.00052, N = 30.179480.193080.188630.181010.183850.178480.187340.182120.203620.220450.245670.230130.22919

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603691215SE +/- 0.064, N = 3SE +/- 0.020, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.017, N = 3SE +/- 0.012, N = 3SE +/- 0.079, N = 3SE +/- 0.025, N = 3SE +/- 0.013, N = 3SE +/- 0.018, N = 3SE +/- 0.023, N = 34.2958.8357.9315.7696.8614.4237.7876.2186.58311.48812.5808.27811.9921. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603K6K9K12K15KSE +/- 194.88, N = 15SE +/- 83.16, N = 3SE +/- 91.57, N = 5SE +/- 101.58, N = 9SE +/- 77.32, N = 15SE +/- 163.05, N = 15SE +/- 96.05, N = 15SE +/- 10.28, N = 3SE +/- 112.13, N = 15SE +/- 17.25, N = 3SE +/- 55.18, N = 15SE +/- 0.24, N = 3SE +/- 46.17, N = 1514044.595296.987047.5210336.768597.6713408.537221.348852.628114.574605.334107.406223.214561.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603K6K9K12K15KSE +/- 112.69, N = 3SE +/- 49.62, N = 3SE +/- 70.16, N = 15SE +/- 146.75, N = 4SE +/- 75.60, N = 15SE +/- 130.38, N = 15SE +/- 70.45, N = 15SE +/- 32.74, N = 3SE +/- 15.07, N = 3SE +/- 26.64, N = 3SE +/- 52.82, N = 15SE +/- 21.96, N = 3SE +/- 53.01, N = 313207.175279.956903.9010369.138493.3813256.977047.129030.212409.964748.954222.581670.534422.501. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.02152.0433.06454.0865.1075SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.6542.6532.0831.9032.0711.6762.0982.0803.3862.6434.4454.0754.5401. (CXX) g++ options: -rdynamic

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660110220330440550SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 1.07, N = 3SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3530.38276.42368.73406.14369.70506.53369.07369.26223.03276.62156.91197.35158.251. (CXX) g++ options: -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5