NVIDIA CUDA OpenCL Compute Tests Pre-Ampere

NVIDIA GeForce compute benchmarks of GTX 1000 and RTX 2000 series. Benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2008319-FI-NVIDIACOM78
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 2 Tests
HPC - High Performance Computing 3 Tests
Multi-Core 3 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 2 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GTX 1070
August 30 2020
  2 Hours, 17 Minutes
GTX 1080
August 29 2020
  2 Hours, 24 Minutes
GTX 1650 SUPER
August 30 2020
  2 Hours, 53 Minutes
GTX 1660
August 30 2020
  2 Hours, 22 Minutes
GTX 1660 SUPER
August 29 2020
  2 Hours, 15 Minutes
RTX 2060
August 27 2020
  4 Hours
RTX 2060 SUPER
August 28 2020
  3 Hours, 54 Minutes
RTX 2070
August 28 2020
  3 Hours, 59 Minutes
RTX 2070 SUPER
August 28 2020
  2 Hours, 30 Minutes
RTX 2080
August 29 2020
  2 Hours, 27 Minutes
RTX 2080 SUPER
August 28 2020
  2 Hours, 21 Minutes
RTX 2080 Ti
August 28 2020
  2 Hours, 14 Minutes
TITAN RTX
August 27 2020
  2 Hours, 49 Minutes
Invert Hiding All Results Option
  2 Hours, 48 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA CUDA OpenCL Compute Tests Pre-AmpereOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600NVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)ASUS NVIDIA GeForce GTX 1650 SUPER 4GB (1530/6000MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)ASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)NVIDIA TU102 HD AudioNVIDIA TU106 HD AudioNVIDIA TU104 HD AudioNVIDIA GP104 HD AudioNVIDIA TU116 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-42-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudiosMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionNVIDIA CUDA OpenCL Compute Tests Pre-Ampere BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013- TITAN RTX: GPU Compute Cores: 4608- RTX 2060: GPU Compute Cores: 1920- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080 Ti: GPU Compute Cores: 4352- RTX 2070: GPU Compute Cores: 2304- RTX 2080: GPU Compute Cores: 2944- GTX 1080: GPU Compute Cores: 2560- GTX 1660 SUPER: GPU Compute Cores: 1408- GTX 1650 SUPER: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1660: GPU Compute Cores: 1408- TITAN RTX: Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

TITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660Result OverviewPhoronix Test Suite100%163%226%288%351%OctaneBenchclpeakLuxCoreRender OpenCLBlenderRodiniaArrayFireNAMD CUDADarmstadt Automotive Parallel Heterogeneous Suite

NVIDIA CUDA OpenCL Compute Tests Pre-Ampereblender: Barbershop - NVIDIA OptiXblender: Barbershop - CUDAblender: Pabellon Barcelona - CUDAblender: Classroom - CUDAblender: Pabellon Barcelona - NVIDIA OptiXoctanebench: Total Scoreblender: Fishy Cat - CUDAblender: Classroom - NVIDIA OptiXluxcorerender-cl: Foodluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: DLSCblender: BMW27 - CUDAblender: Fishy Cat - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXdaphne: OpenCL - NDT Mappingclpeak: Double-Precision Doubledaphne: NVIDIA CUDA - NDT Mappingnamd-cuda: ATPase Simulation - 327,506 Atomsrodinia: OpenCL Particle Filterclpeak: Single-Precision Floatclpeak: Integer Compute INTarrayfire: Conjugate Gradient OpenCLclpeak: Global Memory BandwidthTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660898.32520.40273.74143.94104.08324.96206366.3672.322.204.935.8735.9633.1623.29349.64544.30625.620.179484.29514044.5913207.171.654530.381758.681038.23565.38287.55211.07165.87791133.75145.011.192.813.3472.3267.5137.90330.31231.69578.190.193088.8355296.985279.952.653276.421736.001013.67530.35282.77195.62206.370933117.51139.841.303.564.2061.7560.3431.22337.40262.17591.340.188637.9317047.526903.902.083368.73905.99522.49336.59153.35131.93233.8025388.9181.161.943.634.3749.6542.1527.43352.15376.35604.180.181015.76910336.7610369.131.903406.14990.71552.03346.63162.41137.37222.37487490.2788.571.893.714.4649.7646.8427.03341.15308.88612.400.183856.8618597.678493.382.071369.70890.52516.75277.75143.60105.42311.11795368.6072.252.204.755.6737.5833.9820.36353.79522.05625.350.178484.42313408.5313256.971.676506.531785.391042.63541.45291.51199.91208.175753119.30143.481.303.524.1862.7760.7331.44343.97267.78594.850.187347.7877221.347047.122.098369.07963.51550.41345.00160.87137.52223.95792490.2685.661.873.594.3150.4244.5627.73348.74344.26611.080.182126.2188852.629030.212.080369.26765.18556.32256.83148.579893173.560.962.082.4085.82351.67297.590.203626.5838114.572409.963.386223.031039.36630.86303.12136.619341161.391.072.272.6484.82320.58173.47570.140.2204511.4884605.334748.952.643276.621201.48951.26415.2892.509207251.450.811.361.62135.98312.38158.07560.440.2456712.5804107.404222.584.445156.91991.22666.87327.21132.918217208.790.912.142.46104.22349.28223.960.230138.2786223.211670.534.075197.351083.24673.64319.05119.405787175.431.002.202.5690.98317.14167.30571.120.2291911.9924561.344422.504.540158.25OpenBenchmarking.org

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080400800120016002000SE +/- 1.39, N = 3SE +/- 0.59, N = 3SE +/- 1.45, N = 3SE +/- 0.61, N = 3SE +/- 0.53, N = 3SE +/- 1.03, N = 3SE +/- 1.45, N = 3SE +/- 0.24, N = 3898.321758.681736.00905.99990.71890.521785.39963.51

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166030060090012001500SE +/- 0.09, N = 3SE +/- 0.18, N = 3SE +/- 0.27, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.29, N = 3SE +/- 0.11, N = 3SE +/- 0.23, N = 3SE +/- 2.44, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3520.401038.231013.67522.49552.03516.751042.63550.41765.181039.361201.48991.221083.24

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16602004006008001000SE +/- 0.23, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.73, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 3273.74565.38530.35336.59346.63277.75541.45345.00556.32630.86951.26666.87673.64

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166090180270360450SE +/- 0.21, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.02, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.49, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3143.94287.55282.77153.35162.41143.60291.51160.87256.83303.12415.28327.21319.05

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 208050100150200250SE +/- 0.01, N = 3SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3104.08211.07195.62131.93137.37105.42199.91137.52

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166070140210280350324.96165.88206.37233.80222.37311.12208.18223.96148.58136.6292.51132.92119.41

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166050100150200250SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.25, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 366.36133.75117.5188.9190.2768.60119.3090.26173.56161.39251.45208.79175.43

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080306090120150SE +/- 0.47, N = 3SE +/- 0.15, N = 3SE +/- 0.26, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 3SE +/- 0.16, N = 372.32145.01139.8481.1688.5772.25143.4885.66

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: FoodTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16600.4950.991.4851.982.475SE +/- 0.06, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.201.191.301.941.892.201.301.870.961.070.810.911.00MIN: 0.13 / MAX: 2.7MIN: 0.25 / MAX: 1.39MIN: 0.26 / MAX: 1.53MIN: 0.25 / MAX: 2.32MIN: 0.29 / MAX: 2.24MIN: 0.29 / MAX: 2.63MIN: 0.29 / MAX: 1.5MIN: 0.25 / MAX: 2.24MIN: 0.13 / MAX: 1.15MIN: 0.25 / MAX: 1.26MIN: 0.24 / MAX: 0.93MIN: 0.23 / MAX: 1.06MIN: 0.24 / MAX: 1.16

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore BenchmarkTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.10932.21863.32794.43725.5465SE +/- 0.10, N = 12SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.932.813.563.633.714.753.523.592.082.271.362.142.20MIN: 0.17 / MAX: 5.74MIN: 0.32 / MAX: 3.18MIN: 0.27 / MAX: 4.03MIN: 0.27 / MAX: 4.15MIN: 0.27 / MAX: 4.24MIN: 0.32 / MAX: 5.38MIN: 0.38 / MAX: 4MIN: 0.27 / MAX: 4.08MIN: 0.14 / MAX: 2.4MIN: 0.27 / MAX: 2.55MIN: 0.27 / MAX: 1.51MIN: 0.32 / MAX: 2.39MIN: 0.32 / MAX: 2.48

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.32082.64163.96245.28326.604SE +/- 0.13, N = 12SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.873.344.204.374.465.674.184.312.402.641.622.462.56MIN: 1.56 / MAX: 6.11MIN: 3.15 / MAX: 3.41MIN: 4.11 / MAX: 4.37MIN: 4.11 / MAX: 4.5MIN: 4.11 / MAX: 4.56MIN: 3.79 / MAX: 5.74MIN: 4.09 / MAX: 4.28MIN: 4.11 / MAX: 4.39MIN: 0.66 / MAX: 2.49MIN: 2.53 / MAX: 2.71MIN: 1.43 / MAX: 1.65MIN: 2.3 / MAX: 2.51MIN: 2.31 / MAX: 2.63

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CUDATITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660306090120150SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.22, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 335.9672.3261.7549.6549.7637.5862.7750.4285.8284.82135.98104.2290.98

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 20801530456075SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 333.1667.5160.3442.1546.8433.9860.7344.56

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: NVIDIA OptiXTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080918273645SE +/- 3.53, N = 15SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 323.2937.9031.2227.4327.0320.3631.4427.73

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenCL - Kernel: NDT MappingTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 166080160240320400SE +/- 1.50, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.42, N = 3SE +/- 1.78, N = 3SE +/- 1.02, N = 3SE +/- 0.47, N = 3SE +/- 2.04, N = 3SE +/- 0.11, N = 3SE +/- 2.39, N = 3SE +/- 0.28, N = 3349.64330.31337.40352.15341.15353.79343.97348.74351.67320.58312.38349.28317.141. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660120240360480600SE +/- 1.45, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 1.15, N = 3SE +/- 0.67, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.36, N = 3SE +/- 0.08, N = 3SE +/- 0.94, N = 3SE +/- 0.07, N = 3544.30231.69262.17376.35308.88522.05267.78344.26297.59173.47158.07223.96167.301. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: NVIDIA CUDA - Kernel: NDT MappingTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1660 SUPERGTX 1650 SUPERGTX 1660140280420560700SE +/- 2.67, N = 3SE +/- 1.50, N = 3SE +/- 1.15, N = 3SE +/- 1.57, N = 3SE +/- 1.48, N = 3SE +/- 2.17, N = 3SE +/- 1.38, N = 3SE +/- 0.67, N = 3SE +/- 1.18, N = 3SE +/- 0.75, N = 3SE +/- 0.87, N = 3625.62578.19591.34604.18612.40625.35594.85611.08570.14560.44571.121. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16600.05530.11060.16590.22120.2765SE +/- 0.00081, N = 3SE +/- 0.00034, N = 3SE +/- 0.00047, N = 3SE +/- 0.00010, N = 3SE +/- 0.00031, N = 3SE +/- 0.00006, N = 3SE +/- 0.00023, N = 3SE +/- 0.00006, N = 3SE +/- 0.00124, N = 3SE +/- 0.00036, N = 3SE +/- 0.00031, N = 3SE +/- 0.00394, N = 3SE +/- 0.00052, N = 30.179480.193080.188630.181010.183850.178480.187340.182120.203620.220450.245670.230130.22919

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603691215SE +/- 0.064, N = 3SE +/- 0.020, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.017, N = 3SE +/- 0.012, N = 3SE +/- 0.079, N = 3SE +/- 0.025, N = 3SE +/- 0.013, N = 3SE +/- 0.018, N = 3SE +/- 0.023, N = 34.2958.8357.9315.7696.8614.4237.7876.2186.58311.48812.5808.27811.9921. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603K6K9K12K15KSE +/- 194.88, N = 15SE +/- 83.16, N = 3SE +/- 91.57, N = 5SE +/- 101.58, N = 9SE +/- 77.32, N = 15SE +/- 163.05, N = 15SE +/- 96.05, N = 15SE +/- 10.28, N = 3SE +/- 112.13, N = 15SE +/- 17.25, N = 3SE +/- 55.18, N = 15SE +/- 0.24, N = 3SE +/- 46.17, N = 1514044.595296.987047.5210336.768597.6713408.537221.348852.628114.574605.334107.406223.214561.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16603K6K9K12K15KSE +/- 112.69, N = 3SE +/- 49.62, N = 3SE +/- 70.16, N = 15SE +/- 146.75, N = 4SE +/- 75.60, N = 15SE +/- 130.38, N = 15SE +/- 70.45, N = 15SE +/- 32.74, N = 3SE +/- 15.07, N = 3SE +/- 26.64, N = 3SE +/- 52.82, N = 15SE +/- 21.96, N = 3SE +/- 53.01, N = 313207.175279.956903.9010369.138493.3813256.977047.129030.212409.964748.954222.581670.534422.501. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 16601.02152.0433.06454.0865.1075SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.6542.6532.0831.9032.0711.6762.0982.0803.3862.6434.4454.0754.5401. (CXX) g++ options: -rdynamic

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthTITAN RTXRTX 2060RTX 2060 SUPERRTX 2080 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2070RTX 2080GTX 1080GTX 1660 SUPERGTX 1650 SUPERGTX 1070GTX 1660110220330440550SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 1.07, N = 3SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3530.38276.42368.73406.14369.70506.53369.07369.26223.03276.62156.91197.35158.251. (CXX) g++ options: -O3 -rdynamic -lOpenCL