NVIDIA RTX 2000 / 4000 / 6000 Ada Generation

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409136-PTS-NVIDIART50
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 3 Tests
Creator Workloads 4 Tests
Multi-Core 5 Tests
NVIDIA GPU Compute 9 Tests
OpenCL 3 Tests
Renderers 4 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 2000 Ada Generation
September 13
  6 Hours, 38 Minutes
RTX 4000 Ada Generation
September 13
  5 Hours, 25 Minutes
RTX 6000 Ada Generation
September 11
  4 Hours, 47 Minutes
Invert Hiding All Results Option
  5 Hours, 37 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA RTX 2000 / 4000 / 6000 Ada GenerationOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 9950X 16-Core @ 8.18GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2308 BIOS)AMD Device 14d82 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32Western Digital WD_BLACK SN850X 2000GBNVIDIA RTX 6000 Ada Generation 48GBNVIDIA RTX 2000 Ada Generation 16GBNVIDIA RTX 4000 Ada Generation 20GBNVIDIA AD102 HD AudioNVIDIA Device 22beNVIDIA Device 22bcDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.8.0-41-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11NVIDIA 560.35.034.6.0OpenCL 3.0 CUDA 12.6.65GCC 13.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudiosMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionNVIDIA RTX 2000 / 4000 / 6000 Ada Generation BenchmarksSystem Logs- nouveau.modeset=0 - Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xb40401c - RTX 6000 Ada Generation: BAR1 / Visible vRAM Size: 65536 MiB - vBIOS Version: 95.02.3a.00.01 - RTX 2000 Ada Generation: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.07.47.00.05 - RTX 4000 Ada Generation: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.04.5c.00.0d - RTX 6000 Ada Generation: GPU Compute Cores: 18176- RTX 2000 Ada Generation: GPU Compute Cores: 2816- RTX 4000 Ada Generation: GPU Compute Cores: 6144- Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationResult OverviewPhoronix Test Suite100%245%391%536%682%vkpeakGpuOwlclpeakProjectPhysX OpenCL-BenchmarkRodiniaChaos Group V-RAYFluidX3DFinanceBenchBlenderParaViewLuxCoreRenderIndigoBenchSPECViewPerf 2020FAHBench

RTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationPer Watt Result OverviewPhoronix Test Suite100%117%135%152%169%FAHBenchParaViewclpeakIndigoBenchSPECViewPerf 2020vkpeakGpuOwlChaos Group V-RAYLuxCoreRenderProjectPhysX OpenCL-BenchmarkFluidX3DP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

NVIDIA RTX 2000 / 4000 / 6000 Ada Generationspecviewperf2020: 2560 x 1440 - CREO-03gpuowl: 77936867gpuowl: 332220523vkpeak: int16-vec4vkpeak: int16-scalarvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp64-vec4vkpeak: fp64-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkpeak: fp32-scalarspecviewperf2020: 2560 x 1440 - MAYA-06specviewperf2020: 2560 x 1440 - MEDICAL-O3gpuowl: 57885161specviewperf2020: 2560 x 1440 - SOLIDWORKS-07specviewperf2020: 2560 x 1440 - SNX-04specviewperf2020: 2560 x 1440 - CATIA-06specviewperf2020: 2560 x 1440 - ENERGY-03blender: Barbershop - NVIDIA CUDAparaview: Many Spheres - 3000 - 3840 x 2160paraview: Many Spheres - 3000 - 3840 x 2160paraview: Many Spheres - 3000 - 2560 x 1440paraview: Many Spheres - 3000 - 2560 x 1440fahbench: fluidx3d: FP32-FP32v-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUblender: Barbershop - NVIDIA OptiXparaview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 3840 x 2160luxcorerender: LuxCore Benchmark - GPUparaview: Wavelet Contour - 3000 - 2560 x 1440paraview: Wavelet Contour - 3000 - 2560 x 1440indigobench: OpenCL GPU - Bedroomluxcorerender: Orange Juice - GPUblender: Pabellon Barcelona - NVIDIA CUDAluxcorerender: Danish Mood - GPUindigobench: OpenCL GPU - Supercarluxcorerender: DLSC - GPUparaview: Wavelet Volume - 3000 - 2560 x 1440paraview: Wavelet Volume - 3000 - 2560 x 1440fluidx3d: FP32-FP16Sfluidx3d: FP32-FP16Cblender: Fishy Cat - NVIDIA CUDAblender: Classroom - NVIDIA CUDAblender: Junkshop - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXblender: Classroom - NVIDIA OptiXopencl-benchmark: Memory Bandwidth Coalesced Writeopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: INT8 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: FP32 Computeopencl-benchmark: FP64 Computeblender: Junkshop - NVIDIA OptiXblender: BMW27 - NVIDIA CUDAluxcorerender: Rainbow Colors and Prism - GPUclpeak: Double-Precision Computeblender: BMW27 - NVIDIA OptiXfinancebench: Monte-Carlo OpenCLrodinia: OpenCL Particle Filterclpeak: Global Memory Bandwidthclpeak: Integer 24-bit Computeclpeak: Integer Computeclpeak: Single-Precision Computefinancebench: Black-Scholes OpenCLRTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada Generation256.461490.31322.4238953.7131027.6648137.8548838.831533.641538.9493562.4746258.4362508.5448174.33813.18211.982043.60578.101026.56236.96156.1745.6515887.667158.4716804.138167.61462.313152525984817432.916521.152625.7610234.762639.6717.788453.567811.1929.28215.8621.3017.7273.90518.9513658.905853.68102081058510.6710.4411.085.488.867.92850.04865.5721.79232.29744.1613.77490.8051.517.345.5735.651534.703.6199.2392872.091815.9140310.5840360.7583188.792.710113.82226.6447.856167.154634.076927.176959.28218.82218.8213745.306968.179186.816944.94276.4465.60308.39184.73285.2283.6931.73188.764339.65743.294739.46647.27207.8500133114192023114.451518.356145.702413.335150.834.632021.821194.018.1615.21108.954.0924.3994.614571.635285.732434253147.2645.8738.4022.7431.0728.55211.81207.714.7875.9476.8231.73513.4390.21522.3022.5311.92217.6110.45266.0351439.206195.005965.805932.1211596.8216.228143170.61449.5195.3012381.119328.0113940.9414000.71440.01440.8727650.7013985.4718498.6413990.27430.1097.35612.37308.72423.66142.9562.74107.858926.80689.049682.32196.58281.646420471960292772.362792.654267.984304.514269.038.033613.711346.7712.8787.7059.206.9236.9157.967467.090466.703770403926.1326.5622.9612.9019.4017.86333.78326.859.69211.48313.9272.82426.9610.43314.6213.2617.62439.327.19199.9268604.991307.5413209.0613188.2525785.687.693OpenBenchmarking.org

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CREO-03RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation60120180240300SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.47, N = 3170.61113.82256.46

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 77936867RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3449.51226.641490.311. (CXX) g++ options: -O3 -lgmp -lOpenCL

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 332220523RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation70140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.45, N = 395.3047.85322.421. (CXX) g++ options: -O3 -lgmp -lOpenCL

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation8K16K24K32K40KSE +/- 0.51, N = 3SE +/- 0.02, N = 3SE +/- 89.71, N = 312381.116167.1538953.71

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation7K14K21K28K35KSE +/- 11.44, N = 3SE +/- 8.93, N = 3SE +/- 5.73, N = 39328.014634.0731027.66

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 1.65, N = 3SE +/- 0.02, N = 3SE +/- 34.51, N = 313940.946927.1748137.85

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 1.03, N = 3SE +/- 0.02, N = 3SE +/- 4.83, N = 314000.716959.2848838.83

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec4RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3440.01218.821533.64

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3440.87218.821538.94

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20K40K60K80K100KSE +/- 16.63, N = 3SE +/- 27.78, N = 3SE +/- 692.18, N = 327650.7013745.3093562.47

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 9.85, N = 3SE +/- 14.03, N = 3SE +/- 153.56, N = 313985.476968.1746258.43

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation13K26K39K52K65KSE +/- 31.57, N = 3SE +/- 18.50, N = 3SE +/- 485.03, N = 318498.649186.8162508.54

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 16.39, N = 3SE +/- 12.68, N = 3SE +/- 370.02, N = 313990.276944.9448174.33

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MAYA-06RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3430.10276.44813.18

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MEDICAL-O3RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.21, N = 397.3565.60211.98

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 57885161RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation400800120016002000SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 1.39, N = 3612.37308.392043.601. (CXX) g++ options: -O3 -lgmp -lOpenCL

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SOLIDWORKS-07RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation120240360480600SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3308.72184.73578.10

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SNX-04RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.27, N = 3SE +/- 0.62, N = 3SE +/- 5.90, N = 3423.66285.221026.56

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CATIA-06RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 1.01, N = 3142.9583.69236.96

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: ENERGY-03RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.57, N = 362.7431.73156.17

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3107.85188.7645.65

ParaView

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3K6K9K12K15KSE +/- 12.03, N = 3SE +/- 1.58, N = 3SE +/- 6.39, N = 38926.814339.6615887.67

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 389.0443.29158.47

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4K8K12K16K20KSE +/- 5.38, N = 3SE +/- 11.59, N = 3SE +/- 73.74, N = 39682.324739.4716804.14

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.74, N = 396.5847.27167.61

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation100200300400500SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.94, N = 3281.65207.85462.31

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP32RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation11002200330044005500SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 16.33, N = 3204713315252

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation13002600390052006500SE +/- 0.00, N = 3SE +/- 3.67, N = 3SE +/- 24.01, N = 3196014195984

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 13.86, N = 3SE +/- 0.00, N = 3SE +/- 8.95, N = 3292720238174

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation306090120150SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 372.36114.4532.91

ParaView

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation14002800420056007000SE +/- 1.68, N = 3SE +/- 1.09, N = 3SE +/- 4.67, N = 32792.651518.366521.15

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation140280420560700SE +/- 0.16, N = 3SE +/- 0.11, N = 3SE +/- 0.45, N = 3267.98145.70625.76

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 1.76, N = 3SE +/- 3.59, N = 3SE +/- 104.89, N = 34304.512413.3410234.76

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 3840 x 2160RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation140280420560700SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 6.56, N = 3269.03150.83639.67

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 38.034.6317.78MIN: 3.12 / MAX: 9.08MIN: 1.89 / MAX: 5.26MIN: 7.98 / MAX: 20.82

ParaView

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 3.94, N = 3SE +/- 4.26, N = 3SE +/- 6.54, N = 33613.712021.828453.57

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.38, N = 3SE +/- 0.41, N = 3SE +/- 0.63, N = 3346.77194.01811.19

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.009, N = 3SE +/- 0.002, N = 3SE +/- 0.029, N = 312.8788.16129.282

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 37.705.2115.86MIN: 6.01 / MAX: 9.92MIN: 4.19 / MAX: 6.51MIN: 13.68 / MAX: 21.62

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 359.20108.9521.30

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 36.924.0917.72MIN: 3.18 / MAX: 7.9MIN: 1.83 / MAX: 4.63MIN: 6.81 / MAX: 20.39

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1632486480SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 336.9224.4073.91

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 37.964.6118.95MIN: 7.71 / MAX: 8.17MIN: 4.41 / MAX: 4.76MIN: 18.15 / MAX: 19.18

ParaView

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3K6K9K12K15KSE +/- 15.84, N = 3SE +/- 5.85, N = 3SE +/- 40.90, N = 37467.094571.6413658.91

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 2560 x 1440RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.99, N = 3SE +/- 0.37, N = 3SE +/- 2.56, N = 3466.70285.73853.68

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16SRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 2.00, N = 33770243410208

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16CRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 2.03, N = 3SE +/- 1.67, N = 3SE +/- 3.50, N = 44039253110585

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1122334455SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 526.1347.2610.67

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 526.5645.8710.44

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 422.9638.4011.08

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.05, N = 1512.9022.745.48

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 519.4031.078.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 617.8628.557.92

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced WriteRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.26, N = 6333.78211.81850.041. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced ReadRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.06, N = 6326.85207.71865.571. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.017, N = 4SE +/- 0.011, N = 3SE +/- 0.068, N = 69.6924.78721.7921. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation816243240SE +/- 0.004, N = 4SE +/- 0.011, N = 3SE +/- 0.127, N = 611.4835.94732.2971. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1020304050SE +/- 0.001, N = 4SE +/- 0.000, N = 3SE +/- 0.291, N = 613.9276.82344.1611. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.84921.69842.54763.39684.246SE +/- 0.060, N = 4SE +/- 0.009, N = 3SE +/- 0.021, N = 62.8241.7353.7741. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20406080100SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.08, N = 626.9613.4490.811. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.33980.67961.01941.35921.699SE +/- 0.000, N = 4SE +/- 0.000, N = 3SE +/- 0.000, N = 60.4330.2151.5101. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.02, N = 4SE +/- 0.04, N = 3SE +/- 0.01, N = 614.6222.307.34

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.02, N = 4SE +/- 0.02, N = 3SE +/- 0.01, N = 713.2622.535.57

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation816243240SE +/- 0.04, N = 5SE +/- 0.02, N = 4SE +/- 0.09, N = 717.6211.9235.65MIN: 16.21 / MAX: 18.4MIN: 10.89 / MAX: 12.56MIN: 31.82 / MAX: 39.14

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.36, N = 5SE +/- 0.22, N = 5SE +/- 2.12, N = 6439.32217.611534.701. (CXX) g++ options: -O3

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3691215SE +/- 0.01, N = 6SE +/- 0.01, N = 5SE +/- 0.00, N = 87.1910.453.61

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Monte-Carlo OpenCLRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation60120180240300SE +/- 0.04, N = 7SE +/- 0.11, N = 7SE +/- 0.04, N = 7199.93266.0499.241. (CXX) g++ options: -O3 -march=native -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3691215SE +/- 0.005, N = 7SE +/- 0.009, N = 5SE +/- 0.004, N = 114.9919.2062.0911. (CXX) g++ options: -O2 -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.05, N = 8SE +/- 0.06, N = 7SE +/- 0.03, N = 11307.54195.00815.911. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation9K18K27K36K45KSE +/- 6.66, N = 13SE +/- 17.53, N = 13SE +/- 289.03, N = 1513209.065965.8040310.581. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation9K18K27K36K45KSE +/- 9.21, N = 13SE +/- 3.31, N = 13SE +/- 462.74, N = 1513188.255932.1240360.751. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20K40K60K80K100KSE +/- 26.99, N = 13SE +/- 14.33, N = 15SE +/- 359.84, N = 1325785.6811596.8283188.791. (CXX) g++ options: -O3

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.041867, N = 15SE +/- 0.001644, N = 14SE +/- 0.003806, N = 157.69300016.2281432.7100001. (CXX) g++ options: -O3 -march=native -fopenmp

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1632486480Min: 29 / Avg: 63.59 / Max: 80Min: 36 / Avg: 66.43 / Max: 79Min: 41 / Avg: 72.97 / Max: 87

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250Min: 6.46 / Avg: 78.35 / Max: 130.8Min: 7.09 / Avg: 47.88 / Max: 70Min: 21.72 / Avg: 191 / Max: 303.57

75 Results Shown

SPECViewPerf 2020
GpuOwl:
  77936867
  332220523
vkpeak:
  int16-vec4
  int16-scalar
  int32-vec4
  int32-scalar
  fp64-vec4
  fp64-scalar
  fp16-vec4
  fp16-scalar
  fp32-vec4
  fp32-scalar
SPECViewPerf 2020:
  2560 x 1440 - MAYA-06
  2560 x 1440 - MEDICAL-O3
GpuOwl
SPECViewPerf 2020:
  2560 x 1440 - SOLIDWORKS-07
  2560 x 1440 - SNX-04
  2560 x 1440 - CATIA-06
  2560 x 1440 - ENERGY-03
Blender
ParaView:
  Many Spheres - 3000 - 3840 x 2160:
    MiPolys / Sec
    Frames / Sec
  Many Spheres - 3000 - 2560 x 1440:
    MiPolys / Sec
    Frames / Sec
FAHBench
FluidX3D
Chaos Group V-RAY:
  NVIDIA CUDA GPU
  NVIDIA RTX GPU
Blender
ParaView:
  Wavelet Contour - 3000 - 3840 x 2160:
    MiPolys / Sec
    Frames / Sec
  Wavelet Volume - 3000 - 3840 x 2160:
    MiVoxels / Sec
    Frames / Sec
LuxCoreRender
ParaView:
  Wavelet Contour - 3000 - 2560 x 1440:
    MiPolys / Sec
    Frames / Sec
IndigoBench
LuxCoreRender
Blender
LuxCoreRender
IndigoBench
LuxCoreRender
ParaView:
  Wavelet Volume - 3000 - 2560 x 1440:
    MiVoxels / Sec
    Frames / Sec
FluidX3D:
  FP32-FP16S
  FP32-FP16C
Blender:
  Fishy Cat - NVIDIA CUDA
  Classroom - NVIDIA CUDA
  Junkshop - NVIDIA CUDA
  Fishy Cat - NVIDIA OptiX
  Pabellon Barcelona - NVIDIA OptiX
  Classroom - NVIDIA OptiX
ProjectPhysX OpenCL-Benchmark:
  Memory Bandwidth Coalesced Write
  Memory Bandwidth Coalesced Read
  INT8 Compute
  INT16 Compute
  INT32 Compute
  INT64 Compute
  FP32 Compute
  FP64 Compute
Blender:
  Junkshop - NVIDIA OptiX
  BMW27 - NVIDIA CUDA
LuxCoreRender
clpeak
Blender
FinanceBench
Rodinia
clpeak:
  Global Memory Bandwidth
  Integer 24-bit Compute
  Integer Compute
  Single-Precision Compute
FinanceBench
GPU Temperature Monitor:
  Phoronix Test Suite System Monitoring:
    Celsius
    Watts