NVIDIA GPU Compute Benchmarks
Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2106089-IB-NVIDIACOM84&grs&sro.
Blender
Blend File: Classroom - Compute: CUDA
Blender
Blend File: Barbershop - Compute: NVIDIA OptiX
clpeak
OpenCL Test: Single-Precision Float
Blender
Blend File: Classroom - Compute: NVIDIA OptiX
Blender
Blend File: Pabellon Barcelona - Compute: CUDA
Blender
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
Chaos Group V-RAY
Mode: NVIDIA CUDA GPU
Chaos Group V-RAY
Mode: NVIDIA RTX GPU
LuxCoreRender
Scene: DLSC - Acceleration: GPU
Blender
Blend File: Fishy Cat - Compute: CUDA
Blender
Blend File: BMW27 - Compute: CUDA
LuxCoreRender
Scene: Danish Mood - Acceleration: GPU
vkpeak
fp32-vec4
ArrayFire
Test: BLAS OpenCL
Blender
Blend File: BMW27 - Compute: NVIDIA OptiX
Blender
Blend File: Fishy Cat - Compute: NVIDIA OptiX
LuxCoreRender
Scene: LuxCore Benchmark - Acceleration: GPU
vkpeak
fp64-vec4
vkpeak
fp32-scalar
vkpeak
fp64-scalar
clpeak
OpenCL Test: Double-Precision Double
vkpeak
int32-scalar
vkpeak
int16-scalar
vkpeak
fp16-scalar
vkpeak
int32-vec4
vkpeak
fp16-vec4
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: GEMM SGEMM_N
Blender
Blend File: Barbershop - Compute: CUDA
ViennaCL
Test: OpenCL BLAS - dGEMM-NN
Hashcat
Benchmark: SHA1
ViennaCL
Test: OpenCL BLAS - dGEMM-TN
Hashcat
Benchmark: MD5
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: MD5 Hash
Hashcat
Benchmark: SHA-512
clpeak
OpenCL Test: Integer Compute INT
vkpeak
int16-vec4
Hashcat
Benchmark: TrueCrypt RIPEMD160 + XTS
IndigoBench
Acceleration: OpenCL GPU - Scene: Bedroom
LuxCoreRender
Scene: Rainbow Colors and Prism - Acceleration: GPU
Mixbench
Backend: NVIDIA CUDA - Benchmark: Half Precision
OctaneBench
Total Score
LuxCoreRender
Scene: Orange Juice - Acceleration: GPU
LeelaChessZero
Backend: OpenCL
RedShift Demo
RealSR-NCNN
Scale: 4x - TAA: Yes
Hashcat
Benchmark: 7-Zip
PlaidML
FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL
PlaidML
FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL
cl-mem
Benchmark: Read
clpeak
OpenCL Test: Global Memory Bandwidth
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: FFT SP
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: S3D
VkResample
Upscale: 2x - Precision: Single
ViennaCL
Test: OpenCL BLAS - dGEMM-NT
cl-mem
Benchmark: Write
PlaidML
FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL
IndigoBench
Acceleration: OpenCL GPU - Scene: Supercar
ViennaCL
Test: OpenCL BLAS - dAXPY
Betsy GPU Compressor
Codec: ETC2 RGB - Quality: Highest
RealSR-NCNN
Scale: 4x - TAA: No
ViennaCL
Test: OpenCL BLAS - dGEMV-N
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Texture Read Bandwidth
Betsy GPU Compressor
Codec: ETC1 - Quality: Highest
ViennaCL
Test: OpenCL BLAS - dDOT
ViennaCL
Test: OpenCL BLAS - dCOPY
FAHBench
ViennaCL
Test: OpenCL BLAS - dGEMM-TT
ArrayFire
Test: Conjugate Gradient OpenCL
Waifu2x-NCNN Vulkan
Scale: 2x - Denoise: 3 - TAA: Yes
ViennaCL
Test: OpenCL BLAS - sAXPY
ViennaCL
Test: OpenCL BLAS - sDOT
ViennaCL
Test: OpenCL BLAS - sCOPY
cl-mem
Benchmark: Copy
NAMD CUDA
ATPase Simulation - 327,506 Atoms
ViennaCL
Test: OpenCL BLAS - dGEMV-T
ViennaCL
Test: OpenCL BLAS - dDOT
vkpeak
fp64-vec4
ViennaCL
Test: OpenCL BLAS - dGEMM-NT
ViennaCL
Test: OpenCL BLAS - dGEMM-TT
RedShift Demo
GPU Power Consumption Monitor
Betsy GPU Compressor
GPU Power Consumption Monitor
Betsy GPU Compressor
GPU Power Consumption Monitor
Waifu2x-NCNN Vulkan
GPU Power Consumption Monitor
RealSR-NCNN
GPU Power Consumption Monitor
RealSR-NCNN
GPU Power Consumption Monitor
VkResample
GPU Power Consumption Monitor
Chaos Group V-RAY
GPU Power Consumption Monitor
Chaos Group V-RAY
Mode: NVIDIA RTX GPU
Chaos Group V-RAY
GPU Power Consumption Monitor
Chaos Group V-RAY
Mode: NVIDIA CUDA GPU
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
Blender
GPU Power Consumption Monitor
IndigoBench
GPU Power Consumption Monitor
IndigoBench
Acceleration: OpenCL GPU - Scene: Bedroom
IndigoBench
GPU Power Consumption Monitor
IndigoBench
Acceleration: OpenCL GPU - Scene: Supercar
SHOC Scalable HeterOgeneous Computing
GPU Power Consumption Monitor
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: S3D
SHOC Scalable HeterOgeneous Computing
GPU Power Consumption Monitor
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: MD5 Hash
SHOC Scalable HeterOgeneous Computing
GPU Power Consumption Monitor
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: GEMM SGEMM_N
SHOC Scalable HeterOgeneous Computing
GPU Power Consumption Monitor
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: FFT SP
SHOC Scalable HeterOgeneous Computing
GPU Power Consumption Monitor
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Texture Read Bandwidth
ViennaCL
GPU Power Consumption Monitor
ViennaCL
Test: OpenCL BLAS - dGEMM-TN
cl-mem
GPU Power Consumption Monitor
cl-mem
Benchmark: Copy
cl-mem
GPU Power Consumption Monitor
cl-mem
Benchmark: Write
cl-mem
GPU Power Consumption Monitor
cl-mem
Benchmark: Read
LeelaChessZero
GPU Power Consumption Monitor
LeelaChessZero
Backend: OpenCL
PlaidML
GPU Power Consumption Monitor
PlaidML
FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL
PlaidML
GPU Power Consumption Monitor
PlaidML
FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL
PlaidML
GPU Power Consumption Monitor
PlaidML
FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL
vkpeak
GPU Power Consumption Monitor
vkpeak
int16-vec4
clpeak
GPU Power Consumption Monitor
clpeak
OpenCL Test: Integer Compute INT
clpeak
GPU Power Consumption Monitor
clpeak
OpenCL Test: Double-Precision Double
clpeak
GPU Power Consumption Monitor
clpeak
OpenCL Test: Single-Precision Float
clpeak
GPU Power Consumption Monitor
clpeak
OpenCL Test: Global Memory Bandwidth
ArrayFire
GPU Power Consumption Monitor
ArrayFire
GPU Power Consumption Monitor
ArrayFire
Test: BLAS OpenCL
LuxCoreRender
GPU Power Consumption Monitor
LuxCoreRender
Scene: Danish Mood - Acceleration: GPU
LuxCoreRender
GPU Power Consumption Monitor
LuxCoreRender
Scene: Orange Juice - Acceleration: GPU
LuxCoreRender
GPU Power Consumption Monitor
LuxCoreRender
Scene: LuxCore Benchmark - Acceleration: GPU
LuxCoreRender
GPU Power Consumption Monitor
LuxCoreRender
Scene: Rainbow Colors and Prism - Acceleration: GPU
LuxCoreRender
GPU Power Consumption Monitor
LuxCoreRender
Scene: DLSC - Acceleration: GPU
OctaneBench
GPU Power Consumption Monitor
OctaneBench
Total Score
Mixbench
GPU Power Consumption Monitor
Mixbench
Backend: NVIDIA CUDA - Benchmark: Integer
Mixbench
Backend: NVIDIA CUDA - Benchmark: Integer
Mixbench
GPU Power Consumption Monitor
Mixbench
Backend: NVIDIA CUDA - Benchmark: Half Precision
Mixbench
GPU Power Consumption Monitor
Mixbench
Backend: NVIDIA CUDA - Benchmark: Double Precision
Mixbench
Backend: NVIDIA CUDA - Benchmark: Double Precision
Mixbench
GPU Power Consumption Monitor
Mixbench
Backend: NVIDIA CUDA - Benchmark: Single Precision
Mixbench
Backend: NVIDIA CUDA - Benchmark: Single Precision
NAMD CUDA
GPU Power Consumption Monitor
FAHBench
GPU Power Consumption Monitor
FAHBench
Hashcat
GPU Power Consumption Monitor
Hashcat
Benchmark: TrueCrypt RIPEMD160 + XTS
Hashcat
GPU Power Consumption Monitor
Hashcat
Benchmark: 7-Zip
Hashcat
GPU Power Consumption Monitor
Hashcat
Benchmark: SHA-512
Hashcat
GPU Power Consumption Monitor
Hashcat
Benchmark: SHA1
Hashcat
GPU Power Consumption Monitor
Hashcat
Benchmark: MD5
Phoronix Test Suite v10.8.5