ripper-nvidia-gpu-compute-4060ti-545driver AMD Ryzen Threadripper 7960X 24-Cores testing with a ASRock TRX50 WS (7.09 BIOS) and NVIDIA GeForce RTX 4060 Ti 16GB on Ubuntu 22.04 via the Phoronix Test Suite. 4060ti: Processor: AMD Ryzen Threadripper 7960X 24-Cores @ 8.23GHz (24 Cores / 48 Threads), Motherboard: ASRock TRX50 WS (7.09 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2000GB Samsung SSD 980 PRO with Heatsink 2TB, Graphics: NVIDIA GeForce RTX 4060 Ti 16GB, Audio: Realtek ALC1220, Monitor: SyncMaster, Network: Aquantia Device 04c0 + Realtek RTL8125 2.5GbE + MEDIATEK Device 0616 OS: Ubuntu 22.04, Kernel: 6.5.0-26-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 545.29.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.3.99, Vulkan: 1.3.260, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1680x1050 vkpeak 20230730 fp32-scalar GFLOPS > Higher Is Better 4060ti . 11960.03 |============================================================ vkpeak 20230730 fp32-vec4 GFLOPS > Higher Is Better 4060ti . 15811.86 |============================================================ vkpeak 20230730 fp16-scalar GFLOPS > Higher Is Better 4060ti . 11993.98 |============================================================ vkpeak 20230730 fp16-vec4 GFLOPS > Higher Is Better 4060ti . 23640.22 |============================================================ vkpeak 20230730 fp64-scalar GFLOPS > Higher Is Better 4060ti . 375.82 |============================================================== vkpeak 20230730 fp64-vec4 GFLOPS > Higher Is Better 4060ti . 377.17 |============================================================== vkpeak 20230730 int32-scalar GIOPS > Higher Is Better 4060ti . 11939.79 |============================================================ vkpeak 20230730 int32-vec4 GIOPS > Higher Is Better 4060ti . 11867.31 |============================================================ vkpeak 20230730 int16-scalar GIOPS > Higher Is Better 4060ti . 7950.11 |============================================================= vkpeak 20230730 int16-vec4 GIOPS > Higher Is Better 4060ti . 10588.24 |============================================================ RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better 4060ti . 9.523 |=============================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better 4060ti . 58.12 |=============================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Seconds < Lower Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better 4060ti . 4.107 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT R2C / C2R Benchmark Score > Higher Is Better 4060ti . 41115 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in half precision Benchmark Score > Higher Is Better 4060ti . 85657 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein in single precision Benchmark Score > Higher Is Better 4060ti . 12276 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in double precision Benchmark Score > Higher Is Better 4060ti . 15278 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision Benchmark Score > Higher Is Better 4060ti . 42649 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C multidimensional in single precision Benchmark Score > Higher Is Better 4060ti . 41830 |=============================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein benchmark in double precision Benchmark Score > Higher Is Better 4060ti . 3031 |================================================================ VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Benchmark Score > Higher Is Better 4060ti . 43158 |=============================================================== Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better 4060ti . 41680900000 |========================================================= Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better 4060ti . 13286566667 |========================================================= Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better 4060ti . 749867 |============================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better 4060ti . 1698666667 |========================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better 4060ti . 498133 |============================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better 4060ti . 168.00 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better 4060ti . 12.95 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better 4060ti . 728.82 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better 4060ti . 26.53 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better 4060ti . 263.31 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better 4060ti . 6863.98 |============================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better 4060ti . 23959.8 |============================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better 4060ti . 13.38 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better 4060ti . 13.55 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better 4060ti . 2717.91 |============================================================= Libplacebo 6.338.2 FPS > Higher Is Better cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better 4060ti . 225.8 |=============================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better 4060ti . 254.8 |=============================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better 4060ti . 252.9 |=============================================================== NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better 4060ti . 0.08821 |============================================================= Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better 4060ti . 500.01 |============================================================== VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better 4060ti . 31.59 |=============================================================== OctaneBench 2020.1 Total Score Score > Higher Is Better 4060ti . 424.29 |============================================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better 4060ti . 282.24 |============================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better 4060ti . 10889.48 |============================================================ clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better 4060ti . 21249.47 |============================================================ clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better 4060ti . 374.42 |============================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better 4060ti . 252.29 |============================================================== LeelaChessZero 0.30 Backend: OpenCL Nodes Per Second > Higher Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better 4060ti . 5.499 |=============================================================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better 4060ti . 6.76 |================================================================ LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better 4060ti . 5.99 |================================================================ LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better 4060ti . 6.54 |================================================================ LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better 4060ti . 6.75 |================================================================ LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better 4060ti . 15.12 |=============================================================== ArrayFire 3.9 Test: Conjugate Gradient OpenCL ms < Lower Is Better 4060ti . 2.981 |=============================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better 4060ti . 9.144 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better 4060ti . 1650 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better 4060ti . 2193 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better 4060ti . 541 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better 4060ti . 421 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better 4060ti . 628 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better 4060ti . 715 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better 4060ti . 387 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better 4060ti . 548 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better 4060ti . 106 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better 4060ti . 101 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better 4060ti . 111 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better 4060ti . 105 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better 4060ti . 239 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better 4060ti . 260 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better 4060ti . 258 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better 4060ti . 242 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better 4060ti . 248 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better 4060ti . 268 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better 4060ti . 198 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better 4060ti . 266 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better 4060ti . 353 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better 4060ti . 352 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better 4060ti . 365 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better 4060ti . 369 |================================================================= GROMACS 2024 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better 4060ti . 12.53 |=============================================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better 4060ti . 5.65 |================================================================ NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better 4060ti . 5.43 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better 4060ti . 7.29 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better 4060ti . 5.38 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better 4060ti . 7.25 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better 4060ti . 2.80 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better 4060ti . 14.33 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better 4060ti . 21.52 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better 4060ti . 6.93 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better 4060ti . 4.70 |================================================================ NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better 4060ti . 11.27 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better 4060ti . 21.19 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better 4060ti . 12.09 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better 4060ti . 15.82 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better 4060ti . 37.30 |=============================================================== NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better 4060ti . 9.26 |================================================================ PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better 4060ti . 8.03 |================================================================ Blender 4.0 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better 4060ti . 19.67 |=============================================================== Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better 4060ti . 14.72 |=============================================================== Blender 4.0 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better 4060ti . 82.34 |=============================================================== Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better 4060ti . 22.01 |=============================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better 4060ti . 10.98 |=============================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better 4060ti . 32.86 |=============================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better 4060ti . 437179932.9 |========================================================= NeatBench 5 Acceleration: GPU FPS > Higher Is Better 4060ti . 4060 |================================================================ Chaos Group V-RAY 6.0 Mode: NVIDIA RTX GPU Vrays > Higher Is Better Chaos Group V-RAY 6.0 Mode: NVIDIA CUDA GPU Vrays > Higher Is Better