NVK Vulkan

Some early NVK Vulkan benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2308119-PTS-NVKVULKA25&rdt&grw.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionRTX 3070 TiRTX 3060 TiRTX 2060RTX 3060 TiRTX 3070 Ti NVK NVK NVK NVIDIA 535 NVIDIA 535 NVIDIA 535AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 4001GBNVIDIA NV174 8GBNVIDIA GA104 HD AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.046.5.0-rc2-phx-nvk (x86_64)GNOME Shell 44.3X Server 1.21.1.7 + Waylandnouveau4.3 Mesa 23.3~git2308100600.81cae3~oibaf~l (git-81cae3d 2023-08-10 lunar-oibaf-ppa)GCC 12.3.0ext43840x2160NVIDIA NV166 6GBNVIDIA TU106 HD AudioAMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)NVIDIA GeForce RTX 2060 6GB6.2.0-27-generic (x86_64)X Server 1.21.1.7NVIDIA 535.984.6.0NVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD AudioNVIDIA GeForce RTX 3070 Ti 8GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- RTX 3070 Ti: NVK: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- RTX 3060 Ti: NVK: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- RTX 2060: NVK: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- RTX 2060: NVIDIA 535: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3060 Ti: NVIDIA 535: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3070 Ti: NVIDIA 535: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Vbios Version Details- RTX 3070 Ti: NVK, RTX 3060 Ti: NVK, RTX 2060: NVK: 102-RAPHAEL-008Security Details- RTX 3070 Ti: NVK: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 3060 Ti: NVK: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 2060: NVK: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 2060: NVIDIA 535: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 3060 Ti: NVIDIA 535: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 3070 Ti: NVIDIA 535: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected Graphics Details- RTX 2060: NVIDIA 535: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.2e.00.05- RTX 3060 Ti: NVIDIA 535: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c- RTX 3070 Ti: NVIDIA 535: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02

yquake2: Vulkan - On - Off - 2560 x 1440yquake2: Vulkan - On - Off - 3840 x 2160yquake2: Vulkan - On - Off - 1920 x 1080yquake2: Vulkan - Off - On - 2560 x 1440yquake2: Vulkan - Off - On - 1920 x 1080yquake2: Vulkan - Off - Off - 1920 x 1080yquake2: Vulkan - Off - Off - 2560 x 1440vkfft: FFT + iFFT R2C / C2Rvkpeak: fp32-scalaryquake2: Vulkan - Off - Off - 3840 x 2160vkpeak: fp32-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkmark: 2560 x 1440 - Mailboxvkmark: 1920 x 1080 - Mailboxyquake2: Vulkan - On - On - 2560 x 1440vkpeak: fp16-scalarvkpeak: fp16-vec4yquake2: Vulkan - On - On - 1920 x 1080RTX 3070 TiRTX 3060 TiRTX 2060RTX 3060 TiRTX 3070 TiRTX 3060 Ti NVK NVK NVK NVIDIA 535 NVIDIA 535 NVIDIA 535RTX 3060 Ti31.216.648.410.217.348.631.3105370.8016.6110.3072.8386.968213910.217.03115.949.61017.149.931.1110456.9415.989.9259.2471.028315110.016.926.21538.78.113.738.926.381662.701575.2363.1475.4454778.113.41713.4810.62279.1550.7885.72418.81708.9233407369.72806.67350.02525.57323.4314432.17844.62142.41093.52972.4827.31314.02991.62212.9339229680.861092.812800.887979710.0119169.711264.22391.01193.23292.11000.61581.33241.22396.94253411642.721203.815425.47975.311661.4023026.621508.59693.4719134.98OpenBenchmarking.org

yquake2

Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 2560 x 1440

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 2560 x 1440NVKNVIDIA 5355001000150020002500SE +/- 0.03, N = 3SE +/- 10.91, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 23.66, N = 331.22391.031.02142.426.21713.41. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 3840 x 2160

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 3840 x 2160NVKNVIDIA 53530060090012001500SE +/- 0.00, N = 3SE +/- 5.25, N = 3SE +/- 0.00, N = 3SE +/- 6.67, N = 3SE +/- 0.00, N = 3SE +/- 1.51, N = 316.61193.215.91093.515.0810.61. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 1920 x 1080

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: On - MSAA: Off - Resolution: 1920 x 1080NVKNVIDIA 5357001400210028003500SE +/- 0.21, N = 3SE +/- 26.37, N = 3SE +/- 0.06, N = 3SE +/- 30.28, N = 3SE +/- 0.22, N = 3SE +/- 16.65, N = 348.43292.149.62972.438.72279.11. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: Off - MSAA: On - Resolution: 2560 x 1440

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: Off - MSAA: On - Resolution: 2560 x 1440NVKNVIDIA 5352004006008001000SE +/- 0.00, N = 3SE +/- 2.79, N = 3SE +/- 0.00, N = 3SE +/- 3.27, N = 3SE +/- 0.00, N = 3SE +/- 2.57, N = 310.21000.610.0827.38.1550.71. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: Off - MSAA: On - Resolution: 1920 x 1080

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: Off - MSAA: On - Resolution: 1920 x 1080NVKNVIDIA 53530060090012001500SE +/- 0.03, N = 3SE +/- 6.57, N = 3SE +/- 0.03, N = 3SE +/- 7.88, N = 3SE +/- 0.00, N = 3SE +/- 7.68, N = 317.31581.317.11314.013.7885.71. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 1920 x 1080

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 1920 x 1080NVKNVIDIA 5357001400210028003500SE +/- 0.17, N = 3SE +/- 19.20, N = 3SE +/- 0.12, N = 3SE +/- 31.93, N = 15SE +/- 0.25, N = 3SE +/- 24.71, N = 1548.63241.249.92991.638.92418.81. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 2560 x 1440

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 2560 x 1440NVKNVIDIA 5355001000150020002500SE +/- 0.03, N = 3SE +/- 3.03, N = 3SE +/- 0.03, N = 3SE +/- 24.36, N = 3SE +/- 0.09, N = 3SE +/- 18.42, N = 531.32396.931.12212.926.31708.91. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

VkFFT

Test: FFT + iFFT R2C / C2R

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT R2C / C2RNVKNVIDIA 5359K18K27K36K45KSE +/- 1.76, N = 3SE +/- 264.61, N = 3SE +/- 0.58, N = 3SE +/- 12.44, N = 3SE +/- 9.50, N = 2SE +/- 15.52, N = 3105342534110433922816233401. (CXX) g++ options: -O3

vkpeak

fp32-scalar

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarNVKNVIDIA 5352K4K6K8K10KSE +/- 0.01, N = 3SE +/- 22.64, N = 3SE +/- 0.01, N = 3SE +/- 9.07, N = 3SE +/- 0.16, N = 3SE +/- 60.39, N = 370.8011642.7256.949680.8662.707369.72

yquake2

Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 3840 x 2160

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: Off - MSAA: Off - Resolution: 3840 x 2160NVKNVIDIA 53530060090012001500SE +/- 0.03, N = 3SE +/- 2.30, N = 3SE +/- 0.00, N = 3SE +/- 1.64, N = 3SE +/- 0.00, N = 3SE +/- 7.55, N = 316.61203.815.91092.815.0806.61. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

vkpeak

fp32-vec4

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4NVKNVIDIA 5353K6K9K12K15KSE +/- 0.05, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 1.16, N = 3SE +/- 0.08, N = 3SE +/- 33.15, N = 3110.3015425.4789.9212800.8875.237350.02

vkpeak

int32-scalar

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarNVK1632486480SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 372.8359.2463.14

vkpeak

int32-vec4

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4NVK20406080100SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 386.9671.0275.44

VKMark

Resolution: 2560 x 1440 - Present Mode: Mailbox

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 2560 x 1440 - Present Mode: MailboxNVK20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38283541. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

VKMark

Resolution: 1920 x 1080 - Present Mode: Mailbox

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 1920 x 1080 - Present Mode: MailboxNVK306090120150SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3139151771. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

yquake2

Renderer: Vulkan - AF: On - MSAA: On - Resolution: 2560 x 1440

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: On - MSAA: On - Resolution: 2560 x 1440NVKNVIDIA 5352004006008001000SE +/- 0.00, N = 3SE +/- 1.73, N = 3SE +/- 0.03, N = 3SE +/- 2.31, N = 3SE +/- 0.00, N = 3SE +/- 5.55, N = 310.2975.310.0797.08.1525.51. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarRTX 2060: NVIDIA 535RTX 3060 TiRTX 3070 Ti: NVIDIA 535RTX 3060 Ti: NVIDIA 5353K6K9K12K15KSE +/- 8.12, N = 3SE +/- 0.89, N = 3SE +/- 1.66, N = 3SE +/- 16.04, N = 37254.969693.9211659.899710.01

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4RTX 2060: NVIDIA 535RTX 3060 TiRTX 3070 Ti: NVIDIA 535RTX 3060 Ti: NVIDIA 5355K10K15K20K25KSE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.74, N = 3SE +/- 32.39, N = 314319.1319135.8523027.2619169.71

yquake2

Renderer: Vulkan - AF: On - MSAA: On - Resolution: 1920 x 1080

RTX 3070 TiRTX 3060 TiRTX 2060OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Vulkan - AF: On - MSAA: On - Resolution: 1920 x 1080NVKNVIDIA 53530060090012001500SE +/- 0.17, N = 3SE +/- 8.31, N = 3SE +/- 0.07, N = 3SE +/- 11.86, N = 3SE +/- 0.10, N = 11SE +/- 3.63, N = 317.01508.516.91264.213.4844.61. (CC) gcc options: -shared -lm -ldl -rdynamic -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC


Phoronix Test Suite v10.8.5