nvidia rtx 5090 compute benchmarks

Intel Core Ultra 9 285K testing with a ASUS ROG MAXIMUS Z890 HERO (1203 BIOS) and ASUS NVIDIA GeForce RTX 5090 32GB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2501241-PTS-NVIDIART86&grr&sor.

nvidia rtx 5090 compute benchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolutionrtx 5090NVIDIA 5090GeForce RTX 5090Intel Core Ultra 9 285K @ 5.10GHz (24 Cores)ASUS ROG MAXIMUS Z890 HERO (1203 BIOS)Intel Device ae7f2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D11000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBASUS NVIDIA GeForce RTX 5090 32GBIntel Device 7f50ASUS VP28URealtek Device 8126 + Intel I226-V + Intel Wi-Fi 7Ubuntu 24.106.11.0-13-generic (x86_64)GNOME Shell 47.0X Server 1.21.1.13NVIDIA 570.86.104.6.0OpenCL 3.0 CUDA 12.8.51 + OpenCL 3.0GCC 14.2.0ext43840x2160OpenBenchmarking.orgKernel Details- nouveau.modeset=0 - Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x114 - Thermald 2.5.8Graphics Details- BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 98.02.2e.00.03OpenCL Details- GPU Compute Cores: 21760Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

nvidia rtx 5090 compute benchmarksshoc: OpenCL - Max SP Flopsncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetvkpeak: int16-vec4vkpeak: int16-scalarvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp64-vec4vkpeak: fp64-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkpeak: fp32-scalarv-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUvkfft: FFT + iFFT C2C 1D batched in double precisionnamd-cuda: ATPase Simulation - 327,506 Atomsindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingblender: Barbershop - NVIDIA CUDAvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkresample: 2x - Doubleclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferblender: Barbershop - NVIDIA OptiXvkfft: FFT + iFFT C2C 1D batched in half precisionfluidx3d: FP32-FP32blender: Pabellon Barcelona - NVIDIA CUDAhashcat: MD5hashcat: 7-Ziphashcat: SHA1vkfft: FFT + iFFT C2C Bluestein in single precisionrealsr-ncnn: 4x - Yeshashcat: SHA-512vkfft: FFT + iFFT C2C multidimensional in single precisionhashcat: TrueCrypt RIPEMD160 + XTSfluidx3d: FP32-FP16Sblender: Junkshop - NVIDIA CUDAblender: Fishy Cat - NVIDIA CUDAfluidx3d: FP32-FP16Cblender: Classroom - NVIDIA CUDAclpeak: Double-Precision Computeblender: Pabellon Barcelona - NVIDIA OptiXvkresample: 2x - Singleopencl-benchmark: Memory Bandwidth Coalesced Writeopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: INT8 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: FP16 Computeopencl-benchmark: FP32 Computeopencl-benchmark: FP64 Computeblender: Classroom - NVIDIA OptiXvkfft: FFT + iFFT R2C / C2Rblender: Junkshop - NVIDIA OptiXshoc: OpenCL - Texture Read Bandwidthblender: BMW27 - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXrealsr-ncnn: 4x - Noblender: BMW27 - NVIDIA OptiXwaifu2x-ncnn: 2x - 3 - Yesclpeak: Global Memory Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - Triadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - S3Dshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Reductionclpeak: Single-Precision Computeclpeak: Integer 24-bit Computeclpeak: Integer Computeshoc: OpenCL - MD5 Hashclpeak: Kernel Latencyrtx 5090NVIDIA 5090GeForce RTX 509012461511.0162.7647.922.0239.3442.4428.588.7410.9337.0513.922.6727.449.84.534.6810.2142.4443806.1340006.661885.3762142.561965.71967.3772592.9362611.7883296.8463013.32485111923637380.0581042.75992.723771724393735.149931103.50518.4113.8324.33302221952417.351068482500003272300688525000003605413.48489004000001446242776000184998.998.92191408.381976.975.6481687.491596.2441.79554.01861.7594.396122.914117.8471.956.161649335.662870.684.724.554.632.922.2991562.974398.3927.832928.689535937.21117.5428.7867837.207121415.5361843.1162151.94142.4075.151245568.6862.8652.5926.3938.9139.7430.18.559.4138.7312.783.0520.784.774.044.2811.2539.7443799.9639989.761894.9562142.071965.391967.4972578.6262611.1783290.2263035.62488211923627730.0585142.85292.5223588424391335.289940103.47818.4213.8924.6305671952717.341062165500003276600690727000003602613.50489012000001446002770700184968.988.92191218.381977.267.065.6491679.441603.9341.72454.03761.7594.4122.944117.8641.956.141648015.672872.534.724.564.42.972.2671564.494400.8527.809828.585636016.51120.5128.787836.98121438.4161866.8662119.51142.245.161246469.2962.8336.225.5736.8638.9129.478.9911.0137.4918.062.6719.796.555.124.298.3738.9143803.639998.4261914.5162141.021965.321967.4372575.3362597.5183257.5963035.62488211923636370.0594342.92492.71123957524191335.19934103.45318.3313.7824.5300737952517.441065440000003264300691043000003595413.486889590000014469527776001850098.92191358.381976.787.045.6481680.231596.8841.75753.95361.7734.392122.941117.8811.9516.171648735.662875.54.654.554.4462.982.2781564.614375.9527.745528.140635961.31120.8828.7881837.243121419.5761903.9362178.05142.5325.15OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsGeForce RTX 5090rtx 5090NVIDIA 509030K60K90K120K150K1246461246151245561. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: FastestDetNVIDIA 5090GeForce RTX 5090rtx 509036912158.689.2911.01MIN: 5.07 / MAX: 85.5MIN: 5.01 / MAX: 89.47MIN: 5.09 / MAX: 92.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vision_transformerrtx 5090GeForce RTX 5090NVIDIA 5090142842567062.7662.8362.86MIN: 40.3 / MAX: 105.61MIN: 41.21 / MAX: 109.15MIN: 42.12 / MAX: 106.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: regnety_400mGeForce RTX 5090rtx 5090NVIDIA 5090122436486036.2047.9052.59MIN: 21.98 / MAX: 458.49MIN: 21.96 / MAX: 421.33MIN: 21.91 / MAX: 425.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: squeezenet_ssdrtx 5090GeForce RTX 5090NVIDIA 509061218243022.0225.5726.39MIN: 7.41 / MAX: 92.66MIN: 7.22 / MAX: 94.51MIN: 7.39 / MAX: 95.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: yolov4-tinyGeForce RTX 5090NVIDIA 5090rtx 509091827364536.8638.9139.34MIN: 11.12 / MAX: 47.83MIN: 15.12 / MAX: 48.75MIN: 15.92 / MAX: 49.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3GeForce RTX 5090NVIDIA 5090rtx 5090102030405038.9139.7442.44MIN: 8.9 / MAX: 75.48MIN: 8.16 / MAX: 76.72MIN: 8.34 / MAX: 76.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet50rtx 5090GeForce RTX 5090NVIDIA 509071421283528.5829.4730.10MIN: 10.02 / MAX: 89.48MIN: 10.07 / MAX: 90.59MIN: 10.13 / MAX: 90.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: alexnetNVIDIA 5090rtx 5090GeForce RTX 509036912158.558.748.99MIN: 3.18 / MAX: 21.75MIN: 3.21 / MAX: 22.23MIN: 3.19 / MAX: 21.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet18NVIDIA 5090rtx 5090GeForce RTX 509036912159.4110.9311.01MIN: 4.47 / MAX: 43.1MIN: 4.48 / MAX: 44.35MIN: 4.51 / MAX: 42.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vgg16rtx 5090GeForce RTX 5090NVIDIA 509091827364537.0537.4938.73MIN: 22.53 / MAX: 46.22MIN: 23.61 / MAX: 45.51MIN: 20.99 / MAX: 46.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: googlenetNVIDIA 5090rtx 5090GeForce RTX 50904812162012.7813.9218.06MIN: 7.62 / MAX: 95.87MIN: 7.49 / MAX: 98.37MIN: 7.48 / MAX: 98.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: blazefacertx 5090GeForce RTX 5090NVIDIA 50900.68631.37262.05892.74523.43152.672.673.05MIN: 2.4 / MAX: 41.21MIN: 2.38 / MAX: 25.86MIN: 2.38 / MAX: 49.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: efficientnet-b0GeForce RTX 5090NVIDIA 5090rtx 509061218243019.7920.7827.44MIN: 6.3 / MAX: 110.05MIN: 6.36 / MAX: 109.98MIN: 6.34 / MAX: 109.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mnasnetNVIDIA 5090GeForce RTX 5090rtx 509036912154.776.559.80MIN: 3.69 / MAX: 54.04MIN: 3.69 / MAX: 59.21MIN: 3.75 / MAX: 63.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: shufflenet-v2NVIDIA 5090rtx 5090GeForce RTX 50901.1522.3043.4564.6085.764.044.535.12MIN: 3.9 / MAX: 5.77MIN: 3.88 / MAX: 57.59MIN: 3.91 / MAX: 67.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3NVIDIA 5090GeForce RTX 5090rtx 50901.0532.1063.1594.2125.2654.284.294.68MIN: 4.05 / MAX: 5.15MIN: 4.06 / MAX: 5.93MIN: 4.08 / MAX: 57.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2GeForce RTX 5090rtx 5090NVIDIA 509036912158.3710.2111.25MIN: 3.84 / MAX: 63.9MIN: 3.85 / MAX: 64.73MIN: 3.82 / MAX: 63.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mobilenetGeForce RTX 5090NVIDIA 5090rtx 5090102030405038.9139.7442.44MIN: 8.9 / MAX: 75.48MIN: 8.16 / MAX: 76.72MIN: 8.34 / MAX: 76.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-vec4rtx 5090GeForce RTX 5090NVIDIA 50909K18K27K36K45K43806.1343803.6043799.96

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-scalarrtx 5090GeForce RTX 5090NVIDIA 50909K18K27K36K45K40006.6039998.4239989.70

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-vec4GeForce RTX 5090NVIDIA 5090rtx 509013K26K39K52K65K61914.5161894.9561885.37

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62142.5662142.0762141.02

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-vec4rtx 5090NVIDIA 5090GeForce RTX 50904008001200160020001965.701965.391965.32

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-scalarNVIDIA 5090GeForce RTX 5090rtx 50904008001200160020001967.491967.431967.37

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-vec4rtx 5090NVIDIA 5090GeForce RTX 509016K32K48K64K80K72592.9372578.6272575.33

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62611.7862611.1762597.51

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-vec4rtx 5090NVIDIA 5090GeForce RTX 509020K40K60K80K100K83296.8483290.2283257.59

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-scalarGeForce RTX 5090NVIDIA 5090rtx 509014K28K42K56K70K63035.6263035.6263013.32

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPUGeForce RTX 5090NVIDIA 5090rtx 509010002000300040005000488248824851

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPUGeForce RTX 5090NVIDIA 5090rtx 50903K6K9K12K15K119231192311923

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precisionrtx 5090GeForce RTX 5090NVIDIA 509014K28K42K56K70K6373863637627731. (CXX) g++ options: -O3

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atomsrtx 5090NVIDIA 5090GeForce RTX 50900.01340.02680.04020.05360.0670.058100.058510.05943

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomGeForce RTX 5090NVIDIA 5090rtx 5090102030405042.9242.8542.76

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarGeForce RTX 5090rtx 5090NVIDIA 50902040608010092.7192.7092.52

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precisionGeForce RTX 5090rtx 5090NVIDIA 509050K100K150K200K250K2395752377172358841. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingrtx 5090NVIDIA 5090GeForce RTX 509050K100K150K200K250K2439372439132419131. (CXX) g++ options: -O3

Blender

Blend File: Barbershop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA CUDAGeForce RTX 5090rtx 5090NVIDIA 509081624324035.1035.1435.28

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precisionNVIDIA 5090GeForce RTX 5090rtx 50902K4K6K8K10K9940993499311. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleGeForce RTX 5090NVIDIA 5090rtx 509020406080100103.45103.48103.511. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA 5090rtx 5090GeForce RTX 509051015202518.4218.4118.331. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA 5090rtx 5090GeForce RTX 50904812162013.8913.8313.781. (CXX) g++ options: -O3

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 509061218243024.3324.5024.60

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precisionNVIDIA 5090rtx 5090GeForce RTX 509070K140K210K280K350K3056713022213007371. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32NVIDIA 5090GeForce RTX 5090rtx 50902K4K6K8K10K952795259524

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA CUDANVIDIA 5090rtx 5090GeForce RTX 50904812162017.3417.3517.44

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5rtx 5090GeForce RTX 5090NVIDIA 509020000M40000M60000M80000M100000MSE +/- 102551750000.00, N = 2SE +/- 102256000000.00, N = 2SE +/- 101883450000.00, N = 2106848250000106544000000106216550000

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipNVIDIA 5090rtx 5090GeForce RTX 5090700K1400K2100K2800K3500K327660032723003264300

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1GeForce RTX 5090NVIDIA 5090rtx 509015000M30000M45000M60000M75000M691043000006907270000068852500000

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precisionrtx 5090NVIDIA 5090GeForce RTX 50908K16K24K32K40K3605436026359541. (CXX) g++ options: -O3

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yesrtx 5090GeForce RTX 5090NVIDIA 5090369121513.4813.4913.50

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512NVIDIA 5090rtx 5090GeForce RTX 50902000M4000M6000M8000M10000M890120000089004000008895900000

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precisionGeForce RTX 5090rtx 5090NVIDIA 509030K60K90K120K150K1446951446241446001. (CXX) g++ options: -O3

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSGeForce RTX 5090rtx 5090NVIDIA 5090600K1200K1800K2400K3000K277760027760002770700

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16SGeForce RTX 5090rtx 5090NVIDIA 50904K8K12K16K20K185001849918496

Blender

Blend File: Junkshop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA CUDANVIDIA 5090rtx 5090GeForce RTX 509036912158.988.999.00

Blender

Blend File: Fishy Cat - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.928.928.92

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16Crtx 5090GeForce RTX 5090NVIDIA 50904K8K12K16K20K191401913519121

Blender

Blend File: Classroom - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.388.388.38

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeNVIDIA 5090rtx 5090GeForce RTX 50904008001200160020001977.261976.901976.781. (CXX) g++ options: -O3

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50902468107.007.047.06

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singlertx 5090GeForce RTX 5090NVIDIA 50901.2712.5423.8135.0846.3555.6485.6485.6491. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced Writertx 5090GeForce RTX 5090NVIDIA 50904008001200160020001687.491680.231679.441. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced ReadNVIDIA 5090GeForce RTX 5090rtx 5090300600900120015001603.931596.881596.241. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT8 Computertx 5090GeForce RTX 5090NVIDIA 5090102030405041.8041.7641.721. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT16 ComputeNVIDIA 5090rtx 5090GeForce RTX 5090122436486054.0454.0253.951. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT32 ComputeGeForce RTX 5090NVIDIA 5090rtx 5090142842567061.7761.7661.761. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT64 ComputeNVIDIA 5090rtx 5090GeForce RTX 50900.991.982.973.964.954.4004.3964.3921. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP16 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP16 ComputeNVIDIA 5090GeForce RTX 5090rtx 5090306090120150122.94122.94122.911. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP32 ComputeGeForce RTX 5090NVIDIA 5090rtx 5090306090120150117.88117.86117.851. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP64 ComputeGeForce RTX 5090NVIDIA 5090rtx 50900.4390.8781.3171.7562.1951.9511.9501.9501. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA OptiXNVIDIA 5090rtx 5090GeForce RTX 50902468106.146.166.17

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2Rrtx 5090GeForce RTX 5090NVIDIA 509040K80K120K160K200K1649331648731648011. (CXX) g++ options: -O3

Blender

Blend File: Junkshop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50901.27582.55163.82745.10326.3795.665.665.67

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthGeForce RTX 5090NVIDIA 5090rtx 509060012001800240030002875.502872.532870.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

Blend File: BMW27 - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA CUDAGeForce RTX 5090rtx 5090NVIDIA 50901.0622.1243.1864.2485.314.654.724.72

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50901.0262.0523.0784.1045.134.554.554.56

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoNVIDIA 5090GeForce RTX 5090rtx 50901.04182.08363.12544.16725.2094.4004.4464.630

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50900.67051.3412.01152.6823.35252.922.972.98

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesNVIDIA 5090GeForce RTX 5090rtx 50900.51731.03461.55192.06922.58652.2672.2782.299

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthGeForce RTX 5090NVIDIA 5090rtx 5090300600900120015001564.611564.491562.971. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPNVIDIA 5090rtx 5090GeForce RTX 509090018002700360045004400.854398.394375.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadrtx 5090NVIDIA 5090GeForce RTX 509071421283527.8327.8127.751. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackrtx 5090NVIDIA 5090GeForce RTX 509071421283528.6928.5928.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NNVIDIA 5090GeForce RTX 5090rtx 50908K16K24K32K40K36016.535961.335937.21. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DGeForce RTX 5090NVIDIA 5090rtx 509020040060080010001120.881120.511117.541. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadGeForce RTX 5090NVIDIA 5090rtx 509071421283528.7928.7928.791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionGeForce RTX 5090rtx 5090NVIDIA 50902004006008001000837.24837.21836.981. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeNVIDIA 5090GeForce RTX 5090rtx 509030K60K90K120K150K121438.41121419.57121415.531. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeGeForce RTX 5090NVIDIA 5090rtx 509013K26K39K52K65K61903.9361866.8661843.111. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeGeForce RTX 5090rtx 5090NVIDIA 509013K26K39K52K65K62178.0562151.9462119.511. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashGeForce RTX 5090rtx 5090NVIDIA 5090306090120150142.53142.41142.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel Latencyrtx 5090GeForce RTX 5090NVIDIA 50901.1612.3223.4834.6445.8055.155.155.161. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5