more gpu comp

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 4080 16GB on Ubuntu 23.10 via the Phoronix Test Suite.

4080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04
OpenCL Notes: GPU Compute Cores: 9728
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

a

c

d

Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: NVIDIA GeForce RTX 4080 16GB, Audio: NVIDIA Device 22bb, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411

OS: Ubuntu 23.10, Kernel: 6.7.0-060700-generic (x86_64), Desktop: GNOME Shell 45.2, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

Blender

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

Libplacebo

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

38 Results Shown

Blender:
BMW27 - NVIDIA CUDA
BMW27 - NVIDIA OptiX
Classroom - NVIDIA CUDA
Fishy Cat - NVIDIA CUDA
Barbershop - NVIDIA CUDA
Classroom - NVIDIA OptiX
Fishy Cat - NVIDIA OptiX
Barbershop - NVIDIA OptiX
Pabellon Barcelona - NVIDIA CUDA
ProjectPhysX OpenCL-Benchmark
Blender
ProjectPhysX OpenCL-Benchmark:
INT8 Compute
Memory Bandwidth Coalesced Read
INT32 Compute
INT16 Compute
FP32 Compute
GpuOwl
ProjectPhysX OpenCL-Benchmark
GpuOwl:
77936867
57885161
Libplacebo:
deband_heavy
polar_nocompute
hdr_peakdetect
ProjectPhysX OpenCL-Benchmark
Libplacebo:
hdr_lut
av1_grain_lap
gaussian
VkFFT:
FFT + iFFT R2C / C2R
FFT + iFFT C2C 1D batched in half precision
FFT + iFFT C2C Bluestein in single precision
FFT + iFFT C2C 1D batched in double precision
FFT + iFFT C2C 1D batched in single precision
FFT + iFFT C2C multidimensional in single precision
FFT + iFFT C2C Bluestein benchmark in double precision
FFT + iFFT C2C 1D batched in single precision, no reshuffling
FluidX3D:
FP32-FP32
FP32-FP16C
FP32-FP16S

4080

Testing initiated at 25 February 2024 20:36 by user pts.

a

Testing initiated at 25 February 2024 21:02 by user pts.

c

Testing initiated at 25 February 2024 21:26 by user pts.

d

Testing initiated at 25 February 2024 22:34 by user pts.

more gpu comp

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

4080

a

c

d

Blender

ProjectPhysX OpenCL-Benchmark

Blender

ProjectPhysX OpenCL-Benchmark

GpuOwl

ProjectPhysX OpenCL-Benchmark

GpuOwl

Libplacebo

ProjectPhysX OpenCL-Benchmark

Libplacebo

VkFFT

FluidX3D

38 Results Shown

4080

a

c

d