Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-tests AMD Ryzen 9 7945HX testing with a Alienware 0DWD2H (1.13.1 BIOS) and NVIDIA GeForce RTX 4090 Laptop GPU 16GB on cachyos rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2409210-EIRI-240308551 .
Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-tests Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL OpenCL Compiler File-System Screen Resolution Display Driver Intel HD 4600 HAS GT2 nVidia RTX 4090 mobile RTX 4090 mobile Intel Core i5-4300M @ 3.30GHz (2 Cores / 4 Threads) Dell 0VWNW8 (A26 BIOS) Intel Xeon E3-1200 v3/4th 8GB 128GB SAMSUNG SSD PM85 Intel HD 4600 HSW GT2 2GB (1250MHz) Intel Xeon E3-1200 v3/4th Intel I217-LM + Intel Centrino Ultimate-N 6300 cachyos rolling 6.7.9-1-cachyos-rt-bore-lto (x86_64) KDE Plasma 6.0.1 X Server 1.21.1.11 4.6 Mesa 24.0.2-arch1.2 OpenCL 2.0 beignet 1.4 (git-f72309a5) GCC 13.2.1 20230801 + Clang 17.0.6 + LLVM 17.0.6 xfs 1920x1080 AMD Ryzen 9 7945HX @ 5.46GHz (16 Cores / 32 Threads) Alienware 0DWD2H (1.13.1 BIOS) AMD Device 14d8 62GB PC SN810 NVMe WDC 2048GB + 4001GB CT4000P3SSD8 NVIDIA GeForce RTX 4090 Laptop GPU 16GB NVIDIA Device 22bb Realtek RTL8125 2.5GbE + Qualcomm QCNFA765 6.11.0-5-cachyos-lto (x86_64) GNOME Shell 47.0 X Server 1.21.1.13 NVIDIA 560.35.03 4.6.0 OpenCL 3.0 + OpenCL 2.1 AMD-APP.dbg (3602.0) + OpenCL 3.0 CUDA 12.6.65 + OpenCL 2.0 AMD-APP (1800.8) GCC 14.2.1 20240910 + Clang 18.1.8 + LLVM 18.1.8 + CUDA 12.6 zfs 2560x1600 OpenCL 3.0 CUDA 12.6.65 OpenBenchmarking.org Compiler Details - Intel HD 4600 HAS GT2: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - nVidia RTX 4090 mobile: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - RTX 4090 mobile: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Intel HD 4600 HAS GT2: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28 - nVidia RTX 4090 mobile: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601206 - RTX 4090 mobile: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601206 Graphics Details - Intel HD 4600 HAS GT2: GLAMOR - nVidia RTX 4090 mobile: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.4c - RTX 4090 mobile: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.4c Security Details - Intel HD 4600 HAS GT2: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - nVidia RTX 4090 mobile: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 4090 mobile: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Kernel Details - nVidia RTX 4090 mobile, RTX 4090 mobile: Transparent Huge Pages: always Environment Details - nVidia RTX 4090 mobile, RTX 4090 mobile: MUTTER_DEBUG_KMS_THREAD_TYPE=user OpenCL Details - nVidia RTX 4090 mobile, RTX 4090 mobile: GPU Compute Cores: 9728
Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-tests opencl-benchmark: FP32 Compute opencl-benchmark: INT64 Compute opencl-benchmark: INT32 Compute opencl-benchmark: INT16 Compute opencl-benchmark: INT8 Compute opencl-benchmark: Memory Bandwidth Coalesced Read opencl-benchmark: Memory Bandwidth Coalesced Write opencl-benchmark: FP64 Compute Intel HD 4600 HAS GT2 nVidia RTX 4090 mobile RTX 4090 mobile 0.07 0.007 0.033 0.101 0.134 17.82 21.62 42.728 3.454 18.074 14.934 11.731 548.59 561.76 0.69 OpenBenchmarking.org
ProjectPhysX OpenCL-Benchmark Operation: FP32 Compute OpenBenchmarking.org TFLOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP32 Compute Intel HD 4600 HAS GT2 RTX 4090 mobile 10 20 30 40 50 SE +/- 0.000, N = 3 SE +/- 0.016, N = 3 0.070 42.728 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT64 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT64 Compute Intel HD 4600 HAS GT2 RTX 4090 mobile 0.7772 1.5544 2.3316 3.1088 3.886 SE +/- 0.000, N = 3 SE +/- 0.009, N = 3 0.007 3.454 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT32 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT32 Compute Intel HD 4600 HAS GT2 RTX 4090 mobile 4 8 12 16 20 SE +/- 0.000, N = 3 SE +/- 0.095, N = 3 0.033 18.074 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT16 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT16 Compute Intel HD 4600 HAS GT2 RTX 4090 mobile 4 8 12 16 20 SE +/- 0.000, N = 3 SE +/- 0.179, N = 3 0.101 14.934 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: INT8 Compute OpenBenchmarking.org TIOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT8 Compute Intel HD 4600 HAS GT2 RTX 4090 mobile 3 6 9 12 15 SE +/- 0.000, N = 3 SE +/- 0.062, N = 3 0.134 11.731 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: Memory Bandwidth Coalesced Read OpenBenchmarking.org GB/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Read Intel HD 4600 HAS GT2 RTX 4090 mobile 120 240 360 480 600 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 17.82 548.59 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: Memory Bandwidth Coalesced Write OpenBenchmarking.org GB/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Write Intel HD 4600 HAS GT2 RTX 4090 mobile 120 240 360 480 600 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 21.62 561.76 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
ProjectPhysX OpenCL-Benchmark Operation: FP64 Compute OpenBenchmarking.org TFLOPs/s, More Is Better ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP64 Compute RTX 4090 mobile 0.1553 0.3106 0.4659 0.6212 0.7765 SE +/- 0.00, N = 3 0.69 1. (CXX) g++ options: -std=c++17 -pthread -lOpenCL
Phoronix Test Suite v10.8.5