CLPEAK AMD Ryzen 9 7945HX testing with a Alienware 0DWD2H (1.13.1 BIOS) and NVIDIA GeForce RTX 4090 Laptop GPU 16GB on cachyos rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2409218-EIRI-240504151&sro&grs .
CLPEAK Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL OpenCL Compiler File-System Screen Resolution Display Driver Radeon HD 8790M Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 AMD Radeon HD 8730M AMD Radeon HD 8730M dGPU mobile Mars series Intel HD Graphics HSW ULT GT2 Dual-Channel RAM AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM nVidia RTX 4090 mobile Intel Core i5-4300M @ 3.30GHz (2 Cores / 4 Threads) Dell 0VWNW8 (A26 BIOS) Intel Xeon E3-1200 v3/4th 8GB 128GB SAMSUNG SSD PM85 AMD Radeon HD 8790M (1250MHz) Intel Xeon E3-1200 v3/4th Intel I217-LM + Intel Centrino Ultimate-N 6300 cachyos rolling 6.6.2-4-cachyos-lto (x86_64) GNOME Shell 45.1 X Server 1.21.1.9 4.6 Mesa 24.0.0-devel (git-023fa0aa5d) (LLVM 16.0.6 DRM 3.54) OpenCL 1.1 Mesa 24.0.0-devel (git-023fa0aa5d) GCC 13.2.1 20231110 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3 xfs 1920x1080 Intel HD 4600 HSW GT2 2GB (1250MHz) 6.7.9-1-cachyos-rt-bore-lto (x86_64) KDE Plasma 6.0.1 X Server 1.21.1.11 4.6 Mesa 24.0.2-arch1.2 OpenCL 2.0 beignet 1.4 (git-f72309a5) Clang 17.0.6 + GCC 13.2.1 20230801 + LLVM 17.0.6 Intel Core i5-4300U @ 2.90GHz (2 Cores / 4 Threads) HP 198F (L71 Ver. 01.49 BIOS) Intel Haswell-ULT DRAM 256GB SanDisk SD7SB3Q- AMD Radeon HD 8500M / 8700M (1100MHz) Intel Haswell-ULT HD Audio Intel I218-LM + Intel 7260 6.8.7-3-cachyos-lto (x86_64) KDE Plasma 6.0.4 X Server 1.21.1.13 + Wayland 4.6 Mesa 24.0.6-arch1.2 (LLVM 17.0.6 DRM 3.57) OpenCL 2.0 beignet 1.4 (git-419c0417) + OpenCL 3.0 GCC 13.2.1 20240417 + Clang 17.0.6 + LLVM 17.0.6 + CUDA 12.4 OpenCL 3.0 16GB 6.8.9-2-cachyos-lto (x86_64) OpenCL 2.0 beignet 1.4 (git-419c0417) OpenCL 3.0 AMD Ryzen 9 7945HX @ 5.46GHz (16 Cores / 32 Threads) Alienware 0DWD2H (1.13.1 BIOS) AMD Device 14d8 62GB PC SN810 NVMe WDC 2048GB + 4001GB CT4000P3SSD8 NVIDIA GeForce RTX 4090 Laptop GPU 16GB NVIDIA Device 22bb Realtek RTL8125 2.5GbE + Qualcomm QCNFA765 6.11.0-5-cachyos-lto (x86_64) GNOME Shell 47.0 X Server 1.21.1.13 NVIDIA 560.35.03 4.6.0 OpenCL 3.0 CUDA 12.6.65 GCC 14.2.1 20240910 + Clang 18.1.8 + LLVM 18.1.8 + CUDA 12.6 zfs 2560x1600 OpenBenchmarking.org Kernel Details - Radeon HD 8790M: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 - Transparent Huge Pages: always - Intel HD Graphics HSW ULT GT2: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 rfkill.default_state=1 kvm.nx_huge_pages=force - Transparent Huge Pages: always - AMD Radeon HD 8730M: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 rfkill.default_state=1 kvm.nx_huge_pages=force - Transparent Huge Pages: always - AMD Radeon HD 8730M dGPU mobile Mars series: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 rfkill.default_state=1 kvm.nx_huge_pages=force - Transparent Huge Pages: always - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: amdgpu.si_support=1 amdgpu.cik_support=1 radeon.si_support=0 radeon.cik_support=0 amdgpu.bapm=0 amdgpu.dc=1 amdgpu.dcfeaturemask=0xffffffff amdgpu.cg_mask=0xffffffff amdgpu.modeset=1 amdgpu.mes=1 - Transparent Huge Pages: always - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: amdgpu.si_support=1 amdgpu.cik_support=1 radeon.si_support=0 radeon.cik_support=0 amdgpu.bapm=0 amdgpu.dc=1 amdgpu.dcfeaturemask=0xffffffff amdgpu.cg_mask=0xffffffff amdgpu.modeset=1 amdgpu.mes=1 - Transparent Huge Pages: always - nVidia RTX 4090 mobile: Transparent Huge Pages: always Environment Details - Radeon HD 8790M: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - Intel HD Graphics HSW ULT GT2: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - AMD Radeon HD 8730M: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - AMD Radeon HD 8730M dGPU mobile Mars series: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" - nVidia RTX 4090 mobile: MUTTER_DEBUG_KMS_THREAD_TYPE=user Compiler Details - Radeon HD 8790M: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Intel HD Graphics HSW ULT GT2: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - AMD Radeon HD 8730M: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - AMD Radeon HD 8730M dGPU mobile Mars series: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - nVidia RTX 4090 mobile: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Radeon HD 8790M: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28 - Intel HD Graphics 4600 HSW GT2: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28 - Intel HD Graphics HSW ULT GT2: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x26 - AMD Radeon HD 8730M: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x26 - AMD Radeon HD 8730M dGPU mobile Mars series: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x26 - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x26 - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x26 - nVidia RTX 4090 mobile: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601206 Graphics Details - Radeon HD 8790M: SNA - BAR1 / Visible vRAM Size: 256 MB - Intel HD Graphics 4600 HSW GT2: GLAMOR - Intel HD Graphics HSW ULT GT2: SNA - BAR1 / Visible vRAM Size: 256 MB - AMD Radeon HD 8730M: SNA - BAR1 / Visible vRAM Size: 256 MB - AMD Radeon HD 8730M dGPU mobile Mars series: SNA - BAR1 / Visible vRAM Size: 256 MB - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: SNA - BAR1 / Visible vRAM Size: 256 MB - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: SNA - BAR1 / Visible vRAM Size: 256 MB - nVidia RTX 4090 mobile: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.4c Security Details - Radeon HD 8790M: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - Intel HD Graphics 4600 HSW GT2: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - Intel HD Graphics HSW ULT GT2: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable; IBPB: disabled; STIBP: disabled; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - AMD Radeon HD 8730M: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable; IBPB: disabled; STIBP: disabled; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - AMD Radeon HD 8730M dGPU mobile Mars series: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable; IBPB: disabled; STIBP: disabled; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - Intel HD Graphics HSW ULT GT2 Dual-Channel RAM: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - nVidia RTX 4090 mobile: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details - nVidia RTX 4090 mobile: GPU Compute Cores: 9728
CLPEAK clpeak: Global Memory Bandwidth clpeak: Integer Compute clpeak: Integer 24-bit Compute clpeak: Single-Precision Compute clpeak: Kernel Latency clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Double-Precision Compute Radeon HD 8790M Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 AMD Radeon HD 8730M AMD Radeon HD 8730M dGPU mobile Mars series Intel HD Graphics HSW ULT GT2 Dual-Channel RAM AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM nVidia RTX 4090 mobile 5.65 34.45 170.38 86.09 49.08 9.67 8.81 43.1 17.98 46.89 46.88 217.54 18.31 8.93 8.54 10.73 41.29 41.35 186.98 38.39 5.10 5.22 39.99 102.07 102.06 458.21 49.12 5.17 5.20 19.18 41.35 41.35 186.94 33.61 10.16 10.25 39.25 102.03 102.01 458.22 49.15 9.92 9.93 495.65 18078.99 18356.72 32811.70 13.04 12.92 12.54 650.97 OpenBenchmarking.org
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 110 220 330 440 550 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 8.26, N = 12 39.99 39.25 17.98 10.73 19.18 5.65 495.65 g++
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 4K 8K 12K 16K 20K SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 116.90, N = 3 102.07 102.03 46.89 41.29 41.35 34.45 18078.99 g++
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 4K 8K 12K 16K 20K SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 4.64, N = 3 102.06 102.01 46.88 41.35 41.35 170.38 18356.72 g++
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 7K 14K 21K 28K 35K SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.18, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 626.22, N = 15 458.21 458.22 217.54 186.98 186.94 86.09 32811.70 g++
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 11 22 33 44 55 SE +/- 0.48, N = 3 SE +/- 0.66, N = 15 SE +/- 0.18, N = 15 SE +/- 0.47, N = 15 SE +/- 0.25, N = 15 SE +/- 0.20, N = 3 SE +/- 2.71, N = 13 49.12 49.15 18.31 38.39 33.61 49.08 13.04 g++ g++ clang++ g++ g++ g++ g++
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.09, N = 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 5.17 9.92 8.93 5.10 10.16 9.67 12.92 g++ g++ clang++ g++ g++ g++ g++
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer AMD Radeon HD 8730M dGPU mobile Mars series AMD Radeon HD 8730M dGPU mobile Mars series w/dual-channel sys RAM Intel HD Graphics 4600 HSW GT2 Intel HD Graphics HSW ULT GT2 Intel HD Graphics HSW ULT GT2 Dual-Channel RAM Radeon HD 8790M nVidia RTX 4090 mobile 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.09, N = 15 SE +/- 0.09, N = 3 SE +/- 0.04, N = 15 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 5.20 9.93 8.54 5.22 10.25 8.81 12.54 g++ g++ clang++ g++ g++ g++ g++
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute Radeon HD 8790M nVidia RTX 4090 mobile 140 280 420 560 700 SE +/- 0.00, N = 3 SE +/- 4.04, N = 13 43.10 650.97 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5