nogaPytorchResults AMD Ryzen 9 9950X 16-Core testing with a ASUS PRIME B650M-A II (3201 BIOS) and NVIDIA GeForce RTX 4060 Ti 16GB on Ubuntu 24.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2501235-NE-NOGAPYTOR33&grt .
nogaPytorchResults Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Server Display Driver Compiler File-System Screen Resolution ptRun1 AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads) ASUS PRIME B650M-A II (3201 BIOS) AMD Device 14d8 4 x 48GB DDR5-3600MT/s G Skill F5-6800J3446F48G 2000GB Samsung SSD 980 PRO 2TB NVIDIA GeForce RTX 4060 Ti 16GB NVIDIA Device 22bd 2 x Intel 10-Gigabit X540-AT2 + Realtek RTL8125 2.5GbE Ubuntu 24.04 6.8.0-51-generic (x86_64) X Server 1.21.1.11 NVIDIA GCC 13.3.0 + CUDA 12.4 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb404023 - Python 3.12.3 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
nogaPytorchResults pytorch: CPU - 64 - ResNet-50 pytorch: NVIDIA CUDA GPU - 64 - ResNet-50 ptRun1 52.79 402.98 OpenBenchmarking.org
PyTorch Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 ptRun1 12 24 36 48 60 SE +/- 0.21, N = 3 52.79 MIN: 47.86 / MAX: 54
PyTorch Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50 ptRun1 90 180 270 360 450 SE +/- 0.33, N = 3 402.98 MIN: 337.14 / MAX: 405.91
Phoronix Test Suite v10.8.5