2024-10-17-ncnn-gpu

AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS) and AMD Radeon RX 6900 XT 16GB on Fedora Linux 40 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2410170-ENB-2024101798&gru.

2024-10-17-ncnn-gpuProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RXAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS)AMD Starship/Matisse128GB2 x 2000GB Sabrent Rocket QAMD Radeon RX 6900 XT 16GBAMD Navi 21/23MPCP28UHD + MP MonitorAquantia AQtion AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200Fedora Linux 406.11.3-200.fc40.x86_64 (x86_64)GNOME Shell 46.5X Server 1.20.14 + Wayland4.6 Mesa 24.1.7 (LLVM 18.1.6 DRM 3.59)OpenCL 2.1 AMD-APP (3614.0) + OpenCL 3.0 PoCL 5.0 Linux RELOC SPIR LLVM 17.0.6 SLEEF DISTRO POCL_DEBUGGCC 14.2.1 20240912 + Clang 18.1.8 + LLVM 18.1.8btrfs7680x2160OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830107c- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

2024-10-17-ncnn-gpuncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX18.787.716.198.786.459.273.7025.0443.2816.0111.7827.6432.2518.5822.2873.139.71OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.36, N = 1218.78MIN: 15.72 / MAX: 589.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.71, N = 127.71MIN: 6.19 / MAX: 556.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.24, N = 126.19MIN: 5.46 / MAX: 262.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.50, N = 128.78MIN: 7.24 / MAX: 5041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.36, N = 126.45MIN: 5.4 / MAX: 916.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.50, N = 129.27MIN: 7.86 / MAX: 315.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX0.83251.6652.49753.334.1625SE +/- 0.22, N = 123.70MIN: 2.87 / MAX: 278.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX612182430SE +/- 1.88, N = 1225.04MIN: 16.8 / MAX: 988.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1020304050SE +/- 1.27, N = 1243.28MIN: 33.71 / MAX: 423.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX48121620SE +/- 0.65, N = 1216.01MIN: 11.05 / MAX: 586.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.34, N = 1211.78MIN: 8.2 / MAX: 373.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX714212835SE +/- 0.86, N = 1227.64MIN: 19.54 / MAX: 1095.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX714212835SE +/- 0.74, N = 1232.25MIN: 25.08 / MAX: 650.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.61, N = 1118.58MIN: 15.49 / MAX: 1036.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.66, N = 1222.28MIN: 18.51 / MAX: 1014.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1632486480SE +/- 1.76, N = 1273.13MIN: 62.69 / MAX: 1079.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.35, N = 129.71MIN: 8.07 / MAX: 424.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.5