test_002 AMD Ryzen 9 7900X3D 12-Core testing with a ASUS ProArt B650-CREATOR (2007 BIOS) and ASUS NVIDIA GeForce RTX 3070 8GB on Ubuntu 24.04 via the Phoronix Test Suite. test_002: Processor: AMD Ryzen 9 7900X3D 12-Core @ 5.66GHz (12 Cores / 24 Threads), Motherboard: ASUS ProArt B650-CREATOR (2007 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-4800MT/s Kingston KF556C36-32, Disk: 2000GB Samsung SSD 990 PRO 2TB, Graphics: ASUS NVIDIA GeForce RTX 3070 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: HP E242, Network: Realtek RTL8111/8168/8211/8411 + Realtek RTL8125 2.5GbE OS: Ubuntu 24.04, Kernel: 6.8.0-47-generic (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 560.35.03, Compiler: GCC 13.2.0 + CUDA 12.6, File-System: ext4, Screen Resolution: 1920x1200 ArrayFire 3.9 Test: Conjugate Gradient OpenCL Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better test_002 . 166.58 |============================================================ Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better test_002 . 165.42 |============================================================ Blender 4.2 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 10.44 |============================================================= Blender 4.2 Blend File: Junkshop - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 17.86 |============================================================= Blender 4.2 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 26.28 |============================================================= Blender 4.2 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 18.01 |============================================================= Blender 4.2 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 89.69 |============================================================= Blender 4.2 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better test_002 . 29.06 |============================================================= Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better cl-mem 2017-01-13 Benchmark: Copy cl-mem 2017-01-13 Benchmark: Read cl-mem 2017-01-13 Benchmark: Write clpeak 1.1.2 OpenCL Test: Integer Compute INT clpeak 1.1.2 OpenCL Test: Single-Precision Float clpeak 1.1.2 OpenCL Test: Double-Precision Double clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth FAHBench 2.3.2 Ns Per Day > Higher Is Better test_002 . 269.62 |============================================================ FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better GROMACS 2024 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better test_002 . 15.11 |============================================================= Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better test_002 . 41067200000 |======================================================= Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better test_002 . 12576133333 |======================================================= Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better test_002 . 655300 |============================================================ Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better test_002 . 1869466667 |======================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better test_002 . 483967 |============================================================ IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better test_002 . 13.18 |============================================================= IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better test_002 . 36.85 |============================================================= LeelaChessZero 0.31.1 Backend: OpenCL Nodes Per Second > Higher Is Better Libplacebo 6.338.2 FPS > Higher Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better test_002 . 8.48 |============================================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better test_002 . 5.89 |============================================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better test_002 . 8.40 |============================================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better test_002 . 7.47 |============================================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better test_002 . 23.13 |============================================================= MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better test_002 . 9956.87 |=========================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better test_002 . 22007.25 |========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better test_002 . 301.61 |============================================================ Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better test_002 . 21228.42 |========================================================== NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better test_002 . 0.09577 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better test_002 . 11.97 |============================================================= NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better test_002 . 4.58 |============================================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better test_002 . 4.59 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better test_002 . 4.84 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better test_002 . 4.27 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better test_002 . 5.97 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better test_002 . 2.00 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better test_002 . 11.72 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better test_002 . 35.81 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better test_002 . 7.46 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better test_002 . 5.74 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better test_002 . 15.15 |============================================================= NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better test_002 . 11.97 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better test_002 . 18.60 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better test_002 . 9.80 |============================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better test_002 . 11.48 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better test_002 . 54.22 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better test_002 . 5.50 |============================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better test_002 . 3070 |============================================================== OctaneBench 2020.1 Total Score Score > Higher Is Better test_002 . 399.09 |============================================================ PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better test_002 . 8.993 |============================================================= RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better test_002 . 48.78 |============================================================= RedShift Demo 3.0 Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better test_002 . 261 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better test_002 . 381 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better test_002 . 245 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better test_002 . 81.6 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better test_002 . 125 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better test_002 . 119.1 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better test_002 . 132 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better test_002 . 154 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better test_002 . 59.5 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better test_002 . 56.9 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better test_002 . 63.4 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better test_002 . 60.7 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS GFLOPS > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT R2C / C2R Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in half precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein in single precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in double precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C multidimensional in single precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein benchmark in double precision Benchmark Score > Higher Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Benchmark Score > Higher Is Better vkpeak 20230730 GFLOPS > Higher Is Better VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Seconds < Lower Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better test_002 . 4.661 |=============================================================