Test 2 x Intel Xeon E5-2682 v4 testing with a Supermicro SYS-7048GR-TR X10DRG-Q v1.10 (3.2 BIOS) and MSI NVIDIA GeForce RTX 3090 24GB on Ubuntu 22.04 via the Phoronix Test Suite. test: Processor: 2 x Intel Xeon E5-2682 v4 @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro X10DRG-Q v1.10 (3.2 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 32GB, Disk: 1000GB Samsung SSD 870 + 4001GB Western Digital WD40EFPX-68C, Graphics: MSI NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC888-VD, Network: 2 x Intel I350 OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768 Test: Processor: 2 x Intel Xeon E5-2682 v4 @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro SYS-7048GR-TR X10DRG-Q v1.10 (3.2 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 2 x 16GB DDR4-2400MT/s, Disk: 1000GB Samsung SSD 870 + 4001GB Western Digital WD40EFPX-68C, Graphics: MSI NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC888-VD, Network: 2 x Intel I350 OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.89, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768 MSI NVIDIA GeForce RTX 3090: Processor: 2 x Intel Xeon E5-2682 v4 @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro SYS-7048GR-TR X10DRG-Q v1.10 (3.2 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 2 x 16GB DDR4-2400MT/s, Disk: 1000GB Samsung SSD 870 + 4001GB Western Digital WD40EFPX-68C, Graphics: MSI NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC888-VD, Network: 2 x Intel I350 OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.89, Vulkan: 1.3.277, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768 SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better NeatBench 5 Acceleration: GPU FPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 3090 |=========================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 359.5 |========================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 827.5 |========================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 753.6 |========================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 69.3 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 97.5 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 110.9 |========================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 16.8 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 23.6 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 25.3 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 29.6 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 37.1 |=========================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 359 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 490 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 359 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 596 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 713 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 641 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 186 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 371 |============================================ clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 816.71 |========================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 526.43 |========================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 37433.27 |======================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 33947.25 |======================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 497.73 |========================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 31838.27 |======================================= clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 34559.24 |======================================= clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 636.06 |========================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 57.5 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 52.8 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 60.0 |=========================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 58.7 |=========================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 590 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 593 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 591 |============================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 590 |============================================ Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GIOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 19101.89 |======================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 15908.05 |======================================= clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 17619.75 |======================================= Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 102516368750 |=================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 43537200000 |==================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 2122900 |======================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 6285933333 |===================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 1629667 |======================================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 28.18 |========================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 17.48 |========================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 21.68 |========================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 16.71 |========================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 52.13 |========================================== LeelaChessZero 0.30 Backend: OpenCL Nodes Per Second > Higher Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better MSI NVIDIA GeForce RTX 3090 . 291.24 |========================================= GROMACS 2024 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better AI Benchmark Alpha 0.1.2 Score > Higher Is Better Chaos Group V-RAY 6.0 Mode: NVIDIA RTX GPU Vrays > Higher Is Better Chaos Group V-RAY 6.0 Mode: NVIDIA CUDA GPU Vrays > Higher Is Better ArrayFire 3.9 Test: Conjugate Gradient OpenCL Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 6.060 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 52.65 |========================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 22.24 |========================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 16.97 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 18.41 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 16.64 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 26.40 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 11.31 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 49.15 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 103.35 |========================================= NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 33.80 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 31.16 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 71.49 |========================================== NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 52.65 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 64.14 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 49.93 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 90.84 |========================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 134.18 |========================================= NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 28.33 |========================================== RedShift Demo 3.0 Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 4.031 |========================================== Blender 4.1 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 4.89 |=========================================== Blender 4.1 Blend File: Junkshop - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 14.33 |========================================== Blender 4.1 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 9.71 |=========================================== Blender 4.1 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 8.45 |=========================================== Blender 4.1 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 37.99 |========================================== Blender 4.1 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better MSI NVIDIA GeForce RTX 3090 . 11.11 |==========================================