KVM testing on Ubuntu 24.04 via the Phoronix Test Suite.
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
5x A5000 kw-dl580-3-4 NVIDIA Processor: 4 x Intel Xeon E7-4880 v2 (60 Cores / 120 Threads) , Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (edk2-20240813-1.fc40 BIOS) , Chipset: Intel 82G33/G31/P35/P31 + ICH9 , Memory: 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 4 GB RAM , Disk: 21GB VIRTUAL-DISK , Graphics: Red Hat QXL paravirtual graphic card 22GB , Audio: QEMU Generic , Network: 2 x Red Hat Virtio 1.0 device
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.0, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0x715Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.6d.00.0dPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion; VMX: flush not necessary SMT vulnerable + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Retpoline + srbds: Not affected + tsx_async_abort: Not affected
Hashcat Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 30000M 60000M 90000M 120000M 150000M SE +/- 23688235109.70, N = 16 SE +/- 31239876948.73, N = 16 137067950000 156178112500
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20000M 40000M 60000M 80000M 100000M SE +/- 133865807.60, N = 3 SE +/- 113680610.09, N = 3 75698633333 91940033333
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 900K 1800K 2700K 3600K 4500K SE +/- 9837.57, N = 3 SE +/- 9462.73, N = 3 3519467 4224700
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 3000M 6000M 9000M 12000M 15000M SE +/- 27986087.81, N = 3 SE +/- 26463244.95, N = 3 10923433333 13308000000
OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 700K 1400K 2100K 2800K 3500K SE +/- 4272.52, N = 3 SE +/- 1039.23, N = 3 2853933 3454200
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 4.99, N = 15 SE +/- 0.92, N = 3 391.91 309.39 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 6K 12K 18K 24K 30K SE +/- 327.84, N = 15 SE +/- 27.29, N = 3 27753.40 18670.28 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
SHOC Scalable HeterOgeneous Computing The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 0.07, N = 3 211.90 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: S3D
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 12.12 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Triad
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP ASPEED - 2 x Intel Xeon Gold 6226R 200 400 600 800 1000 SE +/- 0.17, N = 3 1094.66 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: FFT SP
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash ASPEED - 2 x Intel Xeon Gold 6226R 5 10 15 20 25 SE +/- 0.00, N = 3 22.57 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: MD5 Hash
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.27, N = 3 324.18 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Reduction
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N ASPEED - 2 x Intel Xeon Gold 6226R 800 1600 2400 3200 4000 SE +/- 44.25, N = 4 3630.55 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: GEMM SGEMM_N
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops ASPEED - 2 x Intel Xeon Gold 6226R 5K 10K 15K 20K 25K SE +/- 305.65, N = 3 21619.9 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Max SP Flops
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 12.33 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Bus Speed Download
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.00, N = 3 13.15 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Bus Speed Readback
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth ASPEED - 2 x Intel Xeon Gold 6226R 400 800 1200 1600 2000 SE +/- 4.68, N = 3 1998.58 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Target: OpenCL - Benchmark: Texture Read Bandwidth
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found
OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 130 260 390 520 650 SE +/- 0.43, N = 3 SE +/- 0.03, N = 3 584.4 380.1 1. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 120 240 360 480 600 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 547.4 376.4 1. (CC) gcc options: -O2 -flto -lOpenCL
RedShift Demo This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 6K 12K 18K 24K 30K SE +/- 0.40, N = 3 SE +/- 76.05, N = 3 26836.38 18602.65 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.64, N = 3 SE +/- 0.36, N = 3 483.57 365.83 1. (CXX) g++ options: -O3
OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 130 260 390 520 650 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 582.46 377.04 1. (CXX) g++ options: -O3
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
Backend: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 2 4 6 8 10 SE +/- 0.079, N = 15 SE +/- 0.096, N = 3 6.694 7.105 -O2 -lOpenCL -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl 1. (CXX) g++ options:
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 0.60, N = 3 SE +/- 5.22, N = 12 48.60 57.44 MIN: 29.6 / MAX: 52.28 MAX: 65.49
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 24.72 35.82 MIN: 2.11 / MAX: 34.96 MIN: 12.8 / MAX: 46.64
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 11 22 33 44 55 SE +/- 0.29, N = 15 SE +/- 0.09, N = 3 34.46 50.64 MIN: 0.45 / MAX: 47.01 MIN: 44.98 / MAX: 64.06
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 6 12 18 24 30 SE +/- 1.54, N = 12 SE +/- 2.44, N = 12 16.69 26.70 MAX: 33.09 MAX: 43.01
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 30 60 90 120 150 SE +/- 2.42, N = 12 SE +/- 0.62, N = 3 76.39 122.06 MIN: 61.59 / MAX: 129.05 MIN: 106.98 / MAX: 141.43
ArrayFire Test: Conjugate Gradient OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found
5x A5000 kw-dl580-3-4 NVIDIA: The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 3 6 9 12 15 SE +/- 0.107, N = 3 SE +/- 0.016, N = 3 8.049 10.460 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 34.09, N = 12 SE +/- 3.89, N = 15 299.3 228.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 53.50, N = 12 SE +/- 5.93, N = 15 392.9 382.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 60 120 180 240 300 SE +/- 23.95, N = 12 SE +/- 1.23, N = 15 199.8 261.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 30 60 90 120 150 SE +/- 1.21, N = 12 SE +/- 2.79, N = 15 46.1 117.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 2.44, N = 12 SE +/- 1.77, N = 15 67.8 183.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 2.95, N = 12 SE +/- 1.40, N = 15 56.6 175.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 1.70, N = 12 SE +/- 0.64, N = 15 51.6 108.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 20.15, N = 12 SE +/- 1.22, N = 15 191.3 196.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 0.22, N = 12 SE +/- 1.20, N = 15 57.0 59.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 0.35, N = 12 SE +/- 1.11, N = 14 56.1 59.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 14 28 42 56 70 SE +/- 0.13, N = 12 SE +/- 1.35, N = 14 58.3 62.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 0.14, N = 12 SE +/- 1.46, N = 15 57.2 58.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 309 266 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 90 180 270 360 450 SE +/- 1.53, N = 3 SE +/- 0.33, N = 3 403 345 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 291 312 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.45, N = 3 SE +/- 0.58, N = 3 473 359 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 120 240 360 480 600 SE +/- 1.15, N = 3 SE +/- 0.00, N = 3 534 385 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 477 383 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 164 170 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 70 140 210 280 350 SE +/- 0.88, N = 3 SE +/- 1.76, N = 3 324 317 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 1.15, N = 3 SE +/- 1.45, N = 3 440 344 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 2.03, N = 3 443 348 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 100 200 300 400 500 SE +/- 2.03, N = 3 442 340 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 2.12, N = 9 SE +/- 0.22, N = 12 77.05 18.93 MIN: 37.56 / MAX: 842.19 MIN: 17.49 / MAX: 22.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 9 18 27 36 45 SE +/- 1.67, N = 9 SE +/- 0.10, N = 12 40.16 8.37 MIN: 19.36 / MAX: 731.71 MIN: 7.43 / MAX: 30.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 9 18 27 36 45 SE +/- 1.76, N = 9 SE +/- 0.06, N = 12 40.31 8.52 MIN: 19.09 / MAX: 864.81 MIN: 7.93 / MAX: 87.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 11 22 33 44 55 SE +/- 2.22, N = 9 SE +/- 0.08, N = 12 47.98 9.72 MIN: 22.19 / MAX: 949.82 MIN: 8.97 / MAX: 16.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 9 18 27 36 45 SE +/- 1.96, N = 9 SE +/- 0.12, N = 12 39.58 7.30 MIN: 17.71 / MAX: 715.88 MIN: 6.57 / MAX: 71.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 13 26 39 52 65 SE +/- 1.20, N = 9 SE +/- 0.13, N = 12 57.12 11.02 MIN: 26.93 / MAX: 1232.95 MIN: 9.86 / MAX: 77.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 5 10 15 20 25 SE +/- 1.40, N = 9 SE +/- 0.07, N = 12 21.06 4.13 MIN: 10.19 / MAX: 621.35 MIN: 3.7 / MAX: 4.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 3.17, N = 9 SE +/- 0.33, N = 12 90.21 18.15 MIN: 41.1 / MAX: 1221.73 MIN: 15.73 / MAX: 36.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 30 60 90 120 150 SE +/- 1.79, N = 9 SE +/- 0.42, N = 12 140.86 45.73 MIN: 71.64 / MAX: 393.59 MIN: 41.81 / MAX: 716.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 10 20 30 40 50 SE +/- 1.32, N = 9 SE +/- 0.11, N = 12 43.28 10.92 MIN: 20.79 / MAX: 504.17 MIN: 10.17 / MAX: 12.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 7 14 21 28 35 SE +/- 0.75, N = 9 SE +/- 0.10, N = 12 28.16 7.97 MIN: 13.44 / MAX: 256.84 MIN: 7.31 / MAX: 10.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 2.01, N = 9 SE +/- 0.24, N = 12 93.89 21.90 MIN: 44.98 / MAX: 1038.18 MIN: 20.15 / MAX: 31.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 1.69, N = 9 SE +/- 0.49, N = 12 106.30 33.43 MIN: 52.73 / MAX: 505.17 MIN: 29.11 / MAX: 256.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 20 40 60 80 100 SE +/- 3.32, N = 9 SE +/- 0.34, N = 12 77.37 20.32 MIN: 35.36 / MAX: 1176.72 MIN: 18.17 / MAX: 30.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 27.31, N = 9 SE +/- 0.23, N = 12 227.14 32.77 MIN: 93.69 / MAX: 5948.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 50 100 150 200 250 SE +/- 3.18, N = 9 SE +/- 0.72, N = 12 228.16 58.46 MIN: 122.48 / MAX: 1174.54 MIN: 52.56 / MAX: 125.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet 5x A5000 kw-dl580-3-4 NVIDIA ASPEED - 2 x Intel Xeon Gold 6226R 11 22 33 44 55 SE +/- 1.46, N = 9 SE +/- 0.41, N = 11 50.17 10.39 MIN: 22.35 / MAX: 948.14 MIN: 8.52 / MAX: 30.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
PlaidML This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.
FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 160 320 480 640 800 SE +/- 1.13, N = 3 751.93
FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 400 800 1200 1600 2000 SE +/- 2.99, N = 3 1898.70
FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 500 1000 1500 2000 2500 SE +/- 0.40, N = 3 2201.61
FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL ASPEED - 2 x Intel Xeon Gold 6226R 40 80 120 160 200 SE +/- 0.34, N = 3 179.21
FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'
MandelGPU MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.
OpenCL Device: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.
5x A5000 kw-dl580-3-4 NVIDIA: The test quit with a non-zero exit status.
NeatBench NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.
Acceleration: GPU
ASPEED - 2 x Intel Xeon Gold 6226R: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.
5x A5000 kw-dl580-3-4 NVIDIA: The test run did not produce a result.
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 5x A5000 kw-dl580-3-4 NVIDIA 100 200 300 400 500 SE +/- 1.76, N = 3 443 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 5x A5000 kw-dl580-3-4 NVIDIA 20 40 60 80 100 SE +/- 2.12, N = 9 77.05 MIN: 37.56 / MAX: 842.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
ASPEED - 2 x Intel Xeon Gold 6226R Processor: 2 x Intel Xeon Gold 6226R @ 3.90GHz (32 Cores / 64 Threads), Motherboard: (5.14 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 512GB, Disk: 2 x 8002GB INTEL SSDPE2KX080T8, Graphics: ASPEED 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: 27B2G5, Network: 2 x Intel X722 for 1GbE + 2 x Broadcom BCM57414 NetXtreme-E 10Gb/25Gb
OS: Ubuntu 24.04, Kernel: 6.8.0-38-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08Python Notes: Python 3.8.13Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 19 July 2024 23:42 by user malogica.
5x A5000 kw-dl580-3-4 NVIDIA Processor: 4 x Intel Xeon E7-4880 v2 (60 Cores / 120 Threads), Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (edk2-20240813-1.fc40 BIOS), Chipset: Intel 82G33/G31/P35/P31 + ICH9, Memory: 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 4 GB RAM, Disk: 21GB VIRTUAL-DISK, Graphics: Red Hat QXL paravirtual graphic card 22GB, Audio: QEMU Generic, Network: 2 x Red Hat Virtio 1.0 device
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.131, Compiler: GCC 13.2.0 + CUDA 12.0, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0x715Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.6d.00.0dPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion; VMX: flush not necessary SMT vulnerable + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Retpoline + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2024 22:58 by user root.