2023-02-26-0032

Docker testing on Ubuntu 20.04.4 LTS via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2302257-NE-20230226027&grt.

2023-02-26-0032ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelOpenCLCompilerFile-SystemScreen ResolutionSystem LayerASPEED - AMD Ryzen Threadripper PRO 3955WX 16-CoresAMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads)ASUS Pro WS WRX80E-SAGE SE WIFI (0602 BIOS)AMD Starship/Matisse256GB4 x 2000GB Samsung SSD 970 EVO Plus 2TB + 1000GB Samsung SSD 970 EVO Plus 1TB + 500GB Western Digital WDS500G2B0BASPEED 16GB (2450MHz)Intel Device 4f922 x DELL U2312HM + 28E8502 x Intel 10G X550TUbuntu 20.04.4 LTS6.2.0-1275.native (x86_64)OpenCL 3.0 + OpenCL 2.1 AMD-APP (3513.0)GCC 11.3.0 + LLVM 15.0.3overlayfs1920x1080DockerOpenBenchmarking.org- amdgpu.ppfeaturemask=0xffffffff amdgpu.vm_block_size=10 amdgpu.vm_size=1024 - Transparent Huge Pages: always- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301055- GLAMOR- Python 3.10.6- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Vulnerable + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

2023-02-26-0032cl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Single-Precision Floatclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Srodinia: OpenCL Myocyterodinia: OpenCL Leukocyteshoc: OpenCL - S3Dshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTxsbench-cl: ASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores132.7108.9113.8110.732502.28116.141.9810.546.4422.7410.2751.847425050915222443.9264.88978.4619418.2106.1742111.999609.51581032011.20802.9990440.04412619115856.585.476.371.881.082.576.095.380.312412996.7OpenBenchmarking.org

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.03, N = 3132.71. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.00, N = 3108.91. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.00, N = 3113.81. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 2.58, N = 12110.731. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores5001000150020002500SE +/- 0.20, N = 32502.281. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.04, N = 3116.141. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores0.44550.8911.33651.7822.2275SE +/- 0.00, N = 31.981. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores3691215SE +/- 0.02, N = 310.541. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores246810SE +/- 0.003, N = 36.442

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores0.61671.23341.85012.46683.0835SE +/- 0.002, N = 32.741

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores0.06190.12380.18570.24760.3095SE +/- 0.000, N = 30.275

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores0.41560.83121.24681.66242.078SE +/- 0.003, N = 31.847

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32ASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores9001800270036004500SE +/- 4.10, N = 34250

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores11002200330044005500SE +/- 7.33, N = 35091

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores11002200330044005500SE +/- 14.47, N = 35222

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores100200300400500SE +/- 1.26, N = 3443.931. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores1.12.23.34.45.5SE +/- 0.007, N = 34.8891. (CXX) g++ options: -O2 -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.09, N = 378.461. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores90180270360450SE +/- 0.95, N = 3418.211. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores246810SE +/- 0.0003, N = 36.17421. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.02, N = 3112.001. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores130260390520650SE +/- 0.10, N = 3609.521. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores200K400K600K800K1000KSE +/- 19871.31, N = 158103201. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores3691215SE +/- 0.00, N = 311.211. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores0.67481.34962.02442.69923.374SE +/- 0.0371, N = 32.99901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores100200300400500SE +/- 2.25, N = 3440.041. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.33, N = 31261. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores4080120160200SE +/- 0.58, N = 31911. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 1.73, N = 31581. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores1326395265SE +/- 0.22, N = 356.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.37, N = 385.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.29, N = 376.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores1632486480SE +/- 0.92, N = 371.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.30, N = 381.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.96, N = 382.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 1.61, N = 376.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.38, N = 395.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.91, N = 380.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.58, N = 31241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores306090120150SE +/- 0.00, N = 31291. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTASPEED - AMD Ryzen Threadripper PRO 3955WX 16-Cores20406080100SE +/- 0.48, N = 396.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5