TR 3960X WK AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009286-PTS-TR3960XW65&sro&grw .
TR 3960X WK Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32GB 1000GB Sabrent Rocket 4.0 1TB Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB (1900/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.9.0-rc5-14sep-patch (x86_64) 20200914 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3960X WK hint: FLOAT hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA dolfyn: Computational Fluid Dynamics caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 caffe: GoogleNet - CPU - 1000 ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression gromacs: Water Benchmark ffte: N=256, 3D Complex FFT Routine couchdb: 100 - 1000 - 24 byte: Dhrystone 2 1 2 3 387229070.97652 131.076 8.119 15.815 63797 127209 634925 158702 318121 1585367 16.40 17.46 7.28 6.82 7.31 6.58 8.79 2.97 17.76 41.10 13.89 11.49 23.37 27.88 6.22 9.60 4.35 8.35 3.20 4.44 11.93 0.90 8.63 80.33 3.83 28.12 11.79 21.70 53.53 44.98 19.75 1.44 2.529 83303.755210131 108.154 46013391.3 387185459.42906 131.428 8.243 15.860 63525 127488 636843 158891 319129 1589890 16.39 17.24 7.24 6.78 7.28 6.53 8.64 2.93 17.75 42.30 13.94 11.56 23.55 28.20 6.27 9.79 4.35 8.23 3.20 4.44 12.20 0.89 8.25 80.68 3.82 28.66 11.85 21.32 52.58 45.91 19.69 1.43 2.528 83979.875954596 107.556 45773092.9 388597973.06793 131.508 8.187 15.705 63700 127527 638848 159254 317413 1586757 16.43 17.22 7.33 6.86 7.34 6.57 8.69 2.94 17.78 41.33 13.96 11.53 23.53 28.07 6.13 10.22 4.35 8.46 3.20 4.44 12.25 0.89 8.41 80.10 3.81 28.36 11.92 21.05 54.33 45.89 19.67 1.43 2.527 83465.842136051 108.106 46091339.8 OpenBenchmarking.org
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 224641.82, N = 3 SE +/- 145428.42, N = 3 SE +/- 333985.70, N = 3 387229070.98 387185459.43 388597973.07 1. (CC) gcc options: -O3 -march=native -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.20, N = 3 131.08 131.43 131.51 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.058, N = 3 SE +/- 0.032, N = 3 SE +/- 0.029, N = 3 8.119 8.243 8.187 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 15.82 15.86 15.71
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 14K 28K 42K 56K 70K SE +/- 143.57, N = 3 SE +/- 97.00, N = 3 SE +/- 154.26, N = 3 63797 63525 63700 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 174.78, N = 3 SE +/- 239.20, N = 3 SE +/- 241.08, N = 3 127209 127488 127527 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 3 140K 280K 420K 560K 700K SE +/- 490.79, N = 3 SE +/- 1432.41, N = 3 SE +/- 563.79, N = 3 634925 636843 638848 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 235.22, N = 3 SE +/- 34.07, N = 3 SE +/- 63.49, N = 3 158702 158891 159254 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 70K 140K 210K 280K 350K SE +/- 136.00, N = 3 SE +/- 472.70, N = 3 SE +/- 800.55, N = 3 318121 319129 317413 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 339.92, N = 3 SE +/- 2111.28, N = 3 SE +/- 2136.95, N = 3 1585367 1589890 1586757 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 16.40 16.39 16.43 MIN: 16.11 / MAX: 17.71 MIN: 15.99 / MAX: 17.08 MIN: 16.01 / MAX: 17.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 17.46 17.24 17.22 MIN: 16.88 / MAX: 98.56 MIN: 16.79 / MAX: 18.18 MIN: 16.94 / MAX: 18.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.28 7.24 7.33 MIN: 7.03 / MAX: 9.71 MIN: 6.86 / MAX: 8.46 MIN: 7.03 / MAX: 12.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.82 6.78 6.86 MIN: 6.6 / MAX: 8.32 MIN: 6.64 / MAX: 12.14 MIN: 6.67 / MAX: 9.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.31 7.28 7.34 MIN: 7.1 / MAX: 9.34 MIN: 7.02 / MAX: 8.28 MIN: 7.06 / MAX: 8.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 6.58 6.53 6.57 MIN: 6.37 / MAX: 7.69 MIN: 6.37 / MAX: 7.64 MIN: 6.42 / MAX: 7.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 8.79 8.64 8.69 MIN: 8.52 / MAX: 9.56 MIN: 8.42 / MAX: 13.4 MIN: 8.49 / MAX: 9.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.6683 1.3366 2.0049 2.6732 3.3415 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.97 2.93 2.94 MIN: 2.8 / MAX: 3.43 MIN: 2.78 / MAX: 4.11 MIN: 2.79 / MAX: 3.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 17.76 17.75 17.78 MIN: 17.14 / MAX: 19.41 MIN: 16.84 / MAX: 18.95 MIN: 17.02 / MAX: 54.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 10 20 30 40 50 SE +/- 0.19, N = 3 SE +/- 0.52, N = 3 SE +/- 0.34, N = 3 41.10 42.30 41.33 MIN: 40.58 / MAX: 42.83 MIN: 40.29 / MAX: 44.44 MIN: 40.29 / MAX: 125.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 13.89 13.94 13.96 MIN: 13.68 / MAX: 15.18 MIN: 13.53 / MAX: 14.99 MIN: 13.5 / MAX: 15.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 11.49 11.56 11.53 MIN: 11.33 / MAX: 12 MIN: 11.26 / MAX: 12.58 MIN: 11.25 / MAX: 15.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 23.37 23.55 23.53 MIN: 23.04 / MAX: 24.13 MIN: 23.36 / MAX: 28.08 MIN: 23.13 / MAX: 25.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 27.88 28.20 28.07 MIN: 27.63 / MAX: 32.81 MIN: 27.77 / MAX: 40.69 MIN: 27.72 / MAX: 29.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 6.22 6.27 6.13 MIN: 5.94 / MAX: 30.39 MIN: 5.92 / MAX: 16.22 MIN: 5.93 / MAX: 9.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 3 6 9 12 15 SE +/- 0.66, N = 3 SE +/- 0.43, N = 3 SE +/- 0.43, N = 3 9.60 9.79 10.22 MIN: 7.7 / MAX: 35.6 MIN: 7.28 / MAX: 27.36 MIN: 6.48 / MAX: 44.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.9788 1.9576 2.9364 3.9152 4.894 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.35 4.35 4.35 MIN: 4.18 / MAX: 4.71 MIN: 4.16 / MAX: 4.71 MIN: 4.18 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 8.35 8.23 8.46 MIN: 7 / MAX: 36 MIN: 7 / MAX: 39.95 MIN: 7 / MAX: 32.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.72 1.44 2.16 2.88 3.6 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.20 3.20 3.20 MIN: 3.14 / MAX: 3.51 MIN: 3.14 / MAX: 4.02 MIN: 3.15 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.999 1.998 2.997 3.996 4.995 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.44 4.44 4.44 MIN: 4.29 / MAX: 5.12 MIN: 4.29 / MAX: 4.8 MIN: 4.29 / MAX: 9.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 SE +/- 0.07, N = 3 11.93 12.20 12.25 MIN: 10.02 / MAX: 43.66 MIN: 10.04 / MAX: 36.91 MIN: 9.98 / MAX: 38.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.90 0.89 0.89 MIN: 0.88 / MAX: 1.78 MIN: 0.88 / MAX: 1.08 MIN: 0.87 / MAX: 1.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.38, N = 3 SE +/- 0.41, N = 3 8.63 8.25 8.41 MIN: 6.94 / MAX: 36.76 MIN: 6.93 / MAX: 44.59 MIN: 6.92 / MAX: 34.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 SE +/- 0.31, N = 3 80.33 80.68 80.10 MIN: 70.05 / MAX: 121.13 MIN: 70.8 / MAX: 121.57 MIN: 70.02 / MAX: 120.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.8618 1.7236 2.5854 3.4472 4.309 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.83 3.82 3.81 MIN: 3.75 / MAX: 4.35 MIN: 3.75 / MAX: 4.28 MIN: 3.74 / MAX: 4.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.29, N = 3 SE +/- 0.13, N = 3 28.12 28.66 28.36 MIN: 25.02 / MAX: 62.51 MIN: 24.96 / MAX: 55.15 MIN: 24.67 / MAX: 55.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 3 6 9 12 15 SE +/- 0.24, N = 3 SE +/- 0.39, N = 3 SE +/- 0.32, N = 3 11.79 11.85 11.92 MIN: 10.07 / MAX: 37.47 MIN: 10.05 / MAX: 35.67 MIN: 10.07 / MAX: 40.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 21.70 21.32 21.05 MIN: 12.05 / MAX: 42.59 MIN: 13.93 / MAX: 46.1 MIN: 11.08 / MAX: 40.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 12 24 36 48 60 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 SE +/- 0.31, N = 3 53.53 52.58 54.33
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 44.98 45.91 45.89
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 19.75 19.69 19.67
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.44 1.43 1.43
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.569 1.138 1.707 2.276 2.845 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 2.529 2.528 2.527 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 20K 40K 60K 80K 100K SE +/- 411.46, N = 3 SE +/- 111.80, N = 3 SE +/- 308.33, N = 3 83303.76 83979.88 83465.84 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 20 40 60 80 100 SE +/- 0.55, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 108.15 107.56 108.11 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 10M 20M 30M 40M 50M SE +/- 536306.21, N = 6 SE +/- 211256.10, N = 3 SE +/- 715922.77, N = 3 46013391.3 45773092.9 46091339.8
Phoronix Test Suite v10.8.5