TR 3960X WK AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009286-PTS-TR3960XW65&sro&grr .
TR 3960X WK Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32GB 1000GB Sabrent Rocket 4.0 1TB Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB (1900/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.9.0-rc5-14sep-patch (x86_64) 20200914 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3960X WK caffe: GoogleNet - CPU - 1000 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 200 hint: FLOAT caffe: GoogleNet - CPU - 100 byte: Dhrystone 2 hmmer: Pfam Database Search caffe: AlexNet - CPU - 200 couchdb: 100 - 1000 - 24 mlpack: scikit_qda gromacs: Water Benchmark ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet ncnn: CPU - squeezenet caffe: AlexNet - CPU - 100 mlpack: scikit_ica ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - squeezenet mlpack: scikit_linearridgeregression mlpack: scikit_svm dolfyn: Computational Fluid Dynamics mafft: Multiple Sequence Alignment - LSU RNA ffte: N=256, 3D Complex FFT Routine 1 2 3 1585367 634925 318121 387229070.97652 158702 46013391.3 131.076 127209 108.154 44.98 2.529 27.88 23.37 11.49 13.89 41.10 17.76 2.97 8.79 6.58 7.31 6.82 7.28 17.46 16.40 63797 53.53 21.70 11.79 28.12 3.83 80.33 8.63 0.90 11.93 4.44 3.20 8.35 4.35 9.60 6.22 1.44 19.75 15.815 8.119 83303.755210131 1589890 636843 319129 387185459.42906 158891 45773092.9 131.428 127488 107.556 45.91 2.528 28.20 23.55 11.56 13.94 42.30 17.75 2.93 8.64 6.53 7.28 6.78 7.24 17.24 16.39 63525 52.58 21.32 11.85 28.66 3.82 80.68 8.25 0.89 12.20 4.44 3.20 8.23 4.35 9.79 6.27 1.43 19.69 15.860 8.243 83979.875954596 1586757 638848 317413 388597973.06793 159254 46091339.8 131.508 127527 108.106 45.89 2.527 28.07 23.53 11.53 13.96 41.33 17.78 2.94 8.69 6.57 7.34 6.86 7.33 17.22 16.43 63700 54.33 21.05 11.92 28.36 3.81 80.10 8.41 0.89 12.25 4.44 3.20 8.46 4.35 10.22 6.13 1.43 19.67 15.705 8.187 83465.842136051 OpenBenchmarking.org
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 339.92, N = 3 SE +/- 2111.28, N = 3 SE +/- 2136.95, N = 3 1585367 1589890 1586757 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 3 140K 280K 420K 560K 700K SE +/- 490.79, N = 3 SE +/- 1432.41, N = 3 SE +/- 563.79, N = 3 634925 636843 638848 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 70K 140K 210K 280K 350K SE +/- 136.00, N = 3 SE +/- 472.70, N = 3 SE +/- 800.55, N = 3 318121 319129 317413 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 224641.82, N = 3 SE +/- 145428.42, N = 3 SE +/- 333985.70, N = 3 387229070.98 387185459.43 388597973.07 1. (CC) gcc options: -O3 -march=native -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 235.22, N = 3 SE +/- 34.07, N = 3 SE +/- 63.49, N = 3 158702 158891 159254 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 10M 20M 30M 40M 50M SE +/- 536306.21, N = 6 SE +/- 211256.10, N = 3 SE +/- 715922.77, N = 3 46013391.3 45773092.9 46091339.8
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.20, N = 3 131.08 131.43 131.51 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 174.78, N = 3 SE +/- 239.20, N = 3 SE +/- 241.08, N = 3 127209 127488 127527 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 20 40 60 80 100 SE +/- 0.55, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 108.15 107.56 108.11 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 44.98 45.91 45.89
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.569 1.138 1.707 2.276 2.845 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 2.529 2.528 2.527 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 27.88 28.20 28.07 MIN: 27.63 / MAX: 32.81 MIN: 27.77 / MAX: 40.69 MIN: 27.72 / MAX: 29.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 23.37 23.55 23.53 MIN: 23.04 / MAX: 24.13 MIN: 23.36 / MAX: 28.08 MIN: 23.13 / MAX: 25.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 11.49 11.56 11.53 MIN: 11.33 / MAX: 12 MIN: 11.26 / MAX: 12.58 MIN: 11.25 / MAX: 15.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 13.89 13.94 13.96 MIN: 13.68 / MAX: 15.18 MIN: 13.53 / MAX: 14.99 MIN: 13.5 / MAX: 15.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 10 20 30 40 50 SE +/- 0.19, N = 3 SE +/- 0.52, N = 3 SE +/- 0.34, N = 3 41.10 42.30 41.33 MIN: 40.58 / MAX: 42.83 MIN: 40.29 / MAX: 44.44 MIN: 40.29 / MAX: 125.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 17.76 17.75 17.78 MIN: 17.14 / MAX: 19.41 MIN: 16.84 / MAX: 18.95 MIN: 17.02 / MAX: 54.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.6683 1.3366 2.0049 2.6732 3.3415 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.97 2.93 2.94 MIN: 2.8 / MAX: 3.43 MIN: 2.78 / MAX: 4.11 MIN: 2.79 / MAX: 3.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 8.79 8.64 8.69 MIN: 8.52 / MAX: 9.56 MIN: 8.42 / MAX: 13.4 MIN: 8.49 / MAX: 9.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 6.58 6.53 6.57 MIN: 6.37 / MAX: 7.69 MIN: 6.37 / MAX: 7.64 MIN: 6.42 / MAX: 7.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.31 7.28 7.34 MIN: 7.1 / MAX: 9.34 MIN: 7.02 / MAX: 8.28 MIN: 7.06 / MAX: 8.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.82 6.78 6.86 MIN: 6.6 / MAX: 8.32 MIN: 6.64 / MAX: 12.14 MIN: 6.67 / MAX: 9.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.28 7.24 7.33 MIN: 7.03 / MAX: 9.71 MIN: 6.86 / MAX: 8.46 MIN: 7.03 / MAX: 12.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 17.46 17.24 17.22 MIN: 16.88 / MAX: 98.56 MIN: 16.79 / MAX: 18.18 MIN: 16.94 / MAX: 18.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 16.40 16.39 16.43 MIN: 16.11 / MAX: 17.71 MIN: 15.99 / MAX: 17.08 MIN: 16.01 / MAX: 17.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 14K 28K 42K 56K 70K SE +/- 143.57, N = 3 SE +/- 97.00, N = 3 SE +/- 154.26, N = 3 63797 63525 63700 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 12 24 36 48 60 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 SE +/- 0.31, N = 3 53.53 52.58 54.33
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 21.70 21.32 21.05 MIN: 12.05 / MAX: 42.59 MIN: 13.93 / MAX: 46.1 MIN: 11.08 / MAX: 40.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 3 6 9 12 15 SE +/- 0.24, N = 3 SE +/- 0.39, N = 3 SE +/- 0.32, N = 3 11.79 11.85 11.92 MIN: 10.07 / MAX: 37.47 MIN: 10.05 / MAX: 35.67 MIN: 10.07 / MAX: 40.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.29, N = 3 SE +/- 0.13, N = 3 28.12 28.66 28.36 MIN: 25.02 / MAX: 62.51 MIN: 24.96 / MAX: 55.15 MIN: 24.67 / MAX: 55.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.8618 1.7236 2.5854 3.4472 4.309 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.83 3.82 3.81 MIN: 3.75 / MAX: 4.35 MIN: 3.75 / MAX: 4.28 MIN: 3.74 / MAX: 4.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 SE +/- 0.31, N = 3 80.33 80.68 80.10 MIN: 70.05 / MAX: 121.13 MIN: 70.8 / MAX: 121.57 MIN: 70.02 / MAX: 120.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.38, N = 3 SE +/- 0.41, N = 3 8.63 8.25 8.41 MIN: 6.94 / MAX: 36.76 MIN: 6.93 / MAX: 44.59 MIN: 6.92 / MAX: 34.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.90 0.89 0.89 MIN: 0.88 / MAX: 1.78 MIN: 0.88 / MAX: 1.08 MIN: 0.87 / MAX: 1.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 SE +/- 0.07, N = 3 11.93 12.20 12.25 MIN: 10.02 / MAX: 43.66 MIN: 10.04 / MAX: 36.91 MIN: 9.98 / MAX: 38.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.999 1.998 2.997 3.996 4.995 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.44 4.44 4.44 MIN: 4.29 / MAX: 5.12 MIN: 4.29 / MAX: 4.8 MIN: 4.29 / MAX: 9.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.72 1.44 2.16 2.88 3.6 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.20 3.20 3.20 MIN: 3.14 / MAX: 3.51 MIN: 3.14 / MAX: 4.02 MIN: 3.15 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 8.35 8.23 8.46 MIN: 7 / MAX: 36 MIN: 7 / MAX: 39.95 MIN: 7 / MAX: 32.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.9788 1.9576 2.9364 3.9152 4.894 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.35 4.35 4.35 MIN: 4.18 / MAX: 4.71 MIN: 4.16 / MAX: 4.71 MIN: 4.18 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 3 6 9 12 15 SE +/- 0.66, N = 3 SE +/- 0.43, N = 3 SE +/- 0.43, N = 3 9.60 9.79 10.22 MIN: 7.7 / MAX: 35.6 MIN: 7.28 / MAX: 27.36 MIN: 6.48 / MAX: 44.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 6.22 6.27 6.13 MIN: 5.94 / MAX: 30.39 MIN: 5.92 / MAX: 16.22 MIN: 5.93 / MAX: 9.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.44 1.43 1.43
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 19.75 19.69 19.67
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 15.82 15.86 15.71
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.058, N = 3 SE +/- 0.032, N = 3 SE +/- 0.029, N = 3 8.119 8.243 8.187 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 20K 40K 60K 80K 100K SE +/- 411.46, N = 3 SE +/- 111.80, N = 3 SE +/- 308.33, N = 3 83303.76 83979.88 83465.84 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Phoronix Test Suite v10.8.5