AMD Ryzen 9 3950X Ubuntu Linux AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009247-FI-AMDRYZEN924&grw&sro .
AMD Ryzen 9 3950X Ubuntu Linux Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Run 1 Run 2 Run 3 AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz) NVIDIA TU102 HD Audio DELL P2415Q Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-47-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 450.66 4.6.0 OpenCL 1.2 CUDA 11.0.228 + OpenCL 2.0 AMD-APP (3182.0) 1.2.133 GCC 9.3.0 + CUDA 11.0 ext4 3840x2160 NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz) NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013 OpenCL Details - GPU Compute Cores: 4352 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
AMD Ryzen 9 3950X Ubuntu Linux libraw: Post-Processing Benchmark webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression espeak: Text-To-Speech Synthesis plaidml: No - Training - Mobilenet - OpenCL plaidml: No - Inference - IMDB LSTM - OpenCL plaidml: No - Inference - Mobilenet - OpenCL plaidml: Yes - Inference - Mobilenet - OpenCL plaidml: No - Inference - DenseNet 201 - OpenCL lczero: OpenCL opencv: Features 2D opencv: Object Detection rodinia: OpenCL Particle Filter arrayfire: Conjugate Gradient OpenCL osbench: Create Files osbench: Create Threads osbench: Launch Programs osbench: Create Processes osbench: Memory Allocations perf-bench: Epoll Wait perf-bench: Futex Hash perf-bench: Memcpy 1MB perf-bench: Memset 1MB perf-bench: Sched Pipe mpv: Big Buck Bunny Sunflower 1080p - Software Only perf-bench: Futex Lock-Pi mpv: Big Buck Bunny Sunflower 4K - Software Only perf-bench: Syscall Basic blender: BMW27 - CUDA blender: Classroom - CUDA blender: Fishy Cat - CUDA blender: Barbershop - CUDA blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - CUDA blender: Pabellon Barcelona - NVIDIA OptiX fahbench: hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS mixbench: NVIDIA CUDA - Integer mixbench: NVIDIA CUDA - Half Precision mixbench: NVIDIA CUDA - Double Precision mixbench: NVIDIA CUDA - Single Precision namd-cuda: ATPase Simulation - 327,506 Atoms octanebench: Total Score redshift: financebench: Black-Scholes OpenCL cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth mandelgpu: GPU viennacl: OpenCL LU Factorization influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 Run 1 Run 2 Run 3 35.30 1.456 2.225 15.505 6.718 32.337 26.779 187.74 749.41 2409.86 2750.18 213.43 11693 149676 37391 4.485 1.676 11.187511 13.888200 37.233829 28.947194 65.836350 33012 5042694 15.340630 73.239641 392388 1236.99 451 372.29 21570020 40.73 151.94 73.15 538.72 20.16 73.37 33.10 896.83 292.14 103.93 287.4210 56554000000 17962833333 880967 2469700000 650967 14651.01 32630.52 440.80 14126.10 0.17955 308.84294 247 6.030 325.3 545.4 447.7 13318.75 13379.77 522.54 507.56 450731366.0 79.4030 1349119.4 1503297.6 1534587.9 34.98 1.424 2.212 15.876 6.811 32.879 26.605 187.63 751.39 2421.41 2763.05 212.51 11673 147838 36473 4.459 1.671 10.591332 13.912996 37.083626 28.053125 64.730247 34159 5021169 15.362763 73.369746 397063 1249.00 454 374.39 22900058 40.77 152.47 73.27 538.60 20.25 73.33 33.05 893.03 292.45 104.28 288.8771 56137133333 17822700000 877133 2451866667 646367 14263.82 32596.27 419.44 13791.73 0.17951 310.026199 248 6.037 325.2 545.4 449.8 13258.65 13387.08 521.13 507.87 455807158.0 75.7065 1372772.2 1530852.5 1562422.1 35.05 1.426 2.213 15.466 6.912 32.847 26.491 187.84 747.07 2414.96 2756.92 212.36 11657 148206 34895 4.466 1.662 10.791248 14.068445 37.697156 28.186639 65.164725 34049 5028714 15.094659 71.418354 391536 1242.68 443 372.48 22735445 40.73 152.60 73.17 538.81 20.24 73.58 33.05 893.64 292.50 104.43 288.9235 56145600000 17812100000 879667 2450466667 646600 14510.11 32409.08 428.03 15130.47 0.18004 307.443811 248 6.033 324.5 544.9 447.7 13234.84 13313.50 518.21 507.91 448875915.0 76.8259 1376801.6 1534746.9 1563580.1 OpenBenchmarking.org
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.19, N = 3 SE +/- 0.09, N = 3 35.30 34.98 35.05 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default Run 1 Run 2 Run 3 0.3276 0.6552 0.9828 1.3104 1.638 SE +/- 0.024, N = 3 SE +/- 0.024, N = 3 SE +/- 0.019, N = 3 1.456 1.424 1.426 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 Run 1 Run 2 Run 3 0.5006 1.0012 1.5018 2.0024 2.503 SE +/- 0.026, N = 3 SE +/- 0.026, N = 3 SE +/- 0.037, N = 3 2.225 2.212 2.213 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.19, N = 5 SE +/- 0.11, N = 3 SE +/- 0.23, N = 3 15.51 15.88 15.47 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.032, N = 3 SE +/- 0.071, N = 3 SE +/- 0.091, N = 3 6.718 6.811 6.912 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.39, N = 3 SE +/- 0.30, N = 3 SE +/- 0.16, N = 3 32.34 32.88 32.85 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Run 1 Run 2 Run 3 6 12 18 24 30 SE +/- 0.08, N = 4 SE +/- 0.09, N = 4 SE +/- 0.26, N = 4 26.78 26.61 26.49 1. (CC) gcc options: -O2 -std=c99
PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Run 1 Run 2 Run 3 40 80 120 160 200 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 187.74 187.63 187.84
PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Run 1 Run 2 Run 3 160 320 480 640 800 SE +/- 2.47, N = 3 SE +/- 3.71, N = 3 SE +/- 1.84, N = 3 749.41 751.39 747.07
PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Run 1 Run 2 Run 3 500 1000 1500 2000 2500 SE +/- 3.69, N = 3 SE +/- 2.19, N = 3 SE +/- 3.60, N = 3 2409.86 2421.41 2414.96
PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Run 1 Run 2 Run 3 600 1200 1800 2400 3000 SE +/- 2.56, N = 3 SE +/- 2.19, N = 3 SE +/- 0.87, N = 3 2750.18 2763.05 2756.92
PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Run 1 Run 2 Run 3 50 100 150 200 250 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 SE +/- 0.21, N = 3 213.43 212.51 212.36
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 82.15, N = 3 SE +/- 69.91, N = 3 SE +/- 41.53, N = 3 11693 11673 11657 1. (CXX) g++ options: -flto -pthread
OpenCV Test: Features 2D OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Features 2D Run 1 Run 2 Run 3 30K 60K 90K 120K 150K SE +/- 1997.08, N = 12 SE +/- 2777.13, N = 9 SE +/- 3470.33, N = 12 149676 147838 148206 1. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt
OpenCV Test: Object Detection OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Object Detection Run 1 Run 2 Run 3 8K 16K 24K 32K 40K SE +/- 382.04, N = 3 SE +/- 484.41, N = 3 SE +/- 588.74, N = 3 37391 36473 34895 1. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter Run 1 Run 2 Run 3 1.0091 2.0182 3.0273 4.0364 5.0455 SE +/- 0.027, N = 3 SE +/- 0.016, N = 3 SE +/- 0.017, N = 3 4.485 4.459 4.466 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL Run 1 Run 2 Run 3 0.3771 0.7542 1.1313 1.5084 1.8855 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.006, N = 3 1.676 1.671 1.662 1. (CXX) g++ options: -rdynamic
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files Run 1 Run 2 Run 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 11.19 10.59 10.79 1. (CC) gcc options: -lm
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.26, N = 15 SE +/- 0.29, N = 15 SE +/- 0.30, N = 15 13.89 13.91 14.07 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs Run 1 Run 2 Run 3 9 18 27 36 45 SE +/- 0.28, N = 3 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 37.23 37.08 37.70 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes Run 1 Run 2 Run 3 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 28.95 28.05 28.19 1. (CC) gcc options: -lm
OSBench Test: Memory Allocations OpenBenchmarking.org Ns Per Event, Fewer Is Better OSBench Test: Memory Allocations Run 1 Run 2 Run 3 15 30 45 60 75 SE +/- 0.84, N = 3 SE +/- 0.11, N = 3 SE +/- 0.43, N = 3 65.84 64.73 65.16 1. (CC) gcc options: -lm
perf-bench Benchmark: Epoll Wait OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait Run 1 Run 2 Run 3 7K 14K 21K 28K 35K SE +/- 204.50, N = 3 SE +/- 308.84, N = 3 SE +/- 438.57, N = 4 33012 34159 34049 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
perf-bench Benchmark: Futex Hash OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash Run 1 Run 2 Run 3 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 7897.00, N = 3 SE +/- 7280.95, N = 3 SE +/- 6643.91, N = 3 5042694 5021169 5028714 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
perf-bench Benchmark: Memcpy 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB Run 1 Run 2 Run 3 4 8 12 16 20 SE +/- 0.20, N = 3 SE +/- 0.10, N = 3 SE +/- 0.18, N = 5 15.34 15.36 15.09 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
perf-bench Benchmark: Memset 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB Run 1 Run 2 Run 3 16 32 48 64 80 SE +/- 0.94, N = 4 SE +/- 1.08, N = 4 SE +/- 1.13, N = 3 73.24 73.37 71.42 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
perf-bench Benchmark: Sched Pipe OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe Run 1 Run 2 Run 3 90K 180K 270K 360K 450K SE +/- 3949.47, N = 3 SE +/- 4833.02, N = 3 SE +/- 3250.27, N = 3 392388 397063 391536 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only Run 1 Run 2 Run 3 300 600 900 1200 1500 SE +/- 2.06, N = 3 SE +/- 3.00, N = 3 SE +/- 0.55, N = 3 1236.99 1249.00 1242.68 MIN: 818.67 / MAX: 1669.1 MIN: 823.81 / MAX: 1678.43 MIN: 824.09 / MAX: 1672.6 1. mpv 0.32.0
perf-bench Benchmark: Futex Lock-Pi OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi Run 1 Run 2 Run 3 100 200 300 400 500 SE +/- 2.08, N = 3 SE +/- 3.89, N = 15 SE +/- 5.86, N = 3 451 454 443 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only Run 1 Run 2 Run 3 80 160 240 320 400 SE +/- 0.73, N = 3 SE +/- 0.92, N = 3 SE +/- 1.32, N = 3 372.29 374.39 372.48 MIN: 286.67 / MAX: 441.69 MIN: 288.04 / MAX: 445.07 MIN: 286.84 / MAX: 432.21 1. mpv 0.32.0
perf-bench Benchmark: Syscall Basic OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic Run 1 Run 2 Run 3 5M 10M 15M 20M 25M SE +/- 81316.08, N = 3 SE +/- 98896.75, N = 3 SE +/- 164860.89, N = 3 21570020 22900058 22735445 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
Blender Blend File: BMW27 - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CUDA Run 1 Run 2 Run 3 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 40.73 40.77 40.73
Blender Blend File: Classroom - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: CUDA Run 1 Run 2 Run 3 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.24, N = 3 SE +/- 0.39, N = 3 151.94 152.47 152.60
Blender Blend File: Fishy Cat - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CUDA Run 1 Run 2 Run 3 16 32 48 64 80 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 73.15 73.27 73.17
Blender Blend File: Barbershop - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: CUDA Run 1 Run 2 Run 3 120 240 360 480 600 SE +/- 0.16, N = 3 SE +/- 0.40, N = 3 SE +/- 0.06, N = 3 538.72 538.60 538.81
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: NVIDIA OptiX Run 1 Run 2 Run 3 5 10 15 20 25 SE +/- 0.20, N = 3 SE +/- 0.30, N = 3 SE +/- 0.26, N = 3 20.16 20.25 20.24
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: NVIDIA OptiX Run 1 Run 2 Run 3 16 32 48 64 80 SE +/- 0.25, N = 3 SE +/- 0.28, N = 3 SE +/- 0.18, N = 3 73.37 73.33 73.58
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: NVIDIA OptiX Run 1 Run 2 Run 3 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 33.10 33.05 33.05
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: NVIDIA OptiX Run 1 Run 2 Run 3 200 400 600 800 1000 SE +/- 0.44, N = 3 SE +/- 0.32, N = 3 SE +/- 1.68, N = 3 896.83 893.03 893.64
Blender Blend File: Pabellon Barcelona - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: CUDA Run 1 Run 2 Run 3 60 120 180 240 300 SE +/- 0.05, N = 3 SE +/- 0.19, N = 3 SE +/- 0.04, N = 3 292.14 292.45 292.50
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 103.93 104.28 104.43
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 Run 1 Run 2 Run 3 60 120 180 240 300 SE +/- 0.87, N = 3 SE +/- 0.56, N = 3 SE +/- 0.37, N = 3 287.42 288.88 288.92
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 Run 1 Run 2 Run 3 12000M 24000M 36000M 48000M 60000M SE +/- 78954438.34, N = 3 SE +/- 48887501.24, N = 3 SE +/- 62943175.43, N = 3 56554000000 56137133333 56145600000
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 Run 1 Run 2 Run 3 4000M 8000M 12000M 16000M 20000M SE +/- 28555054.04, N = 3 SE +/- 28850361.06, N = 3 SE +/- 18200000.00, N = 3 17962833333 17822700000 17812100000
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip Run 1 Run 2 Run 3 200K 400K 600K 800K 1000K SE +/- 1637.41, N = 3 SE +/- 1278.45, N = 3 SE +/- 1848.72, N = 3 880967 877133 879667
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 Run 1 Run 2 Run 3 500M 1000M 1500M 2000M 2500M SE +/- 3523256.07, N = 3 SE +/- 3868390.42, N = 3 SE +/- 2796624.95, N = 3 2469700000 2451866667 2450466667
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS Run 1 Run 2 Run 3 140K 280K 420K 560K 700K SE +/- 648.93, N = 3 SE +/- 433.33, N = 3 SE +/- 346.41, N = 3 650967 646367 646600
Mixbench Backend: NVIDIA CUDA - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 20.03, N = 3 SE +/- 207.34, N = 15 SE +/- 28.60, N = 3 14651.01 14263.82 14510.11 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Half Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Run 1 Run 2 Run 3 7K 14K 21K 28K 35K SE +/- 6.67, N = 3 SE +/- 18.28, N = 3 SE +/- 29.77, N = 3 32630.52 32596.27 32409.08 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Run 1 Run 2 Run 3 100 200 300 400 500 SE +/- 0.03, N = 3 SE +/- 7.42, N = 15 SE +/- 4.74, N = 15 440.80 419.44 428.03 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 646.92, N = 15 SE +/- 646.88, N = 15 SE +/- 510.15, N = 15 14126.10 13791.73 15130.47 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms Run 1 Run 2 Run 3 0.0405 0.081 0.1215 0.162 0.2025 SE +/- 0.00010, N = 3 SE +/- 0.00020, N = 3 SE +/- 0.00039, N = 3 0.17955 0.17951 0.18004
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 4.00c Total Score Run 1 Run 2 Run 3 70 140 210 280 350 308.84 310.03 307.44
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 Run 1 Run 2 Run 3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 247 248 248
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-06-06 Benchmark: Black-Scholes OpenCL Run 1 Run 2 Run 3 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 6.030 6.037 6.033 1. (CXX) g++ options: -O3 -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Run 1 Run 2 Run 3 70 140 210 280 350 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 SE +/- 0.50, N = 3 325.3 325.2 324.5 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Run 1 Run 2 Run 3 120 240 360 480 600 SE +/- 0.32, N = 3 SE +/- 0.27, N = 3 SE +/- 1.08, N = 3 545.4 545.4 544.9 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Run 1 Run 2 Run 3 100 200 300 400 500 SE +/- 0.66, N = 3 SE +/- 0.26, N = 3 SE +/- 1.25, N = 3 447.7 449.8 447.7 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 159.82, N = 15 SE +/- 137.42, N = 15 SE +/- 159.10, N = 6 13318.75 13258.65 13234.84 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Run 1 Run 2 Run 3 3K 6K 9K 12K 15K SE +/- 169.22, N = 15 SE +/- 184.15, N = 15 SE +/- 178.75, N = 15 13379.77 13387.08 13313.50 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Run 1 Run 2 Run 3 110 220 330 440 550 SE +/- 1.65, N = 3 SE +/- 0.32, N = 3 SE +/- 1.44, N = 3 522.54 521.13 518.21 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Run 1 Run 2 Run 3 110 220 330 440 550 SE +/- 0.68, N = 3 SE +/- 0.76, N = 3 SE +/- 0.44, N = 3 507.56 507.87 507.91 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Run 1 Run 2 Run 3 100M 200M 300M 400M 500M SE +/- 6814468.33, N = 3 SE +/- 5831583.77, N = 3 SE +/- 1749343.37, N = 3 450731366.0 455807158.0 448875915.0 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
ViennaCL OpenCL LU Factorization OpenBenchmarking.org GFLOPS, More Is Better ViennaCL 1.4.2 OpenCL LU Factorization Run 1 Run 2 Run 3 20 40 60 80 100 SE +/- 1.01, N = 3 SE +/- 0.98, N = 3 SE +/- 1.21, N = 3 79.40 75.71 76.83 1. (CXX) g++ options: -rdynamic -lOpenCL
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Run 1 Run 2 Run 3 300K 600K 900K 1200K 1500K SE +/- 2562.57, N = 3 SE +/- 2002.03, N = 3 SE +/- 2246.17, N = 3 1349119.4 1372772.2 1376801.6
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Run 1 Run 2 Run 3 300K 600K 900K 1200K 1500K SE +/- 691.55, N = 3 SE +/- 1988.25, N = 3 SE +/- 2016.10, N = 3 1503297.6 1530852.5 1534746.9
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Run 1 Run 2 Run 3 300K 600K 900K 1200K 1500K SE +/- 2777.14, N = 3 SE +/- 2819.29, N = 3 SE +/- 929.80, N = 3 1534587.9 1562422.1 1563580.1
Phoronix Test Suite v10.8.5