AMD Ryzen 9 3950X Ubuntu Linux

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2009247-FI-AMDRYZEN924&grs&sor.

AMD Ryzen 9 3950X Ubuntu LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRun 1Run 2Run 3AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-47-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.228 + OpenCL 2.0 AMD-APP (3182.0)1.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013OpenCL Details- GPU Compute Cores: 4352Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD Ryzen 9 3950X Ubuntu Linuxopencv: Object Detectionperf-bench: Syscall Basicosbench: Create Filesviennacl: OpenCL LU Factorizationperf-bench: Epoll Waitosbench: Create Processeswebp: Quality 100, Highest Compressionperf-bench: Memset 1MBmixbench: NVIDIA CUDA - Integerwebp: Quality 100, Losslessperf-bench: Futex Lock-Piwebp: Defaultinfluxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000perf-bench: Memcpy 1MBosbench: Memory Allocationswebp: Quality 100, Lossless, Highest Compressionosbench: Launch Programsmandelgpu: GPUperf-bench: Sched Pipeespeak: Text-To-Speech Synthesismpv: Big Buck Bunny Sunflower 1080p - Software Onlylibraw: Post-Processing Benchmarkhashcat: SHA1arrayfire: Conjugate Gradient OpenCLoctanebench: Total Scoreclpeak: Double-Precision Doublehashcat: SHA-512hashcat: MD5hashcat: TrueCrypt RIPEMD160 + XTSmixbench: NVIDIA CUDA - Half Precisionclpeak: Integer Compute INTwebp: Quality 100rodinia: OpenCL Particle Filterplaidml: No - Inference - IMDB LSTM - OpenCLmpv: Big Buck Bunny Sunflower 4K - Software Onlyclpeak: Single-Precision Floatfahbench: plaidml: No - Inference - DenseNet 201 - OpenCLblender: Pabellon Barcelona - NVIDIA OptiXplaidml: No - Inference - Mobilenet - OpenCLcl-mem: Writeplaidml: Yes - Inference - Mobilenet - OpenCLblender: BMW27 - NVIDIA OptiXhashcat: 7-Zipblender: Classroom - CUDAperf-bench: Futex Hashblender: Barbershop - NVIDIA OptiXredshift: blender: Classroom - NVIDIA OptiXlczero: OpenCLnamd-cuda: ATPase Simulation - 327,506 Atomscl-mem: Copyblender: Fishy Cat - CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - CUDAfinancebench: Black-Scholes OpenCLplaidml: No - Training - Mobilenet - OpenCLblender: BMW27 - CUDAcl-mem: Readclpeak: Global Memory Bandwidthblender: Barbershop - CUDAopencv: Features 2Dosbench: Create Threadsmixbench: NVIDIA CUDA - Single Precisionmixbench: NVIDIA CUDA - Double PrecisionRun 1Run 2Run 3373912157002011.18751179.40303301228.9471946.71873.23964114651.0115.5054511.4561503297.61349119.41534587.915.34063065.83635032.33737.233829450731366.039238826.7791236.9935.30179628333331.676308.84294522.5424697000005655400000065096732630.5213318.752.2254.485749.41372.2913379.77287.4210213.43103.932409.86447.72750.1820.16880967151.945042694896.8324773.37116930.17955325.373.1533.10292.146.030187.7440.73545.4507.56538.7214967613.88820014126.10440.80364732290005810.59133275.70653415928.0531256.81173.36974614263.8215.8764541.4241530852.51372772.21562422.115.36276364.73024732.87937.083626455807158.039706326.6051249.0034.98178227000001.671310.026199521.1324518666675613713333364636732596.2713258.652.2124.459751.39374.3913387.08288.8771212.51104.282421.41449.82763.0520.25877133152.475021169893.0324873.33116730.17951325.273.2733.05292.456.037187.6340.77545.4507.87538.6014783813.91299613791.73419.44348952273544510.79124876.82593404928.1866396.91271.41835414510.1115.4664431.4261534746.91376801.61563580.115.09465965.16472532.84737.697156448875915.039153626.4911242.6835.05178121000001.662307.443811518.2124504666675614560000064660032409.0813234.842.2134.466747.07372.4813313.50288.9235212.36104.432414.96447.72756.9220.24879667152.605028714893.6424873.58116570.18004324.573.1733.05292.506.033187.8440.73544.9507.91538.8114820614.06844515130.47428.03OpenBenchmarking.org

OpenCV

Test: Object Detection

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Object DetectionRun 3Run 2Run 18K16K24K32K40KSE +/- 588.74, N = 3SE +/- 484.41, N = 3SE +/- 382.04, N = 33489536473373911. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

perf-bench

Benchmark: Syscall Basic

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall BasicRun 2Run 3Run 15M10M15M20M25MSE +/- 98896.75, N = 3SE +/- 164860.89, N = 3SE +/- 81316.08, N = 32290005822735445215700201. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Create Files

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create FilesRun 2Run 3Run 13691215SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 310.5910.7911.191. (CC) gcc options: -lm

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRun 1Run 3Run 220406080100SE +/- 1.01, N = 3SE +/- 1.21, N = 3SE +/- 0.98, N = 379.4076.8375.711. (CXX) g++ options: -rdynamic -lOpenCL

perf-bench

Benchmark: Epoll Wait

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll WaitRun 2Run 3Run 17K14K21K28K35KSE +/- 308.84, N = 3SE +/- 438.57, N = 4SE +/- 204.50, N = 33415934049330121. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Create Processes

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ProcessesRun 2Run 3Run 1714212835SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.34, N = 328.0528.1928.951. (CC) gcc options: -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionRun 1Run 2Run 3246810SE +/- 0.032, N = 3SE +/- 0.071, N = 3SE +/- 0.091, N = 36.7186.8116.9121. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

perf-bench

Benchmark: Memset 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBRun 2Run 1Run 31632486480SE +/- 1.08, N = 4SE +/- 0.94, N = 4SE +/- 1.13, N = 373.3773.2471.421. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Mixbench

Backend: NVIDIA CUDA - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerRun 1Run 3Run 23K6K9K12K15KSE +/- 20.03, N = 3SE +/- 28.60, N = 3SE +/- 207.34, N = 1514651.0114510.1114263.821. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessRun 3Run 1Run 248121620SE +/- 0.23, N = 3SE +/- 0.19, N = 5SE +/- 0.11, N = 315.4715.5115.881. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

perf-bench

Benchmark: Futex Lock-Pi

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-PiRun 2Run 1Run 3100200300400500SE +/- 3.89, N = 15SE +/- 2.08, N = 3SE +/- 5.86, N = 34544514431. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultRun 2Run 3Run 10.32760.65520.98281.31041.638SE +/- 0.024, N = 3SE +/- 0.019, N = 3SE +/- 0.024, N = 31.4241.4261.4561. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

InfluxDB

Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2016.10, N = 3SE +/- 1988.25, N = 3SE +/- 691.55, N = 31534746.91530852.51503297.6

InfluxDB

Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2246.17, N = 3SE +/- 2002.03, N = 3SE +/- 2562.57, N = 31376801.61372772.21349119.4

InfluxDB

Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 929.80, N = 3SE +/- 2819.29, N = 3SE +/- 2777.14, N = 31563580.11562422.11534587.9

perf-bench

Benchmark: Memcpy 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBRun 2Run 1Run 348121620SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.18, N = 515.3615.3415.091. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Memory Allocations

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory AllocationsRun 2Run 3Run 11530456075SE +/- 0.11, N = 3SE +/- 0.43, N = 3SE +/- 0.84, N = 364.7365.1665.841. (CC) gcc options: -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionRun 1Run 3Run 2816243240SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.30, N = 332.3432.8532.881. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OSBench

Test: Launch Programs

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch ProgramsRun 2Run 1Run 3918273645SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.19, N = 337.0837.2337.701. (CC) gcc options: -lm

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURun 2Run 1Run 3100M200M300M400M500MSE +/- 5831583.77, N = 3SE +/- 6814468.33, N = 3SE +/- 1749343.37, N = 3455807158.0450731366.0448875915.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched PipeRun 2Run 1Run 390K180K270K360K450KSE +/- 4833.02, N = 3SE +/- 3949.47, N = 3SE +/- 3250.27, N = 33970633923883915361. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisRun 3Run 2Run 1612182430SE +/- 0.26, N = 4SE +/- 0.09, N = 4SE +/- 0.08, N = 426.4926.6126.781. (CC) gcc options: -O2 -std=c99

MPV

Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 1080p - Decode: Software OnlyRun 2Run 3Run 130060090012001500SE +/- 3.00, N = 3SE +/- 0.55, N = 3SE +/- 2.06, N = 31249.001242.681236.99MIN: 823.81 / MAX: 1678.43MIN: 824.09 / MAX: 1672.6MIN: 818.67 / MAX: 1669.11. mpv 0.32.0

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkRun 1Run 3Run 2816243240SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 335.3035.0534.981. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1Run 1Run 2Run 34000M8000M12000M16000M20000MSE +/- 28555054.04, N = 3SE +/- 28850361.06, N = 3SE +/- 18200000.00, N = 3179628333331782270000017812100000

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRun 3Run 2Run 10.37710.75421.13131.50841.8855SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 31.6621.6711.6761. (CXX) g++ options: -rdynamic

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreRun 2Run 1Run 370140210280350310.03308.84307.44

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRun 1Run 2Run 3110220330440550SE +/- 1.65, N = 3SE +/- 0.32, N = 3SE +/- 1.44, N = 3522.54521.13518.211. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512Run 1Run 2Run 3500M1000M1500M2000M2500MSE +/- 3523256.07, N = 3SE +/- 3868390.42, N = 3SE +/- 2796624.95, N = 3246970000024518666672450466667

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5Run 1Run 3Run 212000M24000M36000M48000M60000MSE +/- 78954438.34, N = 3SE +/- 62943175.43, N = 3SE +/- 48887501.24, N = 3565540000005614560000056137133333

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSRun 1Run 3Run 2140K280K420K560K700KSE +/- 648.93, N = 3SE +/- 346.41, N = 3SE +/- 433.33, N = 3650967646600646367

Mixbench

Backend: NVIDIA CUDA - Benchmark: Half Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionRun 1Run 2Run 37K14K21K28K35KSE +/- 6.67, N = 3SE +/- 18.28, N = 3SE +/- 29.77, N = 332630.5232596.2732409.081. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRun 1Run 2Run 33K6K9K12K15KSE +/- 159.82, N = 15SE +/- 137.42, N = 15SE +/- 159.10, N = 613318.7513258.6513234.841. (CXX) g++ options: -O3 -rdynamic -lOpenCL

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Run 2Run 3Run 10.50061.00121.50182.00242.503SE +/- 0.026, N = 3SE +/- 0.037, N = 3SE +/- 0.026, N = 32.2122.2132.2251. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRun 2Run 3Run 11.00912.01823.02734.03645.0455SE +/- 0.016, N = 3SE +/- 0.017, N = 3SE +/- 0.027, N = 34.4594.4664.4851. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLRun 2Run 1Run 3160320480640800SE +/- 3.71, N = 3SE +/- 2.47, N = 3SE +/- 1.84, N = 3751.39749.41747.07

MPV

Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 4K - Decode: Software OnlyRun 2Run 3Run 180160240320400SE +/- 0.92, N = 3SE +/- 1.32, N = 3SE +/- 0.73, N = 3374.39372.48372.29MIN: 288.04 / MAX: 445.07MIN: 286.84 / MAX: 432.21MIN: 286.67 / MAX: 441.691. mpv 0.32.0

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRun 2Run 1Run 33K6K9K12K15KSE +/- 184.15, N = 15SE +/- 169.22, N = 15SE +/- 178.75, N = 1513387.0813379.7713313.501. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Run 3Run 2Run 160120180240300SE +/- 0.37, N = 3SE +/- 0.56, N = 3SE +/- 0.87, N = 3288.92288.88287.42

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLRun 1Run 2Run 350100150200250SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.21, N = 3213.43212.51212.36

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRun 1Run 2Run 320406080100SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3103.93104.28104.43

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 2Run 3Run 15001000150020002500SE +/- 2.19, N = 3SE +/- 3.60, N = 3SE +/- 3.69, N = 32421.412414.962409.86

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRun 2Run 3Run 1100200300400500SE +/- 0.26, N = 3SE +/- 1.25, N = 3SE +/- 0.66, N = 3449.8447.7447.71. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 2Run 3Run 16001200180024003000SE +/- 2.19, N = 3SE +/- 0.87, N = 3SE +/- 2.56, N = 32763.052756.922750.18

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXRun 1Run 3Run 2510152025SE +/- 0.20, N = 3SE +/- 0.26, N = 3SE +/- 0.30, N = 320.1620.2420.25

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-ZipRun 1Run 3Run 2200K400K600K800K1000KSE +/- 1637.41, N = 3SE +/- 1848.72, N = 3SE +/- 1278.45, N = 3880967879667877133

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDARun 1Run 2Run 3306090120150SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.39, N = 3151.94152.47152.60

perf-bench

Benchmark: Futex Hash

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex HashRun 1Run 3Run 21.1M2.2M3.3M4.4M5.5MSE +/- 7897.00, N = 3SE +/- 6643.91, N = 3SE +/- 7280.95, N = 35042694502871450211691. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXRun 2Run 3Run 12004006008001000SE +/- 0.32, N = 3SE +/- 1.68, N = 3SE +/- 0.44, N = 3893.03893.64896.83

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0Run 1Run 2Run 350100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3247248248

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXRun 2Run 1Run 31632486480SE +/- 0.28, N = 3SE +/- 0.25, N = 3SE +/- 0.18, N = 373.3373.3773.58

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLRun 1Run 2Run 33K6K9K12K15KSE +/- 82.15, N = 3SE +/- 69.91, N = 3SE +/- 41.53, N = 31169311673116571. (CXX) g++ options: -flto -pthread

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRun 2Run 1Run 30.04050.0810.12150.1620.2025SE +/- 0.00020, N = 3SE +/- 0.00010, N = 3SE +/- 0.00039, N = 30.179510.179550.18004

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRun 1Run 2Run 370140210280350SE +/- 0.19, N = 3SE +/- 0.15, N = 3SE +/- 0.50, N = 3325.3325.2324.51. (CC) gcc options: -O2 -flto -lOpenCL

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDARun 1Run 3Run 21632486480SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 373.1573.1773.27

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXRun 2Run 3Run 1816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 333.0533.0533.10

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDARun 1Run 2Run 360120180240300SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.04, N = 3292.14292.45292.50

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLRun 1Run 3Run 2246810SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 36.0306.0336.0371. (CXX) g++ options: -O3 -lOpenCL

PlaidML

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLRun 3Run 1Run 24080120160200SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3187.84187.74187.63

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDARun 1Run 3Run 2918273645SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 340.7340.7340.77

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRun 2Run 1Run 3120240360480600SE +/- 0.27, N = 3SE +/- 0.32, N = 3SE +/- 1.08, N = 3545.4545.4544.91. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRun 3Run 2Run 1110220330440550SE +/- 0.44, N = 3SE +/- 0.76, N = 3SE +/- 0.68, N = 3507.91507.87507.561. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDARun 2Run 1Run 3120240360480600SE +/- 0.40, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3538.60538.72538.81

OpenCV

Test: Features 2D

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Features 2DRun 2Run 3Run 130K60K90K120K150KSE +/- 2777.13, N = 9SE +/- 3470.33, N = 12SE +/- 1997.08, N = 121478381482061496761. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OSBench

Test: Create Threads

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ThreadsRun 1Run 2Run 348121620SE +/- 0.26, N = 15SE +/- 0.29, N = 15SE +/- 0.30, N = 1513.8913.9114.071. (CC) gcc options: -lm

Mixbench

Backend: NVIDIA CUDA - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionRun 3Run 1Run 23K6K9K12K15KSE +/- 510.15, N = 15SE +/- 646.92, N = 15SE +/- 646.88, N = 1515130.4714126.1013791.731. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionRun 1Run 3Run 2100200300400500SE +/- 0.03, N = 3SE +/- 4.74, N = 15SE +/- 7.42, N = 15440.80428.03419.441. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2


Phoronix Test Suite v10.8.5