AMD Ryzen 9 3950X Ubuntu Linux

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2009247-FI-AMDRYZEN924
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
CPU Massive 4 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 5 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 3 Tests
Multi-Core 3 Tests
NVIDIA GPU Compute 16 Tests
OpenCL 5 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Run 1
September 23 2020
  3 Hours, 47 Minutes
Run 2
September 24 2020
  3 Hours, 43 Minutes
Run 3
September 24 2020
  3 Hours, 44 Minutes
Invert Hiding All Results Option
  3 Hours, 45 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 3950X Ubuntu LinuxOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz)NVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-47-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.228 + OpenCL 2.0 AMD-APP (3182.0)1.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionAMD Ryzen 9 3950X Ubuntu Linux BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013- GPU Compute Cores: 4352- Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Run 1Run 2Run 3Result OverviewPhoronix Test Suite100%101%102%104%105%ViennaCLMixbenchOSBenchInfluxDBperf-benchMandelGPUeSpeak-NG Speech EngineLibRawArrayFireOctaneBenchMPVHashcatRodiniaFAHBenchWebP Image EncodeRedShift DemoclpeakLeelaChessZeroNAMD CUDAcl-memPlaidMLFinanceBenchBlenderOpenCV

AMD Ryzen 9 3950X Ubuntu Linuxarrayfire: Conjugate Gradient OpenCLblender: BMW27 - CUDAblender: Classroom - CUDAblender: Fishy Cat - CUDAblender: Barbershop - CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: Pabellon Barcelona - CUDAblender: Pabellon Barcelona - NVIDIA OptiXcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthespeak: Text-To-Speech Synthesisfahbench: financebench: Black-Scholes OpenCLhashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000lczero: OpenCLlibraw: Post-Processing Benchmarkmandelgpu: GPUmixbench: NVIDIA CUDA - Integermixbench: NVIDIA CUDA - Half Precisionmixbench: NVIDIA CUDA - Double Precisionmixbench: NVIDIA CUDA - Single Precisionmpv: Big Buck Bunny Sunflower 4K - Software Onlympv: Big Buck Bunny Sunflower 1080p - Software Onlynamd-cuda: ATPase Simulation - 327,506 Atomsoctanebench: Total Scoreopencv: Features 2Dopencv: Object Detectionosbench: Create Filesosbench: Create Threadsosbench: Launch Programsosbench: Create Processesosbench: Memory Allocationsperf-bench: Epoll Waitperf-bench: Futex Hashperf-bench: Memcpy 1MBperf-bench: Memset 1MBperf-bench: Sched Pipeperf-bench: Futex Lock-Piperf-bench: Syscall Basicplaidml: No - Training - Mobilenet - OpenCLplaidml: No - Inference - IMDB LSTM - OpenCLplaidml: No - Inference - Mobilenet - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLplaidml: No - Inference - DenseNet 201 - OpenCLredshift: rodinia: OpenCL Particle Filterviennacl: OpenCL LU Factorizationwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest CompressionRun 1Run 2Run 31.67640.73151.9473.15538.7220.1673.3733.10896.83292.14103.93325.3545.4447.713318.7513379.77522.54507.5626.779287.42106.030565540000001796283333388096724697000006509671349119.41503297.61534587.91169335.30450731366.014651.0132630.52440.8014126.10372.291236.990.17955308.842941496763739111.18751113.88820037.23382928.94719465.83635033012504269415.34063073.23964139238845121570020187.74749.412409.862750.18213.432474.48579.40301.4562.22515.5056.71832.3371.67140.77152.4773.27538.6020.2573.3333.05893.03292.45104.28325.2545.4449.813258.6513387.08521.13507.8726.605288.87716.037561371333331782270000087713324518666676463671372772.21530852.51562422.11167334.98455807158.014263.8232596.27419.4413791.73374.391249.000.17951310.0261991478383647310.59133213.91299637.08362628.05312564.73024734159502116915.36276373.36974639706345422900058187.63751.392421.412763.05212.512484.45975.70651.4242.21215.8766.81132.8791.66240.73152.6073.17538.8120.2473.5833.05893.64292.50104.43324.5544.9447.713234.8413313.50518.21507.9126.491288.92356.033561456000001781210000087966724504666676466001376801.61534746.91563580.11165735.05448875915.014510.1132409.08428.0315130.47372.481242.680.18004307.4438111482063489510.79124814.06844537.69715628.18663965.16472534049502871415.09465971.41835439153644322735445187.84747.072414.962756.92212.362484.46676.82591.4262.21315.4666.91232.847OpenBenchmarking.org

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRun 3Run 2Run 10.37710.75421.13131.50841.8855SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 31.6621.6711.6761. (CXX) g++ options: -rdynamic

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDARun 3Run 2Run 1918273645SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 340.7340.7740.73

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDARun 3Run 2Run 1306090120150SE +/- 0.39, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 3152.60152.47151.94

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDARun 3Run 2Run 11632486480SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 373.1773.2773.15

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDARun 3Run 2Run 1120240360480600SE +/- 0.06, N = 3SE +/- 0.40, N = 3SE +/- 0.16, N = 3538.81538.60538.72

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXRun 3Run 2Run 1510152025SE +/- 0.26, N = 3SE +/- 0.30, N = 3SE +/- 0.20, N = 320.2420.2520.16

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXRun 3Run 2Run 11632486480SE +/- 0.18, N = 3SE +/- 0.28, N = 3SE +/- 0.25, N = 373.5873.3373.37

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXRun 3Run 2Run 1816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 333.0533.0533.10

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXRun 3Run 2Run 12004006008001000SE +/- 1.68, N = 3SE +/- 0.32, N = 3SE +/- 0.44, N = 3893.64893.03896.83

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDARun 3Run 2Run 160120180240300SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3292.50292.45292.14

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRun 3Run 2Run 120406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3104.43104.28103.93

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRun 3Run 2Run 170140210280350SE +/- 0.50, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 3324.5325.2325.31. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRun 3Run 2Run 1120240360480600SE +/- 1.08, N = 3SE +/- 0.27, N = 3SE +/- 0.32, N = 3544.9545.4545.41. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRun 3Run 2Run 1100200300400500SE +/- 1.25, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3447.7449.8447.71. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRun 3Run 2Run 13K6K9K12K15KSE +/- 159.10, N = 6SE +/- 137.42, N = 15SE +/- 159.82, N = 1513234.8413258.6513318.751. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRun 3Run 2Run 13K6K9K12K15KSE +/- 178.75, N = 15SE +/- 184.15, N = 15SE +/- 169.22, N = 1513313.5013387.0813379.771. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRun 3Run 2Run 1110220330440550SE +/- 1.44, N = 3SE +/- 0.32, N = 3SE +/- 1.65, N = 3518.21521.13522.541. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRun 3Run 2Run 1110220330440550SE +/- 0.44, N = 3SE +/- 0.76, N = 3SE +/- 0.68, N = 3507.91507.87507.561. (CXX) g++ options: -O3 -rdynamic -lOpenCL

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisRun 3Run 2Run 1612182430SE +/- 0.26, N = 4SE +/- 0.09, N = 4SE +/- 0.08, N = 426.4926.6126.781. (CC) gcc options: -O2 -std=c99

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Run 3Run 2Run 160120180240300SE +/- 0.37, N = 3SE +/- 0.56, N = 3SE +/- 0.87, N = 3288.92288.88287.42

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLRun 3Run 2Run 1246810SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 36.0336.0376.0301. (CXX) g++ options: -O3 -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5Run 3Run 2Run 112000M24000M36000M48000M60000MSE +/- 62943175.43, N = 3SE +/- 48887501.24, N = 3SE +/- 78954438.34, N = 3561456000005613713333356554000000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1Run 3Run 2Run 14000M8000M12000M16000M20000MSE +/- 18200000.00, N = 3SE +/- 28850361.06, N = 3SE +/- 28555054.04, N = 3178121000001782270000017962833333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-ZipRun 3Run 2Run 1200K400K600K800K1000KSE +/- 1848.72, N = 3SE +/- 1278.45, N = 3SE +/- 1637.41, N = 3879667877133880967

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512Run 3Run 2Run 1500M1000M1500M2000M2500MSE +/- 2796624.95, N = 3SE +/- 3868390.42, N = 3SE +/- 3523256.07, N = 3245046666724518666672469700000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSRun 3Run 2Run 1140K280K420K560K700KSE +/- 346.41, N = 3SE +/- 433.33, N = 3SE +/- 648.93, N = 3646600646367650967

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2246.17, N = 3SE +/- 2002.03, N = 3SE +/- 2562.57, N = 31376801.61372772.21349119.4

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2016.10, N = 3SE +/- 1988.25, N = 3SE +/- 691.55, N = 31534746.91530852.51503297.6

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 929.80, N = 3SE +/- 2819.29, N = 3SE +/- 2777.14, N = 31563580.11562422.11534587.9

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLRun 3Run 2Run 13K6K9K12K15KSE +/- 41.53, N = 3SE +/- 69.91, N = 3SE +/- 82.15, N = 31165711673116931. (CXX) g++ options: -flto -pthread

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkRun 3Run 2Run 1816243240SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 335.0534.9835.301. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURun 3Run 2Run 1100M200M300M400M500MSE +/- 1749343.37, N = 3SE +/- 5831583.77, N = 3SE +/- 6814468.33, N = 3448875915.0455807158.0450731366.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerRun 3Run 2Run 13K6K9K12K15KSE +/- 28.60, N = 3SE +/- 207.34, N = 15SE +/- 20.03, N = 314510.1114263.8214651.011. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionRun 3Run 2Run 17K14K21K28K35KSE +/- 29.77, N = 3SE +/- 18.28, N = 3SE +/- 6.67, N = 332409.0832596.2732630.521. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionRun 3Run 2Run 1100200300400500SE +/- 4.74, N = 15SE +/- 7.42, N = 15SE +/- 0.03, N = 3428.03419.44440.801. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionRun 3Run 2Run 13K6K9K12K15KSE +/- 510.15, N = 15SE +/- 646.88, N = 15SE +/- 646.92, N = 1515130.4713791.7314126.101. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

MPV

MPV is an open-source, cross-platform media player. This test profile tests the frame-rate that can be achieved unsynchronized in a desynchronized mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 4K - Decode: Software OnlyRun 3Run 2Run 180160240320400SE +/- 1.32, N = 3SE +/- 0.92, N = 3SE +/- 0.73, N = 3372.48374.39372.29MIN: 286.84 / MAX: 432.21MIN: 288.04 / MAX: 445.07MIN: 286.67 / MAX: 441.691. mpv 0.32.0

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 1080p - Decode: Software OnlyRun 3Run 2Run 130060090012001500SE +/- 0.55, N = 3SE +/- 3.00, N = 3SE +/- 2.06, N = 31242.681249.001236.99MIN: 824.09 / MAX: 1672.6MIN: 823.81 / MAX: 1678.43MIN: 818.67 / MAX: 1669.11. mpv 0.32.0

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRun 3Run 2Run 10.04050.0810.12150.1620.2025SE +/- 0.00039, N = 3SE +/- 0.00020, N = 3SE +/- 0.00010, N = 30.180040.179510.17955

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreRun 3Run 2Run 170140210280350307.44310.03308.84

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Features 2DRun 3Run 2Run 130K60K90K120K150KSE +/- 3470.33, N = 12SE +/- 2777.13, N = 9SE +/- 1997.08, N = 121482061478381496761. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Object DetectionRun 3Run 2Run 18K16K24K32K40KSE +/- 588.74, N = 3SE +/- 484.41, N = 3SE +/- 382.04, N = 33489536473373911. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OSBench

OSBench is a collection of micro-benchmarks for measuring operating system primitives like time to create threads/processes, launching programs, creating files, and memory allocation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create FilesRun 3Run 2Run 13691215SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 310.7910.5911.191. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ThreadsRun 3Run 2Run 148121620SE +/- 0.30, N = 15SE +/- 0.29, N = 15SE +/- 0.26, N = 1514.0713.9113.891. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch ProgramsRun 3Run 2Run 1918273645SE +/- 0.19, N = 3SE +/- 0.25, N = 3SE +/- 0.28, N = 337.7037.0837.231. (CC) gcc options: -lm

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ProcessesRun 3Run 2Run 1714212835SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.34, N = 328.1928.0528.951. (CC) gcc options: -lm

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory AllocationsRun 3Run 2Run 11530456075SE +/- 0.43, N = 3SE +/- 0.11, N = 3SE +/- 0.84, N = 365.1664.7365.841. (CC) gcc options: -lm

perf-bench

This test profile is used for running Linux perf-bench, the benchmark support within the Linux kernel's perf tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll WaitRun 3Run 2Run 17K14K21K28K35KSE +/- 438.57, N = 4SE +/- 308.84, N = 3SE +/- 204.50, N = 33404934159330121. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex HashRun 3Run 2Run 11.1M2.2M3.3M4.4M5.5MSE +/- 6643.91, N = 3SE +/- 7280.95, N = 3SE +/- 7897.00, N = 35028714502116950426941. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBRun 3Run 2Run 148121620SE +/- 0.18, N = 5SE +/- 0.10, N = 3SE +/- 0.20, N = 315.0915.3615.341. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBRun 3Run 2Run 11632486480SE +/- 1.13, N = 3SE +/- 1.08, N = 4SE +/- 0.94, N = 471.4273.3773.241. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched PipeRun 3Run 2Run 190K180K270K360K450KSE +/- 3250.27, N = 3SE +/- 4833.02, N = 3SE +/- 3949.47, N = 33915363970633923881. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-PiRun 3Run 2Run 1100200300400500SE +/- 5.86, N = 3SE +/- 3.89, N = 15SE +/- 2.08, N = 34434544511. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall BasicRun 3Run 2Run 15M10M15M20M25MSE +/- 164860.89, N = 3SE +/- 98896.75, N = 3SE +/- 81316.08, N = 32273544522900058215700201. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 14080120160200SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3187.84187.63187.74

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLRun 3Run 2Run 1160320480640800SE +/- 1.84, N = 3SE +/- 3.71, N = 3SE +/- 2.47, N = 3747.07751.39749.41

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 15001000150020002500SE +/- 3.60, N = 3SE +/- 2.19, N = 3SE +/- 3.69, N = 32414.962421.412409.86

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 16001200180024003000SE +/- 0.87, N = 3SE +/- 2.19, N = 3SE +/- 2.56, N = 32756.922763.052750.18

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLRun 3Run 2Run 150100150200250SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3212.36212.51213.43

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0Run 3Run 2Run 150100150200250SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3248248247

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRun 3Run 2Run 11.00912.01823.02734.03645.0455SE +/- 0.017, N = 3SE +/- 0.016, N = 3SE +/- 0.027, N = 34.4664.4594.4851. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRun 3Run 2Run 120406080100SE +/- 1.21, N = 3SE +/- 0.98, N = 3SE +/- 1.01, N = 376.8375.7179.401. (CXX) g++ options: -rdynamic -lOpenCL

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultRun 3Run 2Run 10.32760.65520.98281.31041.638SE +/- 0.019, N = 3SE +/- 0.024, N = 3SE +/- 0.024, N = 31.4261.4241.4561. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Run 3Run 2Run 10.50061.00121.50182.00242.503SE +/- 0.037, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 32.2132.2122.2251. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessRun 3Run 2Run 148121620SE +/- 0.23, N = 3SE +/- 0.11, N = 3SE +/- 0.19, N = 515.4715.8815.511. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionRun 3Run 2Run 1246810SE +/- 0.091, N = 3SE +/- 0.071, N = 3SE +/- 0.032, N = 36.9126.8116.7181. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionRun 3Run 2Run 1816243240SE +/- 0.16, N = 3SE +/- 0.30, N = 3SE +/- 0.39, N = 332.8532.8832.341. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

67 Results Shown

ArrayFire
Blender:
  BMW27 - CUDA
  Classroom - CUDA
  Fishy Cat - CUDA
  Barbershop - CUDA
  BMW27 - NVIDIA OptiX
  Classroom - NVIDIA OptiX
  Fishy Cat - NVIDIA OptiX
  Barbershop - NVIDIA OptiX
  Pabellon Barcelona - CUDA
  Pabellon Barcelona - NVIDIA OptiX
cl-mem:
  Copy
  Read
  Write
clpeak:
  Integer Compute INT
  Single-Precision Float
  Double-Precision Double
  Global Memory Bandwidth
eSpeak-NG Speech Engine
FAHBench
FinanceBench
Hashcat:
  MD5
  SHA1
  7-Zip
  SHA-512
  TrueCrypt RIPEMD160 + XTS
InfluxDB:
  4 - 10000 - 2,5000,1 - 10000
  64 - 10000 - 2,5000,1 - 10000
  1024 - 10000 - 2,5000,1 - 10000
LeelaChessZero
LibRaw
MandelGPU
Mixbench:
  NVIDIA CUDA - Integer
  NVIDIA CUDA - Half Precision
  NVIDIA CUDA - Double Precision
  NVIDIA CUDA - Single Precision
MPV:
  Big Buck Bunny Sunflower 4K - Software Only
  Big Buck Bunny Sunflower 1080p - Software Only
NAMD CUDA
OctaneBench
OpenCV:
  Features 2D
  Object Detection
OSBench:
  Create Files
  Create Threads
  Launch Programs
  Create Processes
  Memory Allocations
perf-bench:
  Epoll Wait
  Futex Hash
  Memcpy 1MB
  Memset 1MB
  Sched Pipe
  Futex Lock-Pi
  Syscall Basic
PlaidML:
  No - Training - Mobilenet - OpenCL
  No - Inference - IMDB LSTM - OpenCL
  No - Inference - Mobilenet - OpenCL
  Yes - Inference - Mobilenet - OpenCL
  No - Inference - DenseNet 201 - OpenCL
RedShift Demo
Rodinia
ViennaCL
WebP Image Encode:
  Default
  Quality 100
  Quality 100, Lossless
  Quality 100, Highest Compression
  Quality 100, Lossless, Highest Compression