Radeon ROCm vs. NVIDIA OpenCL August 2017

Radeon ROCm and NVIDIA OpenCL Linux testing by Michael Larabel for a future article on Phoronix.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1708107-TY-OPENCLVEG85
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Radeon R9 285
August 08 2017
 
Radeon R9 290
August 08 2017
 
Radeon RX 480
August 08 2017
 
Radeon RX 560
August 08 2017
 
Radeon RX 580
August 08 2017
 
Radeon R9 Fury
August 08 2017
 
GeForce GTX 780 Ti
August 02 2017
 
GeForce GTX 960
August 06 2017
 
GeForce GTX 970
August 02 2017
 
GeForce GTX 980
August 01 2017
 
GeForce GTX 980 Ti
August 01 2017
 
GeForce GTX 1050
August 01 2017
 
GeForce GTX 1060
August 01 2017
 
GeForce GTX 1070
August 01 2017
 
GeForce GTX 1080
August 01 2017
 
GeForce GTX 1080 Ti
August 02 2017
 
Invert Behavior (Only Show Selected Data)
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Radeon ROCm vs. NVIDIA OpenCL August 2017ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 970GeForce GTX 960Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX 560Radeon RX 480Radeon R9 285Intel Core i7-7740K @ 4.50GHz (8 Cores)ASUS PRIME X299-AIntel Device 591f16384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBNVIDIA GeForce GTX 980 4096MB (1126/3505MHz)Realtek ALC1220Intel ConnectionUbuntu 16.044.13.0-999-generic (x86_64) 20170730Unity 7.4.0NVIDIA 384.594.5.0OpenCL 1.2 CUDA 9.0.1301.0.42GCC 5.4.0 20160609ext43840x2160NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)XFX AMD HAWAII 4096MBRealtek GenericAcer B286HK4.11.0-kfd-compute-rocm-rel-1.6-127 (x86_64)modesetting 1.19.34.5 Mesa 17.3.0-devel- padoka PPA (LLVM 6.0.0)OpenCL 2.0 AMD-APP (2450.0)Sapphire AMD FIJI 4096MBMSI AMD POLARIS10 8192MBAMD POLARIS11 4096MBamdgpu 1.3.0AMD POLARIS10 8192MBXFX AMD TONGA 2048MBmodesetting 1.19.3OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 960: GPU Compute Cores: 1024System Details- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 960: GPU Compute Cores: 1024.

Radeon ROCm vs. NVIDIA OpenCL August 2017mixbench: Integermixbench: Single Precisionclpeak: Double-Precision Doublemixbench: Double Precisionluxmark: GPU - Hotelshoc: OpenCL - Max SP Flopsdarktable: Boat - OpenCLclpeak: Integer Compute INTshoc: OpenCL - MD5 Hashclpeak: Single-Precision Floatcl-mem: Writeclpeak: Global Memory Bandwidthviennacl: OpenCL LU Factorizationshoc: OpenCL - Texture Read Bandwidthclpeak: Transfer Bandwidth enqueueWriteBuffershoc: OpenCL - FFT SPcl-mem: Copyluxmark: GPU - Luxball HDRcl-mem: Readfahbench: shoc: OpenCL - Triaddarktable: Server Room - OpenCLclpeak: Kernel Latencydarktable: Masskrug - OpenCLGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 970GeForce GTX 960Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX 560Radeon RX 480Radeon R9 2851402.274700.91159.68159.8617565051.8915.231296.117.544288.05152.30164.1054.96332.1112.48447.62142.6011959164.4797.3812.030.214.070.211717.515563.69195.67194.7221426208.654.191583.859.275292.12238.40263.3256.91351.4512.21693.44216.3714811266.17108.1512.290.224.330.221366.844389.15150.41152.1617424829.994.661223.587.364157.55138.70146.2354.03382.2112.37322.16137.7011572151.6097.2411.960.194.100.182027.966402.60225.45223.8722867125.883.781631.0310.616359.20191.43196.3658.95454.4712.61470.12186.6316186205.43132.7912.200.193.550.182662.468493.48295.22295.3427259446.743.672349.8614.298249.62213.13218.1561.26520.1312.59597.69206.5312776227.20145.3812.310.184.060.17614.522040.4666.8866.7610222125.7818.18568.643.241937.0085.7392.4341.80274.916.44204.3687.5065699549.786.120.233.820.2227.8889.23415.066.39353213274.173.133200.3019.7411780.26335.47329.4363.67596.8212.61974.37316.8019662338.07186.7412.510.183.550.17968.934263.71246.21245.1111994944.7215.24961.134.663847.88252.00252.9854.59287.3912.36429.87237.109516271.6772.7712.350.255.560.271220.804125.24137.56136.1816494361.6516.261128.646.543728.55129.70143.4153.03288.8712.49382.02125.2710704143.5585.4511.910.224.050.21835.012765.1392.4492.7411302960.4919.30781.164.462429.1070.8081.1247.64277.1112.46209.1570.60614881.4058.3511.240.253.880.24599.0610134790.313.851595.416.114755.89217.70272.1720.97253.6430.33192.4010400123.000.156.670.171386.506508.34447.26440.8513677144.883.421429.329.097068.81391.50431.2621.76245.6830.74550.79206.1313212123.3010.170.135.940.131227.325863.73391.77389.9712186260.484.081251.637.966175.41180.70208.9611.98207.9730.38498.10183.4710400159.739.680.175.630.18508.692465.62164.50164.005092634.536.91524.703.342588.6680.9089.2512.08115.8130.38209.7681.03446893.504.950.275.820.271140.245450.06364.04362.2910705812.644.161162.887.335737.62179.70209.3912.05193.4330.64462.35185.379885157.879.610.135.860.14OpenBenchmarking.org

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9806001200180024003000SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 2.57, N = 3SE +/- 0.50, N = 3SE +/- 0.88, N = 3SE +/- 1.30, N = 3SE +/- 0.97, N = 3SE +/- 3.22, N = 3SE +/- 2.60, N = 3SE +/- 0.54, N = 3SE +/- 1.33, N = 3SE +/- 0.90, N = 31140.24508.691227.321386.50835.011220.80968.9327.88614.522662.462027.961366.841717.511402.271. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9802K4K6K8K10KSE +/- 2.06, N = 3SE +/- 0.14, N = 3SE +/- 4.20, N = 3SE +/- 1.46, N = 3SE +/- 0.91, N = 3SE +/- 2.66, N = 3SE +/- 5.00, N = 3SE +/- 3.63, N = 3SE +/- 0.81, N = 3SE +/- 48.08, N = 3SE +/- 54.41, N = 3SE +/- 2.95, N = 3SE +/- 255.36, N = 3SE +/- 5.25, N = 35450.062465.625863.736508.342765.134125.244263.7189.232040.468493.486402.604389.155563.694700.911. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980130260390520650SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.29, N = 3SE +/- 1.65, N = 3SE +/- 0.08, N = 3SE +/- 0.76, N = 3SE +/- 1.12, N = 3SE +/- 0.30, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3364.04164.50391.77447.26599.0692.44137.56246.21415.0666.88295.22225.45150.41195.67159.68

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98060120180240300Min: 74.5 / Avg: 139.09 / Max: 198.1Min: 57.7 / Avg: 83.39 / Max: 110.6Min: 62.1 / Avg: 135.37 / Max: 215.7Min: 91.2 / Avg: 171.7 / Max: 293.2Min: 158.9 / Avg: 196.69 / Max: 311.4Min: 53.3 / Avg: 128.25 / Max: 203.2Min: 82.9 / Avg: 130.65 / Max: 178.4Min: 59.7 / Avg: 109 / Max: 158.3Min: 44.6 / Avg: 55.5 / Max: 66.4Min: 51.8 / Avg: 159 / Max: 266.2Min: 50.7 / Avg: 88 / Max: 125.3Min: 63.5 / Avg: 152.7 / Max: 241.9Min: 254.1 / Avg: 254.4 / Max: 254.7

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Double PrecisionRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980100200300400500SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.71, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.92, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 2.32, N = 3SE +/- 0.01, N = 3362.29164.00389.97440.8592.74136.18245.116.3966.76295.34223.87152.16194.72159.861. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9808001600240032004000SE +/- 0.67, N = 3SE +/- 2.33, N = 3SE +/- 3.33, N = 3SE +/- 3.84, N = 3SE +/- 0.67, N = 3SE +/- 7.64, N = 3SE +/- 36.71, N = 3SE +/- 5.33, N = 3SE +/- 54.49, N = 3SE +/- 1.00, N = 3SE +/- 28.15, N = 3SE +/- 6.67, N = 3SE +/- 9.21, N = 3SE +/- 36.99, N = 3SE +/- 55.00, N = 310705091218136710131130164911993532102227252286174221421756

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98060120180240300Min: 66.7 / Avg: 159.05 / Max: 205.4Min: 50.6 / Avg: 92.86 / Max: 123.4Min: 61.4 / Avg: 165.94 / Max: 235Min: 121.6 / Avg: 215.47 / Max: 344.8Min: 109.8 / Avg: 204.67 / Max: 296.3Min: 117.1 / Avg: 124.83 / Max: 188.6Min: 57.8 / Avg: 149.11 / Max: 234.2Min: 59.2 / Avg: 226.01 / Max: 336.8Min: 52.6 / Avg: 203.91 / Max: 304.8Min: 65.4 / Avg: 91.36 / Max: 114.7Min: 127.9 / Avg: 156.13 / Max: 262.7Min: 78.5 / Avg: 140.86 / Max: 210.3Min: 73.6 / Avg: 118.38 / Max: 167.3Min: 151.8 / Avg: 198.76 / Max: 304.5Min: 55.9 / Avg: 163.16 / Max: 251

LuxMark

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon R9 285Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98050100150200250Min: 103.4 / Avg: 161.24 / Max: 183.2Min: 51.4 / Avg: 94.7 / Max: 102.8Min: 62.8 / Avg: 167.57 / Max: 197.1Min: 87.9 / Avg: 202.29 / Max: 248.6Min: 164 / Avg: 223.01 / Max: 246.2Min: 68.3 / Avg: 154.46 / Max: 158.8Min: 57.1 / Avg: 191.79 / Max: 197.1Min: 79.8 / Avg: 279.87 / Max: 294.8Min: 75.8 / Avg: 261.7 / Max: 278.5Min: 44.6 / Avg: 107.9 / Max: 115.3Min: 61.4 / Avg: 203.35 / Max: 209.7Min: 52.1 / Avg: 183.39 / Max: 188.4Min: 61.8 / Avg: 152.59 / Max: 156.6Min: 61 / Avg: 242.75 / Max: 254.4Min: 56.4 / Avg: 203.63 / Max: 213.1

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98050100150200250Min: 69.5 / Avg: 134.82 / Max: 175Min: 51.3 / Avg: 83.15 / Max: 94.2Min: 62.8 / Avg: 129.51 / Max: 166.9Min: 167.4 / Avg: 178.62 / Max: 202Min: 111.2 / Avg: 180.14 / Max: 214.7Min: 96.7 / Avg: 124.57 / Max: 141.9Min: 58 / Avg: 146.29 / Max: 167.8Min: 94.2 / Avg: 240.33 / Max: 287.5Min: 66.8 / Avg: 190.32 / Max: 229.1Min: 44.8 / Avg: 89.24 / Max: 101.5Min: 52.2 / Avg: 141.33 / Max: 169.7Min: 50.7 / Avg: 138.56 / Max: 159.1Min: 49.9 / Avg: 117.9 / Max: 137.8Min: 143.3 / Avg: 204.53 / Max: 231.7Min: 55.7 / Avg: 159.95 / Max: 185.7

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9803K6K9K12K15KSE +/- 1.92, N = 3SE +/- 0.11, N = 3SE +/- 1.35, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 7.79, N = 3SE +/- 1.82, N = 3SE +/- 19.58, N = 3SE +/- 65.79, N = 3SE +/- 0.09, N = 3SE +/- 38.82, N = 3SE +/- 32.37, N = 3SE +/- 6.12, N = 3SE +/- 15.98, N = 3SE +/- 3.16, N = 35812.642634.536260.487144.884790.312960.494361.654944.7213274.172125.789446.747125.884829.996208.655051.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

LuxMark

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon R9 285Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98060120180240300Min: 71.5 / Avg: 186.43 / Max: 197.7Min: 50.4 / Avg: 104.54 / Max: 107.3Min: 61.7 / Avg: 198.14 / Max: 205.1Min: 60.8 / Avg: 231.32 / Max: 244.7Min: 108.1 / Avg: 253.37 / Max: 262.7Min: 73.2 / Avg: 145.52 / Max: 149.6Min: 88.4 / Avg: 191.17 / Max: 194.5Min: 127.9 / Avg: 295.03 / Max: 310.7Min: 103.3 / Avg: 258.5 / Max: 265.6Min: 62.1 / Avg: 109.06 / Max: 119.8Min: 81.4 / Avg: 184.66 / Max: 189.3Min: 78.2 / Avg: 186.9 / Max: 196.9Min: 50 / Avg: 151.57 / Max: 157.1Min: 120.1 / Avg: 253.78 / Max: 259Min: 136.6 / Avg: 207.25 / Max: 212.8

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 34.166.914.083.423.8519.3016.2615.243.1318.183.673.784.664.1915.23

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9807001400210028003500SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 6.09, N = 3SE +/- 22.00, N = 3SE +/- 17.95, N = 3SE +/- 117.22, N = 3SE +/- 15.64, N = 3SE +/- 29.30, N = 3SE +/- 19.23, N = 3SE +/- 52.05, N = 3SE +/- 16.78, N = 3SE +/- 0.62, N = 31162.88524.701251.631429.321595.41781.161128.64961.133200.30568.642349.861631.031223.581583.851296.11

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.333.347.969.096.114.466.544.6619.743.2414.2910.617.369.277.541. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9803K6K9K12K15KSE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.38, N = 3SE +/- 0.51, N = 3SE +/- 0.30, N = 3SE +/- 70.89, N = 3SE +/- 0.43, N = 3SE +/- 0.62, N = 3SE +/- 0.82, N = 3SE +/- 0.29, N = 3SE +/- 157.37, N = 3SE +/- 0.57, N = 3SE +/- 74.08, N = 3SE +/- 13.41, N = 3SE +/- 0.64, N = 35737.622588.666175.417068.814755.892429.103728.553847.8811780.261937.008249.626359.204157.555292.124288.05

cl-mem

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98050100150200250Min: 67.2 / Avg: 149.58 / Max: 172.7Min: 61.8 / Avg: 94.24 / Max: 99.4Min: 61.5 / Avg: 128.3 / Max: 177.9Min: 111.7 / Avg: 167.53 / Max: 194.9Min: 208.1 / Avg: 219.88 / Max: 227.1Min: 53 / Avg: 112.52 / Max: 126.6Min: 86.1 / Avg: 140.75 / Max: 165.4Min: 149 / Avg: 212.7 / Max: 266.1Min: 213.3 / Avg: 215.1 / Max: 216.9Min: 44.6 / Avg: 91.7 / Max: 98.4Min: 99.7 / Avg: 126.93 / Max: 164.8Min: 50.4 / Avg: 132.24 / Max: 154.3Min: 77.6 / Avg: 112.65 / Max: 130.4Min: 63.5 / Avg: 169.7 / Max: 225.2Min: 102 / Avg: 140.4 / Max: 165.1

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98060120180240300Min: 112.9 / Avg: 157.65 / Max: 202.4Min: 114.5 / Avg: 114.7 / Max: 114.9Min: 61.6 / Avg: 65.85 / Max: 70.1Min: 63.4 / Avg: 204.7 / Max: 346Min: 109.8 / Avg: 199.5 / Max: 289.2Min: 168.7 / Avg: 174.7 / Max: 180.7Min: 98.4 / Avg: 109.4 / Max: 120.4Min: 130.1 / Avg: 172.7 / Max: 215.3Min: 116.8 / Avg: 117.17 / Max: 117.4Min: 82.3 / Avg: 132.4 / Max: 182.5Min: 104 / Avg: 105.7 / Max: 107.4

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98080160240320400SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.80, N = 3SE +/- 4.06, N = 3SE +/- 1.08, N = 3SE +/- 0.00, N = 2SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.87, N = 3SE +/- 0.09, N = 3SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3179.7080.90180.70391.50217.7070.80129.70252.00335.4785.73213.13191.43138.70238.40152.301. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98090180270360450SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.90, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 10.22, N = 3SE +/- 0.82, N = 3SE +/- 0.18, N = 3SE +/- 3.85, N = 3SE +/- 0.15, N = 3SE +/- 0.45, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3209.3989.25208.96431.26272.1781.12143.41252.98329.4392.43218.15196.36146.23263.32164.10

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9801428425670SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.75, N = 3SE +/- 0.27, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 2.80, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.37, N = 3SE +/- 0.34, N = 3SE +/- 0.03, N = 312.0512.0811.9821.7620.9747.6453.0354.5963.6741.8061.2658.9554.0356.9154.961. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98050100150200250Min: 158.5 / Avg: 168.35 / Max: 173.8Min: 50.7 / Avg: 89.22 / Max: 99.8Min: 61.6 / Avg: 134.4 / Max: 178.4Min: 63.2 / Avg: 162.7 / Max: 196.7Min: 110.3 / Avg: 194.92 / Max: 227.7Min: 75.2 / Avg: 119.81 / Max: 126.4Min: 145.8 / Avg: 161.02 / Max: 165.8Min: 205.2 / Avg: 232.33 / Max: 267.5Min: 131.4 / Avg: 163.5 / Max: 225.5Min: 51 / Avg: 88.33 / Max: 98.2Min: 120.4 / Avg: 154.88 / Max: 168.9Min: 50.8 / Avg: 133.6 / Max: 156.7Min: 113.9 / Avg: 126.18 / Max: 129.8Min: 164 / Avg: 209.78 / Max: 231Min: 55.9 / Avg: 137.83 / Max: 165.7

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98050100150200250Min: 73.8 / Avg: 137.13 / Max: 171.7Min: 51.1 / Avg: 91.53 / Max: 100.1Min: 126.3 / Avg: 165.33 / Max: 179.6Min: 63.6 / Avg: 161.35 / Max: 231Min: 110.7 / Avg: 173.45 / Max: 211.5Min: 75.3 / Avg: 117.34 / Max: 125.9Min: 57.9 / Avg: 143.07 / Max: 165.3Min: 59.1 / Avg: 193.83 / Max: 264Min: 215.6 / Avg: 216.75 / Max: 217.9Min: 97.2 / Avg: 97.62 / Max: 98.3Min: 99.9 / Avg: 143.28 / Max: 167.2Min: 51.8 / Avg: 133.92 / Max: 156.5Min: 123.2 / Avg: 128.26 / Max: 130.1Min: 136.5 / Avg: 208.38 / Max: 235Min: 110.7 / Avg: 156.45 / Max: 166

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980130260390520650SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.89, N = 3SE +/- 1.41, N = 3SE +/- 0.45, N = 3SE +/- 0.24, N = 3SE +/- 0.11, N = 3SE +/- 0.23, N = 3SE +/- 1.17, N = 3SE +/- 0.88, N = 3SE +/- 0.27, N = 3SE +/- 0.59, N = 3SE +/- 0.74, N = 3SE +/- 1.03, N = 3193.43115.81207.97245.68253.64277.11288.87287.39596.82274.91520.13454.47382.21351.45332.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9804080120160200Min: 73.8 / Avg: 140.82 / Max: 162.3Min: 51 / Avg: 87.81 / Max: 93.9Min: 95.5 / Avg: 138.34 / Max: 170Min: 63.9 / Avg: 177.34 / Max: 225.5Min: 110.3 / Avg: 178.68 / Max: 199.3Min: 85.3 / Avg: 121.06 / Max: 124.6Min: 61.8 / Avg: 134.1 / Max: 144.6Min: 220.2 / Avg: 220.42 / Max: 221Min: 57.5 / Avg: 179.46 / Max: 201.8Min: 44.7 / Avg: 85.97 / Max: 90.4Min: 52.7 / Avg: 144.57 / Max: 160.3Min: 133.6 / Avg: 138.18 / Max: 139.4Min: 49.1 / Avg: 105.58 / Max: 118Min: 139.5 / Avg: 189.78 / Max: 196.9Min: 57 / Avg: 152.7 / Max: 162.3

OpenBenchmarking.orgGBPS Per Watt, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 970GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.08780.17560.26340.35120.4390.310.390.320.290.180.120.080.080.110.110.130.080.11

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980714212835SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.33, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 330.6430.3830.3830.7430.3312.4612.4912.3612.616.4412.5912.6112.3712.2112.48

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9802004006008001000SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.45, N = 3SE +/- 6.83, N = 3SE +/- 21.22, N = 3SE +/- 5.16, N = 3SE +/- 12.49, N = 3SE +/- 6.81, N = 3SE +/- 8.94, N = 3SE +/- 7.21, N = 3SE +/- 21.12, N = 3SE +/- 0.99, N = 3462.35209.76498.10550.79209.15382.02429.87974.37204.36597.69470.12322.16693.44447.621. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98070140210280350SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.38, N = 3SE +/- 0.48, N = 3SE +/- 0.64, N = 3SE +/- 0.00, N = 2SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.84, N = 3SE +/- 0.03, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3185.3781.03183.47206.13192.4070.60125.27237.10316.8087.50206.53186.63137.70216.37142.601. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.75381.50762.26143.01523.7692.591.872.832.523.350.761.031.122.310.782.041.631.421.031.05

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9804K8K12K16K20KSE +/- 3.67, N = 3SE +/- 8.69, N = 3SE +/- 3.33, N = 3SE +/- 66.33, N = 3SE +/- 3.00, N = 3SE +/- 20.00, N = 3SE +/- 21.08, N = 3SE +/- 13.57, N = 3SE +/- 15.33, N = 3SE +/- 51.02, N = 3SE +/- 3.84, N = 3SE +/- 38.33, N = 3SE +/- 118.82, N = 3SE +/- 26.67, N = 39885446810400132121040061481070495161966265691277616186115721481111959

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98070140210280350SE +/- 2.42, N = 3SE +/- 0.00, N = 3SE +/- 0.41, N = 3SE +/- 0.12, N = 3SE +/- 0.30, N = 3SE +/- 0.00, N = 2SE +/- 0.05, N = 2SE +/- 0.03, N = 3SE +/- 0.22, N = 3SE +/- 1.03, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3157.8793.50159.73123.30123.0081.40143.55271.67338.0795.00227.20205.43151.60266.17164.471. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.54231.08461.62692.16922.71151.070.911.342.411.120.590.811.082.050.971.381.431.101.141.11

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9804080120160200SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 358.3585.4572.77186.7449.78145.38132.7997.24108.1597.38

clpeak

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980306090120150Min: 76 / Avg: 99.48 / Max: 105.5Min: 51.2 / Avg: 76.94 / Max: 84.1Min: 61.8 / Avg: 95.67 / Max: 107Min: 63.2 / Avg: 106.3 / Max: 120.3Min: 146.4 / Avg: 167.81 / Max: 175Min: 97.8 / Avg: 103.53 / Max: 108Min: 170.7 / Avg: 175.55 / Max: 180.4Min: 149.6 / Avg: 155.28 / Max: 157.1Min: 65.2 / Avg: 76.8 / Max: 89.4Min: 99 / Avg: 118.32 / Max: 126.1Min: 103.2 / Avg: 114.13 / Max: 119.3Min: 75.5 / Avg: 92.25 / Max: 105.6Min: 134.8 / Avg: 156.02 / Max: 164.4Min: 90.6 / Avg: 117.27 / Max: 130.7

ViennaCL

OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.4.2System Power Consumption MonitorRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980306090120150Min: 72.7 / Avg: 93.3 / Max: 104.2Min: 50.6 / Avg: 72.03 / Max: 82.1Min: 62.2 / Avg: 78.77 / Max: 96.1Min: 63.1 / Avg: 86.03 / Max: 126.9Min: 109.8 / Avg: 132.4 / Max: 158.7Min: 85.8 / Avg: 108.05 / Max: 130.3Min: 98.7 / Avg: 125.25 / Max: 151.8Min: 58.3 / Avg: 105.5 / Max: 152.7Min: 130.6 / Avg: 146.5 / Max: 162.4Min: 79.6 / Avg: 86.2 / Max: 92.8Min: 91.5 / Avg: 111.55 / Max: 131.6Min: 68.9 / Avg: 80.05 / Max: 91.2Min: 134.4 / Avg: 142.1 / Max: 149.8Min: 56 / Avg: 80.3 / Max: 104.6

cl-mem

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.40280.80561.20841.61122.0141.060.991.240.740.560.721.021.281.041.791.551.351.571.17

LuxMark

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98036912156.645.377.276.764.547.328.604.2813.509.4713.4012.4711.428.828.62

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.8281.6562.4843.3124.141.431.391.611.381.412.221.971.203.143.083.683.283.241.722.08

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980153045607536.5528.3737.7333.1623.4023.7229.2521.8865.1023.2760.5050.5940.8031.2430.96

FAHBench

OpenBenchmarking.orgNs Per Day Per Watt, More Is BetterFAHBench 2.3.2GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.23180.46360.69540.92721.1590.490.600.361.030.550.980.950.820.610.64

LuxMark

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9802040608010053.0242.7452.4957.1241.0542.2555.9932.2576.0660.2469.1986.6076.3558.3657.70

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9803691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.614.959.6810.1711.2411.9112.3512.516.1212.3112.2011.9612.2912.031. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.3240.6480.9721.2961.621.350.891.111.281.110.600.881.220.901.441.391.071.040.91

Mixbench

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 1060GeForce GTX 980163248648053.3730.6970.4858.6031.1949.6345.63

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.06080.12160.18240.24320.304SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.130.270.170.130.150.250.220.250.180.230.180.190.190.220.21

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.39, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.01, N = 3SE +/- 0.36, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 35.865.825.635.946.673.884.055.563.553.824.063.554.104.334.07

System Power Consumption Monitor

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 98060120180240300Min: 66.5 / Avg: 142.76 / Max: 205.4Min: 50.2 / Avg: 92.38 / Max: 145.4Min: 61.1 / Avg: 144.94 / Max: 235Min: 60.8 / Avg: 177.77 / Max: 346Min: 108.1 / Avg: 170.72 / Max: 311.4Min: 52.1 / Avg: 123.58 / Max: 210.4Min: 55.7 / Avg: 145.83 / Max: 234.2Min: 57.6 / Avg: 202.67 / Max: 336.8Min: 51.8 / Avg: 191.22 / Max: 330.8Min: 44.2 / Avg: 92.91 / Max: 132Min: 48.9 / Avg: 149.85 / Max: 266.2Min: 47.8 / Avg: 138.1 / Max: 211.9Min: 48.3 / Avg: 116.84 / Max: 190.4Min: 60.5 / Avg: 188.98 / Max: 336.2Min: 54.8 / Avg: 152.53 / Max: 254.7

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLRadeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 9800.06080.12160.18240.24320.304SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.140.270.180.130.170.240.210.270.170.220.170.180.180.220.21