OpenCL January 2018 Linux Radeon ROCm NVIDIA

Tests by Michael Larabekl for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1801186-PTS-OPENCLJA59
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1060
January 18 2018
 
GeForce GTX 1070
January 17 2018
 
GeForce GTX 1070 Ti
January 17 2018
 
GeForce GTX 1080
January 17 2018
 
GeForce GTX 1080 Ti
January 17 2018
 
GeForce GTX 680
January 18 2018
 
GeForce GTX 780 Ti
January 18 2018
 
GeForce GTX 960
January 18 2018
 
GeForce GTX 970
January 17 2018
 
GeForce GTX 980 Ti
January 18 2018
 
Radeon R9 285
January 18 2018
 
Radeon R9 290
January 18 2018
 
Radeon R9 Fury
January 18 2018
 
Radeon RX 580
January 18 2018
 
Radeon RX Vega 56
January 18 2018
 
Radeon RX Vega 64
January 18 2018
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL January 2018 Linux Radeon ROCm NVIDIAOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (0606 BIOS)Intel Device 3ec216384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1607/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)XFX AMD Radeon R9 200 2048MBXFX AMD Radeon R9 200 4096MBSapphire AMD Radeon 4096MBMSI AMD Radeon RX 580 8192MBAMD Radeon RX Vega 8192MBRealtek ALC1220DELL P2415QIntel ConnectionUbuntu 17.104.15.0-999-generic (x86_64) 201801144.13.0-25-generic (x86_64)GNOME Shell 3.26.2NVIDIA 390.12modesetting 1.19.54.5.04.5 Mesa 17.4.0-devel- padoka PPA (LLVM 7.0.0)GCC 7.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelsDesktopDisplay DriversOpenGLsCompilerFile-SystemScreen ResolutionOpenCL January 2018 Linux Radeon ROCm NVIDIA BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980 Ti: GPU Compute Cores: 2816

GeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64Result OverviewPhoronix Test Suite100%211%322%433%544%LuxMarkcl-memcl-memcl-memGPU - Luxball HDRWriteCopyRead

OpenCL January 2018 Linux Radeon ROCm NVIDIAcl-mem: Copycl-mem: Readcl-mem: Writedarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLfahbench: juliagpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRmandelgpu: GPUshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64140.80154.47145.573.914.161.3599.46121183382.372107553511407115144009.93402.877.374803.55400.12188.53206.53198.573.124.021.14154016267.602939773615773168963001.60556.1110.737126.65449.34188.30206.60196.403.024.021.15148.85171841227.502901792315247207141915.30566.8213.839057.20501.38211.33229.902222.944.011.15178980675.832986654212227218025195.33660.2514.429441.76519.76319.37340.23343.702.453.971.06190.12204310038.9737301029418898289975392.23988.2020.1413280.77592.11121.50135.73150.139.447.324.5941.1456403401.007472569459548480538.00271.462.26246.64239.20272.70257.707.187.715.1973.6084426813.3713135243962277073285.63441.474.684948.60285.8472.3081.9075.5716.5320.2611.6860.4885979128.7012593263592466365537.10223.454.492959.58283.62126.80144.67134.904.677.504.8687.52112700768.90199860611050495506772.47408.536.544362.55296.38219266.80244.403.444.181.54111.10139018187.932342854914755131281594.13718.319.346216.24351.12126.07110.80136.077258988381.674.183281.791090.99187.43116.80209.13198849223.6011911584564943031.83209.80111.90385.73208047859.87165422455102371755.73841.819.117145.18251.10184.10145.70181.63191808662.2314081471279323817.30562.888.006263.25213.02205.20151.90318.83241899430.33159124192155441242.77952.4913.6810681.50373.30223.77160.87372.23254843524.6025012174577802.601103.2316.0812794.87426.79OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6470140210280350SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 0.50, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.43, N = 3SE +/- 0.45, N = 3SE +/- 0.45, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3140.80188.53188.30211.33319.37121.50239.2072.30126.80219.00126.07187.43209.80184.10205.20223.771. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6470140210280350SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 2.10, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3154.47206.53206.60229.90340.23135.73272.7081.90144.67266.80110.80116.80111.90145.70151.90160.871. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6480160240320400SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 1.25, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.58, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3145.57198.57196.40222.00343.70150.13257.7075.57134.90244.40136.07209.13385.73181.63318.83372.231. (CC) gcc options: -O2 -flto -lOpenCL

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Boat - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 Ti48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.913.123.022.942.459.447.1816.534.673.44

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Masskrug - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 Ti510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.30, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.164.024.024.013.977.327.7120.267.504.18

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Server Room - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 Ti3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.351.141.151.151.064.595.1911.684.861.54

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 1060GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 Ti4080120160200SE +/- 0.05, N = 3SE +/- 0.60, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 399.46148.85190.1241.1473.6060.4887.52111.10

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6450M100M150M200M250MSE +/- 196419.50, N = 3SE +/- 613736.81, N = 3SE +/- 1031171.70, N = 3SE +/- 235273.88, N = 3SE +/- 843495.42, N = 3SE +/- 188888.98, N = 3SE +/- 303705.53, N = 3SE +/- 85942.56, N = 3SE +/- 625651.47, N = 3SE +/- 1085388.09, N = 3SE +/- 517038.05, N = 3SE +/- 639092.02, N = 3SE +/- 577899.40, N = 3SE +/- 621968.09, N = 3SE +/- 415881.11, N = 3121183382.37154016267.60171841227.50178980675.83204310038.9756403401.0084426813.3785979128.70112700768.90139018187.93198849223.60208047859.87191808662.23241899430.33254843524.601. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 568001600240032004000SE +/- 0.58, N = 3SE +/- 26.84, N = 3SE +/- 13.32, N = 3SE +/- 9.84, N = 3SE +/- 1.15, N = 3SE +/- 3.67, N = 3SE +/- 0.67, N = 3SE +/- 17.23, N = 3SE +/- 12.55, N = 3SE +/- 3.00, N = 3SE +/- 1.00, N = 3SE +/- 1.86, N = 3SE +/- 4.00, N = 3SE +/- 3.33, N = 3SE +/- 4.33, N = 32107293929012986373074713131259199823427251191165414081591

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 Ti2K4K6K8K10KSE +/- 4.73, N = 3SE +/- 3.51, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 2.19, N = 3SE +/- 2.00, N = 3SE +/- 6.06, N = 3SE +/- 7.13, N = 3SE +/- 1.53, N = 3SE +/- 2.31, N = 355357736792365421029425695243326360618549

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 645K10K15K20K25KSE +/- 2.33, N = 3SE +/- 61.00, N = 3SE +/- 1.15, N = 3SE +/- 3.18, N = 3SE +/- 90.23, N = 3SE +/- 16.17, N = 3SE +/- 36.83, N = 3SE +/- 21.33, N = 3SE +/- 19.10, N = 3SE +/- 53.12, N = 3SE +/- 1.33, N = 3SE +/- 0.88, N = 3SE +/- 123.17, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 373.21, N = 31140715773152471222718898459596225924105041475589881584522455147122419225012

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460M120M180M240M300MSE +/- 503535.67, N = 3SE +/- 1151240.82, N = 3SE +/- 1459753.08, N = 3SE +/- 1913943.64, N = 3SE +/- 3984585.32, N = 3SE +/- 25083.83, N = 3SE +/- 258135.31, N = 3SE +/- 121075.12, N = 3SE +/- 431906.66, N = 3SE +/- 510200.05, N = 3SE +/- 83691.30, N = 3SE +/- 120193.62, N = 3SE +/- 115711.88, N = 3SE +/- 794414.93, N = 3SE +/- 741807.19, N = 3115144009.93168963001.60207141915.30218025195.33289975392.2348480538.0077073285.6366365537.1095506772.47131281594.1364943031.83102371755.7379323817.30155441242.77174577802.601. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 642004006008001000SE +/- 2.48, N = 3SE +/- 3.52, N = 3SE +/- 0.30, N = 3SE +/- 1.31, N = 3SE +/- 0.82, N = 3SE +/- 3.93, N = 5SE +/- 7.87, N = 3SE +/- 1.16, N = 3SE +/- 0.66, N = 3SE +/- 12.20, N = 6SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 3.87, N = 3SE +/- 0.36, N = 3SE +/- 1.19, N = 3402.87556.11566.82660.25988.20271.46441.47223.45408.53718.31381.67841.81562.88952.491103.231. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.3710.7313.8314.4220.142.264.684.496.549.344.189.118.0013.6816.081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 643K6K9K12K15KSE +/- 25.86, N = 3SE +/- 31.44, N = 3SE +/- 3.79, N = 3SE +/- 47.55, N = 3SE +/- 67.22, N = 3SE +/- 19.88, N = 3SE +/- 6.86, N = 3SE +/- 1.35, N = 3SE +/- 12.88, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 72.77, N = 3SE +/- 48.47, N = 34803.557126.659057.209441.7613280.774948.602959.584362.556216.243281.797145.186263.2510681.5012794.871. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 642004006008001000SE +/- 2.04, N = 3SE +/- 1.87, N = 3SE +/- 0.48, N = 3SE +/- 1.52, N = 3SE +/- 0.55, N = 3SE +/- 4.62, N = 3SE +/- 1.27, N = 3SE +/- 1.64, N = 3SE +/- 1.19, N = 3SE +/- 1.13, N = 3SE +/- 0.02, N = 3SE +/- 0.99, N = 3SE +/- 0.03, N = 3SE +/- 1.23, N = 3SE +/- 2.39, N = 3400.12449.34501.38519.76592.11246.64285.84283.62296.38351.121090.99251.10213.02373.30426.791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi