Linux GPU Compute EOY 2024

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2412100-PTS-GPUCOMP547
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Arc A580
December 10
  51 Minutes
Arc A750
December 10
  47 Minutes
Arc A770
December 10
  55 Minutes
RTX 4060
December 06
  1 Hour, 49 Minutes
RTX 4070
December 05
  1 Hour, 11 Minutes
RTX 4070 SUPER
December 05
  1 Hour, 4 Minutes
RX 7600 XT
December 10
  1 Hour, 34 Minutes
RX 7700 XT
December 10
  1 Hour, 13 Minutes
RX 7800 XT
December 10
  1 Hour, 14 Minutes
Invert Behavior (Only Show Selected Data)
  1 Hour, 11 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Linux GPU Compute EOY 2024ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionRTX 4070 SUPERRTX 4070RTX 4060Arc A580Arc A750Arc A770RX 7600 XTRX 7700 XTRX 7800 XTIntel Core Ultra 9 285K @ 5.10GHz (24 Cores)ASUS ROG MAXIMUS Z890 HERO (1101 BIOS)Intel Device ae7f2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D14001GB Western Digital WD_BLACK SN850X 4000GB + 1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA GeForce RTX 4070 SUPER 12GBIntel Device 7f50ASUS VP28URealtek Device 8126 + Intel I226-V + Intel Wi-Fi 7Ubuntu 24.106.11.0-9-generic (x86_64)GNOME Shell 47.0X Server 1.21.1.13NVIDIA 565.57.014.6.0OpenCL 3.0 CUDA 12.7.33GCC 14.2.0ext43840x21601000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBASUS NVIDIA GeForce RTX 4070 12GBASUS NVIDIA GeForce RTX 4060 8GB4001GB Western Digital WD_BLACK SN850X 4000GB + 1000GB Western Digital WDS100T1X0E-00AFY0Intel Arc A580 DG2 8GB6.13.0-rc1-phx (x86_64)4.6 Mesa 25.0~git2412080600.8dda40~oibaf~o (git-8dda40c 2024-12-08 oracular-oibaf-ppOpenCL 3.0Intel Arc A750 DG2 8GB1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBIntel Arc A770 DG2 16GB4001GB Western Digital WD_BLACK SN850X 4000GB + 1000GB Western Digital WDS100T1X0E-00AFY0XFX AMD Radeon RX 7600 XT 16GB4.6 Mesa 24.3.0-devel (LLVM 19.1.2 DRM 3.59)OpenCL 2.1 AMD-APP (3635.0)XFX AMD Radeon RX 7700 XT 12GB1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBAMD Radeon RX 7800 XT 16GBOpenBenchmarking.orgKernel Details- RTX 4070 SUPER: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 4070: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 4060: nouveau.modeset=0 - Transparent Huge Pages: madvise- Arc A580: Transparent Huge Pages: madvise- Arc A750: Transparent Huge Pages: madvise- Arc A770: Transparent Huge Pages: madvise- RX 7600 XT: Transparent Huge Pages: madvise- RX 7700 XT: Transparent Huge Pages: madvise- RX 7800 XT: Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x113 - Thermald 2.5.8Graphics Details- RTX 4070 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.01- RTX 4070: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.49.00.03- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- RX 7600 XT: BAR1 / Visible vRAM Size: 16368 MB- RX 7700 XT: BAR1 / Visible vRAM Size: 12272 MB- RX 7800 XT: BAR1 / Visible vRAM Size: 16368 MBOpenCL Details- RTX 4070 SUPER: GPU Compute Cores: 7168- RTX 4070: GPU Compute Cores: 5888- RTX 4060: GPU Compute Cores: 3072Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 4070 SUPERRTX 4070RTX 4060Arc A580Arc A750Arc A770RX 7600 XTRX 7700 XTRX 7800 XTResult OverviewPhoronix Test Suite100%166%232%297%363%HashcatclpeakSHOC Scalable HeterOgeneous ComputingFluidX3Dcl-memDarktable

Linux GPU Compute EOY 2024darktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3Dblender: BMW27 - NVIDIA CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA CUDAblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA CUDAblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA CUDAblender: Barbershop - NVIDIA OptiXblender: Junkshop - NVIDIA CUDAblender: Junkshop - NVIDIA OptiXblender: BMW27 - Intel oneAPIblender: Classroom - Intel oneAPIblender: Fishy Cat - Intel oneAPIblender: Pabellon Barcelona - Intel oneAPIblender: Barbershop - Intel oneAPIblender: Junkshop - Intel oneAPIblender: BMW27 - Radeon HIPblender: Classroom - Radeon HIPblender: Fishy Cat - Radeon HIPblender: Pabellon Barcelona - Radeon HIPblender: Barbershop - Radeon HIPblender: Junkshop - Radeon HIPindigobench: OpenCL GPU - Supercarindigobench: OpenCL GPU - Bedroomhashcat: MD5gpuowl: 57885161hashcat: SHA1hashcat: SHA-512hashcat: 7-Zipgpuowl: 77936867gpuowl: 332220523hashcat: TrueCrypt RIPEMD160 + XTScl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Computeclpeak: Double-Precision Computeclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Kernel Latencyfluidx3d: FP32-FP32fluidx3d: FP32-FP16Sfluidx3d: FP32-FP16CRTX 4070 SUPERRTX 4070RTX 4060Arc A580Arc A750Arc A770RX 7600 XTRX 7700 XTRX 7800 XT1.6471.5630.8830.1162870.011284.2411512.143.8498379.623295.20310.185.7819.5013.3619.5710.0943.4214.8680.4354.6317.3610.8651.94619.51967447333333853.242192751666727785833331199963634.38134.75812346442.8397.6328.7432.5334179.19619.6417337.7317376.904.662783513355951.7171.5940.9070.1232824.701333.879693.1236.2607379.781287.50712.186.6424.4416.9625.3912.7951.9317.32103.6269.0226.4417.5448.00618.03256943850000711.911824473333323243500001028375527.98112.10690588443.2391.1328.4435.7727884.65510.8614278.0314325.534.622878463153812.2591.6670.9990.1321761.66721.3644525.1118.8736246.881155.07419.259.3038.8924.5139.5619.6191.4126.58163.52101.4832.1419.6928.8949.70729787116667375.9995123500001213550000538750277.2958.57357007238.0225.1213.5236.1914678.32265.897511.147530.064.681603296031062.6791.6290.8840.111724.3531191.771778.8318.423469.7189215.03615.7144.0420.1550.16199.8629.31262490000004574160000787833333248000271100143.7294.4259.6387.929755.804047.254046.1335.162447362433042.6741.6230.8790.111883.8381174.222202.8022.795578.1959228.22213.6637.5417.3343.24172.9825.82314417000005495680000943050000247175325500157.8281.1270.1396.9411384.244841.894844.4632.482416381437592.6911.6280.8740.112991.2361226.812512.0525.8160134.2608228.44613.2236.0316.4544.78167.3525.973549068333362090800001064533333245133368263242.9428.2310.6396.5013005.745521.585510.6134.502441409838932.4531.6240.9520.130795.413778.5242247.5915.0274260.61574.958830.1855.2950.46116.64231.0144.3323838216667571.86102506000001156500000492987424.9986.01317914260.9256.7236.5226.328495.58342.101801.617378.9017.241169217523411.9421.5470.8620.1211023.8451169.064314.5027.1628350.024165.44921.1436.2735.2679.48151.8629.9338.55313.67039745083333899.82169360500001891250000781988669.64137.04401347390.7386.5353.4358.3715382.86570.143382.5612911.6218.771388258826671.7741.5330.8220.1161101.661751.854704.9829.1842543.158213.10918.5231.9131.7874.01130.1226.1145.68117.19840131733333960.92173279833331903750000840613716.85147.08478200560.5536.1489.1489.3516593.53627.853633.8913703.1718.63170931263113OpenBenchmarking.org

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Boat - Acceleration: OpenCLRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER0.60551.2111.81652.4223.0275SE +/- 0.002, N = 10SE +/- 0.002, N = 9SE +/- 0.005, N = 8SE +/- 0.004, N = 8SE +/- 0.005, N = 8SE +/- 0.005, N = 8SE +/- 0.004, N = 9SE +/- 0.003, N = 10SE +/- 0.004, N = 91.7741.9422.4532.6912.6742.6792.2591.7171.647

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Masskrug - Acceleration: OpenCLRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER0.37510.75021.12531.50041.8755SE +/- 0.007, N = 10SE +/- 0.006, N = 10SE +/- 0.007, N = 10SE +/- 0.005, N = 10SE +/- 0.007, N = 10SE +/- 0.006, N = 10SE +/- 0.003, N = 10SE +/- 0.005, N = 10SE +/- 0.004, N = 101.5331.5471.6241.6281.6231.6291.6671.5941.563

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Room - Acceleration: OpenCLRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER0.22480.44960.67440.89921.124SE +/- 0.006, N = 11SE +/- 0.005, N = 11SE +/- 0.006, N = 11SE +/- 0.001, N = 11SE +/- 0.001, N = 11SE +/- 0.002, N = 11SE +/- 0.001, N = 11SE +/- 0.002, N = 11SE +/- 0.002, N = 110.8220.8620.9520.8740.8790.8840.9990.9070.883

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Rack - Acceleration: OpenCLRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER0.02970.05940.08910.11880.1485SE +/- 0.001, N = 15SE +/- 0.001, N = 13SE +/- 0.001, N = 15SE +/- 0.001, N = 14SE +/- 0.001, N = 14SE +/- 0.001, N = 14SE +/- 0.000, N = 13SE +/- 0.000, N = 13SE +/- 0.000, N = 130.1160.1210.1300.1120.1110.1110.1320.1230.116

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER6001200180024003000SE +/- 3.86, N = 8SE +/- 6.85, N = 8SE +/- 1.56, N = 8SE +/- 3.42, N = 3SE +/- 0.13, N = 3SE +/- 1.95, N = 3SE +/- 1.58, N = 6SE +/- 1.12, N = 6SE +/- 1.71, N = 61101.661023.85795.41991.24883.84724.351761.662824.702870.011. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER400800120016002000SE +/- 2.09, N = 12SE +/- 1.68, N = 12SE +/- 0.32, N = 11SE +/- 6.21, N = 14SE +/- 6.19, N = 14SE +/- 4.67, N = 14SE +/- 0.97, N = 11SE +/- 0.39, N = 11SE +/- 0.73, N = 111751.851169.06778.521226.811174.221191.77721.361333.871284.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER2K4K6K8K10KSE +/- 5.84, N = 10SE +/- 5.03, N = 10SE +/- 3.33, N = 9SE +/- 31.44, N = 15SE +/- 5.35, N = 11SE +/- 11.95, N = 10SE +/- 24.39, N = 9SE +/- 6.25, N = 10SE +/- 19.61, N = 114704.984314.502247.592512.052202.801778.834525.119693.1211512.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER1020304050SE +/- 0.05, N = 13SE +/- 0.00, N = 13SE +/- 0.03, N = 12SE +/- 0.03, N = 15SE +/- 0.01, N = 13SE +/- 0.01, N = 13SE +/- 0.01, N = 12SE +/- 0.02, N = 13SE +/- 0.03, N = 1329.1827.1615.0325.8222.8018.4218.8736.2643.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER120240360480600SE +/- 2.78, N = 12SE +/- 0.53, N = 11SE +/- 0.08, N = 11SE +/- 6.54, N = 12SE +/- 0.13, N = 10SE +/- 0.18, N = 9SE +/- 0.02, N = 11SE +/- 0.88, N = 15SE +/- 0.08, N = 12543.16350.02260.62134.2678.2069.72246.88379.78379.621. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER60120180240300SE +/- 1.91, N = 15SE +/- 1.18, N = 3SE +/- 0.62, N = 9SE +/- 0.22, N = 15SE +/- 0.31, N = 15SE +/- 0.23, N = 15SE +/- 0.15, N = 15SE +/- 0.18, N = 13SE +/- 0.08, N = 12213.11165.4574.96228.45228.22215.04155.07287.51295.201. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 4SE +/- 0.03, N = 519.2512.1810.18

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER3691215SE +/- 0.09, N = 5SE +/- 0.03, N = 6SE +/- 0.03, N = 159.306.645.78

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER918273645SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 338.8924.4419.50

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 424.5116.9613.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER918273645SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 339.5625.3919.57

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 4SE +/- 0.10, N = 519.6112.7910.09

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER20406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 391.4151.9343.42

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 426.5817.3214.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER4080120160200SE +/- 0.37, N = 3SE +/- 0.36, N = 3SE +/- 0.03, N = 3163.52103.6280.43

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER20406080100SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.05, N = 3101.4869.0254.63

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER714212835SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 332.1426.4417.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.01, N = 519.6917.5410.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: Intel oneAPIArc A770Arc A750Arc A58048121620SE +/- 0.36, N = 15SE +/- 0.36, N = 15SE +/- 0.36, N = 1513.2213.6615.71

Blend File: BMW27 - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: Intel oneAPIArc A770Arc A750Arc A5801020304050SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 336.0337.5444.04

Blend File: Classroom - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: Intel oneAPIArc A770Arc A750Arc A580510152025SE +/- 0.58, N = 15SE +/- 0.58, N = 15SE +/- 0.58, N = 1516.4517.3320.15

Blend File: Fishy Cat - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: Intel oneAPIArc A770Arc A750Arc A5801122334455SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.28, N = 344.7843.2450.16

Blend File: Pabellon Barcelona - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: Intel oneAPIArc A770Arc A750Arc A5804080120160200SE +/- 0.30, N = 3SE +/- 0.32, N = 3SE +/- 0.39, N = 3167.35172.98199.86

Blend File: Barbershop - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: Intel oneAPIArc A770Arc A750Arc A580714212835SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 325.9725.8229.31

Blend File: Junkshop - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT714212835SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.15, N = 318.5221.1430.18

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT1224364860SE +/- 0.12, N = 3SE +/- 0.43, N = 4SE +/- 0.04, N = 331.9136.2755.29

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT1122334455SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 331.7835.2650.46

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT306090120150SE +/- 0.06, N = 3SE +/- 0.33, N = 3SE +/- 0.50, N = 374.0179.48116.64

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT50100150200250SE +/- 0.50, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 3130.12151.86231.01

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: Radeon HIPRX 7800 XTRX 7700 XTRX 7600 XT1020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 326.1129.9344.33

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRX 7800 XTRX 7700 XTRTX 4060RTX 4070RTX 4070 SUPER1224364860SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 345.6838.5528.8948.0151.95

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRX 7800 XTRX 7700 XTRTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.011, N = 3SE +/- 0.018, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 317.19813.6709.70718.03219.519

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER14000M28000M42000M56000M70000MSE +/- 51306033.87, N = 6SE +/- 39002994.90, N = 6SE +/- 68956476.21, N = 6SE +/- 20961320.20, N = 6SE +/- 8077829.74, N = 6SE +/- 8638479.80, N = 6SE +/- 24181996.29, N = 6SE +/- 35399027.76, N = 6SE +/- 49647307.19, N = 6401317333333974508333323838216667354906833333144170000026249000000297871166675694385000067447333333

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 57885161RX 7800 XTRX 7700 XTRX 7600 XTRTX 4060RTX 4070RTX 4070 SUPER2004006008001000SE +/- 0.81, N = 3SE +/- 0.27, N = 3SE +/- 0.11, N = 3SE +/- 0.54, N = 3SE +/- 0.17, N = 3SE +/- 0.42, N = 3960.92899.82571.86375.99711.91853.241. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 57885161

Arc A580: The test run did not produce a result. E: ./open 20241210 04:35:52 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:51:51 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:19:46 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER5000M10000M15000M20000M25000MSE +/- 22128976.73, N = 6SE +/- 2610204.33, N = 6SE +/- 20629558.08, N = 6SE +/- 2281315.41, N = 5SE +/- 1758806.41, N = 5SE +/- 2199454.48, N = 5SE +/- 9883142.21, N = 6SE +/- 13909341.39, N = 6SE +/- 22178749.84, N = 617327983333169360500001025060000062090800005495680000457416000095123500001824473333321927516667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER600M1200M1800M2400M3000MSE +/- 9450282.18, N = 6SE +/- 549393.61, N = 6SE +/- 2013123.61, N = 6SE +/- 624855.54, N = 6SE +/- 375721.53, N = 6SE +/- 202758.75, N = 6SE +/- 1194082.63, N = 6SE +/- 1068566.02, N = 6SE +/- 2352079.46, N = 61903750000189125000011565000001064533333943050000787833333121355000023243500002778583333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER300K600K900K1200K1500KSE +/- 1329.67, N = 8SE +/- 1268.64, N = 8SE +/- 3699.93, N = 15SE +/- 66.67, N = 3SE +/- 125.00, N = 4SE +/- 122.47, N = 4SE +/- 985.43, N = 8SE +/- 5954.67, N = 8SE +/- 2113.56, N = 884061378198849298724513324717524800053875010283751199963

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 77936867RX 7800 XTRX 7700 XTRX 7600 XTRTX 4060RTX 4070RTX 4070 SUPER150300450600750SE +/- 0.00, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3SE +/- 0.37, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3716.85669.64424.99277.29527.98634.381. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 77936867

Arc A580: The test run did not produce a result. E: ./open 20241210 04:36:17 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:52:16 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:20:11 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 332220523RX 7800 XTRX 7700 XTRX 7600 XTRTX 4060RTX 4070RTX 4070 SUPER306090120150SE +/- 0.19, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3147.08137.0486.0158.57112.10134.751. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 332220523

Arc A580: The test run did not produce a result. E: ./open 20241210 04:36:41 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:52:40 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:20:35 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER200K400K600K800K1000KSE +/- 446.01, N = 8SE +/- 47254.23, N = 15SE +/- 147.08, N = 7SE +/- 212.08, N = 8SE +/- 190.86, N = 8SE +/- 198.81, N = 7SE +/- 2610.06, N = 15SE +/- 2512.78, N = 8SE +/- 5423.39, N = 13478200401347317914368263325500271100357007690588812346

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER120240360480600SE +/- 0.54, N = 9SE +/- 0.17, N = 8SE +/- 0.15, N = 6SE +/- 0.09, N = 7SE +/- 0.04, N = 6SE +/- 0.00, N = 6SE +/- 0.45, N = 6SE +/- 0.80, N = 8SE +/- 1.17, N = 8560.5390.7260.9242.9157.8143.7238.0443.2442.81. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER120240360480600SE +/- 1.15, N = 9SE +/- 0.18, N = 8SE +/- 0.10, N = 6SE +/- 0.13, N = 7SE +/- 0.02, N = 6SE +/- 0.03, N = 6SE +/- 0.39, N = 6SE +/- 0.43, N = 8SE +/- 0.80, N = 8536.1386.5256.7428.2281.1294.4225.1391.1397.61. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER110220330440550SE +/- 1.30, N = 9SE +/- 0.13, N = 8SE +/- 0.04, N = 6SE +/- 0.03, N = 7SE +/- 0.13, N = 6SE +/- 0.02, N = 6SE +/- 0.37, N = 6SE +/- 0.39, N = 8SE +/- 0.51, N = 8489.1353.4236.5310.6270.1259.6213.5328.4328.71. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER110220330440550SE +/- 2.08, N = 8SE +/- 0.64, N = 8SE +/- 0.32, N = 6SE +/- 0.19, N = 8SE +/- 0.11, N = 9SE +/- 0.18, N = 9SE +/- 0.66, N = 10SE +/- 0.06, N = 10SE +/- 1.92, N = 10489.35358.37226.32396.50396.94387.92236.19435.77432.531. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER7K14K21K28K35KSE +/- 31.09, N = 12SE +/- 23.93, N = 12SE +/- 12.54, N = 12SE +/- 1.39, N = 4SE +/- 0.54, N = 4SE +/- 0.86, N = 4SE +/- 70.91, N = 13SE +/- 141.03, N = 13SE +/- 17.53, N = 1316593.5315382.868495.5813005.7411384.249755.8014678.3227884.6534179.191. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRX 7800 XTRX 7700 XTRX 7600 XTRTX 4060RTX 4070RTX 4070 SUPER140280420560700SE +/- 2.78, N = 7SE +/- 3.09, N = 7SE +/- 0.32, N = 7SE +/- 0.70, N = 6SE +/- 2.22, N = 6SE +/- 2.39, N = 6627.85570.14342.10265.89510.86619.641. (CXX) g++ options: -O3

OpenCL Test: Double-Precision Compute

Arc A580: The test run did not produce a result.

Arc A750: The test run did not produce a result.

Arc A770: The test run did not produce a result.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER4K8K12K16K20KSE +/- 6.45, N = 11SE +/- 2.50, N = 11SE +/- 2.25, N = 11SE +/- 4.18, N = 4SE +/- 3.31, N = 4SE +/- 0.96, N = 4SE +/- 36.28, N = 13SE +/- 75.61, N = 13SE +/- 102.12, N = 133633.893382.561801.615521.584841.894047.257511.1414278.0317337.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER4K8K12K16K20KSE +/- 66.84, N = 13SE +/- 46.29, N = 13SE +/- 30.00, N = 15SE +/- 5.95, N = 4SE +/- 1.88, N = 4SE +/- 0.30, N = 4SE +/- 33.56, N = 13SE +/- 70.70, N = 14SE +/- 93.44, N = 1313703.1712911.627378.905510.614844.464046.137530.0614325.5317376.901. (CXX) g++ options: -O3

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER816243240SE +/- 0.03, N = 12SE +/- 0.05, N = 12SE +/- 0.18, N = 15SE +/- 0.10, N = 12SE +/- 0.08, N = 12SE +/- 0.02, N = 12SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 1518.6318.7717.2434.5032.4835.164.684.624.661. (CXX) g++ options: -O3

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32RX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER6001200180024003000SE +/- 8.84, N = 3SE +/- 3.18, N = 3SE +/- 4.58, N = 3SE +/- 7.26, N = 3SE +/- 10.07, N = 3SE +/- 17.46, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3170913881169244124162447160328782783

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16SRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER11002200330044005500SE +/- 12.70, N = 3SE +/- 3.18, N = 3SE +/- 4.63, N = 3SE +/- 27.93, N = 15SE +/- 6.74, N = 3SE +/- 13.01, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3312625882175409838143624296046315133

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16CRX 7800 XTRX 7700 XTRX 7600 XTArc A770Arc A750Arc A580RTX 4060RTX 4070RTX 4070 SUPER12002400360048006000SE +/- 12.98, N = 3SE +/- 12.25, N = 3SE +/- 4.26, N = 3SE +/- 53.38, N = 3SE +/- 6.03, N = 3SE +/- 11.85, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 3.51, N = 3311326672341389337593304310653815595