Linux GPU Compute EOY 2024

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2412100-PTS-GPUCOMP547
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Arc A580
December 10
  51 Minutes
Arc A750
December 10
  47 Minutes
Arc A770
December 10
  55 Minutes
RTX 4060
December 06
  1 Hour, 49 Minutes
RTX 4070
December 05
  1 Hour, 11 Minutes
RTX 4070 SUPER
December 05
  1 Hour, 4 Minutes
RX 7600 XT
December 10
  1 Hour, 34 Minutes
RX 7700 XT
December 10
  1 Hour, 13 Minutes
RX 7800 XT
December 10
  1 Hour, 14 Minutes
Invert Behavior (Only Show Selected Data)
  1 Hour, 11 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Linux GPU Compute EOY 2024ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverArc A580Arc A750Arc A770RTX 4060RTX 4070RTX 4070 SUPERRX 7600 XTRX 7700 XTRX 7800 XTIntel Core Ultra 9 285K @ 5.10GHz (24 Cores)ASUS ROG MAXIMUS Z890 HERO (1101 BIOS)Intel Device ae7f2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D14001GB Western Digital WD_BLACK SN850X 4000GB + 1000GB Western Digital WDS100T1X0E-00AFY0Intel Arc A580 DG2 8GBIntel Device 7f50ASUS VP28URealtek Device 8126 + Intel I226-V + Intel Wi-Fi 7Ubuntu 24.106.13.0-rc1-phx (x86_64)GNOME Shell 47.0X Server 1.21.1.134.6 Mesa 25.0~git2412080600.8dda40~oibaf~o (git-8dda40c 2024-12-08 oracular-oibaf-ppOpenCL 3.0GCC 14.2.0ext43840x2160Intel Arc A750 DG2 8GB1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBIntel Arc A770 DG2 16GBASUS NVIDIA GeForce RTX 4060 8GB6.11.0-9-generic (x86_64)NVIDIA 565.57.014.6.0OpenCL 3.0 CUDA 12.7.33ASUS NVIDIA GeForce RTX 4070 12GB4001GB Western Digital WD_BLACK SN850X 4000GB + 1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA GeForce RTX 4070 SUPER 12GBXFX AMD Radeon RX 7600 XT 16GB6.13.0-rc1-phx (x86_64)4.6 Mesa 24.3.0-devel (LLVM 19.1.2 DRM 3.59)OpenCL 2.1 AMD-APP (3635.0)XFX AMD Radeon RX 7700 XT 12GB1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBAMD Radeon RX 7800 XT 16GBOpenBenchmarking.orgKernel Details- Arc A580: Transparent Huge Pages: madvise- Arc A750: Transparent Huge Pages: madvise- Arc A770: Transparent Huge Pages: madvise- RTX 4060: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 4070: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 4070 SUPER: nouveau.modeset=0 - Transparent Huge Pages: madvise- RX 7600 XT: Transparent Huge Pages: madvise- RX 7700 XT: Transparent Huge Pages: madvise- RX 7800 XT: Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x113 - Thermald 2.5.8Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affectedGraphics Details- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- RTX 4070: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.49.00.03- RTX 4070 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.01- RX 7600 XT: BAR1 / Visible vRAM Size: 16368 MB- RX 7700 XT: BAR1 / Visible vRAM Size: 12272 MB- RX 7800 XT: BAR1 / Visible vRAM Size: 16368 MBOpenCL Details- RTX 4060: GPU Compute Cores: 3072- RTX 4070: GPU Compute Cores: 5888- RTX 4070 SUPER: GPU Compute Cores: 7168

Arc A580Arc A750Arc A770RTX 4060RTX 4070RTX 4070 SUPERRX 7600 XTRX 7700 XTRX 7800 XTResult OverviewPhoronix Test Suite100%166%232%297%363%HashcatclpeakSHOC Scalable HeterOgeneous ComputingFluidX3Dcl-memDarktable

Linux GPU Compute EOY 2024blender: BMW27 - NVIDIA CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA CUDAblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA CUDAblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA CUDAblender: Barbershop - NVIDIA OptiXblender: Junkshop - NVIDIA CUDAblender: Junkshop - NVIDIA OptiXblender: BMW27 - Intel oneAPIblender: Classroom - Intel oneAPIblender: Fishy Cat - Intel oneAPIblender: Pabellon Barcelona - Intel oneAPIblender: Barbershop - Intel oneAPIblender: Junkshop - Intel oneAPIblender: BMW27 - Radeon HIPblender: Classroom - Radeon HIPblender: Fishy Cat - Radeon HIPblender: Pabellon Barcelona - Radeon HIPblender: Barbershop - Radeon HIPblender: Junkshop - Radeon HIPcl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Computeclpeak: Double-Precision Computeclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Kernel Latencydarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLfluidx3d: FP32-FP32fluidx3d: FP32-FP16Sfluidx3d: FP32-FP16Cgpuowl: 57885161gpuowl: 77936867gpuowl: 332220523hashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTSindigobench: OpenCL GPU - Supercarindigobench: OpenCL GPU - Bedroomshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3DArc A580Arc A750Arc A770RTX 4060RTX 4070RTX 4070 SUPERRX 7600 XTRX 7700 XTRX 7800 XT15.7144.0420.1550.16199.8629.31143.7294.4259.6387.929755.804047.254046.1335.162.6791.6290.8840.111244736243304262490000004574160000787833333248000271100724.3531191.771778.8318.423469.7189215.03613.6637.5417.3343.24172.9825.82157.8281.1270.1396.9411384.244841.894844.4632.482.6741.6230.8790.111241638143759314417000005495680000943050000247175325500883.8381174.222202.8022.795578.1959228.22213.2236.0316.4544.78167.3525.97242.9428.2310.6396.5013005.745521.585510.6134.502.6911.6280.8740.1122441409838933549068333362090800001064533333245133368263991.2361226.812512.0525.8160134.2608228.44619.259.3038.8924.5139.5619.6191.4126.58163.52101.4832.1419.69238.0225.1213.5236.1914678.32265.897511.147530.064.682.2591.6670.9990.132160329603106375.99277.2958.57297871166679512350000121355000053875035700728.8949.7071761.66721.3644525.1118.8736246.881155.07412.186.6424.4416.9625.3912.7951.9317.32103.6269.0226.4417.54443.2391.1328.4435.7727884.65510.8614278.0314325.534.621.7171.5940.9070.123287846315381711.91527.98112.1056943850000182447333332324350000102837569058848.00618.0322824.701333.879693.1236.2607379.781287.50710.185.7819.5013.3619.5710.0943.4214.8680.4354.6317.3610.86442.8397.6328.7432.5334179.19619.6417337.7317376.904.661.6471.5630.8830.116278351335595853.24634.38134.7567447333333219275166672778583333119996381234651.94619.5192870.011284.2411512.143.8498379.623295.20330.1855.2950.46116.64231.0144.33260.9256.7236.5226.328495.58342.101801.617378.9017.242.4531.6240.9520.130116921752341571.86424.9986.0123838216667102506000001156500000492987317914795.413778.5242247.5915.0274260.61574.958821.1436.2735.2679.48151.8629.93390.7386.5353.4358.3715382.86570.143382.5612911.6218.771.9421.5470.8620.121138825882667899.82669.64137.043974508333316936050000189125000078198840134738.55313.6701023.8451169.064314.5027.1628350.024165.44918.5231.9131.7874.01130.1226.11560.5536.1489.1489.3516593.53627.853633.8913703.1718.631.7741.5330.8220.116170931263113960.92716.85147.084013173333317327983333190375000084061347820045.68117.1981101.661751.854704.9829.1842543.158213.109OpenBenchmarking.org

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 4SE +/- 0.03, N = 519.2512.1810.18

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER3691215SE +/- 0.09, N = 5SE +/- 0.03, N = 6SE +/- 0.03, N = 159.306.645.78

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER918273645SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 338.8924.4419.50

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 424.5116.9613.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER918273645SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 339.5625.3919.57

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 4SE +/- 0.10, N = 519.6112.7910.09

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER20406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 391.4151.9343.42

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 426.5817.3214.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER4080120160200SE +/- 0.37, N = 3SE +/- 0.36, N = 3SE +/- 0.03, N = 3163.52103.6280.43

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER20406080100SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.05, N = 3101.4869.0254.63

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA CUDARTX 4060RTX 4070RTX 4070 SUPER714212835SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 332.1426.4417.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA OptiXRTX 4060RTX 4070RTX 4070 SUPER510152025SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.01, N = 519.6917.5410.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: Intel oneAPIArc A580Arc A750Arc A77048121620SE +/- 0.36, N = 15SE +/- 0.36, N = 15SE +/- 0.36, N = 1515.7113.6613.22

Blend File: BMW27 - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: Intel oneAPIArc A580Arc A750Arc A7701020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 344.0437.5436.03

Blend File: Classroom - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: Intel oneAPIArc A580Arc A750Arc A770510152025SE +/- 0.58, N = 15SE +/- 0.58, N = 15SE +/- 0.58, N = 1520.1517.3316.45

Blend File: Fishy Cat - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: Intel oneAPIArc A580Arc A770Arc A7501122334455SE +/- 0.28, N = 3SE +/- 0.28, N = 3SE +/- 0.29, N = 350.1644.7843.24

Blend File: Pabellon Barcelona - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: Intel oneAPIArc A580Arc A750Arc A7704080120160200SE +/- 0.39, N = 3SE +/- 0.32, N = 3SE +/- 0.30, N = 3199.86172.98167.35

Blend File: Barbershop - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 2, total unfreed memory 0.000107 MB

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: Intel oneAPIArc A580Arc A770Arc A750714212835SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 329.3125.9725.82

Blend File: Junkshop - Compute: Intel oneAPI

RX 7600 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7700 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

RX 7800 XT: The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT714212835SE +/- 0.15, N = 3SE +/- 0.23, N = 3SE +/- 0.04, N = 330.1821.1418.52

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT1224364860SE +/- 0.04, N = 3SE +/- 0.43, N = 4SE +/- 0.12, N = 355.2936.2731.91

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT1122334455SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 350.4635.2631.78

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT306090120150SE +/- 0.50, N = 3SE +/- 0.33, N = 3SE +/- 0.06, N = 3116.6479.4874.01

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT50100150200250SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.50, N = 3231.01151.86130.12

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: Radeon HIPRX 7600 XTRX 7700 XTRX 7800 XT1020304050SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 344.3329.9326.11

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadArc A580Arc A750RTX 4060Arc A770RX 7600 XTRX 7700 XTRTX 4070 SUPERRTX 4070RX 7800 XT120240360480600SE +/- 0.00, N = 6SE +/- 0.04, N = 6SE +/- 0.45, N = 6SE +/- 0.09, N = 7SE +/- 0.15, N = 6SE +/- 0.17, N = 8SE +/- 1.17, N = 8SE +/- 0.80, N = 8SE +/- 0.54, N = 9143.7157.8238.0242.9260.9390.7442.8443.2560.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 4060RX 7600 XTArc A750Arc A580RX 7700 XTRTX 4070RTX 4070 SUPERArc A770RX 7800 XT120240360480600SE +/- 0.39, N = 6SE +/- 0.10, N = 6SE +/- 0.02, N = 6SE +/- 0.03, N = 6SE +/- 0.18, N = 8SE +/- 0.43, N = 8SE +/- 0.80, N = 8SE +/- 0.13, N = 7SE +/- 1.15, N = 9225.1256.7281.1294.4386.5391.1397.6428.2536.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 4060RX 7600 XTArc A580Arc A750Arc A770RTX 4070RTX 4070 SUPERRX 7700 XTRX 7800 XT110220330440550SE +/- 0.37, N = 6SE +/- 0.04, N = 6SE +/- 0.02, N = 6SE +/- 0.13, N = 6SE +/- 0.03, N = 7SE +/- 0.39, N = 8SE +/- 0.51, N = 8SE +/- 0.13, N = 8SE +/- 1.30, N = 9213.5236.5259.6270.1310.6328.4328.7353.4489.11. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRX 7600 XTRTX 4060RX 7700 XTArc A580Arc A770Arc A750RTX 4070 SUPERRTX 4070RX 7800 XT110220330440550SE +/- 0.32, N = 6SE +/- 0.66, N = 10SE +/- 0.64, N = 8SE +/- 0.18, N = 9SE +/- 0.19, N = 8SE +/- 0.11, N = 9SE +/- 1.92, N = 10SE +/- 0.06, N = 10SE +/- 2.08, N = 8226.32236.19358.37387.92396.50396.94432.53435.77489.351. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 7600 XTArc A580Arc A750Arc A770RTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER7K14K21K28K35KSE +/- 12.54, N = 12SE +/- 0.86, N = 4SE +/- 0.54, N = 4SE +/- 1.39, N = 4SE +/- 70.91, N = 13SE +/- 23.93, N = 12SE +/- 31.09, N = 12SE +/- 141.03, N = 13SE +/- 17.53, N = 138495.589755.8011384.2413005.7414678.3215382.8616593.5327884.6534179.191. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRTX 4060RX 7600 XTRTX 4070RX 7700 XTRTX 4070 SUPERRX 7800 XT140280420560700SE +/- 0.70, N = 6SE +/- 0.32, N = 7SE +/- 2.22, N = 6SE +/- 3.09, N = 7SE +/- 2.39, N = 6SE +/- 2.78, N = 7265.89342.10510.86570.14619.64627.851. (CXX) g++ options: -O3

OpenCL Test: Double-Precision Compute

Arc A580: The test run did not produce a result.

Arc A750: The test run did not produce a result.

Arc A770: The test run did not produce a result.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRX 7600 XTRX 7700 XTRX 7800 XTArc A580Arc A750Arc A770RTX 4060RTX 4070RTX 4070 SUPER4K8K12K16K20KSE +/- 2.25, N = 11SE +/- 2.50, N = 11SE +/- 6.45, N = 11SE +/- 0.96, N = 4SE +/- 3.31, N = 4SE +/- 4.18, N = 4SE +/- 36.28, N = 13SE +/- 75.61, N = 13SE +/- 102.12, N = 131801.613382.563633.894047.254841.895521.587511.1414278.0317337.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeArc A580Arc A750Arc A770RX 7600 XTRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER4K8K12K16K20KSE +/- 0.30, N = 4SE +/- 1.88, N = 4SE +/- 5.95, N = 4SE +/- 30.00, N = 15SE +/- 33.56, N = 13SE +/- 46.29, N = 13SE +/- 66.84, N = 13SE +/- 70.70, N = 14SE +/- 93.44, N = 134046.134844.465510.617378.907530.0612911.6213703.1714325.5317376.901. (CXX) g++ options: -O3

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyArc A580Arc A770Arc A750RX 7700 XTRX 7800 XTRX 7600 XTRTX 4060RTX 4070 SUPERRTX 4070816243240SE +/- 0.02, N = 12SE +/- 0.10, N = 12SE +/- 0.08, N = 12SE +/- 0.05, N = 12SE +/- 0.03, N = 12SE +/- 0.18, N = 15SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 1535.1634.5032.4818.7718.6317.244.684.664.621. (CXX) g++ options: -O3

Darktable

OpenBenchmarking.orgWatts, Fewer Is BetterDarktableGPU Power Consumption MonitorRTX 4070 SUPER20406080100Min: 8.83 / Avg: 38.51 / Max: 123.72

OpenBenchmarking.orgCelsius, Fewer Is BetterDarktableGPU Temperature MonitorRTX 4070 SUPER918273645Min: 40 / Avg: 42.64 / Max: 46

OpenBenchmarking.orgWatts, Fewer Is BetterDarktableGPU Power Consumption MonitorRTX 4070 SUPER1020304050Min: 10.24 / Avg: 27.52 / Max: 50.46

OpenBenchmarking.orgCelsius, Fewer Is BetterDarktableGPU Temperature MonitorRTX 4070 SUPER918273645Min: 43 / Avg: 44.44 / Max: 46

OpenBenchmarking.orgWatts, Fewer Is BetterDarktableGPU Power Consumption MonitorRTX 4070 SUPER1122334455Min: 10.41 / Avg: 28.71 / Max: 54.66

OpenBenchmarking.orgCelsius, Fewer Is BetterDarktableGPU Temperature MonitorRTX 4070 SUPER1020304050Min: 44 / Avg: 45.69 / Max: 49

OpenBenchmarking.orgWatts, Fewer Is BetterDarktableGPU Power Consumption MonitorRTX 4070 SUPER816243240Min: 10.43 / Avg: 20.37 / Max: 37.12

OpenBenchmarking.orgCelsius, Fewer Is BetterDarktableGPU Temperature MonitorRTX 4070 SUPER1020304050Min: 45 / Avg: 45.53 / Max: 47

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Boat - Acceleration: OpenCLArc A770Arc A580Arc A750RX 7600 XTRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER0.60551.2111.81652.4223.0275SE +/- 0.004, N = 8SE +/- 0.005, N = 8SE +/- 0.005, N = 8SE +/- 0.005, N = 8SE +/- 0.004, N = 9SE +/- 0.002, N = 9SE +/- 0.002, N = 10SE +/- 0.003, N = 10SE +/- 0.004, N = 92.6912.6792.6742.4532.2591.9421.7741.7171.647

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Masskrug - Acceleration: OpenCLRTX 4060Arc A580Arc A770RX 7600 XTArc A750RTX 4070RTX 4070 SUPERRX 7700 XTRX 7800 XT0.37510.75021.12531.50041.8755SE +/- 0.003, N = 10SE +/- 0.006, N = 10SE +/- 0.005, N = 10SE +/- 0.007, N = 10SE +/- 0.007, N = 10SE +/- 0.005, N = 10SE +/- 0.004, N = 10SE +/- 0.006, N = 10SE +/- 0.007, N = 101.6671.6291.6281.6241.6231.5941.5631.5471.533

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Room - Acceleration: OpenCLRTX 4060RX 7600 XTRTX 4070Arc A580RTX 4070 SUPERArc A750Arc A770RX 7700 XTRX 7800 XT0.22480.44960.67440.89921.124SE +/- 0.001, N = 11SE +/- 0.006, N = 11SE +/- 0.002, N = 11SE +/- 0.002, N = 11SE +/- 0.002, N = 11SE +/- 0.001, N = 11SE +/- 0.001, N = 11SE +/- 0.005, N = 11SE +/- 0.006, N = 110.9990.9520.9070.8840.8830.8790.8740.8620.822

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Rack - Acceleration: OpenCLRTX 4060RX 7600 XTRTX 4070RX 7700 XTRX 7800 XTRTX 4070 SUPERArc A770Arc A750Arc A5800.02970.05940.08910.11880.1485SE +/- 0.000, N = 13SE +/- 0.001, N = 15SE +/- 0.000, N = 13SE +/- 0.001, N = 13SE +/- 0.001, N = 15SE +/- 0.000, N = 13SE +/- 0.001, N = 14SE +/- 0.001, N = 14SE +/- 0.001, N = 140.1320.1300.1230.1210.1160.1160.1120.1110.111

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32RX 7600 XTRX 7700 XTRTX 4060RX 7800 XTArc A750Arc A770Arc A580RTX 4070 SUPERRTX 40706001200180024003000SE +/- 4.58, N = 3SE +/- 3.18, N = 3SE +/- 0.00, N = 3SE +/- 8.84, N = 3SE +/- 10.07, N = 3SE +/- 7.26, N = 3SE +/- 17.46, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3116913881603170924162441244727832878

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16SRX 7600 XTRX 7700 XTRTX 4060RX 7800 XTArc A580Arc A750Arc A770RTX 4070RTX 4070 SUPER11002200330044005500SE +/- 4.63, N = 3SE +/- 3.18, N = 3SE +/- 0.58, N = 3SE +/- 12.70, N = 3SE +/- 13.01, N = 3SE +/- 6.74, N = 3SE +/- 27.93, N = 15SE +/- 0.88, N = 3SE +/- 0.67, N = 3217525882960312636243814409846315133

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16CRX 7600 XTRX 7700 XTRTX 4060RX 7800 XTArc A580Arc A750Arc A770RTX 4070RTX 4070 SUPER12002400360048006000SE +/- 4.26, N = 3SE +/- 12.25, N = 3SE +/- 0.00, N = 3SE +/- 12.98, N = 3SE +/- 11.85, N = 3SE +/- 6.03, N = 3SE +/- 53.38, N = 3SE +/- 0.67, N = 3SE +/- 3.51, N = 3234126673106311333043759389353815595

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 57885161RTX 4060RX 7600 XTRTX 4070RTX 4070 SUPERRX 7700 XTRX 7800 XT2004006008001000SE +/- 0.54, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3SE +/- 0.42, N = 3SE +/- 0.27, N = 3SE +/- 0.81, N = 3375.99571.86711.91853.24899.82960.921. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 57885161

Arc A580: The test run did not produce a result. E: ./open 20241210 04:35:52 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:51:51 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:19:46 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 77936867RTX 4060RX 7600 XTRTX 4070RTX 4070 SUPERRX 7700 XTRX 7800 XT150300450600750SE +/- 0.37, N = 3SE +/- 0.00, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3277.29424.99527.98634.38669.64716.851. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 77936867

Arc A580: The test run did not produce a result. E: ./open 20241210 04:36:17 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:52:16 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:20:11 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 332220523RTX 4060RX 7600 XTRTX 4070RTX 4070 SUPERRX 7700 XTRX 7800 XT306090120150SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 358.5786.01112.10134.75137.04147.081. (CXX) g++ options: -O3 -lgmp -lOpenCL

Exponent: 332220523

Arc A580: The test run did not produce a result. E: ./open 20241210 04:36:41 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A750: The test run did not produce a result. E: ./open 20241210 05:52:40 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Arc A770: The test run did not produce a result. E: ./open 20241210 08:20:35 Exception gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at gpuowl-7.5/src/clwrap.cpp:245 build

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RX 7600 XTArc A580RTX 4060Arc A750Arc A770RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER14000M28000M42000M56000M70000MSE +/- 68956476.21, N = 6SE +/- 8638479.80, N = 6SE +/- 24181996.29, N = 6SE +/- 8077829.74, N = 6SE +/- 20961320.20, N = 6SE +/- 39002994.90, N = 6SE +/- 51306033.87, N = 6SE +/- 35399027.76, N = 6SE +/- 49647307.19, N = 6238382166672624900000029787116667314417000003549068333339745083333401317333335694385000067447333333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1Arc A580Arc A750Arc A770RTX 4060RX 7600 XTRX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER5000M10000M15000M20000M25000MSE +/- 2199454.48, N = 5SE +/- 1758806.41, N = 5SE +/- 2281315.41, N = 5SE +/- 9883142.21, N = 6SE +/- 20629558.08, N = 6SE +/- 2610204.33, N = 6SE +/- 22128976.73, N = 6SE +/- 13909341.39, N = 6SE +/- 22178749.84, N = 645741600005495680000620908000095123500001025060000016936050000173279833331824473333321927516667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512Arc A580Arc A750Arc A770RX 7600 XTRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER600M1200M1800M2400M3000MSE +/- 202758.75, N = 6SE +/- 375721.53, N = 6SE +/- 624855.54, N = 6SE +/- 2013123.61, N = 6SE +/- 1194082.63, N = 6SE +/- 549393.61, N = 6SE +/- 9450282.18, N = 6SE +/- 1068566.02, N = 6SE +/- 2352079.46, N = 67878333339430500001064533333115650000012135500001891250000190375000023243500002778583333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipArc A770Arc A750Arc A580RX 7600 XTRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER300K600K900K1200K1500KSE +/- 66.67, N = 3SE +/- 125.00, N = 4SE +/- 122.47, N = 4SE +/- 3699.93, N = 15SE +/- 985.43, N = 8SE +/- 1268.64, N = 8SE +/- 1329.67, N = 8SE +/- 5954.67, N = 8SE +/- 2113.56, N = 824513324717524800049298753875078198884061310283751199963

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSArc A580RX 7600 XTArc A750RTX 4060Arc A770RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER200K400K600K800K1000KSE +/- 198.81, N = 7SE +/- 147.08, N = 7SE +/- 190.86, N = 8SE +/- 2610.06, N = 15SE +/- 212.08, N = 8SE +/- 47254.23, N = 15SE +/- 446.01, N = 8SE +/- 2512.78, N = 8SE +/- 5423.39, N = 13271100317914325500357007368263401347478200690588812346

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER1224364860SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 328.8938.5545.6848.0151.95

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 4060RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER510152025SE +/- 0.002, N = 3SE +/- 0.018, N = 3SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 39.70713.67017.19818.03219.519

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthArc A580RX 7600 XTArc A750Arc A770RX 7700 XTRX 7800 XTRTX 4060RTX 4070RTX 4070 SUPER6001200180024003000SE +/- 1.95, N = 3SE +/- 1.56, N = 8SE +/- 0.13, N = 3SE +/- 3.42, N = 3SE +/- 6.85, N = 8SE +/- 3.86, N = 8SE +/- 1.58, N = 6SE +/- 1.12, N = 6SE +/- 1.71, N = 6724.35795.41883.84991.241023.851101.661761.662824.702870.011. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 4060RX 7600 XTRX 7700 XTArc A750Arc A580Arc A770RTX 4070 SUPERRTX 4070RX 7800 XT400800120016002000SE +/- 0.97, N = 11SE +/- 0.32, N = 11SE +/- 1.68, N = 12SE +/- 6.19, N = 14SE +/- 4.67, N = 14SE +/- 6.21, N = 14SE +/- 0.73, N = 11SE +/- 0.39, N = 11SE +/- 2.09, N = 12721.36778.521169.061174.221191.771226.811284.241333.871751.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NArc A580Arc A750RX 7600 XTArc A770RX 7700 XTRTX 4060RX 7800 XTRTX 4070RTX 4070 SUPER2K4K6K8K10KSE +/- 11.95, N = 10SE +/- 5.35, N = 11SE +/- 3.33, N = 9SE +/- 31.44, N = 15SE +/- 5.03, N = 10SE +/- 24.39, N = 9SE +/- 5.84, N = 10SE +/- 6.25, N = 10SE +/- 19.61, N = 111778.832202.802247.592512.054314.504525.114704.989693.1211512.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 7600 XTArc A580RTX 4060Arc A750Arc A770RX 7700 XTRX 7800 XTRTX 4070RTX 4070 SUPER1020304050SE +/- 0.03, N = 12SE +/- 0.01, N = 13SE +/- 0.01, N = 12SE +/- 0.01, N = 13SE +/- 0.03, N = 15SE +/- 0.00, N = 13SE +/- 0.05, N = 13SE +/- 0.02, N = 13SE +/- 0.03, N = 1315.0318.4218.8722.8025.8227.1629.1836.2643.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionArc A580Arc A750Arc A770RTX 4060RX 7600 XTRX 7700 XTRTX 4070 SUPERRTX 4070RX 7800 XT120240360480600SE +/- 0.18, N = 9SE +/- 0.13, N = 10SE +/- 6.54, N = 12SE +/- 0.02, N = 11SE +/- 0.08, N = 11SE +/- 0.53, N = 11SE +/- 0.08, N = 12SE +/- 0.88, N = 15SE +/- 2.78, N = 1269.7278.20134.26246.88260.62350.02379.62379.78543.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 7600 XTRTX 4060RX 7700 XTRX 7800 XTArc A580Arc A750Arc A770RTX 4070RTX 4070 SUPER60120180240300SE +/- 0.62, N = 9SE +/- 0.15, N = 15SE +/- 1.18, N = 3SE +/- 1.91, N = 15SE +/- 0.23, N = 15SE +/- 0.31, N = 15SE +/- 0.22, N = 15SE +/- 0.18, N = 13SE +/- 0.08, N = 1274.96155.07165.45213.11215.04228.22228.45287.51295.201. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

64 Results Shown

Blender:
  BMW27 - NVIDIA CUDA
  BMW27 - NVIDIA OptiX
  Classroom - NVIDIA CUDA
  Classroom - NVIDIA OptiX
  Fishy Cat - NVIDIA CUDA
  Fishy Cat - NVIDIA OptiX
  Pabellon Barcelona - NVIDIA CUDA
  Pabellon Barcelona - NVIDIA OptiX
  Barbershop - NVIDIA CUDA
  Barbershop - NVIDIA OptiX
  Junkshop - NVIDIA CUDA
  Junkshop - NVIDIA OptiX
  BMW27 - Intel oneAPI
  Classroom - Intel oneAPI
  Fishy Cat - Intel oneAPI
  Pabellon Barcelona - Intel oneAPI
  Barbershop - Intel oneAPI
  Junkshop - Intel oneAPI
  BMW27 - Radeon HIP
  Classroom - Radeon HIP
  Fishy Cat - Radeon HIP
  Pabellon Barcelona - Radeon HIP
  Barbershop - Radeon HIP
  Junkshop - Radeon HIP
cl-mem:
  Read
  Write
  Copy
clpeak:
  Global Memory Bandwidth
  Single-Precision Compute
  Double-Precision Compute
  Integer Compute
  Integer 24-bit Compute
  Kernel Latency
Darktable:
  GPU Power Consumption Monitor
  GPU Temp Monitor
  GPU Power Consumption Monitor
  GPU Temp Monitor
  GPU Power Consumption Monitor
  GPU Temp Monitor
  GPU Power Consumption Monitor
  GPU Temp Monitor
Darktable:
  Boat - OpenCL
  Masskrug - OpenCL
  Server Room - OpenCL
  Server Rack - OpenCL
FluidX3D:
  FP32-FP32
  FP32-FP16S
  FP32-FP16C
GpuOwl:
  57885161
  77936867
  332220523
Hashcat:
  MD5
  SHA1
  SHA-512
  7-Zip
  TrueCrypt RIPEMD160 + XTS
IndigoBench:
  OpenCL GPU - Supercar
  OpenCL GPU - Bedroom
SHOC Scalable HeterOgeneous Computing:
  OpenCL - Texture Read Bandwidth
  OpenCL - FFT SP
  OpenCL - GEMM SGEMM_N
  OpenCL - MD5 Hash
  OpenCL - Reduction
  OpenCL - S3D