Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310268-NE-COMPUTEAR41
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Arc A380
October 16 2023
  44 Minutes
Arc A580
October 16 2023
  21 Minutes
Arc A750
October 15 2023
  33 Minutes
Arc A770
October 14 2023
  20 Minutes
RTX 3060
October 25 2023
  16 Minutes
RTX 3060 Ti
October 25 2023
  13 Minutes
RTX 3070
October 24 2023
  13 Minutes
RTX 3070 Ti
October 25 2023
  11 Minutes
RTX 3080
October 26 2023
  10 Minutes
RTX 4060
October 24 2023
  14 Minutes
Invert Hiding All Results Option
  20 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads)ASUS PRIME Z790-P WIFI (0812 BIOS)Intel Device 7a2732GBWestern Digital WD_BLACK SN850X 4000GB + Western Digital WD_BLACK SN850X 1000GBIntel Arc A380 DG2 6GB (2450MHz)Realtek ALC897ASUS VP28UUbuntu 23.106.6.0-060600rc5-generic (x86_64)GNOME Shell 45.0X Server + Wayland4.6 Mesa 23.3~git2310140600.29e2e9~oibaf~m (git-29e2e92 2023-10-14 mantic-oibaf-ppa)OpenCL 3.0GCC 13.2.0ext41920x1080Intel Arc A580 DG2 8GB (2400MHz)Intel Arc A750 DG2 8GB (2400MHz)Intel Arc A770 DG2 16GB (2400MHz)eVGA NVIDIA GeForce RTX 3060 12GBX Server 1.21.1.7NVIDIA 545.23.064.6.0OpenCL 3.0 CUDA 12.3.68NVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GeForce RTX 3070 8GBNVIDIA GeForce RTX 3070 Ti 8GBNVIDIA GeForce RTX 3080 10GBMSI NVIDIA GeForce RTX 4060 8GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x119 - Thermald 2.5.4Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Graphics Details- RTX 3060: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.06.14.40.46- RTX 3060 Ti: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.25.00.2c- RTX 3070: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.5b.00.02- RTX 3080: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.20.00.07- RTX 4060: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 95.07.31.00.e3OpenCL Details- RTX 3060: GPU Compute Cores: 3584- RTX 3060 Ti: GPU Compute Cores: 4864- RTX 3070: GPU Compute Cores: 5888- RTX 3070 Ti: GPU Compute Cores: 6144- RTX 3080: GPU Compute Cores: 8704- RTX 4060: GPU Compute Cores: 3072

Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060Result OverviewPhoronix Test Suite100%274%448%621%795%clpeakFluidX3DHashcatVkResampleSHOC Scalable HeterOgeneous Computing

Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023hashcat: MD5hashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthvkresample: 2x - Singlefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Single-Precision ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40601008888333328450000010102578.370110.4720422.1126.437811.771711.2342454.74562.113622111510971484.661484.04120.433315.8226186933333797816667281583214.64719.99101170.3718.542423.452922.4241731.21720.6112532348338824065.044068.45387.879748.2630086833333900383333312767225.25420.50501192.0422.336023.245222.3922766.31419.1732470395242124833.884834.19396.8011371.11351551333331078400000383757226.19020.02191219.9126.088223.470022.4374983.09317.3182616427144225530.715519.11397.6213008.6824797700000977616667290620172.71323.1156886.87215.105324.026726.34471270.5726.3202141354341436311.446336.87315.0512309.21346509571431355383333402138212.44223.40181123.7521.196423.998726.33732107.3721.0912616466651167798.688000.27379.9915298.76415897000001647616667494963219.88023.38071135.7325.331124.000726.34322124.5320.22926285027519110067.0510095.67390.4419674.25423328714291659233333498788286.62723.74891509.2426.226324.016726.34272076.6115.81235155921683710643.4410700.73529.7920722.73584029000002300950000676288339.34723.91961919.3336.515724.022226.34182205.2812.76141827758822614227.5514210.91638.2227261.61299704666671217933333365767156.10512.3434722.94218.924912.837113.19281790.4032.5371631315830857559.917587.93238.3914783.09OpenBenchmarking.org

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 3080RTX 3070 TiRTX 3070Arc A770RTX 3060 TiArc A750RTX 4060Arc A580RTX 3060Arc A38013000M26000M39000M52000M65000MSE +/- 62354336.92, N = 6SE +/- 42136771.15, N = 7SE +/- 25896432.92, N = 7SE +/- 14282919.09, N = 6SE +/- 39649619.33, N = 7SE +/- 32959759.98, N = 6SE +/- 48062540.28, N = 3SE +/- 52502537.50, N = 6SE +/- 40085550.18, N = 6SE +/- 10038904.88, N = 658402900000423328714294158970000035155133333346509571433008683333329970466667261869333332479770000010088883333
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 3080RTX 3070 TiRTX 3070Arc A770RTX 3060 TiArc A750RTX 4060Arc A580RTX 3060Arc A38010000M20000M30000M40000M50000MMin: 58178200000 / Avg: 58402900000 / Max: 58591400000Min: 42253900000 / Avg: 42332871428.57 / Max: 42531400000Min: 41511900000 / Avg: 41589700000 / Max: 41704000000Min: 35111600000 / Avg: 35155133333.33 / Max: 35199500000Min: 34547500000 / Avg: 34650957142.86 / Max: 34818900000Min: 30019800000 / Avg: 30086833333.33 / Max: 30191700000Min: 29876100000 / Avg: 29970466666.67 / Max: 30033500000Min: 26088100000 / Avg: 26186933333.33 / Max: 26361100000Min: 24678100000 / Avg: 24797700000 / Max: 24952800000Min: 10065000000 / Avg: 10088883333.33 / Max: 10120100000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060Arc A770RTX 3060Arc A750Arc A580Arc A380500M1000M1500M2000M2500MSE +/- 6840650.55, N = 6SE +/- 2377767.39, N = 6SE +/- 2876620.32, N = 6SE +/- 3243497.77, N = 6SE +/- 88191.71, N = 3SE +/- 594138.03, N = 5SE +/- 852610.37, N = 6SE +/- 6115031.57, N = 6SE +/- 871174.18, N = 6SE +/- 862167.81, N = 6230095000016592333331647616667135538333312179333331078400000977616667900383333797816667284500000
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060Arc A770RTX 3060Arc A750Arc A580Arc A380400M800M1200M1600M2000MMin: 2267800000 / Avg: 2300950000 / Max: 2311900000Min: 1649700000 / Avg: 1659233333.33 / Max: 1664500000Min: 1640200000 / Avg: 1647616666.67 / Max: 1656000000Min: 1343000000 / Avg: 1355383333.33 / Max: 1364900000Min: 1217800000 / Avg: 1217933333.33 / Max: 1218100000Min: 1077000000 / Avg: 1078400000 / Max: 1080400000Min: 974400000 / Avg: 977616666.67 / Max: 979400000Min: 874800000 / Avg: 900383333.33 / Max: 914400000Min: 794200000 / Avg: 797816666.67 / Max: 800200000Min: 282000000 / Avg: 284500000 / Max: 286600000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770RTX 4060Arc A750RTX 3060Arc A580Arc A380140K280K420K560K700KSE +/- 2376.78, N = 8SE +/- 1315.08, N = 8SE +/- 403.09, N = 8SE +/- 2383.72, N = 8SE +/- 178.43, N = 7SE +/- 2383.51, N = 3SE +/- 1857.90, N = 6SE +/- 2070.36, N = 15SE +/- 319.81, N = 6SE +/- 149.30, N = 4676288498788494963402138383757365767312767290620281583101025
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770RTX 4060Arc A750RTX 3060Arc A580Arc A380120K240K360K480K600KMin: 667600 / Avg: 676287.5 / Max: 685700Min: 495300 / Avg: 498787.5 / Max: 505000Min: 493400 / Avg: 494962.5 / Max: 496600Min: 393600 / Avg: 402137.5 / Max: 411600Min: 383200 / Avg: 383757.14 / Max: 384400Min: 361000 / Avg: 365766.67 / Max: 368200Min: 305200 / Avg: 312766.67 / Max: 316500Min: 269000 / Avg: 290620 / Max: 297600Min: 280400 / Avg: 281583.33 / Max: 282400Min: 100700 / Avg: 101025 / Max: 101400

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A38070140210280350SE +/- 0.05, N = 13SE +/- 0.07, N = 13SE +/- 0.84, N = 3SE +/- 1.04, N = 3SE +/- 0.05, N = 13SE +/- 0.72, N = 3SE +/- 0.03, N = 13SE +/- 0.01, N = 13SE +/- 0.02, N = 3SE +/- 0.55, N = 3339.35286.63226.19225.25219.88214.65212.44172.71156.1178.371. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A38060120180240300Min: 339.02 / Avg: 339.35 / Max: 339.63Min: 286.27 / Avg: 286.63 / Max: 287.09Min: 225.12 / Avg: 226.19 / Max: 227.86Min: 223.36 / Avg: 225.25 / Max: 226.94Min: 219.57 / Avg: 219.88 / Max: 220.18Min: 213.2 / Avg: 214.65 / Max: 215.44Min: 212.28 / Avg: 212.44 / Max: 212.62Min: 172.62 / Avg: 172.71 / Max: 172.78Min: 156.08 / Avg: 156.11 / Max: 156.13Min: 77.27 / Avg: 78.37 / Max: 79.061. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRTX 3080RTX 3070 TiRTX 3060 TiRTX 3070RTX 3060Arc A750Arc A770Arc A580RTX 4060Arc A380612182430SE +/- 0.01, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.01, N = 13SE +/- 0.07, N = 12SE +/- 0.06, N = 15SE +/- 0.07, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 1223.9223.7523.4023.3823.1220.5120.0219.9912.3410.471. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRTX 3080RTX 3070 TiRTX 3060 TiRTX 3070RTX 3060Arc A750Arc A770Arc A580RTX 4060Arc A380612182430Min: 23.88 / Avg: 23.92 / Max: 23.97Min: 23.73 / Avg: 23.75 / Max: 23.78Min: 23.37 / Avg: 23.4 / Max: 23.43Min: 23.36 / Avg: 23.38 / Max: 23.4Min: 23.07 / Avg: 23.12 / Max: 23.14Min: 20.09 / Avg: 20.51 / Max: 20.81Min: 19.14 / Avg: 20.02 / Max: 20.12Min: 19.32 / Avg: 19.99 / Max: 20.44Min: 12.33 / Avg: 12.34 / Max: 12.36Min: 10.43 / Avg: 10.47 / Max: 10.511. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 3080RTX 3070 TiArc A770Arc A750Arc A580RTX 3070RTX 3060 TiRTX 3060RTX 4060Arc A380400800120016002000SE +/- 0.31, N = 12SE +/- 0.06, N = 12SE +/- 9.02, N = 15SE +/- 7.59, N = 13SE +/- 9.38, N = 15SE +/- 0.28, N = 12SE +/- 0.05, N = 12SE +/- 0.26, N = 12SE +/- 0.20, N = 3SE +/- 0.22, N = 121919.331509.241219.911192.041170.371135.731123.75886.87722.94422.111. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 3080RTX 3070 TiArc A770Arc A750Arc A580RTX 3070RTX 3060 TiRTX 3060RTX 4060Arc A38030060090012001500Min: 1916.88 / Avg: 1919.33 / Max: 1920.78Min: 1508.89 / Avg: 1509.24 / Max: 1509.56Min: 1175.44 / Avg: 1219.91 / Max: 1258.29Min: 1138.91 / Avg: 1192.04 / Max: 1214.54Min: 1128.41 / Avg: 1170.37 / Max: 1216.68Min: 1133.76 / Avg: 1135.73 / Max: 1137.34Min: 1123.34 / Avg: 1123.75 / Max: 1124.06Min: 885.51 / Avg: 886.87 / Max: 887.59Min: 722.62 / Avg: 722.94 / Max: 723.3Min: 420.21 / Avg: 422.11 / Max: 4231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 3080RTX 3070 TiArc A770RTX 3070Arc A750RTX 3060 TiRTX 4060Arc A580RTX 3060Arc A380816243240SE +/- 0.1231, N = 14SE +/- 0.0040, N = 13SE +/- 0.0148, N = 15SE +/- 0.0181, N = 13SE +/- 0.0328, N = 13SE +/- 0.0208, N = 13SE +/- 0.0125, N = 3SE +/- 0.0057, N = 13SE +/- 0.0089, N = 12SE +/- 0.0096, N = 1036.515726.226326.088225.331122.336021.196418.924918.542415.10536.43781. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 3080RTX 3070 TiArc A770RTX 3070Arc A750RTX 3060 TiRTX 4060Arc A580RTX 3060Arc A380816243240Min: 35.37 / Avg: 36.52 / Max: 37.39Min: 26.19 / Avg: 26.23 / Max: 26.24Min: 25.91 / Avg: 26.09 / Max: 26.14Min: 25.18 / Avg: 25.33 / Max: 25.46Min: 22.17 / Avg: 22.34 / Max: 22.57Min: 21.07 / Avg: 21.2 / Max: 21.31Min: 18.91 / Avg: 18.92 / Max: 18.95Min: 18.51 / Avg: 18.54 / Max: 18.59Min: 15.07 / Avg: 15.11 / Max: 15.17Min: 6.37 / Avg: 6.44 / Max: 6.471. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3060RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A580Arc A750RTX 4060Arc A380612182430SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 14SE +/- 0.00, N = 14SE +/- 0.02, N = 14SE +/- 0.00, N = 3SE +/- 0.00, N = 1224.0324.0224.0224.0024.0023.4723.4523.2512.8411.771. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3060RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A580Arc A750RTX 4060Arc A380612182430Min: 24.02 / Avg: 24.03 / Max: 24.04Min: 24.01 / Avg: 24.02 / Max: 24.04Min: 24 / Avg: 24.02 / Max: 24.04Min: 23.99 / Avg: 24 / Max: 24.01Min: 23.99 / Avg: 24 / Max: 24.01Min: 23.46 / Avg: 23.47 / Max: 23.48Min: 23.43 / Avg: 23.45 / Max: 23.46Min: 23.14 / Avg: 23.25 / Max: 23.32Min: 12.83 / Avg: 12.84 / Max: 12.84Min: 11.77 / Avg: 11.77 / Max: 11.781. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRTX 3060RTX 3070RTX 3070 TiRTX 3080RTX 3060 TiArc A770Arc A580Arc A750RTX 4060Arc A380612182430SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.01, N = 13SE +/- 0.00, N = 3SE +/- 0.00, N = 1226.3426.3426.3426.3426.3422.4422.4222.3913.1911.231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRTX 3060RTX 3070RTX 3070 TiRTX 3080RTX 3060 TiArc A770Arc A580Arc A750RTX 4060Arc A380612182430Min: 26.34 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.34Min: 22.43 / Avg: 22.44 / Max: 22.44Min: 22.41 / Avg: 22.42 / Max: 22.43Min: 22.35 / Avg: 22.39 / Max: 22.42Min: 13.19 / Avg: 13.19 / Max: 13.19Min: 11.23 / Avg: 11.23 / Max: 11.241. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 3080RTX 3070RTX 3060 TiRTX 3070 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A3805001000150020002500SE +/- 4.76, N = 3SE +/- 4.90, N = 4SE +/- 3.45, N = 4SE +/- 5.97, N = 4SE +/- 2.12, N = 3SE +/- 2.22, N = 4SE +/- 1.66, N = 3SE +/- 21.20, N = 15SE +/- 0.06, N = 3SE +/- 2.94, N = 32205.282124.532107.372076.611790.401270.57983.09766.31731.22454.751. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 3080RTX 3070RTX 3060 TiRTX 3070 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A380400800120016002000Min: 2195.84 / Avg: 2205.28 / Max: 2211.09Min: 2111.84 / Avg: 2124.53 / Max: 2134.27Min: 2100.69 / Avg: 2107.37 / Max: 2114.96Min: 2058.72 / Avg: 2076.61 / Max: 2083.26Min: 1787.35 / Avg: 1790.4 / Max: 1794.48Min: 1265.09 / Avg: 1270.57 / Max: 1275.64Min: 980.65 / Avg: 983.09 / Max: 986.26Min: 701.03 / Avg: 766.31 / Max: 867.87Min: 731.1 / Avg: 731.22 / Max: 731.32Min: 451.67 / Avg: 454.75 / Max: 460.631. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A3801428425670SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 6SE +/- 0.01, N = 6SE +/- 0.00, N = 5SE +/- 0.00, N = 6SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 3SE +/- 0.04, N = 312.7615.8117.3219.1720.2320.6121.0926.3232.5462.111. (CXX) g++ options: -O3
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A3801224364860Min: 12.72 / Avg: 12.76 / Max: 12.81Min: 15.8 / Avg: 15.81 / Max: 15.82Min: 17.31 / Avg: 17.32 / Max: 17.32Min: 19.14 / Avg: 19.17 / Max: 19.19Min: 20.22 / Avg: 20.23 / Max: 20.24Min: 20.6 / Avg: 20.61 / Max: 20.62Min: 21.06 / Avg: 21.09 / Max: 21.13Min: 26.31 / Avg: 26.32 / Max: 26.33Min: 32.53 / Avg: 32.54 / Max: 32.54Min: 62.08 / Avg: 62.11 / Max: 62.181. (CXX) g++ options: -O3

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A580Arc A750RTX 3060RTX 4060Arc A3809001800270036004500SE +/- 2.52, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.45, N = 3SE +/- 5.57, N = 3SE +/- 2.73, N = 3SE +/- 8.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3418235152628261626162532247021411631622
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32RTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A580Arc A750RTX 3060RTX 4060Arc A3807001400210028003500Min: 4179 / Avg: 4182 / Max: 4187Min: 3515 / Avg: 3515 / Max: 3515Min: 2628 / Avg: 2628 / Max: 2628Min: 2614 / Avg: 2616.33 / Max: 2619Min: 2609 / Avg: 2616 / Max: 2627Min: 2528 / Avg: 2531.67 / Max: 2537Min: 2461 / Avg: 2470 / Max: 2486Min: 2141 / Avg: 2141 / Max: 2141Min: 1631 / Avg: 1631 / Max: 1631Min: 622 / Avg: 622 / Max: 622

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A750RTX 3060Arc A580RTX 4060Arc A38017003400510068008500SE +/- 7.80, N = 3SE +/- 3.53, N = 3SE +/- 1.00, N = 3SE +/- 7.51, N = 3SE +/- 45.67, N = 3SE +/- 14.29, N = 3SE +/- 2.73, N = 3SE +/- 20.04, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 37758592150274666427139523543348331581115
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A750RTX 3060Arc A580RTX 4060Arc A38013002600390052006500Min: 7745 / Avg: 7758.33 / Max: 7772Min: 5916 / Avg: 5921.33 / Max: 5928Min: 5026 / Avg: 5027 / Max: 5029Min: 4658 / Avg: 4666 / Max: 4681Min: 4184 / Avg: 4270.67 / Max: 4339Min: 3924 / Avg: 3952 / Max: 3971Min: 3539 / Avg: 3542.67 / Max: 3548Min: 3446 / Avg: 3482.67 / Max: 3515Min: 3157 / Avg: 3158.33 / Max: 3159Min: 1115 / Avg: 1115.33 / Max: 1116

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A750RTX 3060Arc A580RTX 4060Arc A3802K4K6K8K10KSE +/- 3.06, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 2.33, N = 3SE +/- 30.66, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 8.41, N = 3SE +/- 0.00, N = 3SE +/- 10.37, N = 38226683751915116442242124143388230851097
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiArc A770Arc A750RTX 3060Arc A580RTX 4060Arc A38014002800420056007000Min: 8220 / Avg: 8226 / Max: 8230Min: 6835 / Avg: 6836.67 / Max: 6838Min: 5190 / Avg: 5190.67 / Max: 5191Min: 5112 / Avg: 5116.33 / Max: 5120Min: 4378 / Avg: 4422 / Max: 4481Min: 4210 / Avg: 4211.67 / Max: 4213Min: 4143 / Avg: 4143.33 / Max: 4144Min: 3865 / Avg: 3881.67 / Max: 3892Min: 3085 / Avg: 3085 / Max: 3085Min: 1077 / Avg: 1097.33 / Max: 1111

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A3803K6K9K12K15KSE +/- 73.29, N = 13SE +/- 47.35, N = 13SE +/- 5.08, N = 13SE +/- 13.43, N = 13SE +/- 13.40, N = 3SE +/- 3.38, N = 13SE +/- 7.88, N = 3SE +/- 3.74, N = 3SE +/- 3.59, N = 3SE +/- 0.22, N = 414227.5510643.4410067.057798.687559.916311.445530.714833.884065.041484.661. (CXX) g++ options: -O3
OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A3802K4K6K8K10KMin: 13942.66 / Avg: 14227.55 / Max: 14785.54Min: 10301.26 / Avg: 10643.44 / Max: 10891.72Min: 10017.6 / Avg: 10067.05 / Max: 10086.2Min: 7746.07 / Avg: 7798.68 / Max: 7889.66Min: 7533.12 / Avg: 7559.91 / Max: 7573.86Min: 6298.44 / Avg: 6311.44 / Max: 6331.69Min: 5517.54 / Avg: 5530.71 / Max: 5544.78Min: 4827.06 / Avg: 4833.88 / Max: 4839.93Min: 4057.87 / Avg: 4065.04 / Max: 4068.93Min: 1484.36 / Avg: 1484.66 / Max: 1485.31. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A3803K6K9K12K15KSE +/- 48.38, N = 13SE +/- 41.98, N = 13SE +/- 17.10, N = 13SE +/- 42.07, N = 13SE +/- 0.58, N = 3SE +/- 8.59, N = 13SE +/- 3.66, N = 3SE +/- 2.06, N = 3SE +/- 0.41, N = 3SE +/- 0.13, N = 414210.9110700.7310095.678000.277587.936336.875519.114834.194068.451484.041. (CXX) g++ options: -O3
OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060RTX 3060Arc A770Arc A750Arc A580Arc A3802K4K6K8K10KMin: 14036.56 / Avg: 14210.91 / Max: 14648.4Min: 10360.48 / Avg: 10700.73 / Max: 10931.3Min: 10024.85 / Avg: 10095.67 / Max: 10210.89Min: 7751.57 / Avg: 8000.27 / Max: 8194.31Min: 7586.79 / Avg: 7587.93 / Max: 7588.65Min: 6293.21 / Avg: 6336.87 / Max: 6396.53Min: 5513.68 / Avg: 5519.11 / Max: 5526.09Min: 4830.09 / Avg: 4834.19 / Max: 4836.45Min: 4067.63 / Avg: 4068.45 / Max: 4068.91Min: 1483.85 / Avg: 1484.04 / Max: 1484.421. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A380140280420560700SE +/- 0.92, N = 11SE +/- 0.68, N = 15SE +/- 0.13, N = 8SE +/- 0.15, N = 8SE +/- 0.02, N = 11SE +/- 0.07, N = 8SE +/- 0.53, N = 11SE +/- 0.84, N = 9SE +/- 0.06, N = 3SE +/- 0.08, N = 5638.22529.79397.62396.80390.44387.87379.99315.05238.39120.431. (CXX) g++ options: -O3
OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 3080RTX 3070 TiArc A770Arc A750RTX 3070Arc A580RTX 3060 TiRTX 3060RTX 4060Arc A380110220330440550Min: 631.47 / Avg: 638.22 / Max: 642.91Min: 520.29 / Avg: 529.79 / Max: 530.56Min: 397 / Avg: 397.62 / Max: 398.15Min: 396.21 / Avg: 396.8 / Max: 397.39Min: 390.35 / Avg: 390.44 / Max: 390.52Min: 387.49 / Avg: 387.87 / Max: 388.08Min: 377.79 / Avg: 379.99 / Max: 382.89Min: 308.4 / Avg: 315.05 / Max: 316Min: 238.27 / Avg: 238.39 / Max: 238.45Min: 120.25 / Avg: 120.43 / Max: 120.631. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060Arc A770RTX 3060Arc A750Arc A580Arc A3806K12K18K24K30KSE +/- 181.88, N = 13SE +/- 137.23, N = 13SE +/- 38.39, N = 13SE +/- 67.44, N = 13SE +/- 0.10, N = 3SE +/- 0.78, N = 3SE +/- 13.72, N = 13SE +/- 2.25, N = 3SE +/- 1.76, N = 3SE +/- 0.38, N = 327261.6120722.7319674.2515298.7614783.0913008.6812309.2111371.119748.263315.821. (CXX) g++ options: -O3
OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 4060Arc A770RTX 3060Arc A750Arc A580Arc A3805K10K15K20K25KMin: 25620 / Avg: 27261.61 / Max: 27891.71Min: 19887.82 / Avg: 20722.73 / Max: 21283.14Min: 19519.49 / Avg: 19674.25 / Max: 20026.68Min: 14975.48 / Avg: 15298.76 / Max: 15904.81Min: 14782.9 / Avg: 14783.09 / Max: 14783.19Min: 13007.73 / Avg: 13008.68 / Max: 13010.23Min: 12227.17 / Avg: 12309.21 / Max: 12409.19Min: 11366.66 / Avg: 11371.11 / Max: 11373.91Min: 9745.03 / Avg: 9748.26 / Max: 9751.1Min: 3315.34 / Avg: 3315.82 / Max: 3316.581. (CXX) g++ options: -O3