Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310268-NE-COMPUTEAR41
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Arc A380
October 16 2023
  44 Minutes
Arc A580
October 16 2023
  21 Minutes
Arc A750
October 15 2023
  33 Minutes
Arc A770
October 14 2023
  20 Minutes
RTX 3060
October 25 2023
  16 Minutes
RTX 3060 Ti
October 25 2023
  13 Minutes
RTX 3070
October 24 2023
  13 Minutes
RTX 3070 Ti
October 25 2023
  11 Minutes
RTX 3080
October 26 2023
  10 Minutes
RTX 4060
October 24 2023
  14 Minutes
Invert Hiding All Results Option
  20 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads)ASUS PRIME Z790-P WIFI (0812 BIOS)Intel Device 7a2732GBWestern Digital WD_BLACK SN850X 4000GB + Western Digital WD_BLACK SN850X 1000GBIntel Arc A380 DG2 6GB (2450MHz)Realtek ALC897ASUS VP28UUbuntu 23.106.6.0-060600rc5-generic (x86_64)GNOME Shell 45.0X Server + Wayland4.6 Mesa 23.3~git2310140600.29e2e9~oibaf~m (git-29e2e92 2023-10-14 mantic-oibaf-ppa)OpenCL 3.0GCC 13.2.0ext41920x1080Intel Arc A580 DG2 8GB (2400MHz)Intel Arc A750 DG2 8GB (2400MHz)Intel Arc A770 DG2 16GB (2400MHz)eVGA NVIDIA GeForce RTX 3060 12GBX Server 1.21.1.7NVIDIA 545.23.064.6.0OpenCL 3.0 CUDA 12.3.68NVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GeForce RTX 3070 8GBNVIDIA GeForce RTX 3070 Ti 8GBNVIDIA GeForce RTX 3080 10GBMSI NVIDIA GeForce RTX 4060 8GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x119 - Thermald 2.5.4Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Graphics Details- RTX 3060: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.06.14.40.46- RTX 3060 Ti: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.25.00.2c- RTX 3070: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.5b.00.02- RTX 3080: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.20.00.07- RTX 4060: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 95.07.31.00.e3OpenCL Details- RTX 3060: GPU Compute Cores: 3584- RTX 3060 Ti: GPU Compute Cores: 4864- RTX 3070: GPU Compute Cores: 5888- RTX 3070 Ti: GPU Compute Cores: 6144- RTX 3080: GPU Compute Cores: 8704- RTX 4060: GPU Compute Cores: 3072

Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060Result OverviewPhoronix Test Suite100%274%448%621%795%clpeakFluidX3DHashcatVkResampleSHOC Scalable HeterOgeneous Computing

Intel Arc Graphics vs. NVIDIA Linux Compute - October 2023hashcat: MD5hashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthvkresample: 2x - Singlefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Single-Precision ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40601008888333328450000010102578.370110.4720422.1126.437811.771711.2342454.74562.113622111510971484.661484.04120.433315.8226186933333797816667281583214.64719.99101170.3718.542423.452922.4241731.21720.6112532348338824065.044068.45387.879748.2630086833333900383333312767225.25420.50501192.0422.336023.245222.3922766.31419.1732470395242124833.884834.19396.8011371.11351551333331078400000383757226.19020.02191219.9126.088223.470022.4374983.09317.3182616427144225530.715519.11397.6213008.6824797700000977616667290620172.71323.1156886.87215.105324.026726.34471270.5726.3202141354341436311.446336.87315.0512309.21346509571431355383333402138212.44223.40181123.7521.196423.998726.33732107.3721.0912616466651167798.688000.27379.9915298.76415897000001647616667494963219.88023.38071135.7325.331124.000726.34322124.5320.22926285027519110067.0510095.67390.4419674.25423328714291659233333498788286.62723.74891509.2426.226324.016726.34272076.6115.81235155921683710643.4410700.73529.7920722.73584029000002300950000676288339.34723.91961919.3336.515724.022226.34182205.2812.76141827758822614227.5514210.91638.2227261.61299704666671217933333365767156.10512.3434722.94218.924912.837113.19281790.4032.5371631315830857559.917587.93238.3914783.09OpenBenchmarking.org

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406013000M26000M39000M52000M65000MSE +/- 10038904.88, N = 6SE +/- 52502537.50, N = 6SE +/- 32959759.98, N = 6SE +/- 14282919.09, N = 6SE +/- 40085550.18, N = 6SE +/- 39649619.33, N = 7SE +/- 25896432.92, N = 7SE +/- 42136771.15, N = 7SE +/- 62354336.92, N = 6SE +/- 48062540.28, N = 310088883333261869333333008683333335155133333247977000003465095714341589700000423328714295840290000029970466667
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406010000M20000M30000M40000M50000MMin: 10065000000 / Avg: 10088883333.33 / Max: 10120100000Min: 26088100000 / Avg: 26186933333.33 / Max: 26361100000Min: 30019800000 / Avg: 30086833333.33 / Max: 30191700000Min: 35111600000 / Avg: 35155133333.33 / Max: 35199500000Min: 24678100000 / Avg: 24797700000 / Max: 24952800000Min: 34547500000 / Avg: 34650957142.86 / Max: 34818900000Min: 41511900000 / Avg: 41589700000 / Max: 41704000000Min: 42253900000 / Avg: 42332871428.57 / Max: 42531400000Min: 58178200000 / Avg: 58402900000 / Max: 58591400000Min: 29876100000 / Avg: 29970466666.67 / Max: 30033500000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060500M1000M1500M2000M2500MSE +/- 862167.81, N = 6SE +/- 871174.18, N = 6SE +/- 6115031.57, N = 6SE +/- 594138.03, N = 5SE +/- 852610.37, N = 6SE +/- 3243497.77, N = 6SE +/- 2876620.32, N = 6SE +/- 2377767.39, N = 6SE +/- 6840650.55, N = 6SE +/- 88191.71, N = 3284500000797816667900383333107840000097761666713553833331647616667165923333323009500001217933333
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060400M800M1200M1600M2000MMin: 282000000 / Avg: 284500000 / Max: 286600000Min: 794200000 / Avg: 797816666.67 / Max: 800200000Min: 874800000 / Avg: 900383333.33 / Max: 914400000Min: 1077000000 / Avg: 1078400000 / Max: 1080400000Min: 974400000 / Avg: 977616666.67 / Max: 979400000Min: 1343000000 / Avg: 1355383333.33 / Max: 1364900000Min: 1640200000 / Avg: 1647616666.67 / Max: 1656000000Min: 1649700000 / Avg: 1659233333.33 / Max: 1664500000Min: 2267800000 / Avg: 2300950000 / Max: 2311900000Min: 1217800000 / Avg: 1217933333.33 / Max: 1218100000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060140K280K420K560K700KSE +/- 149.30, N = 4SE +/- 319.81, N = 6SE +/- 1857.90, N = 6SE +/- 178.43, N = 7SE +/- 2070.36, N = 15SE +/- 2383.72, N = 8SE +/- 403.09, N = 8SE +/- 1315.08, N = 8SE +/- 2376.78, N = 8SE +/- 2383.51, N = 3101025281583312767383757290620402138494963498788676288365767
OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060120K240K360K480K600KMin: 100700 / Avg: 101025 / Max: 101400Min: 280400 / Avg: 281583.33 / Max: 282400Min: 305200 / Avg: 312766.67 / Max: 316500Min: 383200 / Avg: 383757.14 / Max: 384400Min: 269000 / Avg: 290620 / Max: 297600Min: 393600 / Avg: 402137.5 / Max: 411600Min: 493400 / Avg: 494962.5 / Max: 496600Min: 495300 / Avg: 498787.5 / Max: 505000Min: 667600 / Avg: 676287.5 / Max: 685700Min: 361000 / Avg: 365766.67 / Max: 368200

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406070140210280350SE +/- 0.55, N = 3SE +/- 0.72, N = 3SE +/- 1.04, N = 3SE +/- 0.84, N = 3SE +/- 0.01, N = 13SE +/- 0.03, N = 13SE +/- 0.05, N = 13SE +/- 0.07, N = 13SE +/- 0.05, N = 13SE +/- 0.02, N = 378.37214.65225.25226.19172.71212.44219.88286.63339.35156.111. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406060120180240300Min: 77.27 / Avg: 78.37 / Max: 79.06Min: 213.2 / Avg: 214.65 / Max: 215.44Min: 223.36 / Avg: 225.25 / Max: 226.94Min: 225.12 / Avg: 226.19 / Max: 227.86Min: 172.62 / Avg: 172.71 / Max: 172.78Min: 212.28 / Avg: 212.44 / Max: 212.62Min: 219.57 / Avg: 219.88 / Max: 220.18Min: 286.27 / Avg: 286.63 / Max: 287.09Min: 339.02 / Avg: 339.35 / Max: 339.63Min: 156.08 / Avg: 156.11 / Max: 156.131. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430SE +/- 0.01, N = 12SE +/- 0.07, N = 12SE +/- 0.07, N = 12SE +/- 0.06, N = 15SE +/- 0.01, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.01, N = 13SE +/- 0.01, N = 310.4719.9920.5120.0223.1223.4023.3823.7523.9212.341. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430Min: 10.43 / Avg: 10.47 / Max: 10.51Min: 19.32 / Avg: 19.99 / Max: 20.44Min: 20.09 / Avg: 20.51 / Max: 20.81Min: 19.14 / Avg: 20.02 / Max: 20.12Min: 23.07 / Avg: 23.12 / Max: 23.14Min: 23.37 / Avg: 23.4 / Max: 23.43Min: 23.36 / Avg: 23.38 / Max: 23.4Min: 23.73 / Avg: 23.75 / Max: 23.78Min: 23.88 / Avg: 23.92 / Max: 23.97Min: 12.33 / Avg: 12.34 / Max: 12.361. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060400800120016002000SE +/- 0.22, N = 12SE +/- 9.38, N = 15SE +/- 7.59, N = 13SE +/- 9.02, N = 15SE +/- 0.26, N = 12SE +/- 0.05, N = 12SE +/- 0.28, N = 12SE +/- 0.06, N = 12SE +/- 0.31, N = 12SE +/- 0.20, N = 3422.111170.371192.041219.91886.871123.751135.731509.241919.33722.941. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406030060090012001500Min: 420.21 / Avg: 422.11 / Max: 423Min: 1128.41 / Avg: 1170.37 / Max: 1216.68Min: 1138.91 / Avg: 1192.04 / Max: 1214.54Min: 1175.44 / Avg: 1219.91 / Max: 1258.29Min: 885.51 / Avg: 886.87 / Max: 887.59Min: 1123.34 / Avg: 1123.75 / Max: 1124.06Min: 1133.76 / Avg: 1135.73 / Max: 1137.34Min: 1508.89 / Avg: 1509.24 / Max: 1509.56Min: 1916.88 / Avg: 1919.33 / Max: 1920.78Min: 722.62 / Avg: 722.94 / Max: 723.31. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060816243240SE +/- 0.0096, N = 10SE +/- 0.0057, N = 13SE +/- 0.0328, N = 13SE +/- 0.0148, N = 15SE +/- 0.0089, N = 12SE +/- 0.0208, N = 13SE +/- 0.0181, N = 13SE +/- 0.0040, N = 13SE +/- 0.1231, N = 14SE +/- 0.0125, N = 36.437818.542422.336026.088215.105321.196425.331126.226336.515718.92491. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060816243240Min: 6.37 / Avg: 6.44 / Max: 6.47Min: 18.51 / Avg: 18.54 / Max: 18.59Min: 22.17 / Avg: 22.34 / Max: 22.57Min: 25.91 / Avg: 26.09 / Max: 26.14Min: 15.07 / Avg: 15.11 / Max: 15.17Min: 21.07 / Avg: 21.2 / Max: 21.31Min: 25.18 / Avg: 25.33 / Max: 25.46Min: 26.19 / Avg: 26.23 / Max: 26.24Min: 35.37 / Avg: 36.52 / Max: 37.39Min: 18.91 / Avg: 18.92 / Max: 18.951. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430SE +/- 0.00, N = 12SE +/- 0.00, N = 14SE +/- 0.02, N = 14SE +/- 0.00, N = 14SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 311.7723.4523.2523.4724.0324.0024.0024.0224.0212.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430Min: 11.77 / Avg: 11.77 / Max: 11.78Min: 23.43 / Avg: 23.45 / Max: 23.46Min: 23.14 / Avg: 23.25 / Max: 23.32Min: 23.46 / Avg: 23.47 / Max: 23.48Min: 24.02 / Avg: 24.03 / Max: 24.04Min: 23.99 / Avg: 24 / Max: 24.01Min: 23.99 / Avg: 24 / Max: 24.01Min: 24 / Avg: 24.02 / Max: 24.04Min: 24.01 / Avg: 24.02 / Max: 24.04Min: 12.83 / Avg: 12.84 / Max: 12.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430SE +/- 0.00, N = 12SE +/- 0.00, N = 13SE +/- 0.01, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 311.2322.4222.3922.4426.3426.3426.3426.3426.3413.191. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060612182430Min: 11.23 / Avg: 11.23 / Max: 11.24Min: 22.41 / Avg: 22.42 / Max: 22.43Min: 22.35 / Avg: 22.39 / Max: 22.42Min: 22.43 / Avg: 22.44 / Max: 22.44Min: 26.34 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.34Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 26.33 / Avg: 26.34 / Max: 26.35Min: 13.19 / Avg: 13.19 / Max: 13.191. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40605001000150020002500SE +/- 2.94, N = 3SE +/- 0.06, N = 3SE +/- 21.20, N = 15SE +/- 1.66, N = 3SE +/- 2.22, N = 4SE +/- 3.45, N = 4SE +/- 4.90, N = 4SE +/- 5.97, N = 4SE +/- 4.76, N = 3SE +/- 2.12, N = 3454.75731.22766.31983.091270.572107.372124.532076.612205.281790.401. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060400800120016002000Min: 451.67 / Avg: 454.75 / Max: 460.63Min: 731.1 / Avg: 731.22 / Max: 731.32Min: 701.03 / Avg: 766.31 / Max: 867.87Min: 980.65 / Avg: 983.09 / Max: 986.26Min: 1265.09 / Avg: 1270.57 / Max: 1275.64Min: 2100.69 / Avg: 2107.37 / Max: 2114.96Min: 2111.84 / Avg: 2124.53 / Max: 2134.27Min: 2058.72 / Avg: 2076.61 / Max: 2083.26Min: 2195.84 / Avg: 2205.28 / Max: 2211.09Min: 1787.35 / Avg: 1790.4 / Max: 1794.481. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40601428425670SE +/- 0.04, N = 3SE +/- 0.00, N = 6SE +/- 0.01, N = 6SE +/- 0.00, N = 6SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 362.1120.6119.1717.3226.3221.0920.2315.8112.7632.541. (CXX) g++ options: -O3
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40601224364860Min: 62.08 / Avg: 62.11 / Max: 62.18Min: 20.6 / Avg: 20.61 / Max: 20.62Min: 19.14 / Avg: 19.17 / Max: 19.19Min: 17.31 / Avg: 17.32 / Max: 17.32Min: 26.31 / Avg: 26.32 / Max: 26.33Min: 21.06 / Avg: 21.09 / Max: 21.13Min: 20.22 / Avg: 20.23 / Max: 20.24Min: 15.8 / Avg: 15.81 / Max: 15.82Min: 12.72 / Avg: 12.76 / Max: 12.81Min: 32.53 / Avg: 32.54 / Max: 32.541. (CXX) g++ options: -O3

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40609001800270036004500SE +/- 0.00, N = 3SE +/- 2.73, N = 3SE +/- 8.02, N = 3SE +/- 5.57, N = 3SE +/- 0.00, N = 3SE +/- 1.45, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 2.52, N = 3SE +/- 0.00, N = 3622253224702616214126162628351541821631
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32Arc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40607001400210028003500Min: 622 / Avg: 622 / Max: 622Min: 2528 / Avg: 2531.67 / Max: 2537Min: 2461 / Avg: 2470 / Max: 2486Min: 2609 / Avg: 2616 / Max: 2627Min: 2141 / Avg: 2141 / Max: 2141Min: 2614 / Avg: 2616.33 / Max: 2619Min: 2628 / Avg: 2628 / Max: 2628Min: 3515 / Avg: 3515 / Max: 3515Min: 4179 / Avg: 4182 / Max: 4187Min: 1631 / Avg: 1631 / Max: 1631

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406017003400510068008500SE +/- 0.33, N = 3SE +/- 20.04, N = 3SE +/- 14.29, N = 3SE +/- 45.67, N = 3SE +/- 2.73, N = 3SE +/- 7.51, N = 3SE +/- 1.00, N = 3SE +/- 3.53, N = 3SE +/- 7.80, N = 3SE +/- 0.67, N = 31115348339524271354346665027592177583158
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406013002600390052006500Min: 1115 / Avg: 1115.33 / Max: 1116Min: 3446 / Avg: 3482.67 / Max: 3515Min: 3924 / Avg: 3952 / Max: 3971Min: 4184 / Avg: 4270.67 / Max: 4339Min: 3539 / Avg: 3542.67 / Max: 3548Min: 4658 / Avg: 4666 / Max: 4681Min: 5026 / Avg: 5027 / Max: 5029Min: 5916 / Avg: 5921.33 / Max: 5928Min: 7745 / Avg: 7758.33 / Max: 7772Min: 3157 / Avg: 3158.33 / Max: 3159

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40602K4K6K8K10KSE +/- 10.37, N = 3SE +/- 8.41, N = 3SE +/- 0.88, N = 3SE +/- 30.66, N = 3SE +/- 0.33, N = 3SE +/- 2.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 3.06, N = 3SE +/- 0.00, N = 31097388242124422414351165191683782263085
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 406014002800420056007000Min: 1077 / Avg: 1097.33 / Max: 1111Min: 3865 / Avg: 3881.67 / Max: 3892Min: 4210 / Avg: 4211.67 / Max: 4213Min: 4378 / Avg: 4422 / Max: 4481Min: 4143 / Avg: 4143.33 / Max: 4144Min: 5112 / Avg: 5116.33 / Max: 5120Min: 5190 / Avg: 5190.67 / Max: 5191Min: 6835 / Avg: 6836.67 / Max: 6838Min: 8220 / Avg: 8226 / Max: 8230Min: 3085 / Avg: 3085 / Max: 3085

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40603K6K9K12K15KSE +/- 0.22, N = 4SE +/- 3.59, N = 3SE +/- 3.74, N = 3SE +/- 7.88, N = 3SE +/- 3.38, N = 13SE +/- 13.43, N = 13SE +/- 5.08, N = 13SE +/- 47.35, N = 13SE +/- 73.29, N = 13SE +/- 13.40, N = 31484.664065.044833.885530.716311.447798.6810067.0510643.4414227.557559.911. (CXX) g++ options: -O3
OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40602K4K6K8K10KMin: 1484.36 / Avg: 1484.66 / Max: 1485.3Min: 4057.87 / Avg: 4065.04 / Max: 4068.93Min: 4827.06 / Avg: 4833.88 / Max: 4839.93Min: 5517.54 / Avg: 5530.71 / Max: 5544.78Min: 6298.44 / Avg: 6311.44 / Max: 6331.69Min: 7746.07 / Avg: 7798.68 / Max: 7889.66Min: 10017.6 / Avg: 10067.05 / Max: 10086.2Min: 10301.26 / Avg: 10643.44 / Max: 10891.72Min: 13942.66 / Avg: 14227.55 / Max: 14785.54Min: 7533.12 / Avg: 7559.91 / Max: 7573.861. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40603K6K9K12K15KSE +/- 0.13, N = 4SE +/- 0.41, N = 3SE +/- 2.06, N = 3SE +/- 3.66, N = 3SE +/- 8.59, N = 13SE +/- 42.07, N = 13SE +/- 17.10, N = 13SE +/- 41.98, N = 13SE +/- 48.38, N = 13SE +/- 0.58, N = 31484.044068.454834.195519.116336.878000.2710095.6710700.7314210.917587.931. (CXX) g++ options: -O3
OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40602K4K6K8K10KMin: 1483.85 / Avg: 1484.04 / Max: 1484.42Min: 4067.63 / Avg: 4068.45 / Max: 4068.91Min: 4830.09 / Avg: 4834.19 / Max: 4836.45Min: 5513.68 / Avg: 5519.11 / Max: 5526.09Min: 6293.21 / Avg: 6336.87 / Max: 6396.53Min: 7751.57 / Avg: 8000.27 / Max: 8194.31Min: 10024.85 / Avg: 10095.67 / Max: 10210.89Min: 10360.48 / Avg: 10700.73 / Max: 10931.3Min: 14036.56 / Avg: 14210.91 / Max: 14648.4Min: 7586.79 / Avg: 7587.93 / Max: 7588.651. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060140280420560700SE +/- 0.08, N = 5SE +/- 0.07, N = 8SE +/- 0.15, N = 8SE +/- 0.13, N = 8SE +/- 0.84, N = 9SE +/- 0.53, N = 11SE +/- 0.02, N = 11SE +/- 0.68, N = 15SE +/- 0.92, N = 11SE +/- 0.06, N = 3120.43387.87396.80397.62315.05379.99390.44529.79638.22238.391. (CXX) g++ options: -O3
OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 4060110220330440550Min: 120.25 / Avg: 120.43 / Max: 120.63Min: 387.49 / Avg: 387.87 / Max: 388.08Min: 396.21 / Avg: 396.8 / Max: 397.39Min: 397 / Avg: 397.62 / Max: 398.15Min: 308.4 / Avg: 315.05 / Max: 316Min: 377.79 / Avg: 379.99 / Max: 382.89Min: 390.35 / Avg: 390.44 / Max: 390.52Min: 520.29 / Avg: 529.79 / Max: 530.56Min: 631.47 / Avg: 638.22 / Max: 642.91Min: 238.27 / Avg: 238.39 / Max: 238.451. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40606K12K18K24K30KSE +/- 0.38, N = 3SE +/- 1.76, N = 3SE +/- 2.25, N = 3SE +/- 0.78, N = 3SE +/- 13.72, N = 13SE +/- 67.44, N = 13SE +/- 38.39, N = 13SE +/- 137.23, N = 13SE +/- 181.88, N = 13SE +/- 0.10, N = 33315.829748.2611371.1113008.6812309.2115298.7619674.2520722.7327261.6114783.091. (CXX) g++ options: -O3
OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeArc A380Arc A580Arc A750Arc A770RTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 40605K10K15K20K25KMin: 3315.34 / Avg: 3315.82 / Max: 3316.58Min: 9745.03 / Avg: 9748.26 / Max: 9751.1Min: 11366.66 / Avg: 11371.11 / Max: 11373.91Min: 13007.73 / Avg: 13008.68 / Max: 13010.23Min: 12227.17 / Avg: 12309.21 / Max: 12409.19Min: 14975.48 / Avg: 15298.76 / Max: 15904.81Min: 19519.49 / Avg: 19674.25 / Max: 20026.68Min: 19887.82 / Avg: 20722.73 / Max: 21283.14Min: 25620 / Avg: 27261.61 / Max: 27891.71Min: 14782.9 / Avg: 14783.09 / Max: 14783.191. (CXX) g++ options: -O3