gpu comp

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2402256-PTS-GPUCOMP691&gru&rdt&rro.

gpu compProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolution4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklmAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d82 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4080 SUPER 16GBNVIDIA Device 22bbDELL U2723QEIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.7.0-060700-generic (x86_64)GNOME Shell 45.2X Server 1.21.1.7NVIDIA 550.40.074.6.0OpenCL 3.0 CUDA 12.4.74GCC 13.2.0ext43840x2160MSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22be4001GB Western Digital WD_BLACK SN850X 4000GB + 2000GB Samsung SSD 980 PRO 2TBASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GBNVIDIA Device 22bb2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD AudioOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- 4080 super a: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- b: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- c: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- d: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- RTX 4060: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- e: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- f: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- g: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4070 TI SUPER: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- h: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- i: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4090: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- j: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- k: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- l: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- m: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Graphics Details- 4080 super a: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- b: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- c: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- d: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- f: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- g: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- 4070 TI SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- i: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- j: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- k: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- l: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- m: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01OpenCL Details- 4080 super a: GPU Compute Cores: 10240- b: GPU Compute Cores: 10240- c: GPU Compute Cores: 10240- d: GPU Compute Cores: 10240- RTX 4060: GPU Compute Cores: 3072- e: GPU Compute Cores: 3072- f: GPU Compute Cores: 3072- g: GPU Compute Cores: 3072- 4070 TI SUPER: GPU Compute Cores: 8448- h: GPU Compute Cores: 8448- i: GPU Compute Cores: 8448- 4090: GPU Compute Cores: 16384- j: GPU Compute Cores: 16384- k: GPU Compute Cores: 16384- l: GPU Compute Cores: 16384- m: GPU Compute Cores: 16384Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

gpu compvkfft: FFT + iFFT R2C / C2Rvkfft: FFT + iFFT C2C 1D batched in half precisionvkfft: FFT + iFFT C2C Bluestein in single precisionvkfft: FFT + iFFT C2C 1D batched in double precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflinglibplacebo: deband_heavylibplacebo: polar_nocomputelibplacebo: hdr_peakdetectlibplacebo: hdr_lutlibplacebo: av1_grain_laplibplacebo: gaussianopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: Memory Bandwidth Coalesced Writegpuowl: 57885161gpuowl: 77936867gpuowl: 332220523fluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sopencl-benchmark: FP64 Computeopencl-benchmark: FP32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT8 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm6725714318316388314641061727394856861076041529.331919.913002.982858.683475.583407.97680.51632.691191.89511323896.06189.683972812780770.86153.5044.23727.62923.98420.8226395614696316963317751063487502957061077411527.651918.793260.622858.873476.353411.13680.56631.651190.48896.06189.833968812180790.86453.4894.25227.58123.97820.8196388713942916838343461062086864257121078381528.481918.633184.62856.353476.233413.41680.77629.881191.89511323894.45189.613967814580790.86453.6314.23927.6323.98420.8256764714705916831343561063007467456871077541530.761924.723251.952860.743189.613412.08680.67629.11191.89511323896.06189.833967812880780.86453.6554.23927.6323.98220.822356018282310715120254234737049237143062514.76655.961570.771525.071723.151512.45252.9258.47376.65278.5558.851609311430380.26416.5052.0878.4917.4146.179352018272510635120704233838684237343067517.52659.341546.471525.751725.791512.65252.89258.40378.36278.5558.851609311330390.26416.5062.0948.4917.4176.225351678299710593120964234138064238543079514.52655.721506.931522.31722.451511.72252.89258.4376.65277.3258.581608311430390.26416.5052.0898.4917.4176.215352198306510461121034234538609237343071514.7656.21503.721526.731724.711511.91252.89258.36376.65278.5558.851609311430390.26416.5052.0918.4917.4196.2146755813871614512306771026066645448501042711323.81672.462874.752789.693538.353118.52619.31607.151003.01744.60159.823831742066780.72545.0684.2823.18620.21717.1356684912780616243231991026926758749991042941324.361671.912882.562789.733516.563120.65619.28606.591004.016064257745.16159.873830742066810.72545.0674.27223.20720.21217.1356687414116416357241251026826813749831042661324.841672.082872.212772.43487.013120.9619.29607.191003.01745.16159.823830742166830.72545.0714.30823.20220.19817.1347919521416516151378121516118048981831522452294.942848.163880.873542.893470.094406.25927.96896.461926.781432.664756447304.5157361106399061.38985.8314.34644.41138.33533.2347637221441819398375731496748534982111522202270.652845.613586.113546.093490.134364.76927.85902.581937.9844961241440.92304.7957331088297851.38985.8714.34444.32938.36133.2247175519762918205464611505278025782231520662293.542856.183990.183545.363503.944407.68928.04902.921937.9844961241434.72304.5157371098598061.38985.7974.34744.39838.51833.227878020581318369506841503788420082101529862295.232851.233616.573536.983462.394405.21927.98903.861926.781434.72304.6057391098298001.38985.8494.34944.34838.36133.2327591220680318619456651512347458782081524512293.812855.334000.043543.513453.744407.96928.1902.161934.241440.92305.3457371098398091.38985.8094.34944.3338.34133.229OpenBenchmarking.org

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2Rmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a20K40K60K80K100KSE +/- 24.98, N = 3759127878071755763727919566874668496755835219351673520135601676476388763956672571. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a50K100K150K200K250KSE +/- 194.61, N = 3206803205813197629214418214165141164127806138716830658299782725828231470591394291469631431831. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a4K8K12K16K20KSE +/- 43.25, N = 3186191836918205193981615116357162431451210461105931063510715168311683816963163881. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a11K22K33K44K55KSE +/- 8.95, N = 3456655068446461375733781224125231993067712103120961207012025343563434631775314641. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a30K60K90K120K150KSE +/- 0.67, N = 3151234150378150527149674151611102682102692102606423454234142338423471063001062081063481061721. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a20K40K60K80K100KSE +/- 51.67, N = 3745878420080257853498048968137675876645438609380643868437049746746864275029739481. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precisionmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a2K4K6K8K10KSE +/- 2.40, N = 382088210822382118183498349994850237323852373237156875712570656861. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a30K60K90K120K150KSE +/- 2.03, N = 3152451152986152066152220152245104266104294104271430714307943067430621077541078381077411076041. (CXX) g++ options: -O3

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: deband_heavymlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a5001000150020002500SE +/- 0.15, N = 32293.812295.232293.542270.652294.941324.841324.361323.80514.70514.52517.52514.761530.761528.481527.651529.331. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: polar_nocomputemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a6001200180024003000SE +/- 0.04, N = 32855.332851.232856.182845.612848.161672.081671.911672.46656.20655.72659.34655.961924.721918.631918.791919.911. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_peakdetectmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a9001800270036004500SE +/- 19.72, N = 34000.043616.573990.183586.113880.872872.212882.562874.751503.721506.931546.471570.773251.953184.603260.623002.981. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: hdr_lut

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_lutmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a8001600240032004000SE +/- 0.11, N = 33543.513536.983545.363546.093542.892772.402789.732789.691526.731522.301525.751525.072860.742856.352858.872858.681. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: av1_grain_lapmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a8001600240032004000SE +/- 0.40, N = 33453.743462.393503.943490.133470.093487.013516.563538.351724.711722.451725.791723.153189.613476.233476.353475.581. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: gaussian

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: gaussianmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a9001800270036004500SE +/- 0.37, N = 34407.964405.214407.684364.764406.253120.903120.653118.521511.911511.721512.651512.453412.083413.413411.133407.971. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Readmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a2004006008001000SE +/- 0.01, N = 3928.10927.98928.04927.85927.96619.29619.28619.31252.89252.89252.89252.90680.67680.77680.56680.511. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Writemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a2004006008001000SE +/- 0.01, N = 3902.16903.86902.92902.58896.46607.19606.59607.15258.36258.40258.40258.47629.10629.88631.65632.691. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

Exponent: 57885161

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 57885161mlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a400800120016002000SE +/- 0.00, N = 31934.241926.781937.981937.981926.781003.011004.021003.01376.65376.65378.36376.651191.901191.901190.481191.901. (CXX) g++ options: -O3 -lgmp -lOpenCL

GpuOwl

Exponent: 77936867

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 77936867mlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a30060090012001500SE +/- 0.00, N = 31440.921434.721434.721440.921432.66745.16745.16744.60278.55277.32278.55278.55896.06894.45896.06896.061. (CXX) g++ options: -O3 -lgmp -lOpenCL

GpuOwl

Exponent: 332220523

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 332220523mlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a70140210280350SE +/- 0.00, N = 3305.34304.60304.51304.79304.51159.82159.87159.8258.8558.5858.8558.85189.83189.61189.83189.681. (CXX) g++ options: -O3 -lgmp -lOpenCL

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32mlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a12002400360048006000SE +/- 0.00, N = 35737573957375733573638303830383116091608160916093967396739683972

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16Cmlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a2K4K6K8K10KSE +/- 0.33, N = 3109831098210985108821106374217420742031143114311331148128814581218127

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16Smlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a2K4K6K8K10KSE +/- 0.33, N = 39809980098069785990666836681667830393039303930388078807980798077

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a0.31250.6250.93751.251.5625SE +/- 0.000, N = 31.3891.3891.3891.3891.3890.7250.7250.7250.2640.2640.2640.2640.8640.8640.8640.8611. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a20406080100SE +/- 0.00, N = 385.8185.8585.8085.8785.8345.0745.0745.0716.5116.5116.5116.5153.6653.6353.4953.501. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a0.97851.9572.93553.9144.8925SE +/- 0.002, N = 34.3494.3494.3474.3444.3464.3084.2724.2802.0912.0892.0942.0874.2394.2394.2524.2371. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a1020304050SE +/- 0.000, N = 344.33044.34844.39844.32944.41123.20223.20723.1868.4918.4918.4918.49127.63027.63027.58127.6291. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a918273645SE +/- 0.001, N = 338.34138.36138.51838.36138.33520.19820.21220.2177.4197.4177.4177.41423.98223.98423.97823.9841. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 Computemlkj4090ih4070 TI SUPERgfeRTX 4060dcb4080 super a816243240SE +/- 0.012, N = 333.22933.23233.22033.22433.23417.13417.13517.1356.2146.2156.2256.17920.82220.82520.81920.8221. (CXX) g++ options: -std=c++17 -pthread -lOpenCL


Phoronix Test Suite v10.8.5