gpu comp

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2402256-PTS-GPUCOMP691&grr&sro.

gpu compProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolution4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklmAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d82 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4080 SUPER 16GBNVIDIA Device 22bbDELL U2723QEIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.7.0-060700-generic (x86_64)GNOME Shell 45.2X Server 1.21.1.7NVIDIA 550.40.074.6.0OpenCL 3.0 CUDA 12.4.74GCC 13.2.0ext43840x2160MSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22be4001GB Western Digital WD_BLACK SN850X 4000GB + 2000GB Samsung SSD 980 PRO 2TBASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GBNVIDIA Device 22bb2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD AudioOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- 4080 super a: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- b: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- c: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- d: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- RTX 4060: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- e: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- f: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- g: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4070 TI SUPER: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- h: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- i: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4090: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- j: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- k: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- l: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- m: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Graphics Details- 4080 super a: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- b: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- c: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- d: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- f: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- g: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- 4070 TI SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- i: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- j: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- k: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- l: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- m: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01OpenCL Details- 4080 super a: GPU Compute Cores: 10240- b: GPU Compute Cores: 10240- c: GPU Compute Cores: 10240- d: GPU Compute Cores: 10240- RTX 4060: GPU Compute Cores: 3072- e: GPU Compute Cores: 3072- f: GPU Compute Cores: 3072- g: GPU Compute Cores: 3072- 4070 TI SUPER: GPU Compute Cores: 8448- h: GPU Compute Cores: 8448- i: GPU Compute Cores: 8448- 4090: GPU Compute Cores: 16384- j: GPU Compute Cores: 16384- k: GPU Compute Cores: 16384- l: GPU Compute Cores: 16384- m: GPU Compute Cores: 16384Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

gpu compgpuowl: 77936867gpuowl: 332220523gpuowl: 57885161vkfft: FFT + iFFT C2C 1D batched in double precisionvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingvkfft: FFT + iFFT C2C Bluestein in single precisionfluidx3d: FP32-FP32libplacebo: gaussianlibplacebo: av1_grain_laplibplacebo: hdr_lutlibplacebo: hdr_peakdetectlibplacebo: polar_nocomputelibplacebo: deband_heavyvkfft: FFT + iFFT C2C 1D batched in half precisionfluidx3d: FP32-FP16Sfluidx3d: FP32-FP16Cvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT R2C / C2Ropencl-benchmark: Memory Bandwidth Coalesced Writeopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: INT8 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: FP32 Computeopencl-benchmark: FP64 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm896.06189.681191.895113233146456861061721076041638839723407.973475.582858.683002.981919.911529.33143183807781277394867257632.69680.5120.82223.98427.6294.23753.5040.861896.06189.831190.483177557061063481077411696339683411.133476.352858.873260.621918.791527.65146963807981217502963956631.65680.5620.81923.97827.5814.25253.4890.864894.45189.611191.895113233434657121062081078381683839673413.413476.232856.353184.61918.631528.48139429807981456864263887629.88680.7720.82523.98427.634.23953.6310.864896.06189.831191.895113233435656871063001077541683139673412.083189.612860.743251.951924.721530.76147059807881287467467647629.1680.6720.82223.98227.634.23953.6550.864278.5558.85376.6512025237142347430621071516091512.451723.151525.071570.77655.96514.7682823303831143704935601258.47252.96.1797.4148.4912.08716.5050.264278.5558.85378.3612070237342338430671063516091512.651725.791525.751546.47659.34517.5282725303931133868435201258.40252.896.2257.4178.4912.09416.5060.264277.3258.58376.6512096238542341430791059316081511.721722.451522.31506.93655.72514.5282997303931143806435167258.4252.896.2157.4178.4912.08916.5050.264278.5558.85376.6512103237342345430711046116091511.911724.711526.731503.72656.2514.783065303931143860935219258.36252.896.2147.4198.4912.09116.5050.264744.60159.821003.013067748501026061042711451238313118.523538.352789.692874.751672.461323.8138716667874206645467558607.15619.3117.13520.21723.1864.2845.0680.725745.16159.871004.0160642572319949991026921042941624338303120.653516.562789.732882.561671.911324.36127806668174206758766849606.59619.2817.13520.21223.2074.27245.0670.725745.16159.821003.012412549831026821042661635738303120.93487.012772.42872.211672.081324.84141164668374216813766874607.19619.2917.13420.19823.2024.30845.0710.7251432.664756447304.511926.783781281831516111522451615157364406.253470.093542.893880.872848.162294.942141659906110638048979195896.46927.9633.23438.33544.4114.34685.8311.3891440.92304.791937.9844961243757382111496741522201939857334364.763490.133546.093586.112845.612270.652144189785108828534976372902.58927.8533.22438.36144.3294.34485.8711.3891434.72304.511937.9844961244646182231505271520661820557374407.683503.943545.363990.182856.182293.541976299806109858025771755902.92928.0433.2238.51844.3984.34785.7971.3891434.72304.601926.785068482101503781529861836957394405.213462.393536.983616.572851.232295.232058139800109828420078780903.86927.9833.23238.36144.3484.34985.8491.3891440.92305.341934.244566582081512341524511861957374407.963453.743543.514000.042855.332293.812068039809109837458775912902.16928.133.22938.34144.334.34985.8091.389OpenBenchmarking.org

GpuOwl

Exponent: 77936867

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 779368674070 TI SUPER4080 super a4090RTX 4060bcdefghijklm30060090012001500SE +/- 0.00, N = 3744.60896.061432.66278.55896.06894.45896.06278.55277.32278.55745.16745.161440.921434.721434.721440.921. (CXX) g++ options: -O3 -lgmp -lOpenCL

GpuOwl

Exponent: 332220523

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 3322205234070 TI SUPER4080 super a4090RTX 4060bcdefghijklm70140210280350SE +/- 0.00, N = 3159.82189.68304.5158.85189.83189.61189.8358.8558.5858.85159.87159.82304.79304.51304.60305.341. (CXX) g++ options: -O3 -lgmp -lOpenCL

GpuOwl

Exponent: 57885161

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 578851614070 TI SUPER4080 super a4090RTX 4060bcdefghijklm400800120016002000SE +/- 0.00, N = 31003.011191.901926.78376.651190.481191.901191.90378.36376.65376.651004.021003.011937.981937.981926.781934.241. (CXX) g++ options: -O3 -lgmp -lOpenCL

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm11K22K33K44K55KSE +/- 8.95, N = 3306773146437812120253177534346343561207012096121032319924125375734646150684456651. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm2K4K6K8K10KSE +/- 2.40, N = 348505686818323715706571256872373238523734999498382118223821082081. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm30K60K90K120K150KSE +/- 0.67, N = 3102606106172151611423471063481062081063004233842341423451026921026821496741505271503781512341. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm30K60K90K120K150KSE +/- 2.03, N = 3104271107604152245430621077411078381077544306743079430711042941042661522201520661529861524511. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm4K8K12K16K20KSE +/- 43.25, N = 3145121638816151107151696316838168311063510593104611624316357193981820518369186191. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP324070 TI SUPER4080 super a4090RTX 4060bcdefghijklm12002400360048006000SE +/- 0.00, N = 33831397257361609396839673967160916081609383038305733573757395737

Libplacebo

Test: gaussian

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: gaussian4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm9001800270036004500SE +/- 0.37, N = 33118.523407.974406.251512.453411.133413.413412.081512.651511.721511.913120.653120.904364.764407.684405.214407.961. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: av1_grain_lap4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm8001600240032004000SE +/- 0.40, N = 33538.353475.583470.091723.153476.353476.233189.611725.791722.451724.713516.563487.013490.133503.943462.393453.741. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: hdr_lut

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_lut4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm8001600240032004000SE +/- 0.11, N = 32789.692858.683542.891525.072858.872856.352860.741525.751522.301526.732789.732772.403546.093545.363536.983543.511. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_peakdetect4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm9001800270036004500SE +/- 19.72, N = 32874.753002.983880.871570.773260.623184.603251.951546.471506.931503.722882.562872.213586.113990.183616.574000.041. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: polar_nocompute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm6001200180024003000SE +/- 0.04, N = 31672.461919.912848.16655.961918.791918.631924.72659.34655.72656.201671.911672.082845.612856.182851.232855.331. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: deband_heavy4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm5001000150020002500SE +/- 0.15, N = 31323.801529.332294.94514.761527.651528.481530.76517.52514.52514.701324.361324.842270.652293.542295.232293.811. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm50K100K150K200K250KSE +/- 194.61, N = 3138716143183214165828231469631394291470598272582997830651278061411642144181976292058132068031. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16S4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm2K4K6K8K10KSE +/- 0.33, N = 36678807799063038807980798078303930393039668166839785980698009809

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16C4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm2K4K6K8K10KSE +/- 0.33, N = 3742081271106331148121814581283113311431147420742110882109851098210983

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precision4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm20K40K60K80K100KSE +/- 51.67, N = 3664547394880489370497502968642746743868438064386096758768137853498025784200745871. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2R4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm20K40K60K80K100KSE +/- 24.98, N = 3675586725779195356016395663887676473520135167352196684966874763727175578780759121. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Write4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm2004006008001000SE +/- 0.01, N = 3607.15632.69896.46258.47631.65629.88629.10258.40258.40258.36606.59607.19902.58902.92903.86902.161. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Read4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm2004006008001000SE +/- 0.01, N = 3619.31680.51927.96252.90680.56680.77680.67252.89252.89252.89619.28619.29927.85928.04927.98928.101. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm816243240SE +/- 0.012, N = 317.13520.82233.2346.17920.81920.82520.8226.2256.2156.21417.13517.13433.22433.22033.23233.2291. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm918273645SE +/- 0.001, N = 320.21723.98438.3357.41423.97823.98423.9827.4177.4177.41920.21220.19838.36138.51838.36138.3411. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm1020304050SE +/- 0.000, N = 323.18627.62944.4118.49127.58127.63027.6308.4918.4918.49123.20723.20244.32944.39844.34844.3301. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm0.97851.9572.93553.9144.8925SE +/- 0.002, N = 34.2804.2374.3462.0874.2524.2394.2392.0942.0892.0914.2724.3084.3444.3474.3494.3491. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm20406080100SE +/- 0.00, N = 345.0753.5085.8316.5153.4953.6353.6616.5116.5116.5145.0745.0785.8785.8085.8585.811. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 Compute4070 TI SUPER4080 super a4090RTX 4060bcdefghijklm0.31250.6250.93751.251.5625SE +/- 0.000, N = 30.7250.8611.3890.2640.8640.8640.8640.2640.2640.2640.7250.7251.3891.3891.3891.3891. (CXX) g++ options: -std=c++17 -pthread -lOpenCL


Phoronix Test Suite v10.8.4