gpu comp

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2402256-PTS-GPUCOMP691&grs.

gpu compProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolution4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklmAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d82 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4080 SUPER 16GBNVIDIA Device 22bbDELL U2723QEIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.7.0-060700-generic (x86_64)GNOME Shell 45.2X Server 1.21.1.7NVIDIA 550.40.074.6.0OpenCL 3.0 CUDA 12.4.74GCC 13.2.0ext43840x2160MSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22be4001GB Western Digital WD_BLACK SN850X 4000GB + 2000GB Samsung SSD 980 PRO 2TBASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GBNVIDIA Device 22bb2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD AudioOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- 4080 super a: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- b: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- c: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- d: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- RTX 4060: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- e: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- f: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- g: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4070 TI SUPER: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- h: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- i: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203- 4090: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- j: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- k: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- l: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203- m: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Graphics Details- 4080 super a: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- b: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- c: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- d: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- f: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- g: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- 4070 TI SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- i: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- j: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- k: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- l: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- m: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01OpenCL Details- 4080 super a: GPU Compute Cores: 10240- b: GPU Compute Cores: 10240- c: GPU Compute Cores: 10240- d: GPU Compute Cores: 10240- RTX 4060: GPU Compute Cores: 3072- e: GPU Compute Cores: 3072- f: GPU Compute Cores: 3072- g: GPU Compute Cores: 3072- 4070 TI SUPER: GPU Compute Cores: 8448- h: GPU Compute Cores: 8448- i: GPU Compute Cores: 8448- 4090: GPU Compute Cores: 16384- j: GPU Compute Cores: 16384- k: GPU Compute Cores: 16384- l: GPU Compute Cores: 16384- m: GPU Compute Cores: 16384Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

gpu compopencl-benchmark: INT8 Computeopencl-benchmark: FP64 Computeopencl-benchmark: INT32 Computegpuowl: 332220523opencl-benchmark: FP32 Computegpuowl: 77936867opencl-benchmark: INT16 Computegpuowl: 57885161libplacebo: deband_heavylibplacebo: polar_nocomputevkfft: FFT + iFFT C2C 1D batched in double precisionopencl-benchmark: Memory Bandwidth Coalesced Readvkfft: FFT + iFFT C2C 1D batched in single precisionfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingopencl-benchmark: Memory Bandwidth Coalesced Writevkfft: FFT + iFFT C2C Bluestein benchmark in double precisionfluidx3d: FP32-FP16Slibplacebo: gaussianlibplacebo: hdr_peakdetectvkfft: FFT + iFFT C2C 1D batched in half precisionlibplacebo: hdr_lutvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT R2C / C2Ropencl-benchmark: INT64 Computelibplacebo: av1_grain_lapvkfft: FFT + iFFT C2C Bluestein in single precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm20.8220.86127.629189.6853.504896.0623.9841191.895113231529.331919.9131464680.5110617239728127107604632.69568680773407.973002.981431832858.6873948672574.2373475.581638820.8190.86427.581189.8353.489896.0623.9781190.481527.651918.7931775680.5610634839688121107741631.65570680793411.133260.621469632858.8775029639564.2523476.351696320.8250.86427.63189.6153.631894.4523.9841191.895113231528.481918.6334346680.7710620839678145107838629.88571280793413.413184.61394292856.3568642638874.2393476.231683820.8220.86427.63189.8353.655896.0623.9821191.895113231530.761924.7234356680.6710630039678128107754629.1568780783412.083251.951470592860.7474674676474.2393189.61168316.1790.2648.49158.8516.505278.557.414376.65514.76655.9612025252.9423471609311443062258.47237130381512.451570.77828231525.0737049356012.0871723.15107156.2250.2648.49158.8516.506278.557.417378.36517.52659.3412070252.89423381609311343067258.40237330391512.651546.47827251525.7538684352012.0941725.79106356.2150.2648.49158.5816.505277.327.417376.65514.52655.7212096252.89423411608311443079258.4238530391511.721506.93829971522.338064351672.0891722.45105936.2140.2648.49158.8516.505278.557.419376.65514.7656.212103252.89423451609311443071258.36237330391511.911503.72830651526.7338609352192.0911724.711046117.1350.72523.186159.8245.068744.6020.2171003.011323.81672.4630677619.3110260638317420104271607.15485066783118.522874.751387162789.6966454675584.283538.351451217.1350.72523.207159.8745.067745.1620.2121004.0160642571324.361671.9123199619.2810269238307420104294606.59499966813120.652882.561278062789.7367587668494.2723516.561624317.1340.72523.202159.8245.071745.1620.1981003.011324.841672.0824125619.2910268238307421104266607.19498366833120.92872.211411642772.468137668744.3083487.011635733.2341.38944.411304.5185.8311432.66475644738.3351926.782294.942848.1637812927.96151611573611063152245896.46818399064406.253880.872141653542.8980489791954.3463470.091615133.2241.38944.329304.7985.8711440.9238.3611937.9844961242270.652845.6137573927.85149674573310882152220902.58821197854364.763586.112144183546.0985349763724.3443490.131939833.221.38944.398304.5185.7971434.7238.5181937.9844961242293.542856.1846461928.04150527573710985152066902.92822398064407.683990.181976293545.3680257717554.3473503.941820533.2321.38944.348304.6085.8491434.7238.3611926.782295.232851.2350684927.98150378573910982152986903.86821098004405.213616.572058133536.9884200787804.3493462.391836933.2291.38944.33305.3485.8091440.9238.3411934.242293.812855.3345665928.1151234573710983152451902.16820898094407.964000.042068033543.5174587759124.3493453.7418619OpenBenchmarking.org

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm816243240SE +/- 0.012, N = 320.82220.81920.82520.8226.1796.2256.2156.21417.13517.13517.13433.23433.22433.22033.23233.2291. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm0.31250.6250.93751.251.5625SE +/- 0.000, N = 30.8610.8640.8640.8640.2640.2640.2640.2640.7250.7250.7251.3891.3891.3891.3891.3891. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm1020304050SE +/- 0.000, N = 327.62927.58127.63027.6308.4918.4918.4918.49123.18623.20723.20244.41144.32944.39844.34844.3301. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

Exponent: 332220523

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 3322205234080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm70140210280350SE +/- 0.00, N = 3189.68189.83189.61189.8358.8558.8558.5858.85159.82159.87159.82304.51304.79304.51304.60305.341. (CXX) g++ options: -O3 -lgmp -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm20406080100SE +/- 0.00, N = 353.5053.4953.6353.6616.5116.5116.5116.5145.0745.0745.0785.8385.8785.8085.8585.811. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

Exponent: 77936867

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 779368674080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm30060090012001500SE +/- 0.00, N = 3896.06896.06894.45896.06278.55278.55277.32278.55744.60745.16745.161432.661440.921434.721434.721440.921. (CXX) g++ options: -O3 -lgmp -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm918273645SE +/- 0.001, N = 323.98423.97823.98423.9827.4147.4177.4177.41920.21720.21220.19838.33538.36138.51838.36138.3411. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

Exponent: 57885161

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 578851614080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm400800120016002000SE +/- 0.00, N = 31191.901190.481191.901191.90376.65378.36376.65376.651003.011004.021003.011926.781937.981937.981926.781934.241. (CXX) g++ options: -O3 -lgmp -lOpenCL

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: deband_heavy4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm5001000150020002500SE +/- 0.15, N = 31529.331527.651528.481530.76514.76517.52514.52514.701323.801324.361324.842294.942270.652293.542295.232293.811. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: polar_nocompute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm6001200180024003000SE +/- 0.04, N = 31919.911918.791918.631924.72655.96659.34655.72656.201672.461671.911672.082848.162845.612856.182851.232855.331. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm11K22K33K44K55KSE +/- 8.95, N = 3314643177534346343561202512070120961210330677231992412537812375734646150684456651. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Read4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm2004006008001000SE +/- 0.01, N = 3680.51680.56680.77680.67252.90252.89252.89252.89619.31619.28619.29927.96927.85928.04927.98928.101. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm30K60K90K120K150KSE +/- 0.67, N = 3106172106348106208106300423474233842341423451026061026921026821516111496741505271503781512341. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP324080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm12002400360048006000SE +/- 0.00, N = 33972396839673967160916091608160938313830383057365733573757395737

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16C4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm2K4K6K8K10KSE +/- 0.33, N = 3812781218145812831143113311431147420742074211106310882109851098210983

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm30K60K90K120K150KSE +/- 2.03, N = 3107604107741107838107754430624306743079430711042711042941042661522451522201520661529861524511. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Write4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm2004006008001000SE +/- 0.01, N = 3632.69631.65629.88629.10258.47258.40258.40258.36607.15606.59607.19896.46902.58902.92903.86902.161. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm2K4K6K8K10KSE +/- 2.40, N = 356865706571256872371237323852373485049994983818382118223821082081. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16S4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm2K4K6K8K10KSE +/- 0.33, N = 38077807980798078303830393039303966786681668399069785980698009809

Libplacebo

Test: gaussian

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: gaussian4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm9001800270036004500SE +/- 0.37, N = 33407.973411.133413.413412.081512.451512.651511.721511.913118.523120.653120.904406.254364.764407.684405.214407.961. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_peakdetect4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm9001800270036004500SE +/- 19.72, N = 33002.983260.623184.603251.951570.771546.471506.931503.722874.752882.562872.213880.873586.113990.183616.574000.041. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm50K100K150K200K250KSE +/- 194.61, N = 3143183146963139429147059828238272582997830651387161278061411642141652144181976292058132068031. (CXX) g++ options: -O3

Libplacebo

Test: hdr_lut

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: hdr_lut4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm8001600240032004000SE +/- 0.11, N = 32858.682858.872856.352860.741525.071525.751522.301526.732789.692789.732772.403542.893546.093545.363536.983543.511. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm20K40K60K80K100KSE +/- 51.67, N = 3739487502968642746743704938684380643860966454675876813780489853498025784200745871. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2R4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm20K40K60K80K100KSE +/- 24.98, N = 3672576395663887676473560135201351673521967558668496687479195763727175578780759121. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 Compute4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm0.97851.9572.93553.9144.8925SE +/- 0.002, N = 34.2374.2524.2394.2392.0872.0942.0892.0914.2804.2724.3084.3464.3444.3474.3494.3491. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 6.338.2Test: av1_grain_lap4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm8001600240032004000SE +/- 0.40, N = 33475.583476.353476.233189.611723.151725.791722.451724.713538.353516.563487.013470.093490.133503.943462.393453.741. (CXX) g++ options: -fvisibility=hidden -std=c++20 -O2 -fno-math-errno -fPIC -pthread -MD -MQ -MF

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precision4080 super abcdRTX 4060efg4070 TI SUPERhi4090jklm4K8K12K16K20KSE +/- 43.25, N = 3163881696316838168311071510635105931046114512162431635716151193981820518369186191. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4