docker testing on Ubuntu 20.04.4 LTS via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2307179-NE-EGEO015GP80 egeo-015-gpu - Phoronix Test Suite egeo-015-gpu docker testing on Ubuntu 20.04.4 LTS via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2307179-NE-EGEO015GP80&export=txt&sor&grw .
egeo-015-gpu Processor Motherboard Memory Disk Graphics OS Kernel Display Driver Vulkan Compiler File-System Screen Resolution System Layer OpenCL egeo-015-gpu.conf egeo-015-gpu_1.conf 2 x Intel Xeon Silver 4208 @ 3.20GHz (16 Cores / 32 Threads) Dell 0DY2X0 (2.16.2 BIOS) 64GB 2000GB TOSHIBA DT01ACA2 NVIDIA Quadro RTX 5000 15GB Ubuntu 20.04.4 LTS 5.10.0-23-amd64 (x86_64) NVIDIA 1.1.182 GCC 9.4.0 ext4 1024x768 docker OpenCL 3.0 CUDA 12.0.151 GCC 9.4.0 + CUDA 10.1 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: always Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - egeo-015-gpu.conf: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003302 - egeo-015-gpu_1.conf: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x5003302 Graphics Details - BAR1 / Visible vRAM Size: 256 MiB Python Details - Python 3.8.10 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
egeo-015-gpu shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX neatbench: GPU luxcorerender: DLSC - GPU luxcorerender: Rainbow Colors and Prism - GPU hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS mixbench: NVIDIA CUDA - Integer mixbench: NVIDIA CUDA - Half Precision mixbench: NVIDIA CUDA - Double Precision mixbench: NVIDIA CUDA - Single Precision financebench: Black-Scholes OpenCL viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS egeo-015-gpu.conf egeo-015-gpu_1.conf 8.1957 9.3123 622.66 201.11 186.57 126.85 208.00 336.65 55.28 560.47 2955.35 517.81 555.82 1260.25 930.24 639.69 261.12 7630.70 133.94 15.25 36.99 25.73 144.62 46.92 32.1 4.36 7.93 35015366667 12259133333 546133 1746666667 438933 10231.17 22951.08 307.92 11495.48 0.506 54.9 83.3 89.5 43.5 65.3 71.3 68.8 80.6 39.8 37.9 40.7 39.6 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download egeo-015-gpu_1.conf 2 4 6 8 10 SE +/- 0.0365, N = 3 8.1957 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback egeo-015-gpu_1.conf 3 6 9 12 15 SE +/- 0.0735, N = 10 9.3123 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mobilenet egeo-015-gpu_1.conf 130 260 390 520 650 SE +/- 1.29, N = 3 622.66 MIN: 592.6 / MAX: 667.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 egeo-015-gpu_1.conf 40 80 120 160 200 SE +/- 0.30, N = 3 201.11 MIN: 199.81 / MAX: 209.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 egeo-015-gpu_1.conf 40 80 120 160 200 SE +/- 0.36, N = 3 186.57 MIN: 184.92 / MAX: 193.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: shufflenet-v2 egeo-015-gpu_1.conf 30 60 90 120 150 SE +/- 0.70, N = 3 126.85 MIN: 124.67 / MAX: 132.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mnasnet egeo-015-gpu_1.conf 50 100 150 200 250 SE +/- 0.28, N = 3 208.00 MIN: 206.86 / MAX: 215.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: efficientnet-b0 egeo-015-gpu_1.conf 70 140 210 280 350 SE +/- 0.63, N = 3 336.65 MIN: 334.17 / MAX: 352.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: blazeface egeo-015-gpu_1.conf 12 24 36 48 60 SE +/- 0.08, N = 3 55.28 MIN: 46.28 / MAX: 65.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: googlenet egeo-015-gpu_1.conf 120 240 360 480 600 SE +/- 2.85, N = 3 560.47 MIN: 554.54 / MAX: 624.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vgg16 egeo-015-gpu_1.conf 600 1200 1800 2400 3000 SE +/- 4.93, N = 3 2955.35 MIN: 2945.27 / MAX: 2991.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet18 egeo-015-gpu_1.conf 110 220 330 440 550 SE +/- 3.53, N = 3 517.81 MIN: 511.99 / MAX: 544.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: alexnet egeo-015-gpu_1.conf 120 240 360 480 600 SE +/- 2.15, N = 3 555.82 MIN: 551.5 / MAX: 586.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet50 egeo-015-gpu_1.conf 300 600 900 1200 1500 SE +/- 8.10, N = 3 1260.25 MIN: 1249.21 / MAX: 1318.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: yolov4-tiny egeo-015-gpu_1.conf 200 400 600 800 1000 SE +/- 0.59, N = 3 930.24 MIN: 910.08 / MAX: 995.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: squeezenet_ssd egeo-015-gpu_1.conf 140 280 420 560 700 SE +/- 7.48, N = 3 639.69 MIN: 605.02 / MAX: 702.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: regnety_400m egeo-015-gpu_1.conf 60 120 180 240 300 SE +/- 0.05, N = 3 261.12 MIN: 258.8 / MAX: 272.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vision_transformer egeo-015-gpu_1.conf 1600 3200 4800 6400 8000 SE +/- 32.42, N = 3 7630.70 MIN: 7421.11 / MAX: 8206.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: FastestDet egeo-015-gpu_1.conf 30 60 90 120 150 SE +/- 0.30, N = 3 133.94 MIN: 132.45 / MAX: 139.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: NVIDIA OptiX egeo-015-gpu_1.conf 4 8 12 16 20 SE +/- 0.14, N = 13 15.25
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: NVIDIA OptiX egeo-015-gpu_1.conf 9 18 27 36 45 SE +/- 0.00, N = 3 36.99
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: NVIDIA OptiX egeo-015-gpu_1.conf 6 12 18 24 30 SE +/- 0.14, N = 15 25.73
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: NVIDIA OptiX egeo-015-gpu_1.conf 30 60 90 120 150 SE +/- 0.11, N = 3 144.62
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX egeo-015-gpu_1.conf 11 22 33 44 55 SE +/- 0.02, N = 3 46.92
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU egeo-015-gpu_1.conf 7 14 21 28 35 SE +/- 0.00, N = 3 32.1
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU egeo-015-gpu_1.conf 0.981 1.962 2.943 3.924 4.905 SE +/- 0.01, N = 3 4.36 MIN: 3.37 / MAX: 4.49
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU egeo-015-gpu_1.conf 2 4 6 8 10 SE +/- 0.06, N = 3 7.93 MIN: 5.02 / MAX: 9.08
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 egeo-015-gpu_1.conf 7000M 14000M 21000M 28000M 35000M SE +/- 174159020.57, N = 3 35015366667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 egeo-015-gpu_1.conf 3000M 6000M 9000M 12000M 15000M SE +/- 52192634.64, N = 3 12259133333
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip egeo-015-gpu_1.conf 120K 240K 360K 480K 600K SE +/- 896.91, N = 3 546133
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 egeo-015-gpu_1.conf 400M 800M 1200M 1600M 2000M SE +/- 10325427.08, N = 3 1746666667
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS egeo-015-gpu_1.conf 90K 180K 270K 360K 450K SE +/- 592.55, N = 3 438933
Mixbench Backend: NVIDIA CUDA - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer egeo-015-gpu_1.conf 2K 4K 6K 8K 10K SE +/- 21.26, N = 3 10231.17 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Half Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision egeo-015-gpu_1.conf 5K 10K 15K 20K 25K SE +/- 8.86, N = 3 22951.08 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision egeo-015-gpu_1.conf 70 140 210 280 350 SE +/- 0.00, N = 3 307.92 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision egeo-015-gpu_1.conf 2K 4K 6K 8K 10K SE +/- 0.14, N = 3 11495.48 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL egeo-015-gpu_1.conf 0.1139 0.2278 0.3417 0.4556 0.5695 SE +/- 0.000, N = 3 0.506 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY egeo-015-gpu_1.conf 12 24 36 48 60 SE +/- 0.56, N = 3 54.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY egeo-015-gpu_1.conf 20 40 60 80 100 SE +/- 0.59, N = 3 83.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT egeo-015-gpu_1.conf 20 40 60 80 100 SE +/- 1.18, N = 3 89.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY egeo-015-gpu_1.conf 10 20 30 40 50 SE +/- 0.26, N = 3 43.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY egeo-015-gpu_1.conf 15 30 45 60 75 SE +/- 0.60, N = 3 65.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT egeo-015-gpu_1.conf 16 32 48 64 80 SE +/- 1.05, N = 3 71.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N egeo-015-gpu_1.conf 15 30 45 60 75 SE +/- 0.75, N = 3 68.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T egeo-015-gpu_1.conf 20 40 60 80 100 SE +/- 0.56, N = 3 80.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN egeo-015-gpu_1.conf 9 18 27 36 45 SE +/- 0.06, N = 3 39.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT egeo-015-gpu_1.conf 9 18 27 36 45 SE +/- 0.26, N = 3 37.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN egeo-015-gpu_1.conf 9 18 27 36 45 SE +/- 0.18, N = 3 40.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT egeo-015-gpu_1.conf 9 18 27 36 45 SE +/- 0.05, N = 2 39.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4