Latitude System 1/5 Test

2 x AMD EPYC 9354 32-Core testing with a Supermicro H13DSG-O-CPU v1.10 (1.5 BIOS) and ASPEED 80GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310102-NE-LATITUDES95&grr.

Latitude System 1/5 TestProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionSYS01 Full Test2 x AMD EPYC 9354 32-Core @ 3.25GHz (64 Cores / 128 Threads)Supermicro H13DSG-O-CPU v1.10 (1.5 BIOS)AMD Device 14a424 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N4 x 3841GB SAMSUNG MZQL23T8HCLS-00A07ASPEED 80GB (210/1512MHz)2 x Intel X710 for 10GBASE-TUbuntu 22.046.2.0-34-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.4NVIDIA 535.113.014.5 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 256 bits)OpenCL 3.0 CUDA 12.2.1461.3.242GCC 11.4.0ext41024x768OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa10113e - BAR1 / Visible vRAM Size: 131072 MiB - vBIOS Version: 92.00.68.00.01- GPU Compute Cores: 6912- Python 3.10.12- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Latitude System 1/5 Testindigobench: OpenCL GPU - Bedroomcaffe: GoogleNet - CPU - 1000lczero: BLASncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetcaffe: AlexNet - CPU - 1000lczero: Eigenlczero: OpenCLluxcorerender: Orange Juice - CPUluxcorerender: DLSC - CPUluxcorerender: DLSC - GPUcaffe: GoogleNet - CPU - 200indigobench: OpenCL GPU - Supercarluxcorerender: Danish Mood - GPUncnn: CPU - efficientnet-b0ncnn: CPU - FastestDetncnn: CPU - vision_transformerncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetoctanebench: Total Scorecaffe: GoogleNet - CPU - 100luxcorerender: LuxCore Benchmark - GPUfahbench: rodinia: OpenMP Leukocyterodinia: OpenMP HotSpot3Dnamd-cuda: ATPase Simulation - 327,506 Atomscaffe: AlexNet - CPU - 200shoc: OpenCL - Max SP Flopsarrayfire: BLAS CPUluxcorerender: Orange Juice - GPUhashcat: SHA1indigobench: CPU - Bedroomhashcat: MD5luxcorerender: Danish Mood - CPUindigobench: CPU - Supercarluxcorerender: LuxCore Benchmark - CPUfinancebench: Bonds OpenMPcaffe: AlexNet - CPU - 100luxcorerender: Rainbow Colors and Prism - CPUfinancebench: Repo OpenMProdinia: OpenMP LavaMDrodinia: OpenCL Myocytemandelgpu: CPU+GPUclpeak: Transfer Bandwidth enqueueWriteBuffergromacs: MPI CPU - water_GMX50_bareclpeak: Transfer Bandwidth enqueueReadBufferhashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTShashcat: 7-Zipviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYblender: BMW27 - CPU-Onlyarrayfire: BLAS OpenCLarrayfire: Conjugate Gradient CPUshoc: OpenCL - Texture Read Bandwidthvkresample: 2x - Singlefinancebench: Monte-Carlo OpenCLrodinia: OpenMP CFD Solverarrayfire: Conjugate Gradient OpenCLluxcorerender: Rainbow Colors and Prism - GPUrodinia: OpenCL Particle Filterrodinia: OpenMP Streamclustershoc: OpenCL - S3Dshoc: OpenCL - Bus Speed Readbackclpeak: Single-Precision Computeclpeak: Global Memory Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writeshoc: OpenCL - FFT SPshoc: OpenCL - Reductionshoc: OpenCL - Triadclpeak: Integer 24-bit Computeclpeak: Integer Computeclpeak: Double-Precision Computeshoc: OpenCL - GEMM SGEMM_Nclpeak: Kernel Latencyshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - MD5 Hashfinancebench: Black-Scholes OpenCLredshift: SYS01 Full Test22.1531221397787619.7157.8954.4527.3734.5125.189.3815.2942.9627.817.4018.0111.8817.4614.1312.8622.904537418460567815.2911.1576.0524673949.90347.8218.0418.9557.7753.5625.3835.1824.289.1814.7039.8826.556.9411.3016.6713.9712.9021.54516.55984512561239.23263.121037.55474.8130.023479188019428.81337.7160.109883682500013.8903059141000006.5030.4496.5447810.0703134633024.7330890.75911535.49729.817116018825.922.309.62422.312244006666765591009608167428042374667425724568.443857544122531323311512110511081576.71008222717905361950139020.049644.2525.281581.666.947103.2580018.2772.611254.593.8366.173827.58727.124119346.291494.78235796.51402.44438.11241.63926.071519300.8419264.199719.8013584.95.3826.862042.97161.090OpenBenchmarking.org

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomSYS01 Full Test510152025SE +/- 0.17, N = 322.15

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 1000SYS01 Full Test300K600K900K1200K1500KSE +/- 5008.83, N = 312213971. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASSYS01 Full Test2K4K6K8K10KSE +/- 96.28, N = 978761. (CXX) g++ options: -flto -pthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetSYS01 Full Test510152025SE +/- 0.47, N = 1219.71MIN: 17.07 / MAX: 611.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerSYS01 Full Test1326395265SE +/- 0.91, N = 1257.89MIN: 49.87 / MAX: 1053.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mSYS01 Full Test1224364860SE +/- 0.38, N = 1254.45MIN: 51.03 / MAX: 184.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdSYS01 Full Test612182430SE +/- 0.48, N = 1227.37MIN: 24.43 / MAX: 139.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinySYS01 Full Test816243240SE +/- 0.56, N = 1234.51MIN: 27.91 / MAX: 830.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50SYS01 Full Test612182430SE +/- 0.35, N = 1225.18MIN: 21.4 / MAX: 379.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetSYS01 Full Test3691215SE +/- 0.14, N = 129.38MIN: 8.39 / MAX: 147.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18SYS01 Full Test48121620SE +/- 0.30, N = 1215.29MIN: 13.56 / MAX: 113.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16SYS01 Full Test1020304050SE +/- 0.82, N = 1242.96MIN: 36.28 / MAX: 149.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetSYS01 Full Test714212835SE +/- 0.50, N = 1227.81MIN: 23.45 / MAX: 137.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceSYS01 Full Test246810SE +/- 0.11, N = 127.40MIN: 6.79 / MAX: 170.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0SYS01 Full Test48121620SE +/- 0.20, N = 1218.01MIN: 16.34 / MAX: 140.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetSYS01 Full Test3691215SE +/- 0.14, N = 1211.88MIN: 10.88 / MAX: 131.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2SYS01 Full Test48121620SE +/- 0.13, N = 1217.46MIN: 16.33 / MAX: 145.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3SYS01 Full Test48121620SE +/- 0.11, N = 1214.13MIN: 12.31 / MAX: 138.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2SYS01 Full Test3691215SE +/- 0.05, N = 1212.86MIN: 11.91 / MAX: 103.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetSYS01 Full Test510152025SE +/- 0.32, N = 1222.90MIN: 20.32 / MAX: 166.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1000SYS01 Full Test100K200K300K400K500KSE +/- 456.94, N = 34537411. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenSYS01 Full Test2K4K6K8K10KSE +/- 70.63, N = 384601. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLSYS01 Full Test12002400360048006000SE +/- 76.93, N = 356781. (CXX) g++ options: -flto -pthread

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSYS01 Full Test48121620SE +/- 0.11, N = 1515.29MIN: 12.88 / MAX: 19.25

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSYS01 Full Test3691215SE +/- 0.16, N = 1511.15MIN: 10.28 / MAX: 13.37

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPUSYS01 Full Test20406080100SE +/- 6.91, N = 1276.05MAX: 84.61

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 200SYS01 Full Test50K100K150K200K250KSE +/- 546.79, N = 32467391. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarSYS01 Full Test1122334455SE +/- 0.61, N = 349.90

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPUSYS01 Full Test1122334455SE +/- 0.58, N = 1547.82MIN: 15.56 / MAX: 66.11

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0SYS01 Full Test48121620SE +/- 0.29, N = 218.04MIN: 17.37 / MAX: 33.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetSYS01 Full Test510152025SE +/- 0.67, N = 318.95MIN: 17.53 / MAX: 79.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerSYS01 Full Test1326395265SE +/- 3.08, N = 357.77MIN: 51.81 / MAX: 962.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mSYS01 Full Test1224364860SE +/- 0.64, N = 353.56MIN: 50.47 / MAX: 136.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdSYS01 Full Test612182430SE +/- 0.29, N = 325.38MIN: 22.75 / MAX: 117.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinySYS01 Full Test816243240SE +/- 1.81, N = 335.18MIN: 31.03 / MAX: 125.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50SYS01 Full Test612182430SE +/- 0.40, N = 324.28MIN: 21.91 / MAX: 253.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetSYS01 Full Test3691215SE +/- 0.08, N = 39.18MIN: 8.69 / MAX: 77.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18SYS01 Full Test48121620SE +/- 0.33, N = 314.70MIN: 14.01 / MAX: 172.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16SYS01 Full Test918273645SE +/- 1.57, N = 339.88MIN: 37.18 / MAX: 124.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetSYS01 Full Test612182430SE +/- 0.36, N = 326.55MIN: 23.53 / MAX: 140.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceSYS01 Full Test246810SE +/- 0.04, N = 36.94MIN: 6.76 / MAX: 8.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetSYS01 Full Test3691215SE +/- 0.42, N = 311.30MIN: 10.51 / MAX: 91.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2SYS01 Full Test48121620SE +/- 0.32, N = 316.67MIN: 15.93 / MAX: 24.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3SYS01 Full Test48121620SE +/- 0.08, N = 313.97MIN: 13.55 / MAX: 15.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2SYS01 Full Test3691215SE +/- 0.09, N = 312.90MIN: 12.19 / MAX: 86.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetSYS01 Full Test510152025SE +/- 0.12, N = 321.54MIN: 20.26 / MAX: 130.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreSYS01 Full Test110220330440550516.56

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100SYS01 Full Test30K60K90K120K150KSE +/- 315.86, N = 31256121. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPUSYS01 Full Test918273645SE +/- 3.59, N = 1239.23MAX: 64.43

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2SYS01 Full Test60120180240300SE +/- 0.75, N = 3263.12

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteSYS01 Full Test918273645SE +/- 0.31, N = 837.551. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DSYS01 Full Test20406080100SE +/- 0.87, N = 474.811. (CXX) g++ options: -O2 -lOpenCL

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsSYS01 Full Test0.00530.01060.01590.02120.0265SE +/- 0.00009, N = 30.02347

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200SYS01 Full Test20K40K60K80K100KSE +/- 478.91, N = 3918801. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsSYS01 Full Test4K8K12K16K20KSE +/- 1.53, N = 319428.81. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUSYS01 Full Test30060090012001500SE +/- 95.37, N = 151337.711. (CXX) g++ options: -rdynamic

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPUSYS01 Full Test1326395265SE +/- 0.33, N = 360.10MIN: 52.34 / MAX: 78.16

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1SYS01 Full Test20000M40000M60000M80000M100000MSE +/- 19826623117.19, N = 1698836825000

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomSYS01 Full Test48121620SE +/- 0.03, N = 313.89

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5SYS01 Full Test70000M140000M210000M280000M350000MSE +/- 61357328264.01, N = 16305914100000

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUSYS01 Full Test246810SE +/- 0.06, N = 36.50MIN: 2.82 / MAX: 7.62

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarSYS01 Full Test714212835SE +/- 0.04, N = 330.45

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSYS01 Full Test246810SE +/- 0.04, N = 36.54MIN: 2.32 / MAX: 7.7

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPSYS01 Full Test10K20K30K40K50KSE +/- 541.76, N = 347810.071. (CXX) g++ options: -O3 -march=native -fopenmp

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100SYS01 Full Test10K20K30K40K50KSE +/- 43.65, N = 3463301. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSYS01 Full Test612182430SE +/- 0.61, N = 1524.73MIN: 22.72 / MAX: 28.52

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPSYS01 Full Test7K14K21K28K35KSE +/- 62.00, N = 330890.761. (CXX) g++ options: -O3 -march=native -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDSYS01 Full Test816243240SE +/- 0.06, N = 335.501. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteSYS01 Full Test714212835SE +/- 0.20, N = 329.821. (CXX) g++ options: -O2 -lOpenCL

MandelGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPU+GPUSYS01 Full Test20M40M60M80M100MSE +/- 3830318.56, N = 15116018825.91. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferSYS01 Full Test510152025SE +/- 0.03, N = 322.301. (CXX) g++ options: -O3

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareSYS01 Full Test3691215SE +/- 0.009, N = 39.6241. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferSYS01 Full Test510152025SE +/- 0.04, N = 322.311. (CXX) g++ options: -O3

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512SYS01 Full Test5000M10000M15000M20000M25000MSE +/- 4139779.92, N = 322440066667

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSSYS01 Full Test1.4M2.8M4.2M5.6M7MSE +/- 709.46, N = 36559100

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipSYS01 Full Test2M4M6M8M10MSE +/- 13528.04, N = 39608167

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTSYS01 Full Test9001800270036004500SE +/- 0.00, N = 342801. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNSYS01 Full Test9001800270036004500SE +/- 3.33, N = 342371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTSYS01 Full Test10002000300040005000SE +/- 3.33, N = 346671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNSYS01 Full Test9001800270036004500SE +/- 3.33, N = 342571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TSYS01 Full Test50100150200250SE +/- 0.00, N = 32451. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NSYS01 Full Test1530456075SE +/- 0.03, N = 368.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTSYS01 Full Test90180270360450SE +/- 0.00, N = 34381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYSYS01 Full Test120240360480600SE +/- 0.00, N = 35751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYSYS01 Full Test100200300400500SE +/- 0.33, N = 34411. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTSYS01 Full Test50100150200250SE +/- 0.00, N = 32251. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYSYS01 Full Test70140210280350SE +/- 0.00, N = 33131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYSYS01 Full Test50100150200250SE +/- 0.00, N = 32331. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTSYS01 Full Test306090120150SE +/- 0.58, N = 31151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNSYS01 Full Test306090120150SE +/- 2.33, N = 31211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTSYS01 Full Test20406080100SE +/- 1.76, N = 31051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNSYS01 Full Test20406080100SE +/- 2.40, N = 31101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TSYS01 Full Test2004006008001000SE +/- 6.17, N = 38151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NSYS01 Full Test20406080100SE +/- 1.01, N = 376.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTSYS01 Full Test2004006008001000SE +/- 7.57, N = 310081. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYSYS01 Full Test5001000150020002500SE +/- 210.74, N = 322271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYSYS01 Full Test400800120016002000SE +/- 52.92, N = 317901. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTSYS01 Full Test120240360480600SE +/- 10.40, N = 35361. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYSYS01 Full Test400800120016002000SE +/- 0.00, N = 319501. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYSYS01 Full Test30060090012001500SE +/- 0.00, N = 313901. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-OnlySYS01 Full Test510152025SE +/- 0.02, N = 320.04

ArrayFire

Test: BLAS OpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS OpenCLSYS01 Full Test2K4K6K8K10KSE +/- 0.39, N = 39644.251. (CXX) g++ options: -rdynamic

ArrayFire

Test: Conjugate Gradient CPU

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient CPUSYS01 Full Test612182430SE +/- 0.67, N = 1525.281. (CXX) g++ options: -rdynamic

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthSYS01 Full Test30060090012001500SE +/- 0.55, N = 31581.661. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleSYS01 Full Test246810SE +/- 0.001, N = 36.9471. (CXX) g++ options: -O3

FinanceBench

Benchmark: Monte-Carlo OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Monte-Carlo OpenCLSYS01 Full Test20406080100SE +/- 0.22, N = 3103.261. (CXX) g++ options: -O3 -march=native -fopenmp

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverSYS01 Full Test246810SE +/- 0.031, N = 38.2771. (CXX) g++ options: -O2 -lOpenCL

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLSYS01 Full Test0.58751.1751.76252.352.9375SE +/- 0.002, N = 32.6111. (CXX) g++ options: -rdynamic

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPUSYS01 Full Test60120180240300SE +/- 2.01, N = 3254.59MIN: 216.89 / MAX: 289.85

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterSYS01 Full Test0.86311.72622.58933.45244.3155SE +/- 0.042, N = 53.8361. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterSYS01 Full Test246810SE +/- 0.020, N = 36.1731. (CXX) g++ options: -O2 -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DSYS01 Full Test2004006008001000SE +/- 0.37, N = 3827.591. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackSYS01 Full Test612182430SE +/- 0.00, N = 327.121. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeSYS01 Full Test4K8K12K16K20KSE +/- 7.61, N = 319346.291. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthSYS01 Full Test30060090012001500SE +/- 0.20, N = 31494.781. (CXX) g++ options: -O3

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopySYS01 Full Test50100150200250SE +/- 0.00, N = 32351. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadSYS01 Full Test2004006008001000SE +/- 0.17, N = 3796.51. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteSYS01 Full Test30060090012001500SE +/- 0.82, N = 31402.41. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPSYS01 Full Test10002000300040005000SE +/- 2.55, N = 34438.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionSYS01 Full Test50100150200250SE +/- 0.03, N = 3241.641. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadSYS01 Full Test612182430SE +/- 0.02, N = 326.071. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeSYS01 Full Test4K8K12K16K20KSE +/- 8.76, N = 319300.841. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeSYS01 Full Test4K8K12K16K20KSE +/- 10.96, N = 319264.191. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeSYS01 Full Test2K4K6K8K10KSE +/- 0.15, N = 39719.801. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NSYS01 Full Test3K6K9K12K15KSE +/- 1.31, N = 313584.91. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencySYS01 Full Test1.21052.4213.63154.8426.0525SE +/- 0.01, N = 35.381. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadSYS01 Full Test612182430SE +/- 0.00, N = 326.861. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashSYS01 Full Test1020304050SE +/- 0.00, N = 342.971. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLSYS01 Full Test0.24530.49060.73590.98121.2265SE +/- 0.007, N = 31.0901. (CXX) g++ options: -O3 -march=native -fopenmp


Phoronix Test Suite v10.8.5