mi100-1

KVM testing on AlmaLinux 8.5 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2111222-TJ-2105265IB60
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 3 Tests
Creator Workloads 2 Tests
HPC - High Performance Computing 2 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 5 Tests
OpenCL 6 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
mi100
May 26
  1 Hour, 2 Minutes
V100
May 26
  1 Hour, 35 Minutes
P40
November 22
  1 Hour, 14 Minutes
Invert Hiding All Results Option
  1 Hour, 17 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):


mi100-1ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelOpenCLCompilerFile-SystemScreen ResolutionSystem LayerDisplay DriverVulkanmi100V100P4016 x Intel Core (Haswell no TSX) (16 Cores)RDO OpenStack Compute (1.11.0-2.el7 BIOS)Intel 82G33/G31/P35/P31 + ICH964GB21GB QEMU HDD + 107GB QEMU HDDCirrus Logic GD 5446 32GBRed Hat Virtio deviceUbuntu 18.045.4.0-64-generic (x86_64)OpenCL 2.0 AMD-APP (3275.0)GCC 7.5.0ext41024x768KVM2 x Intel Xeon (Skylake IBRS) (2 Cores)8GB21GB QEMU HDD + 53GB QEMU HDDCirrus Logic GD 5446 8GBUbuntu 20.045.4.0-67-generic (x86_64)NVIDIAOpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.24 x Intel Xeon (Cascadelake) (4 Cores)Red Hat RHEL-AV (0.0.0 BIOS)16GB21GB QEMU HDD + 54GB QEMU HDDCirrus Logic GD 5446 6GBAlmaLinux 8.54.18.0-305.19.1.el8_4.x86_64 (x86_64)GCC 8.5.0 20210514xfs1024x768OpenBenchmarking.orgKernel Details- mi100: Transparent Huge Pages: madvise- V100: Transparent Huge Pages: madvise- P40: Transparent Huge Pages: alwaysCompiler Details- mi100: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - V100: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - P40: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- CPU Microcode: 0x1Python Details- mi100: Python 2.7.17 + Python 3.6.9Security Details- mi100: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Unknown: Dependent on hypervisor status + tsx_async_abort: Not affected - V100: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown - P40: SELinux + itlb_multihit: KVM: Vulnerable + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Graphics Details- P40: BAR1 / Visible vRAM Size: 8192 MiB

mi100V100P40Logarithmic Result OverviewPhoronix Test Suite 10.6.1SHOC Scalable HeterOgeneous ComputingclpeakBlendercl-mem

mi100-1shoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writerodinia: OpenCL Myocyterodinia: OpenCL Heartwalldarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLblender: BMW27 - OpenCLclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLmi100V100P4012.27402783.5127.89102194303313.669414.0831706.109286.8916.8730.0132.6003.1332.0085.0750.1770.86453.7617.877487.8422813.5511439.47960.154.8610.9612.26492278.0931.092814052.712.344113.17091470.52268.5780.2736.7115.4792.9191281.465.5113899.1714073.617003.99769.524.046.645.56618.1560.4141.81011.8020800.24617.658811756.012.336013.1668503.539240.3292.2289.9549.4757.183140.6910130.74368.87282.735.787.54OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadmi100V100P403691215SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 312.2712.2611.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadmi100V100P4048121620Min: 12.09 / Avg: 12.27 / Max: 12.58Min: 12.26 / Avg: 12.26 / Max: 12.27Min: 11.78 / Avg: 11.8 / Max: 11.831. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPmi100V100P406001200180024003000SE +/- 2.72, N = 3SE +/- 7.23, N = 3SE +/- 0.45, N = 32783.512278.09800.251. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPmi100V100P405001000150020002500Min: 2778.13 / Avg: 2783.51 / Max: 2786.93Min: 2263.66 / Avg: 2278.09 / Max: 2286.14Min: 799.59 / Avg: 800.25 / Max: 801.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashmi100V100P40714212835SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 327.8931.0917.661. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashmi100V100P40714212835Min: 27.89 / Avg: 27.89 / Max: 27.9Min: 31.08 / Avg: 31.09 / Max: 31.1Min: 17.66 / Avg: 17.66 / Max: 17.661. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsmi100V100P405M10M15M20M25MSE +/- 89939.99, N = 3SE +/- 6.33, N = 3SE +/- 2.39, N = 321943033.014052.711756.01. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsmi100V100P404M8M12M16M20MMin: 21785500 / Avg: 21943033.33 / Max: 22097000Min: 14040.6 / Avg: 14052.67 / Max: 14062Min: 11751.6 / Avg: 11756 / Max: 11759.81. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadmi100V100P4048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.6712.3412.341. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadmi100V100P4048121620Min: 13.67 / Avg: 13.67 / Max: 13.67Min: 12.34 / Avg: 12.34 / Max: 12.34Min: 12.34 / Avg: 12.34 / Max: 12.341. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackmi100V100P4048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 314.0813.1713.171. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackmi100V100P4048121620Min: 14.08 / Avg: 14.08 / Max: 14.09Min: 13.17 / Avg: 13.17 / Max: 13.17Min: 13.17 / Avg: 13.17 / Max: 13.171. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthmi100V100P4030060090012001500SE +/- 0.38, N = 3SE +/- 1.76, N = 3SE +/- 0.70, N = 3706.111470.52503.541. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthmi100V100P4030060090012001500Min: 705.43 / Avg: 706.11 / Max: 706.73Min: 1468.18 / Avg: 1470.52 / Max: 1473.96Min: 502.3 / Avg: 503.54 / Max: 504.741. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copymi100V100P4060120180240300SE +/- 1.71, N = 3SE +/- 0.47, N = 3SE +/- 0.03, N = 3286.8268.5240.31. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copymi100V100P4050100150200250Min: 284.3 / Avg: 286.83 / Max: 290.1Min: 267.6 / Avg: 268.53 / Max: 269.1Min: 240.2 / Avg: 240.27 / Max: 240.31. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readmi100V100P402004006008001000SE +/- 1.78, N = 3SE +/- 1.72, N = 3SE +/- 0.12, N = 3916.8780.2292.21. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readmi100V100P40160320480640800Min: 914.5 / Avg: 916.8 / Max: 920.3Min: 778 / Avg: 780.2 / Max: 783.6Min: 292 / Avg: 292.2 / Max: 292.41. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writemi100V100P40160320480640800SE +/- 0.52, N = 3SE +/- 0.59, N = 3SE +/- 0.07, N = 3730.0736.7289.91. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writemi100V100P40130260390520650Min: 729.2 / Avg: 730.03 / Max: 731Min: 735.5 / Avg: 736.67 / Max: 737.4Min: 289.8 / Avg: 289.87 / Max: 2901. (CC) gcc options: -O2 -flto -lOpenCL

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocytemi100V100306090120150SE +/- 3.88, N = 12SE +/- 0.98, N = 3132.60115.48-O2 -lOpenCL-m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl1. (CXX) g++ options:
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocytemi100V10020406080100Min: 112.71 / Avg: 132.6 / Max: 149.47Min: 113.58 / Avg: 115.48 / Max: 116.831. (CXX) g++ options:

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Heartwallmi100V1000.70491.40982.11472.81963.5245SE +/- 0.012, N = 33.1332.919-O2 -lOpenCL-m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl1. (CXX) g++ options:
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Heartwallmi100V100246810Min: 3.12 / Avg: 3.13 / Max: 3.161. (CXX) g++ options:

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLmi1000.45180.90361.35541.80722.259SE +/- 0.014, N = 152.008
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLmi100246810Min: 1.9 / Avg: 2.01 / Max: 2.11

Test: Boat - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLmi1001.14192.28383.42574.56765.7095SE +/- 0.051, N = 35.075
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLmi100246810Min: 4.98 / Avg: 5.08 / Max: 5.15

Test: Masskrug - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLmi1000.03980.07960.11940.15920.199SE +/- 0.005, N = 150.177
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLmi10012345Min: 0.15 / Avg: 0.18 / Max: 0.22

Test: Server Rack - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLmi1000.19440.38880.58320.77760.972SE +/- 0.001, N = 30.864
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLmi100246810Min: 0.86 / Avg: 0.86 / Max: 0.87

Test: Server Room - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: OpenCLmi100V100P4030060090012001500SE +/- 2.10, N = 15SE +/- 2.77, N = 3SE +/- 3.70, N = 353.761281.46549.47
OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: OpenCLmi100V100P402004006008001000Min: 50.61 / Avg: 53.76 / Max: 83.16Min: 1276.81 / Avg: 1281.46 / Max: 1286.4Min: 542.58 / Avg: 549.47 / Max: 555.26

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latencymi100V100P401326395265SE +/- 0.64, N = 12SE +/- 0.06, N = 3SE +/- 0.14, N = 317.875.5157.181. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latencymi100V100P401122334455Min: 13.13 / Avg: 17.87 / Max: 20.01Min: 5.44 / Avg: 5.51 / Max: 5.63Min: 56.99 / Avg: 57.18 / Max: 57.461. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTmi100V100P403K6K9K12K15KSE +/- 5.18, N = 3SE +/- 168.65, N = 3SE +/- 24.17, N = 157487.8413899.173140.691. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTmi100V100P402K4K6K8K10KMin: 7479.63 / Avg: 7487.84 / Max: 7497.43Min: 13574.98 / Avg: 13899.17 / Max: 14141.9Min: 2903.67 / Avg: 3140.69 / Max: 3177.171. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatmi100V100P405K10K15K20K25KSE +/- 6.31, N = 3SE +/- 50.95, N = 3SE +/- 127.29, N = 1522813.5514073.6110130.741. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatmi100V100P404K8K12K16K20KMin: 22800.97 / Avg: 22813.55 / Max: 22820.79Min: 13971.73 / Avg: 14073.61 / Max: 14126.05Min: 8784.21 / Avg: 10130.74 / Max: 10324.231. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doublemi100V100P402K4K6K8K10KSE +/- 3.65, N = 3SE +/- 57.57, N = 3SE +/- 0.52, N = 311439.477003.99368.871. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doublemi100V100P402K4K6K8K10KMin: 11434.38 / Avg: 11439.47 / Max: 11446.54Min: 6891.88 / Avg: 7003.99 / Max: 7082.74Min: 367.83 / Avg: 368.87 / Max: 369.391. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthmi100V100P402004006008001000SE +/- 0.94, N = 3SE +/- 0.50, N = 3SE +/- 0.32, N = 3960.15769.52282.731. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthmi100V100P402004006008001000Min: 958.27 / Avg: 960.15 / Max: 961.14Min: 768.84 / Avg: 769.52 / Max: 770.49Min: 282.38 / Avg: 282.73 / Max: 283.371. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBuffermi100V100P401.30052.6013.90155.2026.5025SE +/- 0.05, N = 6SE +/- 0.02, N = 3SE +/- 0.43, N = 124.864.045.781. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBuffermi100V100P40246810Min: 4.78 / Avg: 4.86 / Max: 5.08Min: 4 / Avg: 4.04 / Max: 4.08Min: 4.63 / Avg: 5.78 / Max: 9.131. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBuffermi100V100P403691215SE +/- 1.90, N = 15SE +/- 0.19, N = 15SE +/- 0.37, N = 1510.966.647.541. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBuffermi100V100P403691215Min: 7.11 / Avg: 10.96 / Max: 25.11Min: 5.87 / Avg: 6.64 / Max: 8.24Min: 6.22 / Avg: 7.54 / Max: 11.051. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Boat - Acceleration: OpenCLV1001.25242.50483.75725.00966.262SE +/- 0.038, N = 35.566
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Boat - Acceleration: OpenCLV100246810Min: 5.5 / Avg: 5.57 / Max: 5.63

Test: Boat - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Masskrug - Acceleration: OpenCLV10048121620SE +/- 0.18, N = 318.16
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Masskrug - Acceleration: OpenCLV100510152025Min: 17.91 / Avg: 18.16 / Max: 18.52

Test: Masskrug - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Rack - Acceleration: OpenCLV1000.09320.18640.27960.37280.466SE +/- 0.008, N = 150.414
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Rack - Acceleration: OpenCLV10012345Min: 0.37 / Avg: 0.41 / Max: 0.48

Test: Server Rack - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Room - Acceleration: OpenCLV1000.40730.81461.22191.62922.0365SE +/- 0.021, N = 151.810
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Room - Acceleration: OpenCLV100246810Min: 1.67 / Avg: 1.81 / Max: 1.95

Test: Server Room - Acceleration: OpenCL

P40: The test quit with a non-zero exit status.