first_m100_run

2 x Intel Xeon Gold 6132 testing with a Dell PowerEdge R740 [0M27WY] (2.14.2 BIOS) and Matrox G200eW3 32GB on AlmaLinux 8.7 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2211303-NE-FIRSTM10000&grs.

first_m100_runProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDisplay ServerOpenCLCompilerFile-SystemScreen Resolutionfirst m100 run on r7402 x Intel Xeon Gold 6132 (28 Cores / 56 Threads)Dell PowerEdge R740 [0M27WY] (2.14.2 BIOS)Intel Sky Lake-E DMI3 Registers12 x 16 GB DDR4-2666MT/s M393A2K43BB1-CTD240GB INTEL SSDSC2KB24Matrox G200eW3 32GBDELL U2412M4 x Intel X710 for 10GbE SFP+AlmaLinux 8.74.18.0-425.3.1.el8.x86_64 (x86_64)X Server 1.20.11OpenCL 2.1 AMD-APP (3486.0)GCC 8.5.0 20210514xfs1920x1200OpenBenchmarking.org- Transparent Huge Pages: always- --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - CPU Microcode: 0x2006e05- Python 2.7.18 + Python 3.6.8- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

first_m100_runclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Global Memory Bandwidthclpeak: Double-Precision Doubleclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Kernel Latencydarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLdarktable: Masskrug - OpenCLdarktable: Boat - OpenCLrodinia: OpenCL Heartwallcl-mem: Writecl-mem: Readcl-mem: Copyshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPshoc: OpenCL - Triadrodinia: OpenCL Myocyteparboil: OpenCL BFSfirst m100 run on r7405.634.82938.9211255.7822702.1410328.265.570.8080.2652.4521.4063.060728.0913.0281.9696.90713.137413.74871882001327.92752802.3311.842040.576OpenBenchmarking.org

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferfirst m100 run on r7401.26682.53363.80045.06726.334SE +/- 0.01, N = 35.631. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferfirst m100 run on r7401.08452.1693.25354.3385.4225SE +/- 0.01, N = 34.821. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthfirst m100 run on r7402004006008001000SE +/- 0.51, N = 3938.921. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doublefirst m100 run on r7402K4K6K8K10KSE +/- 9.15, N = 311255.781. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatfirst m100 run on r7405K10K15K20K25KSE +/- 7.79, N = 322702.141. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTfirst m100 run on r7402K4K6K8K10KSE +/- 9.71, N = 310328.261. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latencyfirst m100 run on r7401.25332.50663.75995.01326.2665SE +/- 0.03, N = 35.571. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLfirst m100 run on r7400.18180.36360.54540.72720.909SE +/- 0.007, N = 30.808

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLfirst m100 run on r7400.05960.11920.17880.23840.298SE +/- 0.003, N = 40.265

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLfirst m100 run on r7400.55171.10341.65512.20682.7585SE +/- 0.023, N = 32.452

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLfirst m100 run on r7400.31640.63280.94921.26561.582SE +/- 0.011, N = 101.406

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Heartwallfirst m100 run on r7400.68851.3772.06552.7543.4425SE +/- 0.010, N = 33.0601. (CXX) g++ options: -O2 -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writefirst m100 run on r740160320480640800SE +/- 0.46, N = 3728.01. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readfirst m100 run on r7402004006008001000SE +/- 0.92, N = 3913.01. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyfirst m100 run on r74060120180240300SE +/- 0.38, N = 3281.91. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthfirst m100 run on r740150300450600750SE +/- 0.63, N = 3696.911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackfirst m100 run on r7403691215SE +/- 0.00, N = 313.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadfirst m100 run on r74048121620SE +/- 0.00, N = 313.751. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsfirst m100 run on r7404M8M12M16M20MSE +/- 157796.98, N = 15188200131. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashfirst m100 run on r740714212835SE +/- 0.00, N = 327.931. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPfirst m100 run on r7406001200180024003000SE +/- 1.97, N = 32802.331. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadfirst m100 run on r7403691215SE +/- 0.01, N = 311.841. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocytefirst m100 run on r740918273645SE +/- 10.32, N = 1540.581. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.4