mi100-1 KVM testing on Ubuntu 18.04 via the Phoronix Test Suite. mi100: Processor: 16 x Intel Core (Haswell no TSX) (16 Cores), Motherboard: RDO OpenStack Compute (1.11.0-2.el7 BIOS), Chipset: Intel 82G33/G31/P35/P31 + ICH9, Memory: 64GB, Disk: 21GB QEMU HDD + 107GB QEMU HDD, Graphics: Cirrus Logic GD 5446 32GB, Network: Red Hat Virtio device OS: Ubuntu 18.04, Kernel: 5.4.0-64-generic (x86_64), OpenCL: OpenCL 2.0 AMD-APP (3275.0), Compiler: GCC 7.5.0, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better mi100 . 12.27 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better mi100 . 2783.51 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better mi100 . 27.89 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better mi100 . 21943033 |============================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better mi100 . 13.67 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better mi100 . 14.08 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better mi100 . 706.11 |=============================================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better mi100 . 286.8 |================================================================ cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better mi100 . 916.8 |================================================================ cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better mi100 . 730.0 |================================================================ Rodinia 3.1 Test: OpenCL Myocyte Seconds < Lower Is Better mi100 . 132.60 |=============================================================== Rodinia 3.1 Test: OpenCL Heartwall Seconds < Lower Is Better mi100 . 3.133 |================================================================ Darktable 2.4.2 Test: Boat - Acceleration: OpenCL Seconds < Lower Is Better mi100 . 2.008 |================================================================ Darktable 2.4.2 Test: Masskrug - Acceleration: OpenCL Seconds < Lower Is Better mi100 . 5.075 |================================================================ Darktable 2.4.2 Test: Server Rack - Acceleration: OpenCL Seconds < Lower Is Better mi100 . 0.177 |================================================================ Darktable 2.4.2 Test: Server Room - Acceleration: OpenCL Seconds < Lower Is Better mi100 . 0.864 |================================================================ Blender 2.92 Blend File: BMW27 - Compute: OpenCL Seconds < Lower Is Better mi100 . 53.76 |================================================================ clpeak OpenCL Test: Kernel Latency us < Lower Is Better mi100 . 17.87 |================================================================ clpeak OpenCL Test: Integer Compute INT GIOPS > Higher Is Better mi100 . 7487.84 |============================================================== clpeak OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better mi100 . 22813.55 |============================================================= clpeak OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better mi100 . 11439.47 |============================================================= clpeak OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better mi100 . 960.15 |=============================================================== clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer GBPS > Higher Is Better mi100 . 4.86 |================================================================= clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer GBPS > Higher Is Better mi100 . 10.96 |================================================================