m-40cpu-662mem-8v100 m-40cpu-662mem-8v100 m-40cpu-662mem-8v100: Processor: 2 x Intel Xeon Platinum 8168 (40 Cores), Motherboard: Microsoft Virtual Machine (Hyper-V UEFI v4.0 BIOS), Memory: 662GB, Disk: 32GB Virtual Disk + 3114GB Virtual Disk, Graphics: Tesla V100-SXM2-32GB OS: Ubuntu 20.04, Kernel: 5.4.0-1039-azure (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.9, Display Driver: NVIDIA, OpenCL: OpenCL 1.2 CUDA 11.2.109, Vulkan: 1.2.155, Compiler: GCC 9.3.0 + CUDA 10.1, File-System: ext4, Screen Resolution: 1152x864, System Layer: microsoft SQLite 3.30.1 Threads / Copies: 1 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 3.608 |================================================= SQLite 3.30.1 Threads / Copies: 8 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 5.800 |================================================= SQLite 3.30.1 Threads / Copies: 32 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 11.92 |================================================= SQLite 3.30.1 Threads / Copies: 64 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 20.83 |================================================= SQLite 3.30.1 Threads / Copies: 128 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 44.31 |================================================= Flexible IO Tester 3.25 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 8914 |================================================== Flexible IO Tester 3.25 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 4454 |================================================== Flexible IO Tester 3.25 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 614 |=================================================== Flexible IO Tester 3.25 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 157000 |================================================ Flexible IO Tester 3.25 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 5734 |================================================== Flexible IO Tester 3.25 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 2863 |================================================== Flexible IO Tester 3.25 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 684 |=================================================== Flexible IO Tester 3.25 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 174800 |================================================ Flexible IO Tester 3.25 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 9823 |================================================== Flexible IO Tester 3.25 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 4908 |================================================== Flexible IO Tester 3.25 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 619 |=================================================== Flexible IO Tester 3.25 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 158667 |================================================ Flexible IO Tester 3.25 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 5790 |================================================== Flexible IO Tester 3.25 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 2892 |================================================== Flexible IO Tester 3.25 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 582 |=================================================== Flexible IO Tester 3.25 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better m-40cpu-662mem-8v100 . 149000 |================================================ FS-Mark 3.3 Test: 1000 Files, 1MB Size Files/s > Higher Is Better m-40cpu-662mem-8v100 . 775.1 |================================================= FS-Mark 3.3 Test: 5000 Files, 1MB Size, 4 Threads Files/s > Higher Is Better m-40cpu-662mem-8v100 . 1950.5 |================================================ FS-Mark 3.3 Test: 4000 Files, 32 Sub Dirs, 1MB Size Files/s > Higher Is Better m-40cpu-662mem-8v100 . 809.8 |================================================= FS-Mark 3.3 Test: 1000 Files, 1MB Size, No Sync/FSync Files/s > Higher Is Better m-40cpu-662mem-8v100 . 1583.5 |================================================ Dbench 4.0 12 Clients MB/s > Higher Is Better m-40cpu-662mem-8v100 . 4479.04 |=============================================== Dbench 4.0 1 Clients MB/s > Higher Is Better m-40cpu-662mem-8v100 . 730.32 |================================================ IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 370.88 |================================================ IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 468.01 |================================================ IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 544.68 |================================================ IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 665.96 |================================================ IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 779.43 |================================================ IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 821.17 |================================================ IOR 3.3.0 Block Size: 256MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 728.24 |================================================ IOR 3.3.0 Block Size: 512MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 712.34 |================================================ IOR 3.3.0 Block Size: 1024MB - Disk Target: Default Test Directory MB/s > Higher Is Better m-40cpu-662mem-8v100 . 698.63 |================================================ PostMark 1.51 Disk Transaction Performance TPS > Higher Is Better m-40cpu-662mem-8v100 . 4335 |================================================== Hashcat 6.1.1 Benchmark: MD5 H/s > Higher Is Better m-40cpu-662mem-8v100 . 204328975000 |========================================== Hashcat 6.1.1 Benchmark: SHA1 H/s > Higher Is Better m-40cpu-662mem-8v100 . 65941850000 |=========================================== Hashcat 6.1.1 Benchmark: 7-Zip H/s > Higher Is Better m-40cpu-662mem-8v100 . 7323840 |=============================================== Hashcat 6.1.1 Benchmark: SHA-512 H/s > Higher Is Better m-40cpu-662mem-8v100 . 15675466667 |=========================================== Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better m-40cpu-662mem-8v100 . 4579200 |=============================================== ViennaCL 1.4.2 OpenCL LU Factorization GFLOPS > Higher Is Better m-40cpu-662mem-8v100 . 50.44 |================================================= cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better m-40cpu-662mem-8v100 . 255.2 |================================================= cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better m-40cpu-662mem-8v100 . 707.3 |================================================= cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better m-40cpu-662mem-8v100 . 743.9 |================================================= FAHBench 2.3.2 Ns Per Day > Higher Is Better m-40cpu-662mem-8v100 . 262.18 |================================================ RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer MB/s > Higher Is Better m-40cpu-662mem-8v100 . 30093.25 |============================================== RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer MB/s > Higher Is Better m-40cpu-662mem-8v100 . 30377.03 |============================================== RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer MB/s > Higher Is Better m-40cpu-662mem-8v100 . 25238.35 |============================================== RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer MB/s > Higher Is Better m-40cpu-662mem-8v100 . 28989.88 |============================================== RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer MB/s > Higher Is Better m-40cpu-662mem-8v100 . 29144.21 |============================================== RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point MB/s > Higher Is Better m-40cpu-662mem-8v100 . 25556.49 |============================================== RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point MB/s > Higher Is Better m-40cpu-662mem-8v100 . 30257.56 |============================================== RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point MB/s > Higher Is Better m-40cpu-662mem-8v100 . 23498.35 |============================================== RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point MB/s > Higher Is Better m-40cpu-662mem-8v100 . 29364.52 |============================================== RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point MB/s > Higher Is Better m-40cpu-662mem-8v100 . 27200.16 |============================================== Stream 2013-01-17 Type: Copy MB/s > Higher Is Better m-40cpu-662mem-8v100 . 155286.8 |============================================== Stream 2013-01-17 Type: Scale MB/s > Higher Is Better m-40cpu-662mem-8v100 . 132988.1 |============================================== Stream 2013-01-17 Type: Triad MB/s > Higher Is Better m-40cpu-662mem-8v100 . 149751.7 |============================================== Stream 2013-01-17 Type: Add MB/s > Higher Is Better m-40cpu-662mem-8v100 . 149961.6 |============================================== Tinymembench 2018-05-28 Standard Memcpy MB/s > Higher Is Better m-40cpu-662mem-8v100 . 5366.2 |================================================ Tinymembench 2018-05-28 Standard Memset MB/s > Higher Is Better m-40cpu-662mem-8v100 . 12223.2 |=============================================== MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB MiB/s > Higher Is Better m-40cpu-662mem-8v100 . 4865.86 |=============================================== MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB MiB/s > Higher Is Better m-40cpu-662mem-8v100 . 4855.67 |=============================================== t-test1 2017-01-13 Threads: 1 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 26.33 |================================================= t-test1 2017-01-13 Threads: 2 Seconds < Lower Is Better m-40cpu-662mem-8v100 . 9.062 |================================================= Rodinia 2.4 Test: OpenMP LavaMD Seconds < Lower Is Better m-40cpu-662mem-8v100 . 11.07 |================================================= Rodinia 2.4 Test: OpenMP CFD Solver Seconds < Lower Is Better m-40cpu-662mem-8v100 . 9.312 |================================================= Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better m-40cpu-662mem-8v100 . 4.146 |================================================= NAMD 2.13b1 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better m-40cpu-662mem-8v100 . 0.56489 |=============================================== CacheBench Read Cache MB/s > Higher Is Better m-40cpu-662mem-8v100 . 2872.51 |=============================================== CacheBench Write Cache MB/s > Higher Is Better m-40cpu-662mem-8v100 . 24034.99 |============================================== ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better m-40cpu-662mem-8v100 . 2.508 |================================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 11.82 |================================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 12.14 |================================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 37.02 |================================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 38.18 |================================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 27.78 |================================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 46.43 |================================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 83.86 |================================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 141.57 |================================================ x264 2018-09-25 H.264 Video Encoding Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 181.42 |================================================ x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 23.40 |================================================= x265 3.4 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better m-40cpu-662mem-8v100 . 65.25 |================================================= 7-Zip Compression 16.02 Compress Speed Test MIPS > Higher Is Better m-40cpu-662mem-8v100 . 154757 |================================================ Stockfish 9 Total Time Nodes Per Second > Higher Is Better m-40cpu-662mem-8v100 . 72738220 |============================================== asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better m-40cpu-662mem-8v100 . 80000341 |============================================== Timed Linux Kernel Compilation 4.18 Time To Compile Seconds < Lower Is Better m-40cpu-662mem-8v100 . 32.68 |================================================= POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better m-40cpu-662mem-8v100 . 18.80 |================================================= Radiance Benchmark 5.0 Test: Serial Seconds < Lower Is Better m-40cpu-662mem-8v100 . 841.21 |================================================ Radiance Benchmark 5.0 Test: SMP Parallel Seconds < Lower Is Better m-40cpu-662mem-8v100 . 255.02 |================================================ OpenSSL 1.1.1 RSA 4096-bit Performance Signs Per Second > Higher Is Better m-40cpu-662mem-8v100 . 9857.8 |================================================ FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better m-40cpu-662mem-8v100 . 1.271 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better m-40cpu-662mem-8v100 . 17.61 |================================================= NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better m-40cpu-662mem-8v100 . 7.47 |================================================== NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better m-40cpu-662mem-8v100 . 6.74 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better m-40cpu-662mem-8v100 . 7.37 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better m-40cpu-662mem-8v100 . 6.91 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better m-40cpu-662mem-8v100 . 8.71 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better m-40cpu-662mem-8v100 . 3.30 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better m-40cpu-662mem-8v100 . 19.21 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better m-40cpu-662mem-8v100 . 44.37 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better m-40cpu-662mem-8v100 . 13.36 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better m-40cpu-662mem-8v100 . 8.60 |================================================== NCNN 20201218 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better m-40cpu-662mem-8v100 . 24.73 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better m-40cpu-662mem-8v100 . 29.66 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better m-40cpu-662mem-8v100 . 22.21 |================================================= NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better m-40cpu-662mem-8v100 . 43.61 |================================================= PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL FPS > Higher Is Better m-40cpu-662mem-8v100 . 706.02 |================================================ PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL FPS > Higher Is Better m-40cpu-662mem-8v100 . 3076.83 |=============================================== PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL FPS > Higher Is Better m-40cpu-662mem-8v100 . 3397.98 |=============================================== PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL FPS > Higher Is Better m-40cpu-662mem-8v100 . 247.70 |================================================ ctx_clock Context Switch Time Clocks < Lower Is Better m-40cpu-662mem-8v100 . 1010 |================================================== Sysbench 2018-07-28 Test: CPU Events Per Second > Higher Is Better m-40cpu-662mem-8v100 . 44619.40 |============================================== Blender 2.79a Blend File: Barbershop - Compute: CPU-Only Seconds < Lower Is Better m-40cpu-662mem-8v100 . 369.58 |================================================ clpeak OpenCL Test: Integer Compute INT GIOPS > Higher Is Better m-40cpu-662mem-8v100 . 15474.54 |============================================== clpeak OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better m-40cpu-662mem-8v100 . 15290.39 |============================================== clpeak OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better m-40cpu-662mem-8v100 . 7768.67 |=============================================== clpeak OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better m-40cpu-662mem-8v100 . 713.27 |================================================