KVM testing on CentOS Linux 8 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2203291-IB-TBNVIDIAA71 tb-nvidia-a5000-amd-vm-test - Phoronix Test Suite tb-nvidia-a5000-amd-vm-test KVM testing on CentOS Linux 8 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2203291-IB-TBNVIDIAA71&rdt&grw .
tb-nvidia-a5000-amd-vm-test Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Driver Compiler File-System Screen Resolution System Layer only for performance test Cirrus Logic GD 5446 4 x AMD EPYC (with IBPB) (12 Cores / 24 Threads) QEMU Standard PC (i440FX + PIIX 1996) (1.13.0-1ubuntu1.1 BIOS) Intel 440FX 82441FX PMC 16 GB + 16 GB + 8 GB RAM 215GB QEMU HDD Cirrus Logic GD 5446 24GB NVIDIA GA102 HD Audio 2 x Red Hat Virtio device CentOS Linux 8 4.18.0-193.el8.x86_64 (x86_64) NVIDIA GCC 8.5.0 20210514 + CUDA 11.0 ext4 1024x768 KVM OpenBenchmarking.org Kernel Details - Transparent Huge Pages: always Compiler Details - --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details - CPU Microcode: 0x1000065 Python Details - Python 3.6.8 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + tsx_async_abort: Not affected
tb-nvidia-a5000-amd-vm-test lczero: OpenCL rodinia: OpenCL Particle Filter arrayfire: Conjugate Gradient OpenCL neatbench: GPU fahbench: mixbench: OpenCL - Integer mixbench: OpenCL - Double Precision mixbench: OpenCL - Single Precision financebench: Black-Scholes OpenCL cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - sDOT only for performance test Cirrus Logic GD 5446 15324.11 405.71 29457.90 338.6 663.2 659.9 10501 5.518 1.673 53.1 279.0689 15282.43 277.82 29032.96 8.380 338.1 663.2 659.3 13872.87 26635.82 486.16 656.86 14.8 21.9 21.5 10.07 14.6 16.7 16.3 18.1 34.3 34.1 35.4 34.7 332 449 534 616 565 218 343 446 446 444 333 OpenBenchmarking.org
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: OpenCL Cirrus Logic GD 5446 2K 4K 6K 8K 10K SE +/- 58.00, N = 3 10501 1. (CXX) g++ options: -flto -pthread
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter Cirrus Logic GD 5446 1.2416 2.4832 3.7248 4.9664 6.208 SE +/- 0.004, N = 3 5.518 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL Cirrus Logic GD 5446 0.3821 0.7642 1.1463 1.5284 1.9105 SE +/- 0.006, N = 3 1.695 1. (CXX) g++ options: -rdynamic
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Cirrus Logic GD 5446 12 24 36 48 60 SE +/- 0.54, N = 3 53.1
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 Cirrus Logic GD 5446 60 120 180 240 300 SE +/- 0.44, N = 3 279.07
Mixbench Backend: OpenCL - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Cirrus Logic GD 5446 only for performance test 3K 6K 9K 12K 15K SE +/- 206.50, N = 3 SE +/- 113.43, N = 12 14357.13 15324.11 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: OpenCL - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Cirrus Logic GD 5446 only for performance test 90 180 270 360 450 SE +/- 1.14, N = 3 SE +/- 4.09, N = 15 412.78 405.71 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: OpenCL - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Cirrus Logic GD 5446 only for performance test 6K 12K 18K 24K 30K SE +/- 285.94, N = 6 SE +/- 34.92, N = 3 29198.81 29457.90 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL Cirrus Logic GD 5446 2 4 6 8 10 SE +/- 0.027, N = 3 8.380 1. (CXX) g++ options: -O3 -march=native -fopenmp
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Cirrus Logic GD 5446 only for performance test 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 0.18, N = 3 338.7 338.6 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Cirrus Logic GD 5446 only for performance test 140 280 420 560 700 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 663.3 663.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Cirrus Logic GD 5446 only for performance test 140 280 420 560 700 SE +/- 0.23, N = 3 SE +/- 0.21, N = 3 659.1 659.9 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Cirrus Logic GD 5446 3K 6K 9K 12K 15K SE +/- 50.98, N = 3 13878.94 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Cirrus Logic GD 5446 6K 12K 18K 24K 30K SE +/- 15.97, N = 3 26640.06 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Cirrus Logic GD 5446 110 220 330 440 550 SE +/- 1.95, N = 3 486.12 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Cirrus Logic GD 5446 140 280 420 560 700 SE +/- 0.03, N = 3 656.82 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY Cirrus Logic GD 5446 4 8 12 16 20 SE +/- 0.26, N = 12 12.20 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY Cirrus Logic GD 5446 6 12 18 24 30 SE +/- 0.41, N = 12 18.10 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT Cirrus Logic GD 5446 6 12 18 24 30 SE +/- 0.47, N = 12 19.10 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY Cirrus Logic GD 5446 3 6 9 12 15 SE +/- 0.11, N = 12 10.30 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY Cirrus Logic GD 5446 4 8 12 16 20 SE +/- 0.17, N = 12 15.60 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT Cirrus Logic GD 5446 5 10 15 20 25 SE +/- 0.28, N = 12 16.00 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N Cirrus Logic GD 5446 5 10 15 20 25 SE +/- 0.30, N = 15 19.40 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T Cirrus Logic GD 5446 5 10 15 20 25 SE +/- 0.22, N = 12 17.90 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN Cirrus Logic GD 5446 8 16 24 32 40 SE +/- 0.23, N = 12 34.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT Cirrus Logic GD 5446 8 16 24 32 40 SE +/- 0.30, N = 12 32.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN Cirrus Logic GD 5446 8 16 24 32 40 SE +/- 0.43, N = 12 34.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT Cirrus Logic GD 5446 8 16 24 32 40 SE +/- 0.21, N = 15 34.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY Cirrus Logic GD 5446 70 140 210 280 350 SE +/- 0.58, N = 3 333 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY Cirrus Logic GD 5446 100 200 300 400 500 SE +/- 0.88, N = 3 450 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY Cirrus Logic GD 5446 120 240 360 480 600 SE +/- 0.33, N = 3 533 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY Cirrus Logic GD 5446 130 260 390 520 650 SE +/- 0.33, N = 3 615 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT Cirrus Logic GD 5446 120 240 360 480 600 SE +/- 0.67, N = 3 566 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N Cirrus Logic GD 5446 50 100 150 200 250 218 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T Cirrus Logic GD 5446 70 140 210 280 350 SE +/- 0.67, N = 3 342 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN Cirrus Logic GD 5446 100 200 300 400 500 445 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT Cirrus Logic GD 5446 100 200 300 400 500 SE +/- 1.00, N = 3 447 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN Cirrus Logic GD 5446 100 200 300 400 500 SE +/- 1.33, N = 3 446 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT Cirrus Logic GD 5446 70 140 210 280 350 335 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4