machine-learning-1 AMD Ryzen 7 3700X 8-Core testing with a ASUS ROG STRIX X570-E GAMING (3001 BIOS) and Sapphire AMD Radeon RX 6600 XT 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2203204-PAXR-MACHINE72&grt .
machine-learning-1 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Sapphire AMD Radeon RX 6600 XT AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASUS ROG STRIX X570-E GAMING (3001 BIOS) AMD Starship/Matisse 32GB 256GB HS-SSD-E1000 256G + 3 x 4001GB Western Digital WD40EFRX-68N + 4001GB Western Digital WD40EFZX-68A Sapphire AMD Radeon RX 6600 XT 8GB (2900/1000MHz) AMD Device ab28 Realtek RTL8125 2.5GbE + Intel I211 Ubuntu 20.04 5.4.0-104-generic (x86_64) GNOME Shell 3.36.9 X Server 1.20.13 4.6 Mesa 22.0.0-devel (LLVM 13.0.1 DRM 3.44) OpenCL 2.2 AMD-APP (3406.0) 1.2.197 GCC 9.4.0 + Clang 10.0.0-4ubuntu1 xfs 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-yTrUTS/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 - BAR1 / Visible vRAM Size: 256 MB - Python 2.7.18 + Python 3.8.10 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of LFENCE IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
machine-learning-1 ai-benchmark: Device Inference Score ai-benchmark: Device Training Score ai-benchmark: Device AI Score caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 caffe: GoogleNet - CPU - 1000 deepspeech: CPU lczero: BLAS mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m numenta-nab: EXPoSE numenta-nab: Relative Entropy numenta-nab: Windowed Gaussian numenta-nab: Earthgecko Skyline numenta-nab: Bayesian Changepoint numpy: onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU opencv: DNN - Deep Neural Network openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU plaidml: No - Inference - VGG16 - CPU plaidml: No - Inference - ResNet 50 - CPU rbenchmark: rnnoise: scikit-learn: shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant tensorflow-lite: Inception ResNet V2 tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 Sapphire AMD Radeon RX 6600 XT 1040 912 1952 51271 101263 522588 129917 255985 1283400 63.61859 755 55.38 73.21 20.17 2.27 2.944 4.690 28.463 7.641 4.539 3.545 38.556 20.07 6.02 5.25 5.04 5.24 10.03 2.09 17.61 71.56 19.21 15.56 34.07 31.27 24.74 12.02 677.483 20.665 10.926 118.927 34.439 399.27 5.81323 10.9607 2.79836 2.68005 24.4029 9.89324 6.96396 23.1182 3.83276 5.00423 4372.26 2952.77 4347.90 2965.30 5.26409 4409.83 2968.64 3.43890 35232 1.98 2025.22 2.00 1983.98 1.49 2656.17 1.48 2658.87 6345.96 0.6 6366.83 0.60 14.50 6.34 0.1552 18.672 8.906 63.3701 11.5509 644.060 13.3172 226.948 458.634 1367870 14.1539 14.0851 555.289 195961 2848230 184144 132751 137838 2585110 2930.581 265.872 65.850 240.431 OpenBenchmarking.org
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score Sapphire AMD Radeon RX 6600 XT 200 400 600 800 1000 1040
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score Sapphire AMD Radeon RX 6600 XT 200 400 600 800 1000 912
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 1952
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Sapphire AMD Radeon RX 6600 XT 11K 22K 33K 44K 55K SE +/- 628.42, N = 3 51271 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Sapphire AMD Radeon RX 6600 XT 20K 40K 60K 80K 100K SE +/- 703.95, N = 3 101263 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 Sapphire AMD Radeon RX 6600 XT 110K 220K 330K 440K 550K SE +/- 4457.63, N = 3 522588 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 1340.99, N = 5 129917 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Sapphire AMD Radeon RX 6600 XT 50K 100K 150K 200K 250K SE +/- 859.86, N = 3 255985 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 Sapphire AMD Radeon RX 6600 XT 300K 600K 900K 1200K 1500K SE +/- 3594.46, N = 3 1283400 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Sapphire AMD Radeon RX 6600 XT 14 28 42 56 70 SE +/- 0.15, N = 3 63.62
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS Sapphire AMD Radeon RX 6600 XT 160 320 480 640 800 SE +/- 8.57, N = 3 755 1. (CXX) g++ options: -flto -pthread
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Sapphire AMD Radeon RX 6600 XT 12 24 36 48 60 SE +/- 0.66, N = 4 55.38
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Sapphire AMD Radeon RX 6600 XT 16 32 48 64 80 SE +/- 0.13, N = 3 73.21
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.10, N = 3 20.17
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Sapphire AMD Radeon RX 6600 XT 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01, N = 3 2.27
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 Sapphire AMD Radeon RX 6600 XT 0.6624 1.3248 1.9872 2.6496 3.312 SE +/- 0.032, N = 4 2.944 MIN: 2.57 / MAX: 31.92 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 Sapphire AMD Radeon RX 6600 XT 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.045, N = 4 4.690 MIN: 4.19 / MAX: 44.08 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 Sapphire AMD Radeon RX 6600 XT 7 14 21 28 35 SE +/- 0.18, N = 4 28.46 MIN: 25.55 / MAX: 105.02 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.176, N = 4 7.641 MIN: 6.8 / MAX: 73.02 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 Sapphire AMD Radeon RX 6600 XT 1.0213 2.0426 3.0639 4.0852 5.1065 SE +/- 0.251, N = 4 4.539 MIN: 3.74 / MAX: 126.01 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 Sapphire AMD Radeon RX 6600 XT 0.7976 1.5952 2.3928 3.1904 3.988 SE +/- 0.027, N = 4 3.545 MIN: 3.21 / MAX: 36.78 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 Sapphire AMD Radeon RX 6600 XT 9 18 27 36 45 SE +/- 0.25, N = 4 38.56 MIN: 35.19 / MAX: 135.51 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.17, N = 3 20.07 MIN: 17.66 / MAX: 48.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.11, N = 3 6.02 MIN: 5.4 / MAX: 25.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Sapphire AMD Radeon RX 6600 XT 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.12, N = 3 5.25 MIN: 4.64 / MAX: 50.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Sapphire AMD Radeon RX 6600 XT 1.134 2.268 3.402 4.536 5.67 SE +/- 0.24, N = 3 5.04 MIN: 4.4 / MAX: 145.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Sapphire AMD Radeon RX 6600 XT 1.179 2.358 3.537 4.716 5.895 SE +/- 0.09, N = 3 5.24 MIN: 4.6 / MAX: 67.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 1.79, N = 3 10.03 MIN: 7.43 / MAX: 971.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Sapphire AMD Radeon RX 6600 XT 0.4703 0.9406 1.4109 1.8812 2.3515 SE +/- 0.03, N = 3 2.09 MIN: 1.91 / MAX: 21.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.04, N = 3 17.61 MIN: 16.12 / MAX: 37.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Sapphire AMD Radeon RX 6600 XT 16 32 48 64 80 SE +/- 0.61, N = 3 71.56 MIN: 67.2 / MAX: 118.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.20, N = 3 19.21 MIN: 17.9 / MAX: 58.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.15, N = 3 15.56 MIN: 14.66 / MAX: 47.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Sapphire AMD Radeon RX 6600 XT 8 16 24 32 40 SE +/- 0.96, N = 3 34.07 MIN: 30.66 / MAX: 202.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Sapphire AMD Radeon RX 6600 XT 7 14 21 28 35 SE +/- 1.44, N = 3 31.27 MIN: 28.05 / MAX: 170.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.22, N = 3 24.74 MIN: 22.79 / MAX: 118.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.37, N = 3 12.02 MIN: 10.63 / MAX: 69.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Numenta Anomaly Benchmark Detector: EXPoSE OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: EXPoSE Sapphire AMD Radeon RX 6600 XT 150 300 450 600 750 SE +/- 3.88, N = 3 677.48
Numenta Anomaly Benchmark Detector: Relative Entropy OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Relative Entropy Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.26, N = 3 20.67
Numenta Anomaly Benchmark Detector: Windowed Gaussian OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Windowed Gaussian Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.06, N = 3 10.93
Numenta Anomaly Benchmark Detector: Earthgecko Skyline OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Earthgecko Skyline Sapphire AMD Radeon RX 6600 XT 30 60 90 120 150 SE +/- 0.91, N = 3 118.93
Numenta Anomaly Benchmark Detector: Bayesian Changepoint OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Bayesian Changepoint Sapphire AMD Radeon RX 6600 XT 8 16 24 32 40 SE +/- 0.42, N = 4 34.44
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Sapphire AMD Radeon RX 6600 XT 90 180 270 360 450 SE +/- 0.08, N = 3 399.27
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.308 2.616 3.924 5.232 6.54 SE +/- 0.04729, N = 9 5.81323 MIN: 5.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.07, N = 3 10.96 MIN: 10.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.6296 1.2592 1.8888 2.5184 3.148 SE +/- 0.02844, N = 6 2.79836 MIN: 2.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.603 1.206 1.809 2.412 3.015 SE +/- 0.01333, N = 3 2.68005 MIN: 2.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.10, N = 3 24.40 MIN: 22.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.11461, N = 4 9.89324 MIN: 5.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.04387, N = 3 6.96396 MIN: 6.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.17, N = 3 23.12 MIN: 21.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.8624 1.7248 2.5872 3.4496 4.312 SE +/- 0.03404, N = 3 3.83276 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.126 2.252 3.378 4.504 5.63 SE +/- 0.08018, N = 15 5.00423 MIN: 4.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 31.52, N = 15 4372.26 MIN: 4139.47 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 16.34, N = 3 2952.77 MIN: 2821.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 36.82, N = 3 4347.90 MIN: 4150.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 8.58, N = 3 2965.30 MIN: 2852.87 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.1844 2.3688 3.5532 4.7376 5.922 SE +/- 0.06121, N = 4 5.26409 MIN: 4.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 26.42, N = 3 4409.83 MIN: 4165.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 10.48, N = 3 2968.64 MIN: 2848.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.7738 1.5476 2.3214 3.0952 3.869 SE +/- 0.12256, N = 12 3.43890 MIN: 2.9 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network Sapphire AMD Radeon RX 6600 XT 8K 16K 24K 32K 40K SE +/- 846.55, N = 15 35232 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.4455 0.891 1.3365 1.782 2.2275 SE +/- 0.02, N = 3 1.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 SE +/- 12.35, N = 3 2025.22 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.45 0.9 1.35 1.8 2.25 SE +/- 0.03, N = 3 2.00 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 SE +/- 12.90, N = 3 1983.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.3353 0.6706 1.0059 1.3412 1.6765 SE +/- 0.01, N = 3 1.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 4.17, N = 3 2656.17 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.333 0.666 0.999 1.332 1.665 SE +/- 0.01, N = 3 1.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 2.59, N = 3 2658.87 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 1400 2800 4200 5600 7000 SE +/- 28.17, N = 3 6345.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 0.6 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 1400 2800 4200 5600 7000 SE +/- 69.46, N = 3 6366.83 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 0.60 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.20, N = 3 14.50
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.01, N = 3 6.34
R Benchmark OpenBenchmarking.org Seconds, Fewer Is Better R Benchmark Sapphire AMD Radeon RX 6600 XT 0.0349 0.0698 0.1047 0.1396 0.1745 SE +/- 0.0014, N = 3 0.1552 1. R scripting front-end version 3.6.3 (2020-02-29)
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.17, N = 6 18.67 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Scikit-Learn OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.22.1 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.009, N = 3 8.906
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Sapphire AMD Radeon RX 6600 XT 14 28 42 56 70 SE +/- 0.33, N = 3 63.37 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.06, N = 3 11.55 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Sapphire AMD Radeon RX 6600 XT 140 280 420 560 700 SE +/- 0.55, N = 3 644.06 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.00, N = 3 13.32 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction Sapphire AMD Radeon RX 6600 XT 50 100 150 200 250 SE +/- 0.14, N = 3 226.95 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N Sapphire AMD Radeon RX 6600 XT 100 200 300 400 500 SE +/- 16.50, N = 15 458.63 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Sapphire AMD Radeon RX 6600 XT 300K 600K 900K 1200K 1500K SE +/- 1637.11, N = 3 1367870 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.00, N = 3 14.15 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.00, N = 3 14.09 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Sapphire AMD Radeon RX 6600 XT 120 240 360 480 600 SE +/- 5.64, N = 3 555.29 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Sapphire AMD Radeon RX 6600 XT 40K 80K 120K 160K 200K SE +/- 654.02, N = 3 195961
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Sapphire AMD Radeon RX 6600 XT 600K 1200K 1800K 2400K 3000K SE +/- 8539.62, N = 3 2848230
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Sapphire AMD Radeon RX 6600 XT 40K 80K 120K 160K 200K SE +/- 2290.48, N = 3 184144
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 383.43, N = 3 132751
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 479.84, N = 3 137838
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Sapphire AMD Radeon RX 6600 XT 600K 1200K 1800K 2400K 3000K SE +/- 12212.78, N = 3 2585110
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 2.09, N = 3 2930.58 MIN: 2768.75 / MAX: 3109.15 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Sapphire AMD Radeon RX 6600 XT 60 120 180 240 300 SE +/- 0.32, N = 3 265.87 MIN: 260.08 / MAX: 314.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Sapphire AMD Radeon RX 6600 XT 15 30 45 60 75 SE +/- 0.20, N = 3 65.85 MIN: 64 / MAX: 67.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Sapphire AMD Radeon RX 6600 XT 50 100 150 200 250 SE +/- 0.15, N = 3 240.43 MIN: 237.39 / MAX: 245.4 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
Phoronix Test Suite v10.8.4