machine-learning-1 AMD Ryzen 7 3700X 8-Core testing with a ASUS ROG STRIX X570-E GAMING (3001 BIOS) and Sapphire AMD Radeon RX 6600 XT 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2203204-PAXR-MACHINE72&grr .
machine-learning-1 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Sapphire AMD Radeon RX 6600 XT AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASUS ROG STRIX X570-E GAMING (3001 BIOS) AMD Starship/Matisse 32GB 256GB HS-SSD-E1000 256G + 3 x 4001GB Western Digital WD40EFRX-68N + 4001GB Western Digital WD40EFZX-68A Sapphire AMD Radeon RX 6600 XT 8GB (2900/1000MHz) AMD Device ab28 Realtek RTL8125 2.5GbE + Intel I211 Ubuntu 20.04 5.4.0-104-generic (x86_64) GNOME Shell 3.36.9 X Server 1.20.13 4.6 Mesa 22.0.0-devel (LLVM 13.0.1 DRM 3.44) OpenCL 2.2 AMD-APP (3406.0) 1.2.197 GCC 9.4.0 + Clang 10.0.0-4ubuntu1 xfs 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-yTrUTS/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 - BAR1 / Visible vRAM Size: 256 MB - Python 2.7.18 + Python 3.8.10 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of LFENCE IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
machine-learning-1 caffe: GoogleNet - CPU - 1000 numenta-nab: EXPoSE caffe: AlexNet - CPU - 1000 ai-benchmark: Device AI Score ai-benchmark: Device Training Score ai-benchmark: Device Inference Score onednn: Recurrent Neural Network Training - f32 - CPU lczero: BLAS plaidml: No - Inference - ResNet 50 - CPU caffe: GoogleNet - CPU - 200 numpy: caffe: GoogleNet - CPU - 100 tnn: CPU - DenseNet plaidml: No - Inference - VGG16 - CPU opencv: DNN - Deep Neural Network mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: squeezenetv1.1 mnn: mobilenetV3 tensorflow-lite: Inception V4 tensorflow-lite: Inception ResNet V2 mlpack: scikit_qda shoc: OpenCL - Max SP Flops numenta-nab: Earthgecko Skyline caffe: AlexNet - CPU - 200 ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU shoc: OpenCL - S3D onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU mlpack: scikit_ica openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU tensorflow-lite: SqueezeNet tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Quant tensorflow-lite: Mobilenet Float openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU mlpack: scikit_linearridgeregression caffe: AlexNet - CPU - 100 onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU deepspeech: CPU numenta-nab: Bayesian Changepoint onednn: IP Shapes 1D - f32 - CPU rnnoise: shoc: OpenCL - GEMM SGEMM_N onednn: IP Shapes 1D - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU mlpack: scikit_svm onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU rbenchmark: numenta-nab: Relative Entropy tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU numenta-nab: Windowed Gaussian onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU scikit-learn: shoc: OpenCL - Texture Read Bandwidth onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU tnn: CPU - SqueezeNet v2 onednn: Deconvolution Batch shapes_3d - f32 - CPU shoc: OpenCL - Triad shoc: OpenCL - Reduction shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download onednn: IP Shapes 1D - bf16bf16bf16 - CPU Sapphire AMD Radeon RX 6600 XT 1283400 677.483 522588 1952 912 1040 4372.26 755 6.34 255985 399.27 129917 2930.581 14.50 35232 38.556 3.545 4.539 7.641 28.463 4.690 2.944 2848230 2585110 73.21 1367870 118.927 101263 12.02 24.74 31.27 34.07 15.56 19.21 71.56 17.61 2.09 10.03 5.24 5.04 5.25 6.02 20.07 4409.83 4347.90 63.3701 2968.64 2965.30 2952.77 55.38 2656.17 1.49 2025.22 1.98 2658.87 1.48 1983.98 2.00 195961 184144 137838 132751 0.6 6345.96 0.60 6366.83 2.27 51271 3.43890 63.61859 34.439 5.81323 18.672 458.634 2.79836 9.89324 20.17 3.83276 0.1552 20.665 265.872 240.431 5.26409 5.00423 10.926 10.9607 2.68005 8.906 555.289 24.4029 23.1182 65.850 6.96396 11.5509 226.948 644.060 13.3172 14.0851 14.1539 OpenBenchmarking.org
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 Sapphire AMD Radeon RX 6600 XT 300K 600K 900K 1200K 1500K SE +/- 3594.46, N = 3 1283400 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Numenta Anomaly Benchmark Detector: EXPoSE OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: EXPoSE Sapphire AMD Radeon RX 6600 XT 150 300 450 600 750 SE +/- 3.88, N = 3 677.48
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 Sapphire AMD Radeon RX 6600 XT 110K 220K 330K 440K 550K SE +/- 4457.63, N = 3 522588 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 1952
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score Sapphire AMD Radeon RX 6600 XT 200 400 600 800 1000 912
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score Sapphire AMD Radeon RX 6600 XT 200 400 600 800 1000 1040
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 31.52, N = 15 4372.26 MIN: 4139.47 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS Sapphire AMD Radeon RX 6600 XT 160 320 480 640 800 SE +/- 8.57, N = 3 755 1. (CXX) g++ options: -flto -pthread
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.01, N = 3 6.34
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Sapphire AMD Radeon RX 6600 XT 50K 100K 150K 200K 250K SE +/- 859.86, N = 3 255985 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Sapphire AMD Radeon RX 6600 XT 90 180 270 360 450 SE +/- 0.08, N = 3 399.27
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 1340.99, N = 5 129917 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 2.09, N = 3 2930.58 MIN: 2768.75 / MAX: 3109.15 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.20, N = 3 14.50
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network Sapphire AMD Radeon RX 6600 XT 8K 16K 24K 32K 40K SE +/- 846.55, N = 15 35232 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 Sapphire AMD Radeon RX 6600 XT 9 18 27 36 45 SE +/- 0.25, N = 4 38.56 MIN: 35.19 / MAX: 135.51 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 Sapphire AMD Radeon RX 6600 XT 0.7976 1.5952 2.3928 3.1904 3.988 SE +/- 0.027, N = 4 3.545 MIN: 3.21 / MAX: 36.78 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 Sapphire AMD Radeon RX 6600 XT 1.0213 2.0426 3.0639 4.0852 5.1065 SE +/- 0.251, N = 4 4.539 MIN: 3.74 / MAX: 126.01 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.176, N = 4 7.641 MIN: 6.8 / MAX: 73.02 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 Sapphire AMD Radeon RX 6600 XT 7 14 21 28 35 SE +/- 0.18, N = 4 28.46 MIN: 25.55 / MAX: 105.02 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 Sapphire AMD Radeon RX 6600 XT 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.045, N = 4 4.690 MIN: 4.19 / MAX: 44.08 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 Sapphire AMD Radeon RX 6600 XT 0.6624 1.3248 1.9872 2.6496 3.312 SE +/- 0.032, N = 4 2.944 MIN: 2.57 / MAX: 31.92 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Sapphire AMD Radeon RX 6600 XT 600K 1200K 1800K 2400K 3000K SE +/- 8539.62, N = 3 2848230
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Sapphire AMD Radeon RX 6600 XT 600K 1200K 1800K 2400K 3000K SE +/- 12212.78, N = 3 2585110
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Sapphire AMD Radeon RX 6600 XT 16 32 48 64 80 SE +/- 0.13, N = 3 73.21
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Sapphire AMD Radeon RX 6600 XT 300K 600K 900K 1200K 1500K SE +/- 1637.11, N = 3 1367870 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Numenta Anomaly Benchmark Detector: Earthgecko Skyline OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Earthgecko Skyline Sapphire AMD Radeon RX 6600 XT 30 60 90 120 150 SE +/- 0.91, N = 3 118.93
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Sapphire AMD Radeon RX 6600 XT 20K 40K 60K 80K 100K SE +/- 703.95, N = 3 101263 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.37, N = 3 12.02 MIN: 10.63 / MAX: 69.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.22, N = 3 24.74 MIN: 22.79 / MAX: 118.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Sapphire AMD Radeon RX 6600 XT 7 14 21 28 35 SE +/- 1.44, N = 3 31.27 MIN: 28.05 / MAX: 170.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Sapphire AMD Radeon RX 6600 XT 8 16 24 32 40 SE +/- 0.96, N = 3 34.07 MIN: 30.66 / MAX: 202.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.15, N = 3 15.56 MIN: 14.66 / MAX: 47.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.20, N = 3 19.21 MIN: 17.9 / MAX: 58.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Sapphire AMD Radeon RX 6600 XT 16 32 48 64 80 SE +/- 0.61, N = 3 71.56 MIN: 67.2 / MAX: 118.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.04, N = 3 17.61 MIN: 16.12 / MAX: 37.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Sapphire AMD Radeon RX 6600 XT 0.4703 0.9406 1.4109 1.8812 2.3515 SE +/- 0.03, N = 3 2.09 MIN: 1.91 / MAX: 21.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 1.79, N = 3 10.03 MIN: 7.43 / MAX: 971.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Sapphire AMD Radeon RX 6600 XT 1.179 2.358 3.537 4.716 5.895 SE +/- 0.09, N = 3 5.24 MIN: 4.6 / MAX: 67.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Sapphire AMD Radeon RX 6600 XT 1.134 2.268 3.402 4.536 5.67 SE +/- 0.24, N = 3 5.04 MIN: 4.4 / MAX: 145.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Sapphire AMD Radeon RX 6600 XT 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.12, N = 3 5.25 MIN: 4.64 / MAX: 50.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.11, N = 3 6.02 MIN: 5.4 / MAX: 25.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.17, N = 3 20.07 MIN: 17.66 / MAX: 48.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 26.42, N = 3 4409.83 MIN: 4165.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 900 1800 2700 3600 4500 SE +/- 36.82, N = 3 4347.90 MIN: 4150.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Sapphire AMD Radeon RX 6600 XT 14 28 42 56 70 SE +/- 0.33, N = 3 63.37 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 10.48, N = 3 2968.64 MIN: 2848.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 8.58, N = 3 2965.30 MIN: 2852.87 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 16.34, N = 3 2952.77 MIN: 2821.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Sapphire AMD Radeon RX 6600 XT 12 24 36 48 60 SE +/- 0.66, N = 4 55.38
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 4.17, N = 3 2656.17 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.3353 0.6706 1.0059 1.3412 1.6765 SE +/- 0.01, N = 3 1.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 SE +/- 12.35, N = 3 2025.22 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.4455 0.891 1.3365 1.782 2.2275 SE +/- 0.02, N = 3 1.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 600 1200 1800 2400 3000 SE +/- 2.59, N = 3 2658.87 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.333 0.666 0.999 1.332 1.665 SE +/- 0.01, N = 3 1.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 400 800 1200 1600 2000 SE +/- 12.90, N = 3 1983.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.45 0.9 1.35 1.8 2.25 SE +/- 0.03, N = 3 2.00 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Sapphire AMD Radeon RX 6600 XT 40K 80K 120K 160K 200K SE +/- 654.02, N = 3 195961
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Sapphire AMD Radeon RX 6600 XT 40K 80K 120K 160K 200K SE +/- 2290.48, N = 3 184144
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 479.84, N = 3 137838
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Sapphire AMD Radeon RX 6600 XT 30K 60K 90K 120K 150K SE +/- 383.43, N = 3 132751
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 0.6 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Sapphire AMD Radeon RX 6600 XT 1400 2800 4200 5600 7000 SE +/- 28.17, N = 3 6345.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 0.60 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU Sapphire AMD Radeon RX 6600 XT 1400 2800 4200 5600 7000 SE +/- 69.46, N = 3 6366.83 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Sapphire AMD Radeon RX 6600 XT 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01, N = 3 2.27
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Sapphire AMD Radeon RX 6600 XT 11K 22K 33K 44K 55K SE +/- 628.42, N = 3 51271 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.7738 1.5476 2.3214 3.0952 3.869 SE +/- 0.12256, N = 12 3.43890 MIN: 2.9 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Sapphire AMD Radeon RX 6600 XT 14 28 42 56 70 SE +/- 0.15, N = 3 63.62
Numenta Anomaly Benchmark Detector: Bayesian Changepoint OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Bayesian Changepoint Sapphire AMD Radeon RX 6600 XT 8 16 24 32 40 SE +/- 0.42, N = 4 34.44
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.308 2.616 3.924 5.232 6.54 SE +/- 0.04729, N = 9 5.81323 MIN: 5.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.17, N = 6 18.67 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N Sapphire AMD Radeon RX 6600 XT 100 200 300 400 500 SE +/- 16.50, N = 15 458.63 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.6296 1.2592 1.8888 2.5184 3.148 SE +/- 0.02844, N = 6 2.79836 MIN: 2.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.11461, N = 4 9.89324 MIN: 5.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.10, N = 3 20.17
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.8624 1.7248 2.5872 3.4496 4.312 SE +/- 0.03404, N = 3 3.83276 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
R Benchmark OpenBenchmarking.org Seconds, Fewer Is Better R Benchmark Sapphire AMD Radeon RX 6600 XT 0.0349 0.0698 0.1047 0.1396 0.1745 SE +/- 0.0014, N = 3 0.1552 1. R scripting front-end version 3.6.3 (2020-02-29)
Numenta Anomaly Benchmark Detector: Relative Entropy OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Relative Entropy Sapphire AMD Radeon RX 6600 XT 5 10 15 20 25 SE +/- 0.26, N = 3 20.67
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Sapphire AMD Radeon RX 6600 XT 60 120 180 240 300 SE +/- 0.32, N = 3 265.87 MIN: 260.08 / MAX: 314.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Sapphire AMD Radeon RX 6600 XT 50 100 150 200 250 SE +/- 0.15, N = 3 240.43 MIN: 237.39 / MAX: 245.4 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.1844 2.3688 3.5532 4.7376 5.922 SE +/- 0.06121, N = 4 5.26409 MIN: 4.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 1.126 2.252 3.378 4.504 5.63 SE +/- 0.08018, N = 15 5.00423 MIN: 4.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Numenta Anomaly Benchmark Detector: Windowed Gaussian OpenBenchmarking.org Seconds, Fewer Is Better Numenta Anomaly Benchmark 1.1 Detector: Windowed Gaussian Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.06, N = 3 10.93
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.07, N = 3 10.96 MIN: 10.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 0.603 1.206 1.809 2.412 3.015 SE +/- 0.01333, N = 3 2.68005 MIN: 2.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Scikit-Learn OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.22.1 Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.009, N = 3 8.906
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Sapphire AMD Radeon RX 6600 XT 120 240 360 480 600 SE +/- 5.64, N = 3 555.29 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.10, N = 3 24.40 MIN: 22.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 6 12 18 24 30 SE +/- 0.17, N = 3 23.12 MIN: 21.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Sapphire AMD Radeon RX 6600 XT 15 30 45 60 75 SE +/- 0.20, N = 3 65.85 MIN: 64 / MAX: 67.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Sapphire AMD Radeon RX 6600 XT 2 4 6 8 10 SE +/- 0.04387, N = 3 6.96396 MIN: 6.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.06, N = 3 11.55 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction Sapphire AMD Radeon RX 6600 XT 50 100 150 200 250 SE +/- 0.14, N = 3 226.95 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Sapphire AMD Radeon RX 6600 XT 140 280 420 560 700 SE +/- 0.55, N = 3 644.06 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Sapphire AMD Radeon RX 6600 XT 3 6 9 12 15 SE +/- 0.00, N = 3 13.32 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.00, N = 3 14.09 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Sapphire AMD Radeon RX 6600 XT 4 8 12 16 20 SE +/- 0.00, N = 3 14.15 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Phoronix Test Suite v10.8.4