Ryzen 5 3600XT 2021 AMD Ryzen 5 3600XT 6-Core testing with a MSI X470 GAMING M7 AC (MS-7B77) v1.0 (1.E0 BIOS) and MSI AMD Radeon R7 370 / R9 270/370 OEM 4GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101018-HA-RYZEN536000&grt&sor .
Ryzen 5 3600XT 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Linux 5.8 Repeat 2 Repeat 3 AMD Ryzen 5 3600XT 6-Core @ 3.80GHz (6 Cores / 12 Threads) MSI X470 GAMING M7 AC (MS-7B77) v1.0 (1.E0 BIOS) AMD Starship/Matisse 16GB 500GB CT500P2SSD8 MSI AMD Radeon R7 370 / R9 270/370 OEM 4GB AMD Oland/Hainan/Cape G237HL Qualcomm Atheros Killer E2500 + Intel 8265 / 8275 Ubuntu 20.10 5.8.0-28-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 4.5 Mesa 20.2.1 (LLVM 11.0.0) GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 5 3600XT 2021 brl-cad: VGR Performance Metric build2: Time To Compile clomp: Static OMP Speedup coremark: CoreMark Size 666 - Iterations Per Second cryptsetup: PBKDF2-sha512 cryptsetup: PBKDF2-whirlpool cryptsetup: AES-XTS 256b Encryption cryptsetup: AES-XTS 256b Decryption cryptsetup: Serpent-XTS 256b Encryption cryptsetup: Serpent-XTS 256b Decryption cryptsetup: Twofish-XTS 256b Encryption cryptsetup: Twofish-XTS 256b Decryption cryptsetup: AES-XTS 512b Encryption cryptsetup: AES-XTS 512b Decryption cryptsetup: Serpent-XTS 512b Encryption cryptsetup: Serpent-XTS 512b Decryption cryptsetup: Twofish-XTS 512b Encryption cryptsetup: Twofish-XTS 512b Decryption encode-ape: WAV To APE ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m node-web-tooling: encode-ogg: WAV To Ogg onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU encode-opus: WAV To Opus Encode phpbench: PHP Benchmark Suite simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID sqlite-speedtest: Timed Time - Size 1,000 build-clash: Time To Compile build-eigen: Time To Compile build-ffmpeg: Time To Compile hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA unpack-firefox: firefox-84.0.source.tar.xz encode-wavpack: WAV To WavPack Linux 5.8 Repeat 2 Repeat 3 93535 177.258 16.9 269797.669527 1800801 753350 2073.5 2080.6 733.5 722.0 429.5 425.4 1826.8 1828.5 737.8 726.3 429.4 428.4 11.422 20.14 6.12 5.29 7.76 5.27 8.30 2.53 17.94 68.66 18.52 14.94 34.89 30.78 24.87 17.66 11.31 18.719 5.82329 12.6446 3.49733 3.06815 24.6072 6.78426 9.18729 23.6698 8.78558 6.67768 5069.40 3385.70 5110.57 3389.04 5.04874 5135.69 3378.54 4.41400 7.138 663340 0.68 0.44 0.76 0.77 57.470 376.604 75.134 65.486 106.406 11.542 17.506 12.412 92847 176.752 17.1 271723.339322 1757676 735744 2095.4 2088.5 734.9 718.6 426.7 427.1 1840.9 1843.9 742.4 730.6 433.6 435.0 11.452 19.72 6.10 5.20 7.72 5.22 8.21 2.45 17.88 68.51 18.50 14.79 34.85 30.61 24.54 17.81 11.19 18.427 5.92486 12.5818 3.51698 3.01204 24.5346 6.79426 9.18130 23.7672 8.76683 6.68031 5078.51 3357.80 5109.67 3361.25 5.03612 5141.79 3385.30 4.39427 7.197 666507 0.67 0.44 0.76 0.79 56.652 375.108 75.400 65.425 106.978 11.347 17.458 12.404 92316 176.924 16.8 268921.700728 1768539 743090 2067.5 2068.2 735.7 721.0 426.2 430.9 1857.5 1858.4 746.7 731.4 434.6 433.3 11.392 19.89 6.02 5.36 7.74 5.23 8.35 2.43 17.71 69.12 18.65 15.19 34.62 30.71 24.58 17.74 11.29 18.645 5.92397 12.7571 3.50419 3.04147 24.5161 6.78786 9.15703 23.6542 8.69285 6.68541 5080.07 3366.59 5132.65 3387.66 5.08034 5162.11 3388.22 4.41212 7.081 659345 0.67 0.44 0.75 0.77 57.559 376.490 75.438 65.540 107.042 11.324 17.470 12.477 OpenBenchmarking.org
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric Linux 5.8 Repeat 2 Repeat 3 20K 40K 60K 80K 100K 93535 92847 92316 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Repeat 2 Repeat 3 Linux 5.8 40 80 120 160 200 SE +/- 0.69, N = 3 SE +/- 0.28, N = 3 SE +/- 0.55, N = 3 176.75 176.92 177.26
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup Repeat 2 Linux 5.8 Repeat 3 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.23, N = 3 SE +/- 0.21, N = 3 17.1 16.9 16.8 1. (CC) gcc options: -fopenmp -O3 -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Repeat 2 Linux 5.8 Repeat 3 60K 120K 180K 240K 300K SE +/- 1424.10, N = 3 SE +/- 48.96, N = 3 SE +/- 1381.97, N = 3 271723.34 269797.67 268921.70 1. (CC) gcc options: -O2 -lrt" -lrt
Cryptsetup PBKDF2-sha512 OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-sha512 Linux 5.8 Repeat 3 Repeat 2 400K 800K 1200K 1600K 2000K SE +/- 11811.84, N = 3 SE +/- 15858.95, N = 3 SE +/- 15938.76, N = 3 1800801 1768539 1757676
Cryptsetup PBKDF2-whirlpool OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool Linux 5.8 Repeat 3 Repeat 2 160K 320K 480K 640K 800K SE +/- 4850.18, N = 3 SE +/- 6755.62, N = 3 SE +/- 5231.74, N = 3 753350 743090 735744
Cryptsetup AES-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Encryption Repeat 2 Linux 5.8 Repeat 3 400 800 1200 1600 2000 SE +/- 11.85, N = 3 SE +/- 19.49, N = 3 SE +/- 17.56, N = 3 2095.4 2073.5 2067.5
Cryptsetup AES-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption Repeat 2 Linux 5.8 Repeat 3 400 800 1200 1600 2000 SE +/- 13.17, N = 3 SE +/- 9.96, N = 3 SE +/- 18.26, N = 3 2088.5 2080.6 2068.2
Cryptsetup Serpent-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption Repeat 3 Repeat 2 Linux 5.8 160 320 480 640 800 SE +/- 7.00, N = 3 SE +/- 7.26, N = 3 SE +/- 6.74, N = 3 735.7 734.9 733.5
Cryptsetup Serpent-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption Linux 5.8 Repeat 3 Repeat 2 160 320 480 640 800 SE +/- 5.38, N = 3 SE +/- 6.51, N = 3 SE +/- 7.77, N = 3 722.0 721.0 718.6
Cryptsetup Twofish-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption Linux 5.8 Repeat 2 Repeat 3 90 180 270 360 450 SE +/- 3.09, N = 3 SE +/- 5.15, N = 3 SE +/- 4.25, N = 3 429.5 426.7 426.2
Cryptsetup Twofish-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption Repeat 3 Repeat 2 Linux 5.8 90 180 270 360 450 SE +/- 2.68, N = 3 SE +/- 4.01, N = 3 SE +/- 4.33, N = 3 430.9 427.1 425.4
Cryptsetup AES-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption Repeat 3 Repeat 2 Linux 5.8 400 800 1200 1600 2000 SE +/- 2.66, N = 3 SE +/- 17.19, N = 3 SE +/- 17.03, N = 3 1857.5 1840.9 1826.8
Cryptsetup AES-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption Repeat 3 Repeat 2 Linux 5.8 400 800 1200 1600 2000 SE +/- 2.48, N = 3 SE +/- 16.96, N = 3 SE +/- 17.25, N = 3 1858.4 1843.9 1828.5
Cryptsetup Serpent-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption Repeat 3 Repeat 2 Linux 5.8 160 320 480 640 800 SE +/- 0.33, N = 3 SE +/- 7.66, N = 3 SE +/- 6.76, N = 3 746.7 742.4 737.8
Cryptsetup Serpent-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Decryption Repeat 3 Repeat 2 Linux 5.8 160 320 480 640 800 SE +/- 0.18, N = 3 SE +/- 3.91, N = 3 SE +/- 5.24, N = 3 731.4 730.6 726.3
Cryptsetup Twofish-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption Repeat 3 Repeat 2 Linux 5.8 90 180 270 360 450 SE +/- 0.15, N = 3 SE +/- 2.88, N = 3 SE +/- 4.82, N = 3 434.6 433.6 429.4
Cryptsetup Twofish-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption Repeat 2 Repeat 3 Linux 5.8 90 180 270 360 450 SE +/- 0.27, N = 3 SE +/- 0.17, N = 3 SE +/- 4.97, N = 3 435.0 433.3 428.4
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE Repeat 3 Linux 5.8 Repeat 2 3 6 9 12 15 SE +/- 0.06, N = 5 SE +/- 0.06, N = 5 SE +/- 0.12, N = 21 11.39 11.42 11.45 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet Repeat 2 Repeat 3 Linux 5.8 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 19.72 19.89 20.14 MIN: 18.42 / MAX: 58.23 MIN: 18.82 / MAX: 52.28 MIN: 18.78 / MAX: 54.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 Repeat 3 Repeat 2 Linux 5.8 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 6.02 6.10 6.12 MIN: 5.47 / MAX: 14.76 MIN: 5.43 / MAX: 16.66 MIN: 5.4 / MAX: 58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 Repeat 2 Linux 5.8 Repeat 3 1.206 2.412 3.618 4.824 6.03 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 5.20 5.29 5.36 MIN: 4.65 / MAX: 25.68 MIN: 4.69 / MAX: 30.34 MIN: 4.81 / MAX: 14.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 Repeat 2 Repeat 3 Linux 5.8 2 4 6 8 10 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 7.72 7.74 7.76 MIN: 7.17 / MAX: 13.38 MIN: 7.17 / MAX: 15.91 MIN: 7.17 / MAX: 17.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet Repeat 2 Repeat 3 Linux 5.8 1.1858 2.3716 3.5574 4.7432 5.929 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 5.22 5.23 5.27 MIN: 4.63 / MAX: 10.82 MIN: 4.66 / MAX: 36.04 MIN: 4.67 / MAX: 10.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 Repeat 2 Linux 5.8 Repeat 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 8.21 8.30 8.35 MIN: 7.37 / MAX: 31.37 MIN: 7.42 / MAX: 26.44 MIN: 7.51 / MAX: 32.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface Repeat 3 Repeat 2 Linux 5.8 0.5693 1.1386 1.7079 2.2772 2.8465 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 2.43 2.45 2.53 MIN: 2.29 / MAX: 7.08 MIN: 2.29 / MAX: 7.26 MIN: 2.29 / MAX: 7.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet Repeat 3 Repeat 2 Linux 5.8 4 8 12 16 20 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 17.71 17.88 17.94 MIN: 16.4 / MAX: 48.5 MIN: 16.43 / MAX: 56.75 MIN: 16.31 / MAX: 59.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 Repeat 2 Linux 5.8 Repeat 3 15 30 45 60 75 SE +/- 0.25, N = 3 SE +/- 0.29, N = 3 SE +/- 0.22, N = 3 68.51 68.66 69.12 MIN: 65.01 / MAX: 111.99 MIN: 65.3 / MAX: 107.34 MIN: 65.47 / MAX: 108.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 Repeat 2 Linux 5.8 Repeat 3 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 18.50 18.52 18.65 MIN: 17.04 / MAX: 37.4 MIN: 17.05 / MAX: 56.7 MIN: 17.29 / MAX: 40.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet Repeat 2 Linux 5.8 Repeat 3 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 14.79 14.94 15.19 MIN: 13.34 / MAX: 40.9 MIN: 13.91 / MAX: 38.48 MIN: 13.48 / MAX: 42.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 Repeat 3 Repeat 2 Linux 5.8 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 34.62 34.85 34.89 MIN: 32.92 / MAX: 80.63 MIN: 32.75 / MAX: 81.04 MIN: 33.24 / MAX: 80.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny Repeat 2 Repeat 3 Linux 5.8 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.14, N = 3 SE +/- 0.30, N = 3 30.61 30.71 30.78 MIN: 28.74 / MAX: 89.42 MIN: 29.37 / MAX: 65.42 MIN: 28.76 / MAX: 80.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd Repeat 2 Repeat 3 Linux 5.8 6 12 18 24 30 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 SE +/- 0.46, N = 3 24.54 24.58 24.87 MIN: 22.47 / MAX: 71.41 MIN: 22.83 / MAX: 56.42 MIN: 22.53 / MAX: 78.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m Linux 5.8 Repeat 3 Repeat 2 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 SE +/- 0.23, N = 3 17.66 17.74 17.81 MIN: 17.1 / MAX: 57.9 MIN: 17.04 / MAX: 62.12 MIN: 16.9 / MAX: 53.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Linux 5.8 Repeat 3 Repeat 2 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 11.31 11.29 11.19 1. Nodejs
v12.18.2
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg Repeat 2 Repeat 3 Linux 5.8 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 18.43 18.65 18.72 1. (CC) gcc options: -O2 -ffast-math -fsigned-char
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Linux 5.8 Repeat 3 Repeat 2 1.3331 2.6662 3.9993 5.3324 6.6655 SE +/- 0.01960, N = 3 SE +/- 0.00727, N = 3 SE +/- 0.01363, N = 3 5.82329 5.92397 5.92486 MIN: 5.31 MIN: 5.37 MIN: 5.36 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Repeat 2 Linux 5.8 Repeat 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 12.58 12.64 12.76 MIN: 11.93 MIN: 12.06 MIN: 12.06 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Linux 5.8 Repeat 3 Repeat 2 0.7913 1.5826 2.3739 3.1652 3.9565 SE +/- 0.00924, N = 3 SE +/- 0.00745, N = 3 SE +/- 0.01058, N = 3 3.49733 3.50419 3.51698 MIN: 3.35 MIN: 3.36 MIN: 3.35 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Repeat 2 Repeat 3 Linux 5.8 0.6903 1.3806 2.0709 2.7612 3.4515 SE +/- 0.04368, N = 3 SE +/- 0.02149, N = 3 SE +/- 0.01517, N = 3 3.01204 3.04147 3.06815 MIN: 2.74 MIN: 2.76 MIN: 2.76 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Repeat 3 Repeat 2 Linux 5.8 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.17, N = 3 SE +/- 0.09, N = 3 24.52 24.53 24.61 MIN: 22.82 MIN: 22.85 MIN: 22.92 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Linux 5.8 Repeat 3 Repeat 2 2 4 6 8 10 SE +/- 0.03755, N = 3 SE +/- 0.02855, N = 3 SE +/- 0.03413, N = 3 6.78426 6.78786 6.79426 MIN: 6.33 MIN: 6.35 MIN: 6.3 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Repeat 3 Repeat 2 Linux 5.8 3 6 9 12 15 SE +/- 0.03988, N = 3 SE +/- 0.01634, N = 3 SE +/- 0.02921, N = 3 9.15703 9.18130 9.18729 MIN: 8.89 MIN: 8.9 MIN: 8.84 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Repeat 3 Linux 5.8 Repeat 2 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 23.65 23.67 23.77 MIN: 22.52 MIN: 22.69 MIN: 22.4 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Repeat 3 Repeat 2 Linux 5.8 2 4 6 8 10 SE +/- 0.01336, N = 3 SE +/- 0.07453, N = 3 SE +/- 0.12505, N = 3 8.69285 8.76683 8.78558 MIN: 7.91 MIN: 7.9 MIN: 7.9 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Linux 5.8 Repeat 2 Repeat 3 2 4 6 8 10 SE +/- 0.00919, N = 3 SE +/- 0.01083, N = 3 SE +/- 0.01027, N = 3 6.67768 6.68031 6.68541 MIN: 6.47 MIN: 6.47 MIN: 6.46 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Linux 5.8 Repeat 2 Repeat 3 1100 2200 3300 4400 5500 SE +/- 11.85, N = 3 SE +/- 21.00, N = 3 SE +/- 26.15, N = 3 5069.40 5078.51 5080.07 MIN: 4979.47 MIN: 4971.76 MIN: 4976 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Repeat 2 Repeat 3 Linux 5.8 700 1400 2100 2800 3500 SE +/- 4.46, N = 3 SE +/- 4.76, N = 3 SE +/- 14.40, N = 3 3357.80 3366.59 3385.70 MIN: 3305.01 MIN: 3318.41 MIN: 3314.6 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Repeat 2 Linux 5.8 Repeat 3 1100 2200 3300 4400 5500 SE +/- 0.87, N = 3 SE +/- 5.15, N = 3 SE +/- 0.83, N = 3 5109.67 5110.57 5132.65 MIN: 5045.96 MIN: 5045.23 MIN: 5073.41 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Repeat 2 Repeat 3 Linux 5.8 700 1400 2100 2800 3500 SE +/- 10.93, N = 3 SE +/- 6.23, N = 3 SE +/- 19.15, N = 3 3361.25 3387.66 3389.04 MIN: 3290.33 MIN: 3326.97 MIN: 3314.6 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Repeat 2 Linux 5.8 Repeat 3 1.1431 2.2862 3.4293 4.5724 5.7155 SE +/- 0.01578, N = 3 SE +/- 0.01204, N = 3 SE +/- 0.00785, N = 3 5.03612 5.04874 5.08034 MIN: 4.31 MIN: 4.36 MIN: 4.39 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Linux 5.8 Repeat 2 Repeat 3 1100 2200 3300 4400 5500 SE +/- 2.49, N = 3 SE +/- 12.15, N = 3 SE +/- 8.57, N = 3 5135.69 5141.79 5162.11 MIN: 5062.6 MIN: 5067.78 MIN: 5091.71 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Linux 5.8 Repeat 2 Repeat 3 700 1400 2100 2800 3500 SE +/- 14.47, N = 3 SE +/- 7.72, N = 3 SE +/- 15.22, N = 3 3378.54 3385.30 3388.22 MIN: 3306.98 MIN: 3320.3 MIN: 3306.01 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Repeat 2 Repeat 3 Linux 5.8 0.9932 1.9864 2.9796 3.9728 4.966 SE +/- 0.01709, N = 3 SE +/- 0.01222, N = 3 SE +/- 0.01035, N = 3 4.39427 4.41212 4.41400 MIN: 4.04 MIN: 3.97 MIN: 3.95 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Repeat 3 Linux 5.8 Repeat 2 2 4 6 8 10 SE +/- 0.044, N = 5 SE +/- 0.040, N = 5 SE +/- 0.019, N = 5 7.081 7.138 7.197 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Repeat 2 Linux 5.8 Repeat 3 140K 280K 420K 560K 700K SE +/- 2399.07, N = 3 SE +/- 4754.65, N = 3 SE +/- 4604.42, N = 3 666507 663340 659345
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya Linux 5.8 Repeat 3 Repeat 2 0.153 0.306 0.459 0.612 0.765 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.68 0.67 0.67 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom Repeat 3 Repeat 2 Linux 5.8 0.099 0.198 0.297 0.396 0.495 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.44 0.44 0.44 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets Repeat 2 Linux 5.8 Repeat 3 0.171 0.342 0.513 0.684 0.855 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.76 0.76 0.75 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID Repeat 2 Repeat 3 Linux 5.8 0.1778 0.3556 0.5334 0.7112 0.889 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.79 0.77 0.77 1. (CXX) g++ options: -O3 -pthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 Repeat 2 Linux 5.8 Repeat 3 13 26 39 52 65 SE +/- 0.30, N = 3 SE +/- 0.13, N = 3 SE +/- 0.40, N = 3 56.65 57.47 57.56 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Timed Clash Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Clash Compilation Time To Compile Repeat 2 Repeat 3 Linux 5.8 80 160 240 320 400 SE +/- 1.96, N = 3 SE +/- 1.29, N = 3 SE +/- 2.24, N = 3 375.11 376.49 376.60
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile Linux 5.8 Repeat 2 Repeat 3 20 40 60 80 100 SE +/- 0.39, N = 3 SE +/- 0.34, N = 3 SE +/- 0.22, N = 3 75.13 75.40 75.44
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile Repeat 2 Linux 5.8 Repeat 3 15 30 45 60 75 SE +/- 0.14, N = 3 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 65.43 65.49 65.54
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search Linux 5.8 Repeat 2 Repeat 3 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.30, N = 3 SE +/- 0.28, N = 3 106.41 106.98 107.04 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Repeat 3 Repeat 2 Linux 5.8 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 11.32 11.35 11.54 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Unpacking Firefox Extracting: firefox-84.0.source.tar.xz OpenBenchmarking.org Seconds, Fewer Is Better Unpacking Firefox 84.0 Extracting: firefox-84.0.source.tar.xz Repeat 2 Repeat 3 Linux 5.8 4 8 12 16 20 SE +/- 0.05, N = 4 SE +/- 0.04, N = 4 SE +/- 0.09, N = 4 17.46 17.47 17.51
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack Repeat 2 Linux 5.8 Repeat 3 3 6 9 12 15 SE +/- 0.07, N = 5 SE +/- 0.07, N = 5 SE +/- 0.06, N = 5 12.40 12.41 12.48 1. (CXX) g++ options: -rdynamic
Phoronix Test Suite v10.8.4