AMD Ryzen 9 3900X New Tests With Kernel Changes AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009220-PTS-AMDRYZEN18&grs&rdt&rro .
AMD Ryzen 9 3900X New Tests With Kernel Changes Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution Ubuntu Default 5.4 Linux 5.8.10 Linux 5.9 Git AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.4.0-42-generic (x86_64) GNOME Shell 3.36.3 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.2.0-devel (git-64cdc13 2020-07-02 focal-oibaf-ppa) (LLVM 10.0.0) 1.2.131 GCC 9.3.0 ext4 1920x1080 5.8.10-050810-generic (x86_64) 5.9.0-050900rc6daily20200922-generic (x86_64) 20200921 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701021 Graphics Details - GLAMOR Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
AMD Ryzen 9 3900X New Tests With Kernel Changes perf-bench: Sched Pipe perf-bench: Syscall Basic glmark2: 1920 x 1080 perf-bench: Epoll Wait lczero: Eigen system-decompress-gzip: perf-bench: Memset 1MB lczero: BLAS ncnn: CPU - resnet18_int8 incompact3d: Cylinder ncnn: CPU - mobilenetv2_yolov3 mnn: MobileNetV2_224 perf-bench: Futex Hash aom-av1: Speed 6 Two-Pass mnn: resnet-v2-50 lammps: 20k Atoms ncnn: CPU - resnet50_int8 ncnn: CPU - googlenet_int8 ncnn: CPU - vgg16_int8 lammps: Rhodopsin Protein aom-av1: Speed 4 Two-Pass ncnn: CPU - squeezenet ncnn: CPU - mnasnet mnn: SqueezeNetV1.0 ncnn: CPU - squeezenet_int8 mnn: mobilenet-v1-1.0 ncnn: CPU - blazeface namd: ATPase Simulation - 327,506 Atoms mocassin: Dust 2D tau100.0 gpaw: Carbon Nanotube aom-av1: Speed 6 Realtime gromacs: Water Benchmark build-llvm: Time To Compile mnn: inception-v3 perf-bench: Futex Lock-Pi compress-zstd: 19 ncnn: CPU - mobilenet_v3 aom-av1: Speed 8 Realtime ncnn: CPU - alexnet kripke: perf-bench: Memcpy 1MB Ubuntu Default 5.4 Linux 5.8.10 Linux 5.9 Git 404240 22276401 4640 43847 530 2.882 70.561859 511 19.73 262.275299 16.39 4.062 4820951 4.12 32.546 9.359 63.31 35.66 103.63 9.116 2.63 4.83 4.86 7.378 13.34 6.868 1.95 1.57422 212 309.675 19.39 1.121 444.847 31.428 669 38.8 4.84 37.57 16.64 4614547 15.142763 328023 20388216 4645 45922 507 2.758 70.589682 520 20.07 267.783132 16.49 3.979 4891875 4.08 33.105 9.201 63.72 36.19 104.48 9.025 2.61 4.81 4.91 7.403 13.37 6.865 1.96 1.56619 212 310.856 19.43 1.121 446.209 31.528 667 38.8 4.83 37.51 16.65 11727667 14.236737 312435 20858233 4903 46204 521 2.848 73.410713 504 20.28 269.579844 16.77 3.977 4798809 4.05 33.030 9.283 64.38 35.79 105.09 9.000 2.60 4.86 4.91 7.328 13.26 6.906 1.96 1.57290 213 310.991 19.47 1.117 444.772 31.472 668 38.7 4.84 37.56 16.64 16249158 14.697864 OpenBenchmarking.org
perf-bench Benchmark: Sched Pipe OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 90K 180K 270K 360K 450K SE +/- 3676.30, N = 3 SE +/- 1251.89, N = 3 SE +/- 3596.98, N = 3 312435 328023 404240 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
perf-bench Benchmark: Syscall Basic OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 5M 10M 15M 20M 25M SE +/- 241562.93, N = 3 SE +/- 214159.67, N = 7 SE +/- 25407.07, N = 3 20858233 20388216 22276401 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
GLmark2 Resolution: 1920 x 1080 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1920 x 1080 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 1100 2200 3300 4400 5500 4903 4645 4640
perf-bench Benchmark: Epoll Wait OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 10K 20K 30K 40K 50K SE +/- 122.75, N = 3 SE +/- 147.89, N = 3 SE +/- 237.60, N = 3 46204 45922 43847 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 110 220 330 440 550 SE +/- 5.36, N = 3 SE +/- 7.88, N = 3 SE +/- 8.65, N = 3 521 507 530 1. (CXX) g++ options: -flto -pthread
System GZIP Decompression OpenBenchmarking.org Seconds, Fewer Is Better System GZIP Decompression Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.6485 1.297 1.9455 2.594 3.2425 SE +/- 0.047, N = 3 SE +/- 0.005, N = 3 SE +/- 0.040, N = 3 2.848 2.758 2.882
perf-bench Benchmark: Memset 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 16 32 48 64 80 SE +/- 0.51, N = 3 SE +/- 0.85, N = 3 SE +/- 0.55, N = 3 73.41 70.59 70.56 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 110 220 330 440 550 SE +/- 6.08, N = 9 SE +/- 6.43, N = 3 SE +/- 7.62, N = 3 504 520 511 1. (CXX) g++ options: -flto -pthread
NCNN Target: CPU - Model: resnet18_int8 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18_int8 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 5 10 15 20 25 SE +/- 0.43, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 20.28 20.07 19.73 MIN: 18.22 / MAX: 95.37 MIN: 18.74 / MAX: 25.55 MIN: 18.45 / MAX: 29.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 60 120 180 240 300 SE +/- 0.77, N = 3 SE +/- 1.03, N = 3 SE +/- 0.51, N = 3 269.58 267.78 262.28 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
NCNN Target: CPU - Model: mobilenetv2_yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenetv2_yolov3 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 4 8 12 16 20 SE +/- 0.27, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 16.77 16.49 16.39 MIN: 15.99 / MAX: 98.19 MIN: 15.89 / MAX: 20.78 MIN: 16.04 / MAX: 17.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: MobileNetV2_224 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.914 1.828 2.742 3.656 4.57 SE +/- 0.028, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 3.977 3.979 4.062 MIN: 3.88 / MAX: 10.11 MIN: 3.89 / MAX: 8.58 MIN: 3.98 / MAX: 6.98 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
perf-bench Benchmark: Futex Hash OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 1000K 2000K 3000K 4000K 5000K SE +/- 12288.80, N = 3 SE +/- 6026.98, N = 3 SE +/- 3705.18, N = 3 4798809 4891875 4820951 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.927 1.854 2.781 3.708 4.635 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.05 4.08 4.12 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: resnet-v2-50 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.24, N = 3 33.03 33.11 32.55 MIN: 32.07 / MAX: 43.69 MIN: 32.16 / MAX: 51.02 MIN: 31.64 / MAX: 42.77 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: 20k Atoms Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 3 6 9 12 15 SE +/- 0.008, N = 3 SE +/- 0.037, N = 3 SE +/- 0.007, N = 3 9.283 9.201 9.359 1. (CXX) g++ options: -O3 -pthread -lm
NCNN Target: CPU - Model: resnet50_int8 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50_int8 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 14 28 42 56 70 SE +/- 0.21, N = 3 SE +/- 0.58, N = 3 SE +/- 0.39, N = 3 64.38 63.72 63.31 MIN: 63.68 / MAX: 142.28 MIN: 62.27 / MAX: 129.81 MIN: 62.23 / MAX: 64.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet_int8 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet_int8 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 35.79 36.19 35.66 MIN: 34.65 / MAX: 41.7 MIN: 34.82 / MAX: 40.62 MIN: 34.99 / MAX: 37.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16_int8 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16_int8 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.29, N = 3 SE +/- 0.56, N = 3 105.09 104.48 103.63 MIN: 101.92 / MAX: 191.56 MIN: 101.26 / MAX: 155.23 MIN: 101.71 / MAX: 114.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: Rhodopsin Protein Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 3 6 9 12 15 SE +/- 0.091, N = 15 SE +/- 0.116, N = 14 SE +/- 0.117, N = 14 9.000 9.025 9.116 1. (CXX) g++ options: -O3 -pthread -lm
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.5918 1.1836 1.7754 2.3672 2.959 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 2.60 2.61 2.63 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 1.0935 2.187 3.2805 4.374 5.4675 SE +/- 0.03, N = 2 SE +/- 0.05, N = 3 SE +/- 0.00, N = 2 4.86 4.81 4.83 MIN: 4.79 / MAX: 5.43 MIN: 4.66 / MAX: 9.77 MIN: 4.79 / MAX: 4.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 1.1048 2.2096 3.3144 4.4192 5.524 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 2 4.91 4.91 4.86 MIN: 4.78 / MAX: 5.79 MIN: 4.82 / MAX: 5.44 MIN: 4.8 / MAX: 4.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: SqueezeNetV1.0 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 2 4 6 8 10 SE +/- 0.048, N = 3 SE +/- 0.023, N = 3 SE +/- 0.045, N = 3 7.328 7.403 7.378 MIN: 7.14 / MAX: 18.84 MIN: 7.25 / MAX: 18.15 MIN: 7.18 / MAX: 16.39 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: squeezenet_int8 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet_int8 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 13.26 13.37 13.34 MIN: 13.05 / MAX: 17.25 MIN: 13.07 / MAX: 48.91 MIN: 13.15 / MAX: 14.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: mobilenet-v1-1.0 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 2 4 6 8 10 SE +/- 0.025, N = 3 SE +/- 0.010, N = 3 SE +/- 0.003, N = 3 6.906 6.865 6.868 MIN: 6.77 / MAX: 16.25 MIN: 6.77 / MAX: 18.04 MIN: 6.74 / MAX: 7.88 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.441 0.882 1.323 1.764 2.205 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.96 1.96 1.95 MIN: 1.91 / MAX: 2.05 MIN: 1.92 / MAX: 2.05 MIN: 1.91 / MAX: 2.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.3542 0.7084 1.0626 1.4168 1.771 SE +/- 0.00093, N = 3 SE +/- 0.00018, N = 3 SE +/- 0.00284, N = 3 1.57290 1.56619 1.57422
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 50 100 150 200 250 213 212 212 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 20.1 Input: Carbon Nanotube Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 70 140 210 280 350 SE +/- 0.32, N = 3 SE +/- 0.24, N = 3 SE +/- 0.20, N = 3 310.99 310.86 309.68 1. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 19.47 19.43 19.39 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.1 Water Benchmark Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 0.2522 0.5044 0.7566 1.0088 1.261 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 1.117 1.121 1.121 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 100 200 300 400 500 SE +/- 4.89, N = 3 SE +/- 2.69, N = 3 SE +/- 3.65, N = 3 444.77 446.21 444.85
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: inception-v3 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 7 14 21 28 35 SE +/- 0.28, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 31.47 31.53 31.43 MIN: 30.41 / MAX: 87.19 MIN: 30.68 / MAX: 85.71 MIN: 30.78 / MAX: 43.53 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
perf-bench Benchmark: Futex Lock-Pi OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 140 280 420 560 700 SE +/- 2.85, N = 3 SE +/- 3.06, N = 3 SE +/- 2.73, N = 3 668 667 669 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
Zstd Compression Compression Level: 19 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 19 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 9 18 27 36 45 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 38.7 38.8 38.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
NCNN Target: CPU - Model: mobilenet_v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet_v3 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 1.089 2.178 3.267 4.356 5.445 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 4.84 4.83 4.84 MIN: 4.72 / MAX: 6.91 MIN: 4.76 / MAX: 5.01 MIN: 4.75 / MAX: 5.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 9 18 27 36 45 SE +/- 0.22, N = 3 SE +/- 0.25, N = 3 SE +/- 0.17, N = 3 37.56 37.51 37.57 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 16.64 16.65 16.64 MIN: 16.49 / MAX: 20.92 MIN: 16.49 / MAX: 21.18 MIN: 16.55 / MAX: 17.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 3M 6M 9M 12M 15M SE +/- 222150.06, N = 4 SE +/- 518659.22, N = 9 SE +/- 152326.40, N = 9 16249158 11727667 4614547 1. (CXX) g++ options: -O3 -fopenmp
perf-bench Benchmark: Memcpy 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB Linux 5.9 Git Linux 5.8.10 Ubuntu Default 5.4 4 8 12 16 20 SE +/- 0.24, N = 3 SE +/- 0.09, N = 3 SE +/- 0.25, N = 15 14.70 14.24 15.14 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma
Phoronix Test Suite v10.8.5