3770k june Intel Core i7-3770K testing with a ECS Z77H2-A2X v1.0 (4.6.5 BIOS) and ECS Intel Xeon E3-1200 v2/3rd Gen Core on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106234-IB-3770KJUNE62 .
3770k june Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i7-3770K @ 3.90GHz (4 Cores / 8 Threads) ECS Z77H2-A2X v1.0 (4.6.5 BIOS) Intel Xeon E3-1200 v2/3rd 8GB 160GB INTEL SSDSA2M160 ECS Intel Xeon E3-1200 v2/3rd Gen Core (1150MHz) Realtek ALC892 G237HL 2 x Realtek RTL8111/8168/8411 Ubuntu 20.04 5.8.0-55-generic (x86_64) GNOME Shell 3.36.9 X Server 1.20.9 1.2.145 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9.1 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
3770k june npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: LU.C npb: MG.C npb: SP.B npb: SP.C compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed srsran: OFDM_Test srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 4 - Bosphorus 1080p svt-av1: Preset 8 - Bosphorus 1080p vpxenc: Speed 0 - Bosphorus 4K vpxenc: Speed 5 - Bosphorus 4K vpxenc: Speed 0 - Bosphorus 1080p vpxenc: Speed 5 - Bosphorus 1080p oidn: RT.hdr_alb_nrm.3840x2160 oidn: RT.ldr_alb_nrm.3840x2160 oidn: RTLightmap.hdr.4096x4096 build-ffmpeg: Time To Compile build-gdb: Time To Compile gromacs: MPI CPU - water_GMX50_bare mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 brl-cad: VGR Performance Metric 1 2 3 7427.17 1599.58 372.01 384.77 707.76 7609.70 2967.47 1808.82 1851.10 713.2 2325.2 70.1 2396.6 14.6 2107.6 505.2 2441.0 83.0 2502.1 13.9 2126.2 70166667 263.4 86.9 273.1 145.0 296.9 94.5 302.9 164.5 14.1 6.4 14.7 12.3 203.96 50.91 192.92 60.21 3.6750 3.8178 4.3717 4.0500 4.6913 4.2050 0.055 0.681 0.211 2.839 1.82 5.42 4.58 13.53 0.09 0.09 0.04 175.921 171.222 0.167 3.555 9.153 96.297 16.942 9.139 13.064 112.712 65.02 16.59 14.91 9.11 14.46 25.67 2.68 77.93 814.35 87.60 53.46 154.48 183.63 89.09 21.66 64.85 16.47 14.83 9.15 14.37 25.31 2.68 77.39 813.89 87.68 53.13 156.72 183.38 88.67 21.65 4569.707 354.820 85.912 316.961 39380 7391.15 1629.09 383.87 392.89 816.87 7618.95 2970.74 1806.68 1851.1 704.2 2326.8 65.9 2400.1 14.6 2101.6 504.4 2441.7 91.9 2503.3 13.9 2121.2 71600000 271.8 89.9 274.9 145.1 289.9 93.8 296.7 159 14.1 6.3 14.6 12.1 204.19 50.85 192.93 60.24 3.6625 3.8456 4.3491 4.0496 4.6225 4.1978 0.055 0.682 0.212 2.846 1.83 5.44 4.55 13.4 0.09 0.09 0.04 176.717 172.573 0.167 3.541 9.157 96.275 16.865 9.13 13.092 112.013 64.89 16.54 14.74 9.14 14.43 25.29 2.68 78.98 814.66 88.02 53.69 155.74 183.51 88.79 21.65 65.14 16.54 15.2 9.17 14.52 25.39 2.69 77.78 814.55 87.48 53.56 155.93 183.22 90.25 21.67 4566.472 353.927 85.951 318.374 39354 7421.29 1607.8 371.89 390.95 594.5 7612.27 2971.98 1804.38 1852.64 696 2329.9 65.8 2397.9 14.6 2097.7 506.3 2431 81.9 2501.5 13.9 2126.2 71300000 265.5 87 272.7 145.1 294.6 93.3 303.2 164.3 14.1 6.4 14.7 12.2 204.39 50.8 193.58 60.22 3.6719 3.849 4.3723 4.0469 4.6621 4.2015 0.055 0.685 0.212 2.847 1.81 5.45 4.5 13.39 0.09 0.09 0.04 176.608 172.023 0.166 3.565 9.271 97.288 16.952 9.168 13.234 112.672 65.09 17.49 15.06 9.11 14.74 25.25 2.7 78.75 813.24 87.37 53.58 156.5 183.24 90.07 21.85 65.84 16.23 14.7 9.12 14.41 25.36 2.68 77.39 816.5 87.31 52.81 153.6 184.28 89.42 21.56 4560.104 354.844 85.884 317.501 39592 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 1600 3200 4800 6400 8000 SE +/- 5.52, N = 3 7427.17 7391.15 7421.29 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 300 600 900 1200 1500 SE +/- 7.53, N = 3 1599.58 1629.09 1607.80 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 80 160 240 320 400 SE +/- 0.68, N = 3 372.01 383.87 371.89 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 90 180 270 360 450 SE +/- 4.40, N = 3 384.77 392.89 390.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 200 400 600 800 1000 SE +/- 62.45, N = 7 707.76 816.87 594.50 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 1600 3200 4800 6400 8000 SE +/- 15.03, N = 3 7609.70 7618.95 7612.27 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 600 1200 1800 2400 3000 SE +/- 3.97, N = 3 2967.47 2970.74 2971.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 400 800 1200 1600 2000 SE +/- 0.65, N = 3 1808.82 1806.68 1804.38 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 1 2 3 400 800 1200 1600 2000 SE +/- 1.23, N = 3 1851.10 1851.10 1852.64 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed 1 2 3 150 300 450 600 750 SE +/- 5.04, N = 3 713.2 704.2 696.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 2.45, N = 3 2325.2 2326.8 2329.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed 1 2 3 16 32 48 64 80 SE +/- 0.87, N = 15 70.1 65.9 65.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 0.69, N = 15 2396.6 2400.1 2397.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 14.6 14.6 14.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 0.50, N = 3 2107.6 2101.6 2097.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed 1 2 3 110 220 330 440 550 SE +/- 6.83, N = 4 505.2 504.4 506.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 1.05, N = 4 2441.0 2441.7 2431.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed 1 2 3 20 40 60 80 100 SE +/- 1.04, N = 15 83.0 91.9 81.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 0.66, N = 15 2502.1 2503.3 2501.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 13.9 13.9 13.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 0.44, N = 3 2126.2 2121.2 2126.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test 1 2 3 15M 30M 45M 60M 75M SE +/- 633333.33, N = 3 70166667 71600000 71300000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 1 2 3 60 120 180 240 300 SE +/- 3.90, N = 3 263.4 271.8 265.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 1 2 3 20 40 60 80 100 SE +/- 0.53, N = 3 86.9 89.9 87.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 1 2 3 60 120 180 240 300 SE +/- 0.17, N = 3 273.1 274.9 272.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 1 2 3 30 60 90 120 150 SE +/- 0.22, N = 3 145.0 145.1 145.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 1 2 3 60 120 180 240 300 SE +/- 1.27, N = 3 296.9 289.9 294.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 1 2 3 20 40 60 80 100 SE +/- 0.43, N = 3 94.5 93.8 93.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 1 2 3 70 140 210 280 350 SE +/- 0.29, N = 3 302.9 296.7 303.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 1 2 3 40 80 120 160 200 SE +/- 0.09, N = 3 164.5 159.0 164.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 14.1 14.1 14.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 6.4 6.3 6.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 14.7 14.6 14.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 12.3 12.1 12.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 1 2 3 40 80 120 160 200 SE +/- 0.15, N = 3 203.96 204.19 204.39 MIN: 155.54 / MAX: 334.76 MIN: 155.82 / MAX: 328.37 MIN: 155.69 / MAX: 339.89 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 50.91 50.85 50.80 MIN: 48.79 / MAX: 54.51 MIN: 48.78 / MAX: 54.4 MIN: 48.76 / MAX: 54.38 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 1080p 1 2 3 40 80 120 160 200 SE +/- 0.03, N = 3 192.92 192.93 193.58 MIN: 173.09 / MAX: 208.53 MIN: 177.1 / MAX: 208.33 MIN: 174.38 / MAX: 209.41 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit 1 2 3 13 26 39 52 65 SE +/- 0.06, N = 3 60.21 60.24 60.22 MIN: 41.19 / MAX: 127.46 MIN: 41.33 / MAX: 126.07 MIN: 41.29 / MAX: 127.76 1. (CC) gcc options: -pthread -lm
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 1 2 3 0.8269 1.6538 2.4807 3.3076 4.1345 SE +/- 0.0247, N = 3 3.6750 3.6625 3.6719 MIN: 3.6 / MAX: 3.75 MIN: 3.64 / MAX: 3.71 MIN: 3.65 / MAX: 3.71
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 1 2 3 0.866 1.732 2.598 3.464 4.33 SE +/- 0.0120, N = 3 3.8178 3.8456 3.8490 MIN: 3.77 / MAX: 3.89 MIN: 3.83 / MAX: 3.9 MIN: 3.83 / MAX: 3.9
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 1 2 3 0.9838 1.9676 2.9514 3.9352 4.919 SE +/- 0.0110, N = 3 4.3717 4.3491 4.3723 MIN: 4.33 / MAX: 4.45 MIN: 4.31 / MAX: 4.42 MIN: 4.33 / MAX: 4.44
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 1 2 3 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.0053, N = 3 4.0500 4.0496 4.0469 MIN: 4.02 / MAX: 4.1 MIN: 4.04 / MAX: 4.09 MIN: 4.03 / MAX: 4.08
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 1 2 3 1.0555 2.111 3.1665 4.222 5.2775 SE +/- 0.0165, N = 3 4.6913 4.6225 4.6621 MIN: 4.63 / MAX: 4.78 MIN: 4.57 / MAX: 4.71 MIN: 4.62 / MAX: 4.74
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 2 3 0.9461 1.8922 2.8383 3.7844 4.7305 SE +/- 0.0032, N = 3 4.2050 4.1978 4.2015 MIN: 4.19 / MAX: 4.25 MIN: 4.18 / MAX: 4.24 MIN: 4.19 / MAX: 4.24
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 1 2 3 0.0124 0.0248 0.0372 0.0496 0.062 SE +/- 0.000, N = 3 0.055 0.055 0.055 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 1 2 3 0.1541 0.3082 0.4623 0.6164 0.7705 SE +/- 0.001, N = 3 0.681 0.682 0.685 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 1 2 3 0.0477 0.0954 0.1431 0.1908 0.2385 SE +/- 0.001, N = 3 0.211 0.212 0.212 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 1 2 3 0.6406 1.2812 1.9218 2.5624 3.203 SE +/- 0.001, N = 3 2.839 2.846 2.847 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K 1 2 3 0.4118 0.8236 1.2354 1.6472 2.059 SE +/- 0.00, N = 3 1.82 1.83 1.81 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K 1 2 3 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.02, N = 3 5.42 5.44 5.45 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 1080p 1 2 3 1.0305 2.061 3.0915 4.122 5.1525 SE +/- 0.01, N = 3 4.58 4.55 4.50 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 SE +/- 0.13, N = 3 13.53 13.40 13.39 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 1 2 3 0.0203 0.0406 0.0609 0.0812 0.1015 SE +/- 0.00, N = 3 0.09 0.09 0.09
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 1 2 3 0.0203 0.0406 0.0609 0.0812 0.1015 SE +/- 0.00, N = 3 0.09 0.09 0.09
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 1 2 3 0.009 0.018 0.027 0.036 0.045 SE +/- 0.00, N = 3 0.04 0.04 0.04
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 175.92 176.72 176.61
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.23, N = 3 171.22 172.57 172.02
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare 1 2 3 0.0376 0.0752 0.1128 0.1504 0.188 SE +/- 0.001, N = 3 0.167 0.167 0.166 1. (CXX) g++ options: -O3 -pthread
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 1 2 3 0.8021 1.6042 2.4063 3.2084 4.0105 SE +/- 0.005, N = 3 3.555 3.541 3.565 MIN: 3.52 / MAX: 4.05 MIN: 3.51 / MAX: 3.67 MIN: 3.53 / MAX: 4.6 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 1 2 3 3 6 9 12 15 SE +/- 0.104, N = 3 9.153 9.157 9.271 MIN: 8.92 / MAX: 11.11 MIN: 9.11 / MAX: 12.34 MIN: 9.23 / MAX: 9.72 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 1 2 3 20 40 60 80 100 SE +/- 0.34, N = 3 96.30 96.28 97.29 MIN: 95.26 / MAX: 140.43 MIN: 95.75 / MAX: 151.03 MIN: 96.5 / MAX: 101.75 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 1 2 3 4 8 12 16 20 SE +/- 0.12, N = 3 16.94 16.87 16.95 MIN: 16.71 / MAX: 20.01 MIN: 16.77 / MAX: 19.57 MIN: 16.86 / MAX: 20.24 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 1 2 3 3 6 9 12 15 SE +/- 0.009, N = 3 9.139 9.130 9.168 MIN: 9.07 / MAX: 11.11 MIN: 9.05 / MAX: 10.46 MIN: 9.12 / MAX: 9.69 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 13.06 13.09 13.23 MIN: 12.97 / MAX: 18.37 MIN: 12.97 / MAX: 15.02 MIN: 13.15 / MAX: 15.11 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 1 2 3 30 60 90 120 150 SE +/- 0.43, N = 3 112.71 112.01 112.67 MIN: 111.52 / MAX: 124.51 MIN: 111.45 / MAX: 116.63 MIN: 111.91 / MAX: 160.39 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mobilenet 1 2 3 15 30 45 60 75 SE +/- 0.07, N = 3 65.02 64.89 65.09 MIN: 64.46 / MAX: 66.39 MIN: 64.53 / MAX: 65.75 MIN: 64.56 / MAX: 67.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 4 8 12 16 20 SE +/- 0.09, N = 3 16.59 16.54 17.49 MIN: 16.31 / MAX: 56.61 MIN: 16.41 / MAX: 18.16 MIN: 17.24 / MAX: 18.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 14.91 14.74 15.06 MIN: 14.71 / MAX: 15.34 MIN: 14.62 / MAX: 15.2 MIN: 14.73 / MAX: 15.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: shufflenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 9.11 9.14 9.11 MIN: 9.02 / MAX: 9.21 MIN: 9.07 / MAX: 10.06 MIN: 9.01 / MAX: 9.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mnasnet 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 14.46 14.43 14.74 MIN: 14.26 / MAX: 16.61 MIN: 14.31 / MAX: 14.78 MIN: 14.57 / MAX: 15.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: efficientnet-b0 1 2 3 6 12 18 24 30 SE +/- 0.18, N = 3 25.67 25.29 25.25 MIN: 25.18 / MAX: 26.95 MIN: 25.18 / MAX: 25.9 MIN: 25.14 / MAX: 25.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: blazeface 1 2 3 0.6075 1.215 1.8225 2.43 3.0375 SE +/- 0.00, N = 3 2.68 2.68 2.70 MIN: 2.66 / MAX: 2.79 MIN: 2.65 / MAX: 2.96 MIN: 2.66 / MAX: 6.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: googlenet 1 2 3 20 40 60 80 100 SE +/- 0.17, N = 3 77.93 78.98 78.75 MIN: 76.44 / MAX: 80.3 MIN: 78.44 / MAX: 81.76 MIN: 78.43 / MAX: 79.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: vgg16 1 2 3 200 400 600 800 1000 SE +/- 0.11, N = 3 814.35 814.66 813.24 MIN: 795.97 / MAX: 846.53 MIN: 797.07 / MAX: 826.77 MIN: 797.51 / MAX: 824.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet18 1 2 3 20 40 60 80 100 SE +/- 0.27, N = 3 87.60 88.02 87.37 MIN: 86.52 / MAX: 90.46 MIN: 87.51 / MAX: 89.09 MIN: 86.64 / MAX: 89.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: alexnet 1 2 3 12 24 36 48 60 SE +/- 0.07, N = 3 53.46 53.69 53.58 MIN: 53.06 / MAX: 56.97 MIN: 53.27 / MAX: 54.45 MIN: 53.14 / MAX: 54.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet50 1 2 3 30 60 90 120 150 SE +/- 0.45, N = 3 154.48 155.74 156.50 MIN: 151.42 / MAX: 161.2 MIN: 154.89 / MAX: 159.24 MIN: 153.36 / MAX: 159.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: yolov4-tiny 1 2 3 40 80 120 160 200 SE +/- 0.22, N = 3 183.63 183.51 183.24 MIN: 180.52 / MAX: 189.31 MIN: 181.83 / MAX: 187.94 MIN: 181.66 / MAX: 185.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: squeezenet_ssd 1 2 3 20 40 60 80 100 SE +/- 0.39, N = 3 89.09 88.79 90.07 MIN: 87.58 / MAX: 136.06 MIN: 88.2 / MAX: 92.3 MIN: 89.14 / MAX: 129.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 21.66 21.65 21.85 MIN: 21.51 / MAX: 23.4 MIN: 21.56 / MAX: 22.02 MIN: 21.63 / MAX: 22.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: mobilenet 1 2 3 15 30 45 60 75 SE +/- 0.14, N = 3 64.85 65.14 65.84 MIN: 64.26 / MAX: 68.21 MIN: 64.49 / MAX: 66.46 MIN: 65.45 / MAX: 67.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 16.47 16.54 16.23 MIN: 16.32 / MAX: 17.99 MIN: 16.4 / MAX: 17.82 MIN: 16 / MAX: 17.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 8 12 16 20 SE +/- 0.11, N = 3 14.83 15.20 14.70 MIN: 14.45 / MAX: 16.17 MIN: 15.12 / MAX: 15.41 MIN: 14.6 / MAX: 15.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 9.15 9.17 9.12 MIN: 9.05 / MAX: 9.72 MIN: 9.09 / MAX: 9.92 MIN: 9.06 / MAX: 9.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: mnasnet 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 14.37 14.52 14.41 MIN: 14 / MAX: 14.6 MIN: 14.44 / MAX: 15.07 MIN: 14.31 / MAX: 14.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 25.31 25.39 25.36 MIN: 25.17 / MAX: 25.66 MIN: 25.26 / MAX: 25.55 MIN: 25.16 / MAX: 30.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: blazeface 1 2 3 0.6053 1.2106 1.8159 2.4212 3.0265 SE +/- 0.01, N = 3 2.68 2.69 2.68 MIN: 2.64 / MAX: 2.84 MIN: 2.66 / MAX: 2.83 MIN: 2.66 / MAX: 2.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: googlenet 1 2 3 20 40 60 80 100 SE +/- 0.16, N = 3 77.39 77.78 77.39 MIN: 76.45 / MAX: 78.76 MIN: 76.97 / MAX: 79.29 MIN: 76.27 / MAX: 79.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: vgg16 1 2 3 200 400 600 800 1000 SE +/- 1.19, N = 3 813.89 814.55 816.50 MIN: 793.12 / MAX: 832.83 MIN: 794.93 / MAX: 843.88 MIN: 800.81 / MAX: 826.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: resnet18 1 2 3 20 40 60 80 100 SE +/- 0.17, N = 3 87.68 87.48 87.31 MIN: 86.58 / MAX: 89.97 MIN: 86.56 / MAX: 90.53 MIN: 86.34 / MAX: 89.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: alexnet 1 2 3 12 24 36 48 60 SE +/- 0.11, N = 3 53.13 53.56 52.81 MIN: 52.47 / MAX: 55.38 MIN: 53.16 / MAX: 54.06 MIN: 52.39 / MAX: 54.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: resnet50 1 2 3 30 60 90 120 150 SE +/- 0.78, N = 3 156.72 155.93 153.60 MIN: 153.07 / MAX: 164.69 MIN: 155.03 / MAX: 157.25 MIN: 152.39 / MAX: 158.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 40 80 120 160 200 SE +/- 0.21, N = 3 183.38 183.22 184.28 MIN: 181.35 / MAX: 187.72 MIN: 181.82 / MAX: 187.1 MIN: 181.67 / MAX: 188.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: squeezenet_ssd 1 2 3 20 40 60 80 100 SE +/- 0.14, N = 3 88.67 90.25 89.42 MIN: 87.74 / MAX: 90.67 MIN: 89.51 / MAX: 92.14 MIN: 88.01 / MAX: 92.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: Vulkan GPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 21.65 21.67 21.56 MIN: 21.31 / MAX: 23.1 MIN: 21.49 / MAX: 22.31 MIN: 21.17 / MAX: 22.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet 1 2 3 1000 2000 3000 4000 5000 SE +/- 2.62, N = 3 4569.71 4566.47 4560.10 MIN: 4549.25 / MAX: 4609.34 MIN: 4553.48 / MAX: 4583.87 MIN: 4548.61 / MAX: 4574.83 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 1 2 3 80 160 240 320 400 SE +/- 0.86, N = 3 354.82 353.93 354.84 MIN: 349.69 / MAX: 361.61 MIN: 348.95 / MAX: 360.71 MIN: 351.03 / MAX: 360.44 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 85.91 85.95 85.88 MIN: 85.8 / MAX: 86.42 MIN: 85.81 / MAX: 86.45 MIN: 85.8 / MAX: 86.22 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 70 140 210 280 350 SE +/- 0.39, N = 3 316.96 318.37 317.50 MIN: 314.88 / MAX: 318.72 MIN: 316.68 / MAX: 320.21 MIN: 316.23 / MAX: 319.2 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric 1 2 3 8K 16K 24K 32K 40K 39380 39354 39592 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
Phoronix Test Suite v10.8.4