10900k june Intel Core i9-10900K testing with a Gigabyte Z490 AORUS MASTER (F20d BIOS) and Gigabyte Intel UHD 630 CML GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106300-IB-10900KJUN95&grs&sor .
10900k june Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i9-10900K @ 5.30GHz (10 Cores / 20 Threads) Gigabyte Z490 AORUS MASTER (F20d BIOS) Intel Comet Lake PCH 16GB Samsung SSD 970 EVO 500GB Gigabyte Intel UHD 630 CML GT2 3GB (1200MHz) Realtek ALC1220 G237HL Intel Device 15f3 + Intel Wi-Fi 6 AX201 Ubuntu 20.04 5.9.0-050900daily20201012-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.9 4.6 Mesa 20.0.8 OpenCL 2.1 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
10900k june npb: MG.C npb: EP.D srsran: OFDM_Test compress-zstd: 8 - Compression Speed ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - shufflenet-v2 npb: FT.C mnn: inception-v3 srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM npb: SP.B srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM compress-zstd: 8 - Decompression Speed srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM npb: LU.C mnn: squeezenetv1.1 compress-zstd: 8, Long Mode - Compression Speed srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM compress-zstd: 3 - Compression Speed srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM mnn: SqueezeNetV1.0 compress-zstd: 3 - Decompression Speed compress-zstd: 19 - Decompression Speed srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - efficientnet-b0 ncnn: CPU - regnety_400m compress-zstd: 19, Long Mode - Compression Speed blosc: blosclz mnn: mobilenetV3 ncnn: CPU - mnasnet npb: SP.C vpxenc: Speed 5 - Bosphorus 1080p embree: Pathtracer ISPC - Asian Dragon mnn: mobilenet-v1-1.0 compress-zstd: 19, Long Mode - Decompression Speed npb: EP.C astcenc: Medium embree: Pathtracer - Crown ncnn: CPU - blazeface npb: CG.C compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 19 - Compression Speed svt-av1: Preset 8 - Bosphorus 1080p mnn: resnet-v2-50 tnn: CPU - DenseNet vpxenc: Speed 5 - Bosphorus 4K embree: Pathtracer - Asian Dragon vpxenc: Speed 0 - Bosphorus 4K build-gdb: Time To Compile compress-zstd: 3, Long Mode - Compression Speed svt-av1: Preset 8 - Bosphorus 4K ncnn: CPU - googlenet dav1d: Chimera 1080p tnn: CPU - MobileNet v2 npb: BT.C svt-av1: Preset 4 - Bosphorus 1080p embree: Pathtracer ISPC - Crown ncnn: CPU - vgg16 embree: Pathtracer ISPC - Asian Dragon Obj ncnn: CPU - mobilenet ncnn: CPU - yolov4-tiny embree: Pathtracer - Asian Dragon Obj astcenc: Thorough dav1d: Summer Nature 1080p build-ffmpeg: Time To Compile dav1d: Chimera 1080p 10-bit ncnn: CPU - alexnet svt-av1: Preset 4 - Bosphorus 4K brl-cad: VGR Performance Metric tnn: CPU - SqueezeNet v1.1 dav1d: Summer Nature 4K ncnn: CPU - resnet18 compress-zstd: 8, Long Mode - Decompression Speed tnn: CPU - SqueezeNet v2 gromacs: MPI CPU - water_GMX50_bare ncnn: CPU - resnet50 vpxenc: Speed 0 - Bosphorus 1080p astcenc: Exhaustive ncnn: CPU - squeezenet_ssd oidn: RTLightmap.hdr.4096x4096 oidn: RT.ldr_alb_nrm.3840x2160 oidn: RT.hdr_alb_nrm.3840x2160 mnn: MobileNetV2_224 1 2 3 11137.19 1838.96 127100000 177.6 4.3 3.15 12947.66 29.905 138.4 292.3 5354.13 423.4 418.5 461.2 244.1 4157.2 68 26904.82 3.083 528.8 461.1 2270.9 142.2 151.7 94.3 4.543 4136.7 3870.4 158.3 3.39 5.33 8.43 29.7 19606.3 1.601 3.34 4949.7 30.42 18.1986 3.318 3835.5 1812.54 4.0492 14.343 1.42 5554.07 4555.9 32.9 69.441 29.415 2870.071 15.28 15.9307 6.32 56.595 1319.6 18.379 12.61 820.43 286.226 26301.99 5.431 16.0139 62.06 16.4077 15.69 23.09 14.7441 9.3677 770.57 48.51 491.13 12.34 1.631 187701 262.424 190.74 13.71 4653.3 58.661 0.963 22.5 14.18 50.2027 17.57 0.23 0.46 0.46 3.151 11121.48 1833.03 122966667 189.0 4.43 3.32 13116.82 29.321 136.9 288.8 5290.88 413.2 414.1 457.5 238.6 4318.1 67.8 26820.57 3.173 546.5 456.3 2347.7 142.1 150.5 93.2 4.552 4263.3 3812.8 156.4 3.48 5.47 8.64 29.3 19796.8 1.637 3.37 5047.56 30.99 18.5302 3.359 3823.6 1790.82 4.0720 14.2603 1.44 5558.74 4498.5 33.0 69.231 29.754 2882.364 15.40 15.8030 6.38 56.064 1307.7 18.248 12.72 819.80 284.087 26239.81 5.401 15.9522 62.08 16.3105 15.77 23.22 14.8025 9.3489 770.74 48.332 490.72 12.33 1.629 187547 261.693 191.09 13.68 4653.7 58.722 0.964 22.50 14.19 50.2359 17.57 0.23 0.46 0.46 2.909 10054.56 1669.82 133566667 186.3 4.55 3.30 12454.59 28.533 143.3 301.8 5126.92 431.5 431.9 476.5 248.1 4243.6 70.4 25911.23 3.060 527.3 472.3 2306.9 146.9 155.5 96.2 4.411 4248.7 3758.1 161.0 3.47 5.47 8.53 29.0 20063.7 1.629 3.41 5031.12 30.39 18.3283 3.305 3884.6 1786.57 4.0145 14.1407 1.44 5627.28 4546.2 33.3 70.032 29.488 2851.863 15.24 15.9659 6.38 56.263 1311.5 18.217 12.63 813.38 285.361 26422.40 5.396 15.9174 62.43 16.3384 15.78 23.19 14.8244 9.3285 767.77 48.405 492.52 12.30 1.634 187146 261.967 191.20 13.68 4643.8 58.734 0.963 22.52 14.19 50.2140 17.57 0.23 0.46 0.46 2.783 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 2K 4K 6K 8K 10K SE +/- 5.16, N = 3 SE +/- 119.21, N = 6 11137.19 11121.48 10054.56 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 400 800 1200 1600 2000 SE +/- 2.47, N = 3 SE +/- 28.88, N = 3 1838.96 1833.03 1669.82 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test 3 1 2 30M 60M 90M 120M 150M SE +/- 437162.57, N = 3 SE +/- 1530068.99, N = 3 133566667 127100000 122966667 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed 2 3 1 40 80 120 160 200 SE +/- 3.24, N = 3 SE +/- 3.13, N = 3 189.0 186.3 177.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.0238 2.0476 3.0714 4.0952 5.119 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 4.30 4.43 4.55 MIN: 4.13 / MAX: 4.55 MIN: 4.09 / MAX: 6.38 MIN: 4.3 / MAX: 5.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: shufflenet-v2 1 3 2 0.747 1.494 2.241 2.988 3.735 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 3.15 3.30 3.32 MIN: 3.02 / MAX: 3.29 MIN: 3.05 / MAX: 4 MIN: 3.09 / MAX: 4.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 1 3 3K 6K 9K 12K 15K SE +/- 12.29, N = 3 SE +/- 108.08, N = 3 13116.82 12947.66 12454.59 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 3 2 1 7 14 21 28 35 SE +/- 0.23, N = 3 SE +/- 0.17, N = 3 28.53 29.32 29.91 MIN: 28.11 / MAX: 41.18 MIN: 28.88 / MAX: 40.74 MIN: 29.05 / MAX: 39.9 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 3 1 2 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 143.3 138.4 136.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 3 1 2 70 140 210 280 350 SE +/- 0.72, N = 3 SE +/- 0.15, N = 3 301.8 292.3 288.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 1100 2200 3300 4400 5500 SE +/- 3.99, N = 3 SE +/- 85.04, N = 3 5354.13 5290.88 5126.92 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 3 1 2 90 180 270 360 450 SE +/- 3.74, N = 3 SE +/- 1.36, N = 3 431.5 423.4 413.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 3 1 2 90 180 270 360 450 SE +/- 0.59, N = 3 SE +/- 1.10, N = 3 431.9 418.5 414.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 3 1 2 100 200 300 400 500 SE +/- 0.84, N = 3 SE +/- 0.93, N = 3 476.5 461.2 457.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 3 1 2 50 100 150 200 250 SE +/- 2.18, N = 3 SE +/- 0.52, N = 3 248.1 244.1 238.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed 2 3 1 900 1800 2700 3600 4500 SE +/- 34.82, N = 3 SE +/- 22.34, N = 3 4318.1 4243.6 4157.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 3 1 2 16 32 48 64 80 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 70.4 68.0 67.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 6K 12K 18K 24K 30K SE +/- 44.76, N = 3 SE +/- 23.04, N = 3 26904.82 26820.57 25911.23 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 3 1 2 0.7139 1.4278 2.1417 2.8556 3.5695 SE +/- 0.018, N = 3 SE +/- 0.050, N = 3 3.060 3.083 3.173 MIN: 2.98 / MAX: 3.82 MIN: 3.01 / MAX: 3.14 MIN: 2.97 / MAX: 5.58 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed 2 1 3 120 240 360 480 600 SE +/- 7.74, N = 4 SE +/- 5.98, N = 7 546.5 528.8 527.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 3 1 2 100 200 300 400 500 SE +/- 2.21, N = 3 SE +/- 0.73, N = 3 472.3 461.1 456.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed 2 3 1 500 1000 1500 2000 2500 SE +/- 12.19, N = 3 SE +/- 18.18, N = 3 2347.7 2306.9 2270.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 3 1 2 30 60 90 120 150 SE +/- 0.20, N = 3 SE +/- 0.12, N = 3 146.9 142.2 142.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 3 1 2 30 60 90 120 150 SE +/- 0.96, N = 3 SE +/- 0.51, N = 3 155.5 151.7 150.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 3 1 2 20 40 60 80 100 SE +/- 0.70, N = 3 SE +/- 0.12, N = 3 96.2 94.3 93.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 3 1 2 1.0242 2.0484 3.0726 4.0968 5.121 SE +/- 0.044, N = 3 SE +/- 0.020, N = 3 4.411 4.543 4.552 MIN: 4.29 / MAX: 5.37 MIN: 4.41 / MAX: 5.05 MIN: 4.41 / MAX: 6.47 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed 2 3 1 900 1800 2700 3600 4500 SE +/- 3.55, N = 3 SE +/- 15.19, N = 3 4263.3 4248.7 4136.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed 1 2 3 800 1600 2400 3200 4000 SE +/- 24.61, N = 3 SE +/- 7.49, N = 3 3870.4 3812.8 3758.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 3 1 2 40 80 120 160 200 SE +/- 0.81, N = 3 SE +/- 0.12, N = 3 161.0 158.3 156.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v3-v3 - Model: mobilenet-v3 1 3 2 0.783 1.566 2.349 3.132 3.915 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 3.39 3.47 3.48 MIN: 3.27 / MAX: 3.56 MIN: 3.32 / MAX: 4.33 MIN: 3.32 / MAX: 4.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: efficientnet-b0 1 2 3 1.2308 2.4616 3.6924 4.9232 6.154 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 5.33 5.47 5.47 MIN: 5.2 / MAX: 5.46 MIN: 5.23 / MAX: 6.69 MIN: 5.26 / MAX: 6.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: regnety_400m 1 3 2 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 8.43 8.53 8.64 MIN: 8.22 / MAX: 8.97 MIN: 8.23 / MAX: 9.86 MIN: 8.31 / MAX: 9.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed 1 2 3 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 29.7 29.3 29.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Compressor: blosclz 3 2 1 4K 8K 12K 16K 20K SE +/- 10.14, N = 3 SE +/- 32.57, N = 3 20063.7 19796.8 19606.3 1. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 1 3 2 0.3683 0.7366 1.1049 1.4732 1.8415 SE +/- 0.006, N = 3 SE +/- 0.016, N = 3 1.601 1.629 1.637 MIN: 1.56 / MAX: 2.29 MIN: 1.57 / MAX: 1.75 MIN: 1.58 / MAX: 2.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mnasnet 1 2 3 0.7673 1.5346 2.3019 3.0692 3.8365 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 3.34 3.37 3.41 MIN: 3.3 / MAX: 3.7 MIN: 3.17 / MAX: 4.84 MIN: 3.2 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 3 1 1100 2200 3300 4400 5500 SE +/- 5.95, N = 3 SE +/- 1.40, N = 3 5047.56 5031.12 4949.70 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 1080p 2 1 3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 30.99 30.42 30.39 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 2 3 1 5 10 15 20 25 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 18.53 18.33 18.20 MIN: 18.17 / MAX: 19.14 MIN: 18.19 / MAX: 18.67 MIN: 18.05 / MAX: 18.61
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 3 1 2 0.7558 1.5116 2.2674 3.0232 3.779 SE +/- 0.011, N = 3 SE +/- 0.033, N = 3 3.305 3.318 3.359 MIN: 3.24 / MAX: 15.08 MIN: 3.27 / MAX: 4.82 MIN: 3.26 / MAX: 3.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed 3 1 2 800 1600 2400 3200 4000 SE +/- 36.64, N = 3 SE +/- 10.65, N = 3 3884.6 3835.5 3823.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 400 800 1200 1600 2000 SE +/- 28.41, N = 3 SE +/- 10.88, N = 3 1812.54 1790.82 1786.57 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Medium 3 1 2 0.9162 1.8324 2.7486 3.6648 4.581 SE +/- 0.0314, N = 3 SE +/- 0.0030, N = 3 4.0145 4.0492 4.0720 1. (CXX) g++ options: -O3 -flto -pthread
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 1 2 3 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 14.34 14.26 14.14 MIN: 14.23 / MAX: 14.6 MIN: 13.99 / MAX: 14.72 MIN: 13.94 / MAX: 14.5
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: blazeface 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 1.42 1.44 1.44 MIN: 1.37 / MAX: 1.64 MIN: 1.36 / MAX: 1.68 MIN: 1.37 / MAX: 2.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 3 2 1 1200 2400 3600 4800 6000 SE +/- 4.50, N = 3 SE +/- 12.02, N = 3 5627.28 5558.74 5554.07 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed 1 3 2 1000 2000 3000 4000 5000 SE +/- 6.37, N = 3 SE +/- 12.77, N = 3 4555.9 4546.2 4498.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed 3 2 1 8 16 24 32 40 SE +/- 0.00, N = 3 SE +/- 0.30, N = 3 33.3 33.0 32.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 3 1 2 16 32 48 64 80 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 70.03 69.44 69.23 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 1 3 2 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.23, N = 3 29.42 29.49 29.75 MIN: 28.55 / MAX: 42.19 MIN: 28.92 / MAX: 40.86 MIN: 29.2 / MAX: 40.31 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet 3 1 2 600 1200 1800 2400 3000 SE +/- 1.97, N = 3 SE +/- 30.42, N = 12 2851.86 2870.07 2882.36 MIN: 2826.92 / MAX: 2907.55 MIN: 2851.69 / MAX: 2906.41 MIN: 2814.73 / MAX: 5048.47 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K 2 1 3 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.11, N = 3 15.40 15.28 15.24 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 3 1 2 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.08, N = 3 15.97 15.93 15.80 MIN: 15.66 / MAX: 16.58 MIN: 15.85 / MAX: 16.15 MIN: 15.63 / MAX: 16.19
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K 3 2 1 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 6.38 6.38 6.32 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile 2 3 1 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 56.06 56.26 56.60
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed 1 3 2 300 600 900 1200 1500 SE +/- 4.36, N = 3 SE +/- 4.76, N = 3 1319.6 1311.5 1307.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 18.38 18.25 18.22 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: googlenet 1 3 2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 12.61 12.63 12.72 MIN: 12.52 / MAX: 12.69 MIN: 12.28 / MAX: 12.8 MIN: 12.43 / MAX: 12.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 1 2 3 200 400 600 800 1000 SE +/- 1.49, N = 3 SE +/- 1.09, N = 3 820.43 819.80 813.38 MIN: 635.4 / MAX: 1129.6 MIN: 635.09 / MAX: 1147.17 MIN: 630.35 / MAX: 1137.18 1. (CC) gcc options: -pthread -lm
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 2 3 1 60 120 180 240 300 SE +/- 0.57, N = 3 SE +/- 2.10, N = 3 284.09 285.36 286.23 MIN: 282.55 / MAX: 286.38 MIN: 282.61 / MAX: 296.95 MIN: 285.92 / MAX: 286.65 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 3 1 2 6K 12K 18K 24K 30K SE +/- 17.15, N = 3 SE +/- 4.43, N = 3 26422.40 26301.99 26239.81 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 1 2 3 1.222 2.444 3.666 4.888 6.11 SE +/- 0.034, N = 3 SE +/- 0.024, N = 3 5.431 5.401 5.396 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 16.01 15.95 15.92 MIN: 15.87 / MAX: 16.33 MIN: 15.78 / MAX: 16.39 MIN: 15.73 / MAX: 16.29
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: vgg16 1 2 3 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 62.06 62.08 62.43 MIN: 61.88 / MAX: 62.6 MIN: 61.89 / MAX: 64.13 MIN: 62.27 / MAX: 62.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 3 2 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.41 16.34 16.31 MIN: 16.26 / MAX: 16.8 MIN: 16.2 / MAX: 16.73 MIN: 16.18 / MAX: 16.71
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 15.69 15.77 15.78 MIN: 15.36 / MAX: 17.55 MIN: 15.35 / MAX: 16.37 MIN: 15.25 / MAX: 25.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: yolov4-tiny 1 3 2 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 23.09 23.19 23.22 MIN: 22.93 / MAX: 23.33 MIN: 22.9 / MAX: 26.04 MIN: 22.94 / MAX: 24.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 3 2 1 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.82 14.80 14.74 MIN: 14.71 / MAX: 15.06 MIN: 14.63 / MAX: 15.07 MIN: 14.68 / MAX: 14.92
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Thorough 3 2 1 3 6 9 12 15 SE +/- 0.0027, N = 3 SE +/- 0.0103, N = 3 9.3285 9.3489 9.3677 1. (CXX) g++ options: -O3 -flto -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 1080p 2 1 3 170 340 510 680 850 SE +/- 0.13, N = 3 SE +/- 1.29, N = 3 770.74 770.57 767.77 MIN: 651.67 / MAX: 836.2 MIN: 661.07 / MAX: 835.71 MIN: 632.05 / MAX: 833.44 1. (CC) gcc options: -pthread -lm
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile 2 3 1 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 48.33 48.41 48.51
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit 3 1 2 110 220 330 440 550 SE +/- 0.13, N = 3 SE +/- 1.02, N = 3 492.52 491.13 490.72 MIN: 392.25 / MAX: 787.53 MIN: 392.46 / MAX: 737.75 MIN: 392.28 / MAX: 749.63 1. (CC) gcc options: -pthread -lm
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: alexnet 3 2 1 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 12.30 12.33 12.34 MIN: 12.19 / MAX: 13.7 MIN: 12.21 / MAX: 20.83 MIN: 12.25 / MAX: 12.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 3 1 2 0.3677 0.7354 1.1031 1.4708 1.8385 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 1.634 1.631 1.629 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric 1 2 3 40K 80K 120K 160K 200K 187701 187547 187146 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 2 3 1 60 120 180 240 300 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 261.69 261.97 262.42 MIN: 261.37 / MAX: 268.05 MIN: 261.39 / MAX: 268.58 MIN: 261.96 / MAX: 268.6 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K 3 2 1 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 191.20 191.09 190.74 MIN: 167.42 / MAX: 198.75 MIN: 165.76 / MAX: 198.45 MIN: 167.81 / MAX: 197.99 1. (CC) gcc options: -pthread -lm
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet18 2 3 1 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.68 13.68 13.71 MIN: 13.49 / MAX: 14.06 MIN: 13.49 / MAX: 13.88 MIN: 13.51 / MAX: 14.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed 2 1 3 1000 2000 3000 4000 5000 SE +/- 3.98, N = 3 SE +/- 7.50, N = 7 4653.7 4653.3 4643.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 1 2 3 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 58.66 58.72 58.73 MIN: 58.6 / MAX: 58.91 MIN: 58.6 / MAX: 60.74 MIN: 58.61 / MAX: 59.06 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare 2 3 1 0.2169 0.4338 0.6507 0.8676 1.0845 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.964 0.963 0.963 1. (CXX) g++ options: -O3 -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet50 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 22.50 22.50 22.52 MIN: 22.25 / MAX: 22.96 MIN: 21.98 / MAX: 22.91 MIN: 22.24 / MAX: 23.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 1080p 3 2 1 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 14.19 14.19 14.18 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Exhaustive 1 3 2 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 50.20 50.21 50.24 1. (CXX) g++ options: -O3 -flto -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: squeezenet_ssd 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 17.57 17.57 17.57 MIN: 17.33 / MAX: 27.34 MIN: 17.36 / MAX: 18.94 MIN: 17.31 / MAX: 18.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 3 2 1 0.0518 0.1036 0.1554 0.2072 0.259 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.23 0.23 0.23
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 3 2 1 0.1035 0.207 0.3105 0.414 0.5175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.46 0.46 0.46
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 3 2 1 0.1035 0.207 0.3105 0.414 0.5175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.46 0.46 0.46
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 3 2 1 0.709 1.418 2.127 2.836 3.545 SE +/- 0.007, N = 3 SE +/- 0.139, N = 3 2.783 2.909 3.151 MIN: 2.67 / MAX: 3.57 MIN: 2.7 / MAX: 4.09 MIN: 2.69 / MAX: 5.79 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Phoronix Test Suite v10.8.4