10900k june Intel Core i9-10900K testing with a Gigabyte Z490 AORUS MASTER (F20d BIOS) and Gigabyte Intel UHD 630 CML GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106300-IB-10900KJUN95&grr&rdt .
10900k june Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i9-10900K @ 5.30GHz (10 Cores / 20 Threads) Gigabyte Z490 AORUS MASTER (F20d BIOS) Intel Comet Lake PCH 16GB Samsung SSD 970 EVO 500GB Gigabyte Intel UHD 630 CML GT2 3GB (1200MHz) Realtek ALC1220 G237HL Intel Device 15f3 + Intel Wi-Fi 6 AX201 Ubuntu 20.04 5.9.0-050900daily20201012-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.9 4.6 Mesa 20.0.8 OpenCL 2.1 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
10900k june tnn: CPU - DenseNet npb: SP.C brl-cad: VGR Performance Metric gromacs: MPI CPU - water_GMX50_bare srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM oidn: RTLightmap.hdr.4096x4096 npb: BT.C svt-av1: Preset 4 - Bosphorus 4K vpxenc: Speed 0 - Bosphorus 4K mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: squeezenetv1.1 mnn: mobilenetV3 npb: LU.C npb: EP.D npb: SP.B oidn: RT.hdr_alb_nrm.3840x2160 oidn: RT.ldr_alb_nrm.3840x2160 srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM build-gdb: Time To Compile compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed astcenc: Exhaustive embree: Pathtracer - Asian Dragon Obj srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed embree: Pathtracer ISPC - Asian Dragon Obj build-ffmpeg: Time To Compile embree: Pathtracer - Crown vpxenc: Speed 0 - Bosphorus 1080p vpxenc: Speed 5 - Bosphorus 4K embree: Pathtracer - Asian Dragon embree: Pathtracer ISPC - Crown compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed srsran: OFDM_Test compress-zstd: 3 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed embree: Pathtracer ISPC - Asian Dragon svt-av1: Preset 8 - Bosphorus 4K npb: FT.C srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM svt-av1: Preset 4 - Bosphorus 1080p npb: CG.C npb: MG.C dav1d: Chimera 1080p 10-bit vpxenc: Speed 5 - Bosphorus 1080p tnn: CPU - MobileNet v2 dav1d: Summer Nature 4K tnn: CPU - SqueezeNet v1.1 blosc: blosclz dav1d: Chimera 1080p astcenc: Thorough svt-av1: Preset 8 - Bosphorus 1080p npb: EP.C dav1d: Summer Nature 1080p tnn: CPU - SqueezeNet v2 astcenc: Medium 1 2 3 2870.071 4949.7 187701 0.963 94.3 158.3 0.23 26301.99 1.631 6.32 29.905 3.318 3.151 4.543 29.415 3.083 1.601 26904.82 1838.96 5354.13 0.46 0.46 151.7 461.1 56.595 4653.3 528.8 8.43 17.57 23.09 22.5 12.34 13.71 62.06 12.61 1.42 5.33 3.34 3.15 3.39 4.3 15.69 3835.5 29.7 50.2027 14.7441 138.4 418.5 3870.4 32.9 16.4077 48.51 14.343 14.18 15.28 15.9307 16.0139 4157.2 177.6 127100000 4136.7 2270.9 4555.9 1319.6 18.1986 18.379 12947.66 292.3 461.2 68 142.2 244.1 423.4 5.431 5554.07 11137.19 491.13 30.42 286.226 190.74 262.424 19606.3 820.43 9.3677 69.441 1812.54 770.57 58.661 4.0492 2882.364 5047.56 187547 0.964 93.2 156.4 0.23 26239.81 1.629 6.38 29.321 3.359 2.909 4.552 29.754 3.173 1.637 26820.57 1833.03 5290.88 0.46 0.46 150.5 456.3 56.064 4653.7 546.5 8.64 17.57 23.22 22.50 12.33 13.68 62.08 12.72 1.44 5.47 3.37 3.32 3.48 4.43 15.77 3823.6 29.3 50.2359 14.8025 136.9 414.1 3812.8 33.0 16.3105 48.332 14.2603 14.19 15.40 15.8030 15.9522 4318.1 189.0 122966667 4263.3 2347.7 4498.5 1307.7 18.5302 18.248 13116.82 288.8 457.5 67.8 142.1 238.6 413.2 5.401 5558.74 11121.48 490.72 30.99 284.087 191.09 261.693 19796.8 819.80 9.3489 69.231 1790.82 770.74 58.722 4.0720 2851.863 5031.12 187146 0.963 96.2 161.0 0.23 26422.40 1.634 6.38 28.533 3.305 2.783 4.411 29.488 3.060 1.629 25911.23 1669.82 5126.92 0.46 0.46 155.5 472.3 56.263 4643.8 527.3 8.53 17.57 23.19 22.52 12.30 13.68 62.43 12.63 1.44 5.47 3.41 3.30 3.47 4.55 15.78 3884.6 29.0 50.2140 14.8244 143.3 431.9 3758.1 33.3 16.3384 48.405 14.1407 14.19 15.24 15.9659 15.9174 4243.6 186.3 133566667 4248.7 2306.9 4546.2 1311.5 18.3283 18.217 12454.59 301.8 476.5 70.4 146.9 248.1 431.5 5.396 5627.28 10054.56 492.52 30.39 285.361 191.20 261.967 20063.7 813.38 9.3285 70.032 1786.57 767.77 58.734 4.0145 OpenBenchmarking.org
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet 1 2 3 600 1200 1800 2400 3000 SE +/- 30.42, N = 12 SE +/- 1.97, N = 3 2870.07 2882.36 2851.86 MIN: 2851.69 / MAX: 2906.41 MIN: 2814.73 / MAX: 5048.47 MIN: 2826.92 / MAX: 2907.55 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 1 2 3 1100 2200 3300 4400 5500 SE +/- 5.95, N = 3 SE +/- 1.40, N = 3 4949.70 5047.56 5031.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric 1 2 3 40K 80K 120K 160K 200K 187701 187547 187146 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare 1 2 3 0.2169 0.4338 0.6507 0.8676 1.0845 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.963 0.964 0.963 1. (CXX) g++ options: -O3 -pthread
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 1 2 3 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.70, N = 3 94.3 93.2 96.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM 1 2 3 40 80 120 160 200 SE +/- 0.12, N = 3 SE +/- 0.81, N = 3 158.3 156.4 161.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 1 2 3 0.0518 0.1036 0.1554 0.2072 0.259 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.23 0.23 0.23
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 6K 12K 18K 24K 30K SE +/- 4.43, N = 3 SE +/- 17.15, N = 3 26301.99 26239.81 26422.40 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 1 2 3 0.3677 0.7354 1.1031 1.4708 1.8385 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 1.631 1.629 1.634 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 6.32 6.38 6.38 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 1 2 3 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 29.91 29.32 28.53 MIN: 29.05 / MAX: 39.9 MIN: 28.88 / MAX: 40.74 MIN: 28.11 / MAX: 41.18 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 1 2 3 0.7558 1.5116 2.2674 3.0232 3.779 SE +/- 0.033, N = 3 SE +/- 0.011, N = 3 3.318 3.359 3.305 MIN: 3.27 / MAX: 4.82 MIN: 3.26 / MAX: 3.84 MIN: 3.24 / MAX: 15.08 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 1 2 3 0.709 1.418 2.127 2.836 3.545 SE +/- 0.139, N = 3 SE +/- 0.007, N = 3 3.151 2.909 2.783 MIN: 2.69 / MAX: 5.79 MIN: 2.7 / MAX: 4.09 MIN: 2.67 / MAX: 3.57 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 1 2 3 1.0242 2.0484 3.0726 4.0968 5.121 SE +/- 0.020, N = 3 SE +/- 0.044, N = 3 4.543 4.552 4.411 MIN: 4.41 / MAX: 5.05 MIN: 4.41 / MAX: 6.47 MIN: 4.29 / MAX: 5.37 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 1 2 3 7 14 21 28 35 SE +/- 0.23, N = 3 SE +/- 0.27, N = 3 29.42 29.75 29.49 MIN: 28.55 / MAX: 42.19 MIN: 29.2 / MAX: 40.31 MIN: 28.92 / MAX: 40.86 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 1 2 3 0.7139 1.4278 2.1417 2.8556 3.5695 SE +/- 0.050, N = 3 SE +/- 0.018, N = 3 3.083 3.173 3.060 MIN: 3.01 / MAX: 3.14 MIN: 2.97 / MAX: 5.58 MIN: 2.98 / MAX: 3.82 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 1 2 3 0.3683 0.7366 1.1049 1.4732 1.8415 SE +/- 0.016, N = 3 SE +/- 0.006, N = 3 1.601 1.637 1.629 MIN: 1.56 / MAX: 2.29 MIN: 1.58 / MAX: 2.42 MIN: 1.57 / MAX: 1.75 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 6K 12K 18K 24K 30K SE +/- 44.76, N = 3 SE +/- 23.04, N = 3 26904.82 26820.57 25911.23 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 400 800 1200 1600 2000 SE +/- 2.47, N = 3 SE +/- 28.88, N = 3 1838.96 1833.03 1669.82 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 1100 2200 3300 4400 5500 SE +/- 3.99, N = 3 SE +/- 85.04, N = 3 5354.13 5290.88 5126.92 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 1 2 3 0.1035 0.207 0.3105 0.414 0.5175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.46 0.46 0.46
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 1 2 3 0.1035 0.207 0.3105 0.414 0.5175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.46 0.46 0.46
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 1 2 3 30 60 90 120 150 SE +/- 0.51, N = 3 SE +/- 0.96, N = 3 151.7 150.5 155.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM 1 2 3 100 200 300 400 500 SE +/- 0.73, N = 3 SE +/- 2.21, N = 3 461.1 456.3 472.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile 1 2 3 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 56.60 56.06 56.26
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed 1 2 3 1000 2000 3000 4000 5000 SE +/- 3.98, N = 3 SE +/- 7.50, N = 7 4653.3 4653.7 4643.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed 1 2 3 120 240 360 480 600 SE +/- 7.74, N = 4 SE +/- 5.98, N = 7 528.8 546.5 527.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: regnety_400m 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 8.43 8.64 8.53 MIN: 8.22 / MAX: 8.97 MIN: 8.31 / MAX: 9.23 MIN: 8.23 / MAX: 9.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: squeezenet_ssd 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 17.57 17.57 17.57 MIN: 17.33 / MAX: 27.34 MIN: 17.36 / MAX: 18.94 MIN: 17.31 / MAX: 18.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: yolov4-tiny 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 23.09 23.22 23.19 MIN: 22.93 / MAX: 23.33 MIN: 22.94 / MAX: 24.13 MIN: 22.9 / MAX: 26.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet50 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 22.50 22.50 22.52 MIN: 22.25 / MAX: 22.96 MIN: 21.98 / MAX: 22.91 MIN: 22.24 / MAX: 23.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 12.34 12.33 12.30 MIN: 12.25 / MAX: 12.43 MIN: 12.21 / MAX: 20.83 MIN: 12.19 / MAX: 13.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.71 13.68 13.68 MIN: 13.51 / MAX: 14.03 MIN: 13.49 / MAX: 14.06 MIN: 13.49 / MAX: 13.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: vgg16 1 2 3 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 62.06 62.08 62.43 MIN: 61.88 / MAX: 62.6 MIN: 61.89 / MAX: 64.13 MIN: 62.27 / MAX: 62.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: googlenet 1 2 3 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.61 12.72 12.63 MIN: 12.52 / MAX: 12.69 MIN: 12.43 / MAX: 12.91 MIN: 12.28 / MAX: 12.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: blazeface 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 1.42 1.44 1.44 MIN: 1.37 / MAX: 1.64 MIN: 1.36 / MAX: 1.68 MIN: 1.37 / MAX: 2.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: efficientnet-b0 1 2 3 1.2308 2.4616 3.6924 4.9232 6.154 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 5.33 5.47 5.47 MIN: 5.2 / MAX: 5.46 MIN: 5.23 / MAX: 6.69 MIN: 5.26 / MAX: 6.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mnasnet 1 2 3 0.7673 1.5346 2.3019 3.0692 3.8365 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 3.34 3.37 3.41 MIN: 3.3 / MAX: 3.7 MIN: 3.17 / MAX: 4.84 MIN: 3.2 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: shufflenet-v2 1 2 3 0.747 1.494 2.241 2.988 3.735 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 3.15 3.32 3.30 MIN: 3.02 / MAX: 3.29 MIN: 3.09 / MAX: 4.08 MIN: 3.05 / MAX: 4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 0.783 1.566 2.349 3.132 3.915 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 3.39 3.48 3.47 MIN: 3.27 / MAX: 3.56 MIN: 3.32 / MAX: 4.74 MIN: 3.32 / MAX: 4.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.0238 2.0476 3.0714 4.0952 5.119 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 4.30 4.43 4.55 MIN: 4.13 / MAX: 4.55 MIN: 4.09 / MAX: 6.38 MIN: 4.3 / MAX: 5.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 15.69 15.77 15.78 MIN: 15.36 / MAX: 17.55 MIN: 15.35 / MAX: 16.37 MIN: 15.25 / MAX: 25.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed 1 2 3 800 1600 2400 3200 4000 SE +/- 10.65, N = 3 SE +/- 36.64, N = 3 3835.5 3823.6 3884.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed 1 2 3 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 29.7 29.3 29.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Exhaustive 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 50.20 50.24 50.21 1. (CXX) g++ options: -O3 -flto -pthread
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.74 14.80 14.82 MIN: 14.68 / MAX: 14.92 MIN: 14.63 / MAX: 15.07 MIN: 14.71 / MAX: 15.06
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 1 2 3 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.19, N = 3 138.4 136.9 143.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM 1 2 3 90 180 270 360 450 SE +/- 1.10, N = 3 SE +/- 0.59, N = 3 418.5 414.1 431.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed 1 2 3 800 1600 2400 3200 4000 SE +/- 24.61, N = 3 SE +/- 7.49, N = 3 3870.4 3812.8 3758.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed 1 2 3 8 16 24 32 40 SE +/- 0.30, N = 3 SE +/- 0.00, N = 3 32.9 33.0 33.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.41 16.31 16.34 MIN: 16.26 / MAX: 16.8 MIN: 16.18 / MAX: 16.71 MIN: 16.2 / MAX: 16.73
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile 1 2 3 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 48.51 48.33 48.41
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 1 2 3 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 14.34 14.26 14.14 MIN: 14.23 / MAX: 14.6 MIN: 13.99 / MAX: 14.72 MIN: 13.94 / MAX: 14.5
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 1080p 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 14.18 14.19 14.19 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K 1 2 3 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.11, N = 3 15.28 15.40 15.24 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.17, N = 3 15.93 15.80 15.97 MIN: 15.85 / MAX: 16.15 MIN: 15.63 / MAX: 16.19 MIN: 15.66 / MAX: 16.58
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 16.01 15.95 15.92 MIN: 15.87 / MAX: 16.33 MIN: 15.78 / MAX: 16.39 MIN: 15.73 / MAX: 16.29
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed 1 2 3 900 1800 2700 3600 4500 SE +/- 34.82, N = 3 SE +/- 22.34, N = 3 4157.2 4318.1 4243.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed 1 2 3 40 80 120 160 200 SE +/- 3.24, N = 3 SE +/- 3.13, N = 3 177.6 189.0 186.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test 1 2 3 30M 60M 90M 120M 150M SE +/- 1530068.99, N = 3 SE +/- 437162.57, N = 3 127100000 122966667 133566667 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed 1 2 3 900 1800 2700 3600 4500 SE +/- 3.55, N = 3 SE +/- 15.19, N = 3 4136.7 4263.3 4248.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed 1 2 3 500 1000 1500 2000 2500 SE +/- 12.19, N = 3 SE +/- 18.18, N = 3 2270.9 2347.7 2306.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed 1 2 3 1000 2000 3000 4000 5000 SE +/- 12.77, N = 3 SE +/- 6.37, N = 3 4555.9 4498.5 4546.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed 1 2 3 300 600 900 1200 1500 SE +/- 4.76, N = 3 SE +/- 4.36, N = 3 1319.6 1307.7 1311.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 1 2 3 5 10 15 20 25 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 18.20 18.53 18.33 MIN: 18.05 / MAX: 18.61 MIN: 18.17 / MAX: 19.14 MIN: 18.19 / MAX: 18.67
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 18.38 18.25 18.22 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 3K 6K 9K 12K 15K SE +/- 12.29, N = 3 SE +/- 108.08, N = 3 12947.66 13116.82 12454.59 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 1 2 3 70 140 210 280 350 SE +/- 0.15, N = 3 SE +/- 0.72, N = 3 292.3 288.8 301.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM 1 2 3 100 200 300 400 500 SE +/- 0.93, N = 3 SE +/- 0.84, N = 3 461.2 457.5 476.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 1 2 3 16 32 48 64 80 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 68.0 67.8 70.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM 1 2 3 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 142.2 142.1 146.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 1 2 3 50 100 150 200 250 SE +/- 0.52, N = 3 SE +/- 2.18, N = 3 244.1 238.6 248.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM 1 2 3 90 180 270 360 450 SE +/- 1.36, N = 3 SE +/- 3.74, N = 3 423.4 413.2 431.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 1 2 3 1.222 2.444 3.666 4.888 6.11 SE +/- 0.034, N = 3 SE +/- 0.024, N = 3 5.431 5.401 5.396 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 1200 2400 3600 4800 6000 SE +/- 12.02, N = 3 SE +/- 4.50, N = 3 5554.07 5558.74 5627.28 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 2K 4K 6K 8K 10K SE +/- 5.16, N = 3 SE +/- 119.21, N = 6 11137.19 11121.48 10054.56 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit 1 2 3 110 220 330 440 550 SE +/- 1.02, N = 3 SE +/- 0.13, N = 3 491.13 490.72 492.52 MIN: 392.46 / MAX: 737.75 MIN: 392.28 / MAX: 749.63 MIN: 392.25 / MAX: 787.53 1. (CC) gcc options: -pthread -lm
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 1080p 1 2 3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 30.42 30.99 30.39 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 1 2 3 60 120 180 240 300 SE +/- 0.57, N = 3 SE +/- 2.10, N = 3 286.23 284.09 285.36 MIN: 285.92 / MAX: 286.65 MIN: 282.55 / MAX: 286.38 MIN: 282.61 / MAX: 296.95 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 190.74 191.09 191.20 MIN: 167.81 / MAX: 197.99 MIN: 165.76 / MAX: 198.45 MIN: 167.42 / MAX: 198.75 1. (CC) gcc options: -pthread -lm
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 60 120 180 240 300 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 262.42 261.69 261.97 MIN: 261.96 / MAX: 268.6 MIN: 261.37 / MAX: 268.05 MIN: 261.39 / MAX: 268.58 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Compressor: blosclz 1 2 3 4K 8K 12K 16K 20K SE +/- 32.57, N = 3 SE +/- 10.14, N = 3 19606.3 19796.8 20063.7 1. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 1 2 3 200 400 600 800 1000 SE +/- 1.49, N = 3 SE +/- 1.09, N = 3 820.43 819.80 813.38 MIN: 635.4 / MAX: 1129.6 MIN: 635.09 / MAX: 1147.17 MIN: 630.35 / MAX: 1137.18 1. (CC) gcc options: -pthread -lm
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Thorough 1 2 3 3 6 9 12 15 SE +/- 0.0103, N = 3 SE +/- 0.0027, N = 3 9.3677 9.3489 9.3285 1. (CXX) g++ options: -O3 -flto -pthread
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 1 2 3 16 32 48 64 80 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 69.44 69.23 70.03 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 400 800 1200 1600 2000 SE +/- 28.41, N = 3 SE +/- 10.88, N = 3 1812.54 1790.82 1786.57 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 1080p 1 2 3 170 340 510 680 850 SE +/- 0.13, N = 3 SE +/- 1.29, N = 3 770.57 770.74 767.77 MIN: 661.07 / MAX: 835.71 MIN: 651.67 / MAX: 836.2 MIN: 632.05 / MAX: 833.44 1. (CC) gcc options: -pthread -lm
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 1 2 3 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 58.66 58.72 58.73 MIN: 58.6 / MAX: 58.91 MIN: 58.6 / MAX: 60.74 MIN: 58.61 / MAX: 59.06 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Medium 1 2 3 0.9162 1.8324 2.7486 3.6648 4.581 SE +/- 0.0030, N = 3 SE +/- 0.0314, N = 3 4.0492 4.0720 4.0145 1. (CXX) g++ options: -O3 -flto -pthread
Phoronix Test Suite v10.8.5