October Intel 6800K Intel Core i7-6800K testing with a MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) and Zotac NVIDIA NV137 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2110290-TJ-OCTOBERIN62 .
October Intel 6800K Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution A B C Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads) MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) Intel Xeon E7 v4/Xeon 16GB 120GB TOSHIBA TR150 Zotac NVIDIA NV137 2GB Realtek ALC1150 MX279 Intel I218-LM + Intel I210 Ubuntu 20.10 5.8.0-50-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 nouveau 4.3 Mesa 20.2.1 GCC 10.2.0 ext4 1920x1080 GCC 10.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - A: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - B: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - C: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-poYruo/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038 Python Details - A: Python 3.8.6 - B: Python 3.8.6 - C: Python 3.8.10 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
October Intel 6800K yquake2: Software CPU - 1920 x 1080 blosc: blosclz npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: LU.C npb: MG.C npb: SP.B npb: SP.C lczero: BLAS lczero: Eigen simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All srsran: OFDM_Test srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit aom-av1: Speed 0 Two-Pass - Bosphorus 4K aom-av1: Speed 4 Two-Pass - Bosphorus 4K aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 8 Realtime - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K aom-av1: Speed 10 Realtime - Bosphorus 4K aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 4 Two-Pass - Bosphorus 1080p aom-av1: Speed 6 Realtime - Bosphorus 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 1080p aom-av1: Speed 8 Realtime - Bosphorus 1080p aom-av1: Speed 9 Realtime - Bosphorus 1080p aom-av1: Speed 10 Realtime - Bosphorus 1080p kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 4 - Bosphorus 1080p svt-av1: Preset 8 - Bosphorus 1080p openvkl: vklBenchmark ISPC openvkl: vklBenchmark Scalar build-ffmpeg: Time To Compile build-gcc: Time To Compile build-gdb: Time To Compile build-linux-kernel: Time To Compile build-llvm: Ninja build-llvm: Unix Makefiles yafaray: Total Time For Sample Scene encode-flac: WAV To FLAC tachyon: Total Time synthmark: VoiceMark_100 cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin openssl: SHA256 openssl: RSA4096 openssl: RSA4096 openssl: openssl: gromacs: MPI CPU - water_GMX50_bare astcenc: Medium astcenc: Thorough astcenc: Exhaustive gimp: resize gimp: rotate gimp: auto-levels gimp: unsharp-mask stress-ng: MMAP stress-ng: NUMA stress-ng: MEMFD stress-ng: Atomic stress-ng: Crypto stress-ng: Malloc stress-ng: RdRand stress-ng: Forking stress-ng: IO_uring stress-ng: SENDFILE stress-ng: CPU Cache stress-ng: CPU Stress stress-ng: Semaphores stress-ng: Matrix Math stress-ng: Vector Math stress-ng: Memory Copying stress-ng: Socket Activity stress-ng: Context Switching stress-ng: Glibc C String Functions stress-ng: Glibc Qsort Data Sorting stress-ng: System V Message Passing redis: LPUSH and LPOP: lpop redis: LPUSH and LPOP: lpush redis: GET redis: MIX redis: SET mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU ecp-candle: P1B2 ecp-candle: P3B1 ecp-candle: P3B2 nginx: 1 nginx: 20 nginx: 100 nginx: 200 nginx: 500 nginx: 1000 nginx: Long Connection - 100 nginx: Long Connection - 500 nginx: Long Connection - 1000 nginx: Short Connection - 100 nginx: Short Connection - 500 nginx: Short Connection - 1000 natron: Spaceship apache: 1 apache: 20 apache: 100 apache: 200 apache: 500 apache: 1000 compress-rar: Linux Source Tree Archiving To RAR brl-cad: VGR Performance Metric opencv: Features 2D opencv: Object Detection opencv: DNN - Deep Neural Network A B C 81.4 10968.6 15318.8 2226.81 517.66 714.76 7986.56 17628.14 11493.61 6921.11 6974.43 1025 947 2.56 0.86 3.56 4 1797.3 2994.7 313 3055.8 27.8 2706.8 666.8 3183.1 349.9 3242.9 21.5 2669.6 2095.9 347.5 2949.8 31.4 2732.8 270.1 3067.6 257.1 3150.2 13.6 2784.1 22.31 5.55 0.6 53.5 53.18 20.29 37.18 176.21 81800000 286.7 96.9 288.4 161.3 319 103.8 318.5 195.6 92.2 45.3 102.9 61.4 392.29 113.61 354.61 317.74 0.06 2.24 5.72 4.54 21.71 29.55 33.12 0.15 5.1 5.37 13.36 69.91 87.19 94.39 3.81 3.89 17.82 18.35 9.06 15.97 36.96 65.48 0.788 8.32 2.15 26.269 43 20 101.203 1646.508 116.071 177.022 1293.41 1326.17 259.219 20.571 161.0541 614.707 145.84 155.57 5052.16 1089.77 192010 1270.12 22050 7233.62 14780 27340 38570 1807041190 1584 106261.2 1591.9 105612.4 0.655 6.1541 11.9933 109.2221 12.326 20.784 16.525 19.105 30.72 121.55 452.15 208402.05 1104.65 49310079.9 243890.96 36131.52 27562.44 65515.94 112.4 11683.25 828934.47 28847.66 17747.32 2655.54 4471.96 2041838.46 757524.58 89.2 3883131.92 780156.69 798023.44 698504.3 650871.26 514913.77 2.401 5.041 47.305 6.798 3.943 5.374 48.224 19.21 5.86 5.26 4.92 5.31 7.96 2.24 17.12 60.17 17.77 14.12 31.63 32.49 25.15 13.1 3833.182 324.353 75.35 289.808 309 586 56 17608 3574 51.664 866.048 1007.873 25112.59 321094.09 286587.24 273601.82 243132.5 223356.35 81504.87 77954.06 76222.86 57682.81 54430.97 53603.43 1.6 4277.13 33372.71 56992.42 54884.91 48693.76 47762.41 88.669 69673 247660 114640 15097 82.4 10939.8 15433.95 2222.73 679.98 693.83 8068.20 17592.95 11439.54 6939.05 6946.70 993 951 2.55 0.86 3.53 4.01 1798.9 2991.1 315.8 3049.5 27.9 2685.0 675.6 3189.8 357.9 3246.0 21.6 2660.9 2111.5 346.5 2954.1 31.4 2730.7 275.7 3069.4 253.2 3135.2 13.9 2786.7 22.34 5.56 0.6 53.01 52.87 20.47 37.10 176.11 81333333 279.4 94.2 279.3 158.1 307.9 103.1 311.8 192.7 92.4 45.5 103.1 61.1 394.09 113.39 351.41 317.26 0.06 2.24 5.78 4.55 21.64 29.53 33.02 0.15 5.10 5.33 13.31 69.76 86.91 94.71 3.82 3.93 17.85 18.36 9.08 15.95 37.08 65.27 0.789 8.385 2.169 26.486 43 20 96.608 1636.952 107.449 153.543 1152.265 1181.922 265.968 22.941 161.0108 614.328 146.00 155.46 5006.64 1134.09 191597 1250.23 22291 7232.25 14987 27360 38643 1806881950 1580.1 106223.1 1568.9 105271.9 0.650 6.1277 11.9990 109.1390 11.879 16.277 16.584 19.121 37.26 123.02 451.73 209318.42 1104.18 49366861.97 243898.56 36321.52 27678.66 65492.37 102.24 11574.90 828486.37 29006.61 17752.73 2663.57 4370.16 2088361.07 746679.27 89.41 3870784.75 775337.29 790620.96 457091.28 461731.16 505314.52 2.395 5.033 46.385 6.927 3.976 5.407 49.273 19.68 5.79 5.26 4.94 5.09 7.82 2.25 17.22 59.65 17.78 14.31 31.69 32.22 25.16 13.10 3839.730 326.440 76.420 291.201 308 584 55 17553 3566 52.431 872.058 1010.621 25115.74 318974.04 285084.05 272542.89 243404.97 219957.57 82411.34 78363.69 77254.72 57269.88 54509.07 53572.83 1.6 4326.37 34939.76 54120.02 55366.14 81.9 10902.1 15469.12 2224.77 687.06 707.98 8062.68 17641.70 11494.86 6869.51 6955.91 999 956 2.55 0.86 3.55 3.98 1782.4 2989.2 313.0 3056.5 27.7 2665.3 674.7 3189.1 355.3 3248.3 21.8 2664.8 2094.0 346.2 2949.7 31.0 2728.8 276.4 3074.3 255.5 3153.3 13.9 2788.3 22.39 5.55 0.6 53.35 53.36 20.65 37.28 175.78 81800000 286.8 96.4 287.3 161.4 318.8 106.0 318.3 195.8 92.3 45.4 102.8 60.9 394.02 113.68 351.26 317.13 0.06 2.24 5.73 4.54 21.69 29.53 33.04 0.15 5.11 5.33 13.32 69.36 86.51 94.58 3.81 3.93 17.86 18.34 9.09 15.95 37.13 65.33 0.793 8.32 2.132 26.408 43 20 96.011 1637.862 109.083 167.569 1152.245 1186.233 259.747 20.744 160.8279 613.995 146.35 156 4997.89 1103.84 191750 1229.99 22030 7253.89 14960 27070 38640 1806806150 1591.6 106536.5 1587.9 105701.2 0.654 6.1442 12.0077 109.1981 13.423 22.398 16.634 18.933 51.18 124.2 452.15 208200.63 1104.48 49293980.41 243884.74 35865.65 29344.02 65182.98 108.02 11391.3 827924.4 28981.15 16065.59 2665.33 4271.88 2062807.92 748976.91 86.93 3878475.74 709680.81 786658.31 455100.67 453693.85 506867.26 2.406 5.039 46.915 6.815 3.98 5.432 48.871 20.28 6.02 5.25 5.01 5.12 7.84 2.28 17.57 60.5 18.08 14.11 32.26 32.59 25.03 13.22 3837.78 324.34 76.078 290.982 308 587 55 17708 3584 51.922 866.203 1002.728 24860.72 318873.1 284523.9 270441.17 240178.52 215715.37 81381.61 77784.73 76529.85 57080.76 54074.02 53298.47 1.6 107.592 69061 262540 106889 15556 OpenBenchmarking.org
yquake2 Renderer: Software CPU - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 8.0 Renderer: Software CPU - Resolution: 1920 x 1080 A B C 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.26, N = 3 81.4 82.4 81.9 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Compressor: blosclz A B C 2K 4K 6K 8K 10K SE +/- 2.22, N = 3 SE +/- 58.36, N = 3 10968.6 10939.8 10902.1 1. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C A B C 3K 6K 9K 12K 15K SE +/- 58.70, N = 3 SE +/- 44.61, N = 3 15318.80 15433.95 15469.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C A B C 500 1000 1500 2000 2500 SE +/- 3.20, N = 3 SE +/- 2.68, N = 3 2226.81 2222.73 2224.77 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C A B C 150 300 450 600 750 SE +/- 9.32, N = 12 SE +/- 9.12, N = 15 517.66 679.98 687.06 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D A B C 150 300 450 600 750 SE +/- 7.37, N = 12 SE +/- 6.96, N = 12 714.76 693.83 707.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C A B C 2K 4K 6K 8K 10K SE +/- 4.85, N = 3 SE +/- 14.32, N = 3 7986.56 8068.20 8062.68 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C A B C 4K 8K 12K 16K 20K SE +/- 61.10, N = 3 SE +/- 25.48, N = 3 17628.14 17592.95 17641.70 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C A B C 2K 4K 6K 8K 10K SE +/- 25.89, N = 3 SE +/- 8.58, N = 3 11493.61 11439.54 11494.86 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B A B C 1500 3000 4500 6000 7500 SE +/- 0.85, N = 3 SE +/- 24.49, N = 3 6921.11 6939.05 6869.51 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C A B C 1500 3000 4500 6000 7500 SE +/- 7.85, N = 3 SE +/- 13.16, N = 3 6974.43 6946.70 6955.91 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS A B C 200 400 600 800 1000 SE +/- 11.85, N = 3 SE +/- 9.49, N = 6 1025 993 999 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen A B C 200 400 600 800 1000 SE +/- 11.36, N = 3 SE +/- 8.63, N = 9 947 951 956 1. (CXX) g++ options: -flto -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 1.0 Throughput Test: Kostya A B C 0.576 1.152 1.728 2.304 2.88 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.56 2.55 2.55 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 1.0 Throughput Test: LargeRandom A B C 0.1935 0.387 0.5805 0.774 0.9675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.86 0.86 0.86 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 1.0 Throughput Test: PartialTweets A B C 0.801 1.602 2.403 3.204 4.005 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 3.56 3.53 3.55 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 1.0 Throughput Test: DistinctUserID A B C 0.9023 1.8046 2.7069 3.6092 4.5115 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 4.00 4.01 3.98 1. (CXX) g++ options: -O3 -pthread
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed A B C 400 800 1200 1600 2000 SE +/- 4.90, N = 3 SE +/- 11.54, N = 3 1797.3 1798.9 1782.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 4.15, N = 3 SE +/- 2.62, N = 3 2994.7 2991.1 2989.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed A B C 70 140 210 280 350 SE +/- 1.57, N = 3 SE +/- 0.38, N = 3 313.0 315.8 313.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed A B C 700 1400 2100 2800 3500 SE +/- 5.19, N = 3 SE +/- 1.51, N = 3 3055.8 3049.5 3056.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed A B C 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 27.8 27.9 27.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 17.19, N = 3 SE +/- 15.66, N = 3 2706.8 2685.0 2665.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed A B C 150 300 450 600 750 SE +/- 7.77, N = 3 SE +/- 6.01, N = 3 666.8 675.6 674.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed A B C 700 1400 2100 2800 3500 SE +/- 2.00, N = 3 SE +/- 2.96, N = 3 3183.1 3189.8 3189.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed A B C 80 160 240 320 400 SE +/- 3.63, N = 3 SE +/- 3.58, N = 3 349.9 357.9 355.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed A B C 700 1400 2100 2800 3500 SE +/- 1.83, N = 3 SE +/- 2.34, N = 3 3242.9 3246.0 3248.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed A B C 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.00, N = 3 21.5 21.6 21.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 9.85, N = 3 SE +/- 6.41, N = 3 2669.6 2660.9 2664.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 3 - Compression Speed A B C 500 1000 1500 2000 2500 SE +/- 4.60, N = 3 SE +/- 9.69, N = 3 2095.9 2111.5 2094.0 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 8 - Compression Speed A B C 80 160 240 320 400 SE +/- 0.35, N = 3 SE +/- 1.00, N = 3 347.5 346.5 346.2 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 8 - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 2.80, N = 3 SE +/- 2.80, N = 3 2949.8 2954.1 2949.7 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 19 - Compression Speed A B C 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.37, N = 3 31.4 31.4 31.0 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 19 - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 1.58, N = 3 SE +/- 0.19, N = 3 2732.8 2730.7 2728.8 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 3, Long Mode - Compression Speed A B C 60 120 180 240 300 SE +/- 1.55, N = 3 SE +/- 0.38, N = 3 270.1 275.7 276.4 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 3, Long Mode - Decompression Speed A B C 700 1400 2100 2800 3500 SE +/- 16.10, N = 3 SE +/- 1.53, N = 3 3067.6 3069.4 3074.3 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 8, Long Mode - Compression Speed A B C 60 120 180 240 300 SE +/- 1.37, N = 3 SE +/- 1.13, N = 3 257.1 253.2 255.5 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 8, Long Mode - Decompression Speed A B C 700 1400 2100 2800 3500 SE +/- 17.90, N = 3 SE +/- 3.06, N = 3 3150.2 3135.2 3153.3 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 19, Long Mode - Compression Speed A B C 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 13.6 13.9 13.9 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression Compression Level: 19, Long Mode - Decompression Speed A B C 600 1200 1800 2400 3000 SE +/- 1.06, N = 3 SE +/- 1.27, N = 3 2784.1 2786.7 2788.3 1. *** zstd command line interface 64-bits v1.4.5, by Yann Collet ***
JPEG XL libjxl Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: PNG - Encode Speed: 5 A B C 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 22.31 22.34 22.39 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL libjxl Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: PNG - Encode Speed: 7 A B C 1.251 2.502 3.753 5.004 6.255 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.55 5.56 5.55 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL libjxl Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: PNG - Encode Speed: 8 A B C 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.6 0.6 0.6 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL libjxl Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: JPEG - Encode Speed: 5 A B C 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.21, N = 3 53.50 53.01 53.35 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL libjxl Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: JPEG - Encode Speed: 7 A B C 12 24 36 48 60 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 53.18 52.87 53.36 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL libjxl Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL libjxl 0.5 Input: JPEG - Encode Speed: 8 A B C 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 20.29 20.47 20.65 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie
JPEG XL Decoding libjxl CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding libjxl 0.5 CPU Threads: 1 A B C 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 37.18 37.10 37.28
JPEG XL Decoding libjxl CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding libjxl 0.5 CPU Threads: All A B C 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.22, N = 3 176.21 176.11 175.78
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test A B C 20M 40M 60M 80M 100M SE +/- 366666.67, N = 3 SE +/- 986576.57, N = 3 81800000 81333333 81800000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM A B C 60 120 180 240 300 SE +/- 3.04, N = 5 SE +/- 0.34, N = 3 286.7 279.4 286.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM A B C 20 40 60 80 100 SE +/- 0.85, N = 5 SE +/- 0.35, N = 3 96.9 94.2 96.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM A B C 60 120 180 240 300 SE +/- 0.53, N = 3 SE +/- 0.55, N = 3 288.4 279.3 287.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM A B C 40 80 120 160 200 SE +/- 0.32, N = 3 SE +/- 1.06, N = 3 161.3 158.1 161.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM A B C 70 140 210 280 350 SE +/- 0.52, N = 3 SE +/- 0.10, N = 3 319.0 307.9 318.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM A B C 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.18, N = 3 103.8 103.1 106.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM A B C 70 140 210 280 350 SE +/- 1.61, N = 3 SE +/- 0.27, N = 3 318.5 311.8 318.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM A B C 40 80 120 160 200 SE +/- 0.50, N = 3 SE +/- 0.20, N = 3 195.6 192.7 195.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM A B C 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.12, N = 3 92.2 92.4 92.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM A B C 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 45.3 45.5 45.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM A B C 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.29, N = 3 102.9 103.1 102.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM A B C 14 28 42 56 70 SE +/- 0.23, N = 3 SE +/- 0.00, N = 3 61.4 61.1 60.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.2 Video Input: Chimera 1080p A B C 90 180 270 360 450 SE +/- 0.87, N = 3 SE +/- 1.12, N = 3 392.29 394.09 394.02 MIN: 290.8 / MAX: 604.97 MIN: 289.72 / MAX: 617.14 MIN: 290.16 / MAX: 620.67 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.2 Video Input: Summer Nature 4K A B C 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.16, N = 3 113.61 113.39 113.68 MIN: 107.63 / MAX: 126.7 MIN: 107.23 / MAX: 127.17 MIN: 107.09 / MAX: 127.86 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.2 Video Input: Summer Nature 1080p A B C 80 160 240 320 400 SE +/- 0.54, N = 3 SE +/- 0.72, N = 3 354.61 351.41 351.26 MIN: 305.08 / MAX: 386.02 MIN: 295.98 / MAX: 384.45 MIN: 292.03 / MAX: 383.28 1. (CC) gcc options: -pthread -lm
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.2 Video Input: Chimera 1080p 10-bit A B C 70 140 210 280 350 SE +/- 0.55, N = 3 SE +/- 0.64, N = 3 317.74 317.26 317.13 MIN: 239.82 / MAX: 500.76 MIN: 239.07 / MAX: 514.5 MIN: 239.09 / MAX: 507.5 1. (CC) gcc options: -pthread -lm
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K A B C 0.0135 0.027 0.0405 0.054 0.0675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.06 0.06 0.06 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K A B C 0.504 1.008 1.512 2.016 2.52 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.24 2.24 2.24 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K A B C 1.3005 2.601 3.9015 5.202 6.5025 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.72 5.78 5.73 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K A B C 1.0238 2.0476 3.0714 4.0952 5.119 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.54 4.55 4.54 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K A B C 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 21.71 21.64 21.69 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K A B C 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 29.55 29.53 29.53 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K A B C 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 33.12 33.02 33.04 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p A B C 0.0338 0.0676 0.1014 0.1352 0.169 SE +/- 0.00, N = 3 0.15 0.15 0.15 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p A B C 1.1498 2.2996 3.4494 4.5992 5.749 SE +/- 0.00, N = 3 5.10 5.10 5.11 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p A B C 1.2083 2.4166 3.6249 4.8332 6.0415 SE +/- 0.02, N = 3 5.37 5.33 5.33 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p A B C 3 6 9 12 15 SE +/- 0.00, N = 3 13.36 13.31 13.32 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p A B C 16 32 48 64 80 SE +/- 0.06, N = 3 69.91 69.76 69.36 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p A B C 20 40 60 80 100 SE +/- 0.06, N = 3 87.19 86.91 86.51 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p A B C 20 40 60 80 100 SE +/- 0.19, N = 3 94.39 94.71 94.58 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Slow A B C 0.8595 1.719 2.5785 3.438 4.2975 SE +/- 0.02, N = 3 3.81 3.82 3.81 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Medium A B C 0.8843 1.7686 2.6529 3.5372 4.4215 SE +/- 0.00, N = 3 3.89 3.93 3.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Slow A B C 4 8 12 16 20 SE +/- 0.01, N = 3 17.82 17.85 17.86 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Medium A B C 5 10 15 20 25 SE +/- 0.01, N = 3 18.35 18.36 18.34 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast A B C 3 6 9 12 15 SE +/- 0.00, N = 3 9.06 9.08 9.09 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast A B C 4 8 12 16 20 SE +/- 0.01, N = 3 15.97 15.95 15.95 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast A B C 9 18 27 36 45 SE +/- 0.03, N = 3 36.96 37.08 37.13 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast A B C 15 30 45 60 75 SE +/- 0.13, N = 3 65.48 65.27 65.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K A B C 0.1784 0.3568 0.5352 0.7136 0.892 SE +/- 0.001, N = 3 0.788 0.789 0.793 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K A B C 2 4 6 8 10 SE +/- 0.013, N = 3 8.320 8.385 8.320 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p A B C 0.488 0.976 1.464 1.952 2.44 SE +/- 0.008, N = 3 2.150 2.169 2.132 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p A B C 6 12 18 24 30 SE +/- 0.04, N = 3 26.27 26.49 26.41 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark ISPC A B C 10 20 30 40 50 43 43 43 MIN: 3 / MAX: 530 MIN: 3 / MAX: 534 MIN: 3 / MAX: 531
OpenVKL Benchmark: vklBenchmark Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark Scalar A B C 5 10 15 20 25 20 20 20 MIN: 2 / MAX: 361 MIN: 2 / MAX: 364 MIN: 2 / MAX: 366
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile A B C 20 40 60 80 100 SE +/- 1.03, N = 3 101.20 96.61 96.01
Timed GCC Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GCC Compilation 11.2.0 Time To Compile A B C 400 800 1200 1600 2000 SE +/- 6.23, N = 3 1646.51 1636.95 1637.86
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile A B C 30 60 90 120 150 SE +/- 1.21, N = 3 116.07 107.45 109.08
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.14 Time To Compile A B C 40 80 120 160 200 SE +/- 0.88, N = 3 177.02 153.54 167.57
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 13.0 Build System: Ninja A B C 300 600 900 1200 1500 SE +/- 0.86, N = 3 1293.41 1152.27 1152.25
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 13.0 Build System: Unix Makefiles A B C 300 600 900 1200 1500 SE +/- 3.12, N = 3 1326.17 1181.92 1186.23
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene A B C 60 120 180 240 300 SE +/- 2.67, N = 9 259.22 265.97 259.75 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.3 WAV To FLAC A B C 5 10 15 20 25 SE +/- 0.88, N = 25 20.57 22.94 20.74 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time A B C 40 80 120 160 200 SE +/- 0.09, N = 3 161.05 161.01 160.83 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 A B C 130 260 390 520 650 SE +/- 1.43, N = 3 614.71 614.33 614.00 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Magi A B C 30 60 90 120 150 SE +/- 0.15, N = 3 145.84 146.00 146.35 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: x25x A B C 30 60 90 120 150 SE +/- 0.04, N = 3 155.57 155.46 156.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Deepcoin A B C 1100 2200 3300 4400 5500 SE +/- 8.94, N = 3 5052.16 5006.64 4997.89 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Ringcoin A B C 200 400 600 800 1000 SE +/- 11.07, N = 3 1089.77 1134.09 1103.84 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Blake-2 S A B C 40K 80K 120K 160K 200K SE +/- 329.16, N = 3 192010 191597 191750 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Garlicoin A B C 300 600 900 1200 1500 SE +/- 6.54, N = 3 1270.12 1250.23 1229.99 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Skeincoin A B C 5K 10K 15K 20K 25K SE +/- 243.39, N = 13 22050 22291 22030 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Myriad-Groestl A B C 1600 3200 4800 6400 8000 SE +/- 13.60, N = 3 7233.62 7232.25 7253.89 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: LBC, LBRY Credits A B C 3K 6K 9K 12K 15K SE +/- 151.69, N = 3 14780 14987 14960 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Quad SHA-256, Pyrite A B C 6K 12K 18K 24K 30K SE +/- 20.00, N = 3 27340 27360 27070 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.18 Algorithm: Triple SHA-256, Onecoin A B C 8K 16K 24K 32K 40K SE +/- 3.33, N = 3 38570 38643 38640 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 A B C 400M 800M 1200M 1600M 2000M SE +/- 92617.00, N = 3 1807041190 1806881950 1806806150 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 A B C 300 600 900 1200 1500 SE +/- 11.77, N = 3 1584.0 1580.1 1591.6 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 A B C 20K 40K 60K 80K 100K SE +/- 276.34, N = 3 106261.2 106223.1 106536.5 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL OpenBenchmarking.org sign/s, More Is Better OpenSSL A B C 300 600 900 1200 1500 SE +/- 11.06, N = 3 1591.9 1568.9 1587.9 1. OpenSSL 1.1.1f 31 Mar 2020
OpenSSL OpenBenchmarking.org verify/s, More Is Better OpenSSL A B C 20K 40K 60K 80K 100K SE +/- 253.28, N = 3 105612.4 105271.9 105701.2 1. OpenSSL 1.1.1f 31 Mar 2020
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare A B C 0.1474 0.2948 0.4422 0.5896 0.737 SE +/- 0.000, N = 3 0.655 0.650 0.654 1. (CXX) g++ options: -O3 -pthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Medium A B C 2 4 6 8 10 SE +/- 0.0070, N = 3 6.1541 6.1277 6.1442 1. (CXX) g++ options: -O3 -flto -pthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Thorough A B C 3 6 9 12 15 SE +/- 0.01, N = 3 11.99 12.00 12.01 1. (CXX) g++ options: -O3 -flto -pthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Exhaustive A B C 20 40 60 80 100 SE +/- 0.02, N = 3 109.22 109.14 109.20 1. (CXX) g++ options: -O3 -flto -pthread
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.18 Test: resize A B C 3 6 9 12 15 SE +/- 0.14, N = 4 12.33 11.88 13.42
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.18 Test: rotate A B C 5 10 15 20 25 SE +/- 0.04, N = 3 20.78 16.28 22.40
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.18 Test: auto-levels A B C 4 8 12 16 20 SE +/- 0.04, N = 3 16.53 16.58 16.63
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.18 Test: unsharp-mask A B C 5 10 15 20 25 SE +/- 0.04, N = 3 19.11 19.12 18.93
Stress-NG Test: MMAP OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: MMAP A B C 12 24 36 48 60 SE +/- 3.52, N = 15 30.72 37.26 51.18 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: NUMA OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: NUMA A B C 30 60 90 120 150 SE +/- 0.70, N = 3 121.55 123.02 124.20 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: MEMFD OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: MEMFD A B C 100 200 300 400 500 SE +/- 0.97, N = 3 452.15 451.73 452.15 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Atomic OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Atomic A B C 40K 80K 120K 160K 200K SE +/- 472.73, N = 3 208402.05 209318.42 208200.63 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Crypto OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Crypto A B C 200 400 600 800 1000 SE +/- 0.47, N = 3 1104.65 1104.18 1104.48 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Malloc OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Malloc A B C 11M 22M 33M 44M 55M SE +/- 63327.73, N = 3 49310079.90 49366861.97 49293980.41 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: RdRand OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: RdRand A B C 50K 100K 150K 200K 250K SE +/- 13.53, N = 3 243890.96 243898.56 243884.74 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Forking OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Forking A B C 8K 16K 24K 32K 40K SE +/- 93.80, N = 3 36131.52 36321.52 35865.65 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: IO_uring OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: IO_uring A B C 6K 12K 18K 24K 30K SE +/- 331.89, N = 15 27562.44 27678.66 29344.02 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: SENDFILE A B C 14K 28K 42K 56K 70K SE +/- 65.94, N = 3 65515.94 65492.37 65182.98 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: CPU Cache OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: CPU Cache A B C 30 60 90 120 150 SE +/- 1.66, N = 15 112.40 102.24 108.02 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: CPU Stress OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: CPU Stress A B C 3K 6K 9K 12K 15K SE +/- 108.55, N = 3 11683.25 11574.90 11391.30 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Semaphores A B C 200K 400K 600K 800K 1000K SE +/- 136.35, N = 3 828934.47 828486.37 827924.40 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Matrix Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Matrix Math A B C 6K 12K 18K 24K 30K SE +/- 22.40, N = 3 28847.66 29006.61 28981.15 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Vector Math A B C 4K 8K 12K 16K 20K SE +/- 2.18, N = 3 17747.32 17752.73 16065.59 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Memory Copying A B C 600 1200 1800 2400 3000 SE +/- 3.09, N = 3 2655.54 2663.57 2665.33 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Socket Activity A B C 1000 2000 3000 4000 5000 SE +/- 72.96, N = 15 4471.96 4370.16 4271.88 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Context Switching A B C 400K 800K 1200K 1600K 2000K SE +/- 9940.72, N = 3 2041838.46 2088361.07 2062807.92 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Glibc C String Functions OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Glibc C String Functions A B C 160K 320K 480K 640K 800K SE +/- 5092.80, N = 3 757524.58 746679.27 748976.91 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Glibc Qsort Data Sorting OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Glibc Qsort Data Sorting A B C 20 40 60 80 100 SE +/- 0.14, N = 3 89.20 89.41 86.93 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: System V Message Passing A B C 800K 1600K 2400K 3200K 4000K SE +/- 6179.16, N = 3 3883131.92 3870784.75 3878475.74 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Redis Memtier / Redis Benchmark Test: LPUSH and LPOP: lpop OpenBenchmarking.org Requests Per Second, More Is Better Redis Memtier / Redis Benchmark Test: LPUSH and LPOP: lpop A B C 200K 400K 600K 800K 1000K SE +/- 5077.18, N = 14 780156.69 775337.29 709680.81 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre 2. Redis server v=6.0.6 sha=00000000:0 malloc=jemalloc-5.2.1 bits=64 build=ca474e00afe358bb
Redis Memtier / Redis Benchmark Test: LPUSH and LPOP: lpush OpenBenchmarking.org Requests Per Second, More Is Better Redis Memtier / Redis Benchmark Test: LPUSH and LPOP: lpush A B C 200K 400K 600K 800K 1000K SE +/- 3929.33, N = 3 798023.44 790620.96 786658.31 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre 2. Redis server v=6.0.6 sha=00000000:0 malloc=jemalloc-5.2.1 bits=64 build=ca474e00afe358bb
Redis Memtier / Redis Benchmark Test: GET OpenBenchmarking.org Operations Per Second, More Is Better Redis Memtier / Redis Benchmark Test: GET A B C 150K 300K 450K 600K 750K SE +/- 3280.47, N = 3 698504.30 457091.28 455100.67 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre 2. Redis server v=6.0.6 sha=00000000:0 malloc=jemalloc-5.2.1 bits=64 build=ca474e00afe358bb
Redis Memtier / Redis Benchmark Test: MIX OpenBenchmarking.org Operations Per Second, More Is Better Redis Memtier / Redis Benchmark Test: MIX A B C 140K 280K 420K 560K 700K SE +/- 3215.23, N = 3 650871.26 461731.16 453693.85 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre 2. Redis server v=6.0.6 sha=00000000:0 malloc=jemalloc-5.2.1 bits=64 build=ca474e00afe358bb
Redis Memtier / Redis Benchmark Test: SET OpenBenchmarking.org Operations Per Second, More Is Better Redis Memtier / Redis Benchmark Test: SET A B C 110K 220K 330K 440K 550K SE +/- 2868.05, N = 3 514913.77 505314.52 506867.26 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre 2. Redis server v=6.0.6 sha=00000000:0 malloc=jemalloc-5.2.1 bits=64 build=ca474e00afe358bb
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 A B C 0.5414 1.0828 1.6242 2.1656 2.707 SE +/- 0.006, N = 3 2.401 2.395 2.406 MIN: 2.38 / MAX: 2.54 MIN: 2.36 / MAX: 3.31 MIN: 2.38 / MAX: 2.53 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 A B C 1.1342 2.2684 3.4026 4.5368 5.671 SE +/- 0.015, N = 3 5.041 5.033 5.039 MIN: 5.01 / MAX: 6.34 MIN: 4.93 / MAX: 17.84 MIN: 5.01 / MAX: 5.7 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 A B C 11 22 33 44 55 SE +/- 0.55, N = 3 47.31 46.39 46.92 MIN: 47.04 / MAX: 103.49 MIN: 45.15 / MAX: 62.28 MIN: 46.76 / MAX: 62.76 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 A B C 2 4 6 8 10 SE +/- 0.051, N = 3 6.798 6.927 6.815 MIN: 6.74 / MAX: 7.68 MIN: 6.71 / MAX: 22.85 MIN: 6.77 / MAX: 7.65 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 A B C 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.001, N = 3 3.943 3.976 3.980 MIN: 3.91 / MAX: 4.82 MIN: 3.93 / MAX: 6.42 MIN: 3.94 / MAX: 4.37 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 A B C 1.2222 2.4444 3.6666 4.8888 6.111 SE +/- 0.003, N = 3 5.374 5.407 5.432 MIN: 5.34 / MAX: 7.92 MIN: 5.37 / MAX: 6.44 MIN: 5.37 / MAX: 6.15 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 A B C 11 22 33 44 55 SE +/- 0.35, N = 3 48.22 49.27 48.87 MIN: 48.04 / MAX: 58.82 MIN: 46.8 / MAX: 166.93 MIN: 48.58 / MAX: 107.35 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet A B C 5 10 15 20 25 SE +/- 0.13, N = 3 19.21 19.68 20.28 MIN: 19.13 / MAX: 19.83 MIN: 19.1 / MAX: 20.48 MIN: 20.19 / MAX: 21.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 A B C 2 4 6 8 10 SE +/- 0.12, N = 3 5.86 5.79 6.02 MIN: 5.78 / MAX: 6.97 MIN: 5.49 / MAX: 19.06 MIN: 5.87 / MAX: 17.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 A B C 1.1835 2.367 3.5505 4.734 5.9175 SE +/- 0.07, N = 3 5.26 5.26 5.25 MIN: 5.21 / MAX: 5.48 MIN: 5.08 / MAX: 17.98 MIN: 5.12 / MAX: 15.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 A B C 1.1273 2.2546 3.3819 4.5092 5.6365 SE +/- 0.06, N = 3 4.92 4.94 5.01 MIN: 4.89 / MAX: 5.09 MIN: 4.77 / MAX: 17.68 MIN: 4.81 / MAX: 17.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet A B C 1.1948 2.3896 3.5844 4.7792 5.974 SE +/- 0.03, N = 3 5.31 5.09 5.12 MIN: 5.27 / MAX: 5.52 MIN: 4.98 / MAX: 5.7 MIN: 5.07 / MAX: 5.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 A B C 2 4 6 8 10 SE +/- 0.05, N = 3 7.96 7.82 7.84 MIN: 7.72 / MAX: 19.11 MIN: 7.69 / MAX: 10.67 MIN: 7.78 / MAX: 8.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface A B C 0.513 1.026 1.539 2.052 2.565 SE +/- 0.01, N = 3 2.24 2.25 2.28 MIN: 2.21 / MAX: 2.37 MIN: 2.22 / MAX: 2.53 MIN: 2.25 / MAX: 2.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet A B C 4 8 12 16 20 SE +/- 0.02, N = 3 17.12 17.22 17.57 MIN: 17.03 / MAX: 17.83 MIN: 17.11 / MAX: 18.02 MIN: 17.48 / MAX: 18.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 A B C 14 28 42 56 70 SE +/- 0.06, N = 3 60.17 59.65 60.50 MIN: 59.33 / MAX: 71.39 MIN: 58.91 / MAX: 79.53 MIN: 59.97 / MAX: 68.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 A B C 4 8 12 16 20 SE +/- 0.10, N = 3 17.77 17.78 18.08 MIN: 17.57 / MAX: 26.98 MIN: 17.53 / MAX: 23.77 MIN: 18.01 / MAX: 18.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet A B C 4 8 12 16 20 SE +/- 0.16, N = 3 14.12 14.31 14.11 MIN: 14.05 / MAX: 14.68 MIN: 14.07 / MAX: 16.88 MIN: 14.04 / MAX: 14.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 A B C 7 14 21 28 35 SE +/- 0.01, N = 3 31.63 31.69 32.26 MIN: 31.51 / MAX: 32.27 MIN: 31.39 / MAX: 43.49 MIN: 31.97 / MAX: 35.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny A B C 8 16 24 32 40 SE +/- 0.05, N = 3 32.49 32.22 32.59 MIN: 31.15 / MAX: 39.76 MIN: 30.38 / MAX: 38.15 MIN: 30.67 / MAX: 34.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd A B C 6 12 18 24 30 SE +/- 0.11, N = 3 25.15 25.16 25.03 MIN: 24.74 / MAX: 34.99 MIN: 24.79 / MAX: 33.7 MIN: 24.8 / MAX: 31.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m A B C 3 6 9 12 15 SE +/- 0.05, N = 3 13.10 13.10 13.22 MIN: 13.01 / MAX: 13.83 MIN: 12.94 / MAX: 13.77 MIN: 13.13 / MAX: 13.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet A B C 800 1600 2400 3200 4000 SE +/- 4.66, N = 3 3833.18 3839.73 3837.78 MIN: 3767.87 / MAX: 3930.85 MIN: 3783.91 / MAX: 3957.64 MIN: 3801.83 / MAX: 3894.03 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 A B C 70 140 210 280 350 SE +/- 1.15, N = 3 324.35 326.44 324.34 MIN: 317.59 / MAX: 338.16 MIN: 318.49 / MAX: 346.83 MIN: 318.79 / MAX: 333.39 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 A B C 20 40 60 80 100 SE +/- 0.46, N = 3 75.35 76.42 76.08 MIN: 75.18 / MAX: 75.69 MIN: 75.78 / MAX: 77.88 MIN: 75.86 / MAX: 84.55 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 A B C 60 120 180 240 300 SE +/- 0.13, N = 3 289.81 291.20 290.98 MIN: 289.27 / MAX: 290.57 MIN: 290.41 / MAX: 292.96 MIN: 290.59 / MAX: 291.4 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: yolov4 - Device: OpenMP CPU A B C 70 140 210 280 350 SE +/- 0.17, N = 3 309 308 308 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: bertsquad-10 - Device: OpenMP CPU A B C 130 260 390 520 650 SE +/- 1.42, N = 3 586 584 587 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: fcn-resnet101-11 - Device: OpenMP CPU A B C 13 26 39 52 65 SE +/- 0.17, N = 3 56 55 55 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: shufflenet-v2-10 - Device: OpenMP CPU A B C 4K 8K 12K 16K 20K SE +/- 58.75, N = 3 17608 17553 17708 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: super-resolution-10 - Device: OpenMP CPU A B C 800 1600 2400 3200 4000 SE +/- 11.81, N = 3 3574 3566 3584 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 A B C 12 24 36 48 60 51.66 52.43 51.92
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 A B C 200 400 600 800 1000 866.05 872.06 866.20
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B2 A B C 200 400 600 800 1000 1007.87 1010.62 1002.73
nginx Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1 A B C 5K 10K 15K 20K 25K SE +/- 47.90, N = 3 25112.59 25115.74 24860.72 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 A B C 70K 140K 210K 280K 350K SE +/- 611.51, N = 3 321094.09 318974.04 318873.10 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 A B C 60K 120K 180K 240K 300K SE +/- 60.05, N = 3 286587.24 285084.05 284523.90 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 A B C 60K 120K 180K 240K 300K SE +/- 143.68, N = 3 273601.82 272542.89 270441.17 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 500 A B C 50K 100K 150K 200K 250K SE +/- 968.56, N = 3 243132.50 243404.97 240178.52 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1000 A B C 50K 100K 150K 200K 250K SE +/- 2198.07, N = 3 223356.35 219957.57 215715.37 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Nginx Test: Long Connection - Connections: 100 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Long Connection - Connections: 100 A B C 20K 40K 60K 80K 100K SE +/- 19.30, N = 3 81504.87 82411.34 81381.61 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Nginx Test: Long Connection - Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Long Connection - Connections: 500 A B C 20K 40K 60K 80K 100K SE +/- 41.26, N = 3 77954.06 78363.69 77784.73 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Nginx Test: Long Connection - Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Long Connection - Connections: 1000 A B C 17K 34K 51K 68K 85K SE +/- 44.06, N = 3 76222.86 77254.72 76529.85 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Nginx Test: Short Connection - Connections: 100 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Short Connection - Connections: 100 A B C 12K 24K 36K 48K 60K SE +/- 12.44, N = 3 57682.81 57269.88 57080.76 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Nginx Test: Short Connection - Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Short Connection - Connections: 500 A B C 12K 24K 36K 48K 60K SE +/- 69.09, N = 3 54430.97 54509.07 54074.02 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Nginx Test: Short Connection - Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Nginx Test: Short Connection - Connections: 1000 A B C 11K 22K 33K 44K 55K SE +/- 229.71, N = 3 53603.43 53572.83 53298.47 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2 2. nginx version: nginx/1.18.0 (Ubuntu)
Natron Input: Spaceship OpenBenchmarking.org FPS, More Is Better Natron 2.4 Input: Spaceship A B C 0.36 0.72 1.08 1.44 1.8 SE +/- 0.01, N = 7 1.6 1.6 1.6
Apache HTTP Server Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1 A B 900 1800 2700 3600 4500 SE +/- 13.10, N = 3 4277.13 4326.37 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 20 A B 7K 14K 21K 28K 35K SE +/- 162.08, N = 3 33372.71 34939.76 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 A B 12K 24K 36K 48K 60K SE +/- 463.07, N = 3 56992.42 54120.02 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 A B 12K 24K 36K 48K 60K SE +/- 141.79, N = 3 54884.91 55366.14 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 A 10K 20K 30K 40K 50K 48693.76 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1000 A 10K 20K 30K 40K 50K 47762.41 1. (CC) gcc options: -shared -fPIC -O2 -pthread
RAR Compression Linux Source Tree Archiving To RAR OpenBenchmarking.org Seconds, Fewer Is Better RAR Compression 6.0.2 Linux Source Tree Archiving To RAR A C 20 40 60 80 100 88.67 107.59
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric A C 15K 30K 45K 60K 75K 69673 69061 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
OpenCV Test: Features 2D OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: Features 2D A C 60K 120K 180K 240K 300K 247660 262540 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenCV Test: Object Detection OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: Object Detection A C 20K 40K 60K 80K 100K 114640 106889 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network A C 3K 6K 9K 12K 15K 15097 15556 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
Phoronix Test Suite v10.8.4