Ryzen 9 5950X Compiler Testing AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) and AMD NAVY_FLOUNDER 12GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103149-PTS-RYZEN95931&grw .
Ryzen 9 5950X Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) AMD Starship/Matisse 32GB 2000GB Corsair Force MP600 + 2000GB AMD NAVY_FLOUNDER 12GB (2855/1000MHz) AMD Device ab28 ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.6-051106-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.168 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - Python 3.8.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 9 5950X Compiler Testing tscp: AI Chess Performance cryptopp: Unkeyed Algorithms compress-lz4: 1 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed crafty: Elapsed Time basis: ETC1S basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 ngspice: C2670 ngspice: C7552 encode-opus: WAV To Opus Encode encode-wavpack: WAV To WavPack astcenc: Medium astcenc: Thorough astcenc: Exhaustive etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All libraw: Post-Processing Benchmark webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression encode-ogg: WAV To Ogg synthmark: VoiceMark_100 gcrypt: quantlib: mrbayes: Primate Phylogeny Analysis rnnoise: mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU openfoam: Motorbike 30M build-llvm: Time To Compile compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed sysbench: CPU aom-av1: Speed 0 Two-Pass aom-av1: Speed 4 Two-Pass aom-av1: Speed 6 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 8 Realtime graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space tachyon: Total Time svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p x264: H.264 Video Encoding dav1d: Summer Nature 4K dav1d: Summer Nature 1080p svt-av1: Enc Mode 4 - 1080p svt-av1: Enc Mode 8 - 1080p x265: Bosphorus 4K x265: Bosphorus 1080p c-ray: Total Time - 4K, 16 Rays Per Pixel povray: Trace Time avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless build-godot: Time To Compile smallpt: Global Illumination Renderer; 128 Samples gnuradio: Five Back to Back FIR Filters gnuradio: Signal Source (Cosine) gnuradio: FIR Filter gnuradio: IIR Filter gnuradio: FM Deemphasis Filter gnuradio: Hilbert Transform liquid-dsp: 1 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 32 - 256 - 57 redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET sqlite-speedtest: Timed Time - Size 1,000 simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID GCC 10.2 1965773 545.914609 12330.56 13771.1 72.36 13400.1 71.13 13397.7 11731249 19.896 5.157 15.902 28.126 71.603 62.816 5.484 10.149 4.0524 6.9922 52.9268 1546.299 386.561 245.041 74.12 11.20 1.14 87.35 87.07 38.13 56.53 210.99 78.66 2.274 111.802 203.811 6.414 367.371 1.042 1.652 13.990 5.242 28.813 13.578 966.298 171.186 3196.9 59.869 14.197 5.081 25.065 3.240 2.351 32.344 433 614 99 15049 6721 216.281 211.567 12.42 4.43 3.85 4.23 3.93 5.32 1.83 12.76 57.89 14.11 10.82 25.67 20.77 13.77 17.61 3.95979 9.25967 17.2905 4.46777 3.55467 2757.52 1773.67 0.638664 97.75 370.571 1057.4 4617.1 51.6 4251.7 1425.9 4737.1 1122.6 4886.2 36.6 4350.9 91743.72 0.37 9.20 35.13 29.43 121.13 1166 1056 375 439 2165 454 1115 44.3941 235.04 228.96 208.93 243.69 971.79 6.137 51.774 27.83 89.80 25.089 24.093 43.615 23.538 8.927 2.934 30.977 4.875 79.523 4.674 920.8 4715.4 1063.5 843.1 1055.0 515.8 81844000 1111200000 1164966667 3549910.50 3041527.37 2222217.52 3470419.90 2640316.17 42.599 3.72 1.22 5.64 5.73 OpenBenchmarking.org
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 400K 800K 1200K 1600K 2000K SE +/- 7442.75, N = 5 1965773 1. (CC) gcc options: -O3 -march=native
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms GCC 10.2 120 240 360 480 600 SE +/- 3.29, N = 15 545.91 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 76.55, N = 3 12330.56 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 38.97, N = 3 13771.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.86, N = 3 72.36 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 48.22, N = 3 13400.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.68, N = 6 71.13 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 35.65, N = 6 13397.7 1. (CC) gcc options: -O3
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 10.2 3M 6M 9M 12M 15M SE +/- 26371.45, N = 3 11731249 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S GCC 10.2 5 10 15 20 25 SE +/- 0.04, N = 3 19.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 GCC 10.2 1.1603 2.3206 3.4809 4.6412 5.8015 SE +/- 0.023, N = 3 5.157 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 3 15.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 GCC 10.2 7 14 21 28 35 SE +/- 0.04, N = 3 28.13 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 16 32 48 64 80 SE +/- 0.21, N = 3 71.60 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 14 28 42 56 70 SE +/- 0.15, N = 3 62.82 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 1.2339 2.4678 3.7017 4.9356 6.1695 SE +/- 0.031, N = 5 5.484 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 10.2 3 6 9 12 15 SE +/- 0.10, N = 5 10.15 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium GCC 10.2 0.9118 1.8236 2.7354 3.6472 4.559 SE +/- 0.0178, N = 3 4.0524 1. (CXX) g++ options: -O3 -march=native -flto -pthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough GCC 10.2 2 4 6 8 10 SE +/- 0.0057, N = 3 6.9922 1. (CXX) g++ options: -O3 -march=native -flto -pthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive GCC 10.2 12 24 36 48 60 SE +/- 0.09, N = 3 52.93 1. (CXX) g++ options: -O3 -march=native -flto -pthread
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 GCC 10.2 300 600 900 1200 1500 SE +/- 2.21, N = 3 1546.30 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 GCC 10.2 80 160 240 320 400 SE +/- 0.37, N = 3 386.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 GCC 10.2 50 100 150 200 250 SE +/- 1.65, N = 3 245.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 GCC 10.2 16 32 48 64 80 SE +/- 0.03, N = 3 74.12 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 11.20 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 8 GCC 10.2 0.2565 0.513 0.7695 1.026 1.2825 SE +/- 0.00, N = 3 1.14 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 87.35 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 87.07 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 38.13 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 GCC 10.2 13 26 39 52 65 SE +/- 0.05, N = 3 56.53
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: All GCC 10.2 50 100 150 200 250 SE +/- 0.29, N = 3 210.99
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 20 40 60 80 100 SE +/- 0.16, N = 3 78.66 1. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default GCC 10.2 0.5117 1.0234 1.5351 2.0468 2.5585 SE +/- 0.005, N = 3 2.274 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 GCC 10.2 30 60 90 120 150 SE +/- 1.06, N = 3 111.80 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 GCC 10.2 40 80 120 160 200 SE +/- 0.04, N = 3 203.81 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 GCC 10.2 2 4 6 8 10 SE +/- 0.011, N = 3 6.414 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression GCC 10.2 80 160 240 320 400 SE +/- 0.42, N = 3 367.37 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default GCC 10.2 0.2345 0.469 0.7035 0.938 1.1725 SE +/- 0.008, N = 3 1.042 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 GCC 10.2 0.3717 0.7434 1.1151 1.4868 1.8585 SE +/- 0.018, N = 4 1.652 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 10.2 4 8 12 16 20 SE +/- 0.11, N = 3 13.99 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression GCC 10.2 1.1795 2.359 3.5385 4.718 5.8975 SE +/- 0.018, N = 3 5.242 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 10.2 7 14 21 28 35 SE +/- 0.08, N = 3 28.81 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg GCC 10.2 3 6 9 12 15 SE +/- 0.04, N = 3 13.58 1. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 200 400 600 800 1000 SE +/- 1.26, N = 3 966.30 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 40 80 120 160 200 SE +/- 0.29, N = 3 171.19 1. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 700 1400 2100 2800 3500 SE +/- 33.41, N = 5 3196.9 1. (CXX) g++ options: -O3 -march=native -rdynamic
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 13 26 39 52 65 SE +/- 0.15, N = 3 59.87 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -lm -lreadline
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 GCC 10.2 4 8 12 16 20 SE +/- 0.04, N = 3 14.20 1. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 GCC 10.2 1.1432 2.2864 3.4296 4.5728 5.716 SE +/- 0.010, N = 3 5.081 MIN: 4.92 / MAX: 14.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 25.07 MIN: 23.97 / MAX: 39.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 GCC 10.2 0.729 1.458 2.187 2.916 3.645 SE +/- 0.049, N = 3 3.240 MIN: 3.12 / MAX: 11.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 GCC 10.2 0.529 1.058 1.587 2.116 2.645 SE +/- 0.027, N = 3 2.351 MIN: 2.27 / MAX: 7.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.09, N = 3 32.34 MIN: 31.33 / MAX: 42.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 1.96, N = 3 433 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 130 260 390 520 650 SE +/- 6.71, N = 3 614 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 20 40 60 80 100 SE +/- 0.17, N = 3 99 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 3K 6K 9K 12K 15K SE +/- 134.84, N = 3 15049 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1400 2800 4200 5600 7000 SE +/- 215.50, N = 12 6721 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 50 100 150 200 250 SE +/- 0.56, N = 3 216.28 MIN: 215.1 / MAX: 218.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.57, N = 3 211.57 MIN: 206.88 / MAX: 212.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet GCC 10.2 3 6 9 12 15 SE +/- 0.16, N = 15 12.42 MIN: 11.7 / MAX: 20.08 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 10.2 0.9968 1.9936 2.9904 3.9872 4.984 SE +/- 0.01, N = 15 4.43 MIN: 4.19 / MAX: 11.09 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 10.2 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.02, N = 15 3.85 MIN: 3.74 / MAX: 10.85 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 GCC 10.2 0.9518 1.9036 2.8554 3.8072 4.759 SE +/- 0.01, N = 15 4.23 MIN: 4.15 / MAX: 9.05 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet GCC 10.2 0.8843 1.7686 2.6529 3.5372 4.4215 SE +/- 0.02, N = 15 3.93 MIN: 3.71 / MAX: 6.06 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 GCC 10.2 1.197 2.394 3.591 4.788 5.985 SE +/- 0.02, N = 15 5.32 MIN: 5.15 / MAX: 13.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface GCC 10.2 0.4118 0.8236 1.2354 1.6472 2.059 SE +/- 0.01, N = 15 1.83 MIN: 1.77 / MAX: 3.9 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet GCC 10.2 3 6 9 12 15 SE +/- 0.06, N = 15 12.76 MIN: 12.19 / MAX: 19.36 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 GCC 10.2 13 26 39 52 65 SE +/- 0.12, N = 15 57.89 MIN: 55.89 / MAX: 80.86 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 15 14.11 MIN: 13.84 / MAX: 23.15 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet GCC 10.2 3 6 9 12 15 SE +/- 0.09, N = 15 10.82 MIN: 10.41 / MAX: 17.59 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 GCC 10.2 6 12 18 24 30 SE +/- 0.21, N = 15 25.67 MIN: 24.52 / MAX: 35.96 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny GCC 10.2 5 10 15 20 25 SE +/- 0.17, N = 15 20.77 MIN: 19.69 / MAX: 43.19 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 13.77 MIN: 13.25 / MAX: 23.45 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 17.61 MIN: 16.94 / MAX: 25.97 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU GCC 10.2 0.891 1.782 2.673 3.564 4.455 SE +/- 0.00506, N = 3 3.95979 MIN: 3.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU GCC 10.2 3 6 9 12 15 SE +/- 0.01340, N = 3 9.25967 MIN: 9.1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU GCC 10.2 4 8 12 16 20 SE +/- 0.09, N = 3 17.29 MIN: 16.58 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU GCC 10.2 1.0052 2.0104 3.0156 4.0208 5.026 SE +/- 0.30276, N = 15 4.46777 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU GCC 10.2 0.7998 1.5996 2.3994 3.1992 3.999 SE +/- 0.00753, N = 3 3.55467 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU GCC 10.2 600 1200 1800 2400 3000 SE +/- 2.01, N = 3 2757.52 MIN: 2719.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU GCC 10.2 400 800 1200 1600 2000 SE +/- 5.00, N = 3 1773.67 MIN: 1750.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU GCC 10.2 0.1437 0.2874 0.4311 0.5748 0.7185 SE +/- 0.000722, N = 3 0.638664 MIN: 0.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 20 40 60 80 100 SE +/- 0.08, N = 3 97.75 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile GCC 10.2 80 160 240 320 400 SE +/- 2.79, N = 3 370.57
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 3.93, N = 3 1057.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 4617.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 12 24 36 48 60 SE +/- 0.20, N = 3 51.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 6.53, N = 3 4251.7 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 300 600 900 1200 1500 SE +/- 2.43, N = 3 1425.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 46.74, N = 3 4737.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 2.15, N = 3 1122.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 29.99, N = 3 4886.2 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 8 16 24 32 40 SE +/- 0.03, N = 3 36.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 72.38, N = 3 4350.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 10.2 20K 40K 60K 80K 100K SE +/- 115.96, N = 3 91743.72 1. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 0 Two-Pass GCC 10.2 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 0.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 4 Two-Pass GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.20 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime GCC 10.2 8 16 24 32 40 SE +/- 0.16, N = 3 35.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass GCC 10.2 7 14 21 28 35 SE +/- 0.26, N = 3 29.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime GCC 10.2 30 60 90 120 150 SE +/- 0.75, N = 3 121.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 300 600 900 1200 1500 SE +/- 3.67, N = 3 1166 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 200 400 600 800 1000 SE +/- 3.51, N = 3 1056 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 80 160 240 320 400 SE +/- 1.00, N = 3 375 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 100 200 300 400 500 SE +/- 0.33, N = 3 439 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 500 1000 1500 2000 2500 SE +/- 1.45, N = 3 2165 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 100 200 300 400 500 SE +/- 0.67, N = 3 454 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 200 400 600 800 1000 SE +/- 1.33, N = 3 1115 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 44.39 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 2.40, N = 12 235.04 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 0.68, N = 3 228.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding GCC 10.2 50 100 150 200 250 SE +/- 1.66, N = 9 208.93 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K GCC 10.2 50 100 150 200 250 SE +/- 0.47, N = 3 243.69 MIN: 181.29 / MAX: 252.3 1. (CC) gcc options: -O3 -march=native -pthread -lm
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p GCC 10.2 200 400 600 800 1000 SE +/- 1.38, N = 3 971.79 MIN: 732.02 / MAX: 1055.82 1. (CC) gcc options: -O3 -march=native -pthread -lm
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p GCC 10.2 2 4 6 8 10 SE +/- 0.014, N = 3 6.137 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p GCC 10.2 12 24 36 48 60 SE +/- 0.24, N = 3 51.77 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 7 14 21 28 35 SE +/- 0.16, N = 3 27.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 89.80 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 6 12 18 24 30 SE +/- 0.07, N = 3 25.09 1. (CC) gcc options: -lm -lpthread -O3 -march=native
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 6 12 18 24 30 SE +/- 0.09, N = 3 24.09 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 10 20 30 40 50 SE +/- 0.21, N = 3 43.62 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 23.54 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 2 4 6 8 10 SE +/- 0.048, N = 3 8.927 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 0.6602 1.3204 1.9806 2.6408 3.301 SE +/- 0.014, N = 3 2.934 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 7 14 21 28 35 SE +/- 0.06, N = 3 30.98 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 1.0969 2.1938 3.2907 4.3876 5.4845 SE +/- 0.022, N = 3 4.875 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 79.52
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 10.2 1.0517 2.1034 3.1551 4.2068 5.2585 SE +/- 0.015, N = 3 4.674 1. (CXX) g++ options: -fopenmp -O3 -march=native
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters GCC 10.2 200 400 600 800 1000 SE +/- 19.67, N = 9 920.8 1. 3.8.1.0
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) GCC 10.2 1000 2000 3000 4000 5000 SE +/- 16.39, N = 9 4715.4 1. 3.8.1.0
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 2.09, N = 9 1063.5 1. 3.8.1.0
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 1.32, N = 9 843.1 1. 3.8.1.0
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter GCC 10.2 200 400 600 800 1000 SE +/- 0.78, N = 9 1055.0 1. 3.8.1.0
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform GCC 10.2 110 220 330 440 550 SE +/- 0.63, N = 9 515.8 1. 3.8.1.0
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 20M 40M 60M 80M 100M SE +/- 828458.69, N = 5 81844000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 5768882.04, N = 3 1111200000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 497772.82, N = 3 1164966667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP GCC 10.2 800K 1600K 2400K 3200K 4000K SE +/- 26197.04, N = 3 3549910.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 39730.96, N = 15 3041527.37 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 500K 1000K 1500K 2000K 2500K SE +/- 23396.73, N = 15 2222217.52 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 36718.95, N = 15 3470419.90 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 600K 1200K 1800K 2400K 3000K SE +/- 26145.63, N = 15 2640316.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.60 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya GCC 10.2 0.837 1.674 2.511 3.348 4.185 SE +/- 0.03, N = 3 3.72 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom GCC 10.2 0.2745 0.549 0.8235 1.098 1.3725 SE +/- 0.01, N = 3 1.22 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets GCC 10.2 1.269 2.538 3.807 5.076 6.345 SE +/- 0.05, N = 3 5.64 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID GCC 10.2 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.02, N = 3 5.73 1. (CXX) g++ options: -O3 -march=native -pthread
Phoronix Test Suite v10.8.5