Ryzen 9 5950X Compiler Testing AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) and AMD NAVY_FLOUNDER 12GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103149-PTS-RYZEN95931&grs .
Ryzen 9 5950X Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) AMD Starship/Matisse 32GB 2000GB Corsair Force MP600 + 2000GB AMD NAVY_FLOUNDER 12GB (2855/1000MHz) AMD Device ab28 ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.6-051106-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.168 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - Python 3.8.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 9 5950X Compiler Testing encode-wavpack: WAV To WavPack onnx: shufflenet-v2-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU sysbench: CPU tnn: CPU - SqueezeNet v1.1 tnn: CPU - MobileNet v2 ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 redis: SET redis: GET redis: LPUSH redis: SADD redis: LPOP sqlite-speedtest: Timed Time - Size 1,000 basis: UASTC Level 3 basis: UASTC Level 2 basis: UASTC Level 0 basis: ETC1S astcenc: Exhaustive astcenc: Thorough astcenc: Medium liquid-dsp: 32 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 1 - 256 - 57 synthmark: VoiceMark_100 webp2: Quality 100, Lossless Compression webp2: Quality 100, Compression Effort 5 webp2: Quality 95, Compression Effort 7 webp2: Quality 75, Compression Effort 7 webp2: Default tachyon: Total Time rnnoise: ngspice: C7552 ngspice: C2670 gcrypt: encode-opus: WAV To Opus Encode encode-ogg: WAV To Ogg onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - f32 - CPU smallpt: Global Illumination Renderer; 128 Samples povray: Trace Time c-ray: Total Time - 4K, 16 Rays Per Pixel build-llvm: Time To Compile build-godot: Time To Compile avifenc: 10, Lossless avifenc: 6, Lossless avifenc: 10 avifenc: 6 avifenc: 2 avifenc: 0 x265: Bosphorus 1080p x265: Bosphorus 4K x264: H.264 Video Encoding svt-vp9: Visual Quality Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-av1: Enc Mode 8 - 1080p svt-av1: Enc Mode 4 - 1080p aom-av1: Speed 8 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 6 Realtime aom-av1: Speed 4 Two-Pass aom-av1: Speed 0 Two-Pass dav1d: Summer Nature 1080p dav1d: Summer Nature 4K graphics-magick: HWB Color Space graphics-magick: Noise-Gaussian graphics-magick: Resizing graphics-magick: Enhanced graphics-magick: Sharpen graphics-magick: Rotate graphics-magick: Swirl tscp: AI Chess Performance crafty: Elapsed Time libraw: Post-Processing Benchmark gnuradio: Hilbert Transform gnuradio: FM Deemphasis Filter gnuradio: IIR Filter gnuradio: FIR Filter gnuradio: Signal Source (Cosine) jpegxl-decode: All jpegxl-decode: 1 jpegxl: JPEG - 8 jpegxl: JPEG - 7 jpegxl: JPEG - 5 jpegxl: PNG - 8 jpegxl: PNG - 7 jpegxl: PNG - 5 compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed simdjson: DistinctUserID simdjson: PartialTweets simdjson: LargeRand simdjson: Kostya webp: Quality 100, Lossless, Highest Compression webp: Quality 100, Highest Compression webp: Quality 100, Lossless webp: Quality 100 webp: Default openfoam: Motorbike 30M mrbayes: Primate Phylogeny Analysis etcpak: ETC2 etcpak: ETC1 etcpak: DXT1 cryptopp: Unkeyed Algorithms quantlib: onnx: super-resolution-10 - OpenMP CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU gnuradio: Five Back to Back FIR Filters GCC 10.2 10.149 15049 99 614 433 91743.72 211.567 216.281 17.61 13.77 20.77 25.67 10.82 14.11 57.89 12.76 1.83 5.32 3.93 4.23 3.85 4.43 12.42 32.344 2.351 3.240 25.065 5.081 2640316.17 3470419.90 2222217.52 3041527.37 3549910.50 42.599 28.126 15.902 5.157 19.896 52.9268 6.9922 4.0524 1164966667 1111200000 81844000 966.298 367.371 6.414 203.811 111.802 2.274 44.3941 14.197 62.816 71.603 171.186 5.484 13.578 0.638664 1773.67 2757.52 3.55467 17.2905 9.25967 3.95979 4.674 24.093 25.089 370.571 79.523 4.875 30.977 2.934 8.927 23.538 43.615 89.80 27.83 208.93 228.96 235.04 51.774 6.137 121.13 29.43 35.13 9.20 0.37 971.79 243.69 1115 454 2165 439 375 1056 1166 1965773 11731249 78.66 515.8 1055.0 843.1 1063.5 4715.4 210.99 56.53 38.13 87.07 87.35 1.14 11.20 74.12 4350.9 36.6 4886.2 1122.6 4737.1 1425.9 4251.7 51.6 4617.1 1057.4 13397.7 71.13 13400.1 72.36 13771.1 12330.56 5.73 5.64 1.22 3.72 28.813 5.242 13.990 1.652 1.042 97.75 59.869 245.041 386.561 1546.299 545.914609 3196.9 6721 4.46777 920.8 OpenBenchmarking.org
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 10.2 3 6 9 12 15 SE +/- 0.10, N = 5 10.15 1. (CXX) g++ options: -O3 -march=native -rdynamic
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 3K 6K 9K 12K 15K SE +/- 134.84, N = 3 15049 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 20 40 60 80 100 SE +/- 0.17, N = 3 99 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 130 260 390 520 650 SE +/- 6.71, N = 3 614 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 1.96, N = 3 433 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 10.2 20K 40K 60K 80K 100K SE +/- 115.96, N = 3 91743.72 1. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.57, N = 3 211.57 MIN: 206.88 / MAX: 212.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 50 100 150 200 250 SE +/- 0.56, N = 3 216.28 MIN: 215.1 / MAX: 218.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 17.61 MIN: 16.94 / MAX: 25.97 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 13.77 MIN: 13.25 / MAX: 23.45 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny GCC 10.2 5 10 15 20 25 SE +/- 0.17, N = 15 20.77 MIN: 19.69 / MAX: 43.19 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 GCC 10.2 6 12 18 24 30 SE +/- 0.21, N = 15 25.67 MIN: 24.52 / MAX: 35.96 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet GCC 10.2 3 6 9 12 15 SE +/- 0.09, N = 15 10.82 MIN: 10.41 / MAX: 17.59 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 15 14.11 MIN: 13.84 / MAX: 23.15 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 GCC 10.2 13 26 39 52 65 SE +/- 0.12, N = 15 57.89 MIN: 55.89 / MAX: 80.86 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet GCC 10.2 3 6 9 12 15 SE +/- 0.06, N = 15 12.76 MIN: 12.19 / MAX: 19.36 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface GCC 10.2 0.4118 0.8236 1.2354 1.6472 2.059 SE +/- 0.01, N = 15 1.83 MIN: 1.77 / MAX: 3.9 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 GCC 10.2 1.197 2.394 3.591 4.788 5.985 SE +/- 0.02, N = 15 5.32 MIN: 5.15 / MAX: 13.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet GCC 10.2 0.8843 1.7686 2.6529 3.5372 4.4215 SE +/- 0.02, N = 15 3.93 MIN: 3.71 / MAX: 6.06 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 GCC 10.2 0.9518 1.9036 2.8554 3.8072 4.759 SE +/- 0.01, N = 15 4.23 MIN: 4.15 / MAX: 9.05 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 10.2 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.02, N = 15 3.85 MIN: 3.74 / MAX: 10.85 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 10.2 0.9968 1.9936 2.9904 3.9872 4.984 SE +/- 0.01, N = 15 4.43 MIN: 4.19 / MAX: 11.09 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet GCC 10.2 3 6 9 12 15 SE +/- 0.16, N = 15 12.42 MIN: 11.7 / MAX: 20.08 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.09, N = 3 32.34 MIN: 31.33 / MAX: 42.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 GCC 10.2 0.529 1.058 1.587 2.116 2.645 SE +/- 0.027, N = 3 2.351 MIN: 2.27 / MAX: 7.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 GCC 10.2 0.729 1.458 2.187 2.916 3.645 SE +/- 0.049, N = 3 3.240 MIN: 3.12 / MAX: 11.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 25.07 MIN: 23.97 / MAX: 39.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 GCC 10.2 1.1432 2.2864 3.4296 4.5728 5.716 SE +/- 0.010, N = 3 5.081 MIN: 4.92 / MAX: 14.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 600K 1200K 1800K 2400K 3000K SE +/- 26145.63, N = 15 2640316.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 36718.95, N = 15 3470419.90 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 500K 1000K 1500K 2000K 2500K SE +/- 23396.73, N = 15 2222217.52 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 39730.96, N = 15 3041527.37 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP GCC 10.2 800K 1600K 2400K 3200K 4000K SE +/- 26197.04, N = 3 3549910.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.60 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 GCC 10.2 7 14 21 28 35 SE +/- 0.04, N = 3 28.13 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 3 15.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 GCC 10.2 1.1603 2.3206 3.4809 4.6412 5.8015 SE +/- 0.023, N = 3 5.157 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S GCC 10.2 5 10 15 20 25 SE +/- 0.04, N = 3 19.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive GCC 10.2 12 24 36 48 60 SE +/- 0.09, N = 3 52.93 1. (CXX) g++ options: -O3 -march=native -flto -pthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough GCC 10.2 2 4 6 8 10 SE +/- 0.0057, N = 3 6.9922 1. (CXX) g++ options: -O3 -march=native -flto -pthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium GCC 10.2 0.9118 1.8236 2.7354 3.6472 4.559 SE +/- 0.0178, N = 3 4.0524 1. (CXX) g++ options: -O3 -march=native -flto -pthread
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 497772.82, N = 3 1164966667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 5768882.04, N = 3 1111200000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 20M 40M 60M 80M 100M SE +/- 828458.69, N = 5 81844000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 200 400 600 800 1000 SE +/- 1.26, N = 3 966.30 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression GCC 10.2 80 160 240 320 400 SE +/- 0.42, N = 3 367.37 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 GCC 10.2 2 4 6 8 10 SE +/- 0.011, N = 3 6.414 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 GCC 10.2 40 80 120 160 200 SE +/- 0.04, N = 3 203.81 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 GCC 10.2 30 60 90 120 150 SE +/- 1.06, N = 3 111.80 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default GCC 10.2 0.5117 1.0234 1.5351 2.0468 2.5585 SE +/- 0.005, N = 3 2.274 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 44.39 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 GCC 10.2 4 8 12 16 20 SE +/- 0.04, N = 3 14.20 1. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 14 28 42 56 70 SE +/- 0.15, N = 3 62.82 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 16 32 48 64 80 SE +/- 0.21, N = 3 71.60 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 40 80 120 160 200 SE +/- 0.29, N = 3 171.19 1. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 1.2339 2.4678 3.7017 4.9356 6.1695 SE +/- 0.031, N = 5 5.484 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg GCC 10.2 3 6 9 12 15 SE +/- 0.04, N = 3 13.58 1. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU GCC 10.2 0.1437 0.2874 0.4311 0.5748 0.7185 SE +/- 0.000722, N = 3 0.638664 MIN: 0.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU GCC 10.2 400 800 1200 1600 2000 SE +/- 5.00, N = 3 1773.67 MIN: 1750.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU GCC 10.2 600 1200 1800 2400 3000 SE +/- 2.01, N = 3 2757.52 MIN: 2719.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU GCC 10.2 0.7998 1.5996 2.3994 3.1992 3.999 SE +/- 0.00753, N = 3 3.55467 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU GCC 10.2 4 8 12 16 20 SE +/- 0.09, N = 3 17.29 MIN: 16.58 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU GCC 10.2 3 6 9 12 15 SE +/- 0.01340, N = 3 9.25967 MIN: 9.1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU GCC 10.2 0.891 1.782 2.673 3.564 4.455 SE +/- 0.00506, N = 3 3.95979 MIN: 3.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 10.2 1.0517 2.1034 3.1551 4.2068 5.2585 SE +/- 0.015, N = 3 4.674 1. (CXX) g++ options: -fopenmp -O3 -march=native
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 6 12 18 24 30 SE +/- 0.09, N = 3 24.09 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 6 12 18 24 30 SE +/- 0.07, N = 3 25.09 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile GCC 10.2 80 160 240 320 400 SE +/- 2.79, N = 3 370.57
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 79.52
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 1.0969 2.1938 3.2907 4.3876 5.4845 SE +/- 0.022, N = 3 4.875 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 7 14 21 28 35 SE +/- 0.06, N = 3 30.98 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 0.6602 1.3204 1.9806 2.6408 3.301 SE +/- 0.014, N = 3 2.934 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 2 4 6 8 10 SE +/- 0.048, N = 3 8.927 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 23.54 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 10 20 30 40 50 SE +/- 0.21, N = 3 43.62 1. (CXX) g++ options: -O3 -fPIC -lm
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 89.80 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 7 14 21 28 35 SE +/- 0.16, N = 3 27.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding GCC 10.2 50 100 150 200 250 SE +/- 1.66, N = 9 208.93 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 0.68, N = 3 228.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 2.40, N = 12 235.04 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p GCC 10.2 12 24 36 48 60 SE +/- 0.24, N = 3 51.77 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p GCC 10.2 2 4 6 8 10 SE +/- 0.014, N = 3 6.137 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime GCC 10.2 30 60 90 120 150 SE +/- 0.75, N = 3 121.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass GCC 10.2 7 14 21 28 35 SE +/- 0.26, N = 3 29.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime GCC 10.2 8 16 24 32 40 SE +/- 0.16, N = 3 35.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 4 Two-Pass GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.20 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 0 Two-Pass GCC 10.2 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 0.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p GCC 10.2 200 400 600 800 1000 SE +/- 1.38, N = 3 971.79 MIN: 732.02 / MAX: 1055.82 1. (CC) gcc options: -O3 -march=native -pthread -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K GCC 10.2 50 100 150 200 250 SE +/- 0.47, N = 3 243.69 MIN: 181.29 / MAX: 252.3 1. (CC) gcc options: -O3 -march=native -pthread -lm
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 200 400 600 800 1000 SE +/- 1.33, N = 3 1115 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 100 200 300 400 500 SE +/- 0.67, N = 3 454 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 500 1000 1500 2000 2500 SE +/- 1.45, N = 3 2165 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 100 200 300 400 500 SE +/- 0.33, N = 3 439 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 80 160 240 320 400 SE +/- 1.00, N = 3 375 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 200 400 600 800 1000 SE +/- 3.51, N = 3 1056 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 300 600 900 1200 1500 SE +/- 3.67, N = 3 1166 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 400K 800K 1200K 1600K 2000K SE +/- 7442.75, N = 5 1965773 1. (CC) gcc options: -O3 -march=native
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 10.2 3M 6M 9M 12M 15M SE +/- 26371.45, N = 3 11731249 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 20 40 60 80 100 SE +/- 0.16, N = 3 78.66 1. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform GCC 10.2 110 220 330 440 550 SE +/- 0.63, N = 9 515.8 1. 3.8.1.0
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter GCC 10.2 200 400 600 800 1000 SE +/- 0.78, N = 9 1055.0 1. 3.8.1.0
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 1.32, N = 9 843.1 1. 3.8.1.0
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 2.09, N = 9 1063.5 1. 3.8.1.0
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) GCC 10.2 1000 2000 3000 4000 5000 SE +/- 16.39, N = 9 4715.4 1. 3.8.1.0
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: All GCC 10.2 50 100 150 200 250 SE +/- 0.29, N = 3 210.99
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 GCC 10.2 13 26 39 52 65 SE +/- 0.05, N = 3 56.53
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 38.13 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 87.07 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 87.35 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 8 GCC 10.2 0.2565 0.513 0.7695 1.026 1.2825 SE +/- 0.00, N = 3 1.14 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 11.20 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 GCC 10.2 16 32 48 64 80 SE +/- 0.03, N = 3 74.12 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 72.38, N = 3 4350.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 8 16 24 32 40 SE +/- 0.03, N = 3 36.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 29.99, N = 3 4886.2 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 2.15, N = 3 1122.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 46.74, N = 3 4737.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 300 600 900 1200 1500 SE +/- 2.43, N = 3 1425.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 6.53, N = 3 4251.7 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 12 24 36 48 60 SE +/- 0.20, N = 3 51.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 4617.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 3.93, N = 3 1057.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 35.65, N = 6 13397.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.68, N = 6 71.13 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 48.22, N = 3 13400.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.86, N = 3 72.36 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 38.97, N = 3 13771.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 76.55, N = 3 12330.56 1. (CC) gcc options: -O3
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID GCC 10.2 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.02, N = 3 5.73 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets GCC 10.2 1.269 2.538 3.807 5.076 6.345 SE +/- 0.05, N = 3 5.64 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom GCC 10.2 0.2745 0.549 0.8235 1.098 1.3725 SE +/- 0.01, N = 3 1.22 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya GCC 10.2 0.837 1.674 2.511 3.348 4.185 SE +/- 0.03, N = 3 3.72 1. (CXX) g++ options: -O3 -march=native -pthread
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 10.2 7 14 21 28 35 SE +/- 0.08, N = 3 28.81 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression GCC 10.2 1.1795 2.359 3.5385 4.718 5.8975 SE +/- 0.018, N = 3 5.242 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 10.2 4 8 12 16 20 SE +/- 0.11, N = 3 13.99 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 GCC 10.2 0.3717 0.7434 1.1151 1.4868 1.8585 SE +/- 0.018, N = 4 1.652 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default GCC 10.2 0.2345 0.469 0.7035 0.938 1.1725 SE +/- 0.008, N = 3 1.042 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 20 40 60 80 100 SE +/- 0.08, N = 3 97.75 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 13 26 39 52 65 SE +/- 0.15, N = 3 59.87 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -lm -lreadline
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 GCC 10.2 50 100 150 200 250 SE +/- 1.65, N = 3 245.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 GCC 10.2 80 160 240 320 400 SE +/- 0.37, N = 3 386.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 GCC 10.2 300 600 900 1200 1500 SE +/- 2.21, N = 3 1546.30 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms GCC 10.2 120 240 360 480 600 SE +/- 3.29, N = 15 545.91 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 700 1400 2100 2800 3500 SE +/- 33.41, N = 5 3196.9 1. (CXX) g++ options: -O3 -march=native -rdynamic
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1400 2800 4200 5600 7000 SE +/- 215.50, N = 12 6721 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU GCC 10.2 1.0052 2.0104 3.0156 4.0208 5.026 SE +/- 0.30276, N = 15 4.46777 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters GCC 10.2 200 400 600 800 1000 SE +/- 19.67, N = 9 920.8 1. 3.8.1.0
Phoronix Test Suite v10.8.5