Ryzen 9 5950X Compiler Testing AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) and AMD NAVY_FLOUNDER 12GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103149-PTS-RYZEN95931&grr .
Ryzen 9 5950X Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS) AMD Starship/Matisse 32GB 2000GB Corsair Force MP600 + 2000GB AMD NAVY_FLOUNDER 12GB (2855/1000MHz) AMD Device ab28 ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.6-051106-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.168 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - Python 3.8.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 9 5950X Compiler Testing gnuradio: Hilbert Transform gnuradio: FM Deemphasis Filter gnuradio: IIR Filter gnuradio: FIR Filter gnuradio: Signal Source (Cosine) gnuradio: Five Back to Back FIR Filters onnx: super-resolution-10 - OpenMP CPU build-llvm: Time To Compile webp2: Quality 100, Lossless Compression cryptopp: Unkeyed Algorithms ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet jpegxl: PNG - 8 webp2: Quality 95, Compression Effort 7 gcrypt: onnx: fcn-resnet101-11 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU webp2: Quality 75, Compression Effort 7 onednn: Deconvolution Batch shapes_1d - f32 - CPU openfoam: Motorbike 30M compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed jpegxl: PNG - 7 sysbench: CPU onednn: Recurrent Neural Network Training - f32 - CPU build-godot: Time To Compile onednn: Recurrent Neural Network Inference - f32 - CPU mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 redis: LPUSH ngspice: C2670 redis: SET aom-av1: Speed 4 Two-Pass redis: SADD redis: GET simdjson: DistinctUserID simdjson: PartialTweets ngspice: C7552 graphics-magick: Enhanced graphics-magick: Sharpen graphics-magick: Noise-Gaussian graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Resizing graphics-magick: HWB Color Space mrbayes: Primate Phylogeny Analysis astcenc: Exhaustive simdjson: Kostya aom-av1: Speed 0 Two-Pass simdjson: LargeRand compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed jpegxl-decode: 1 compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed tachyon: Total Time avifenc: 0 sqlite-speedtest: Timed Time - Size 1,000 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed jpegxl-decode: All quantlib: compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed liquid-dsp: 1 - 256 - 57 compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed avifenc: 6, Lossless synthmark: VoiceMark_100 webp: Quality 100, Lossless, Highest Compression basis: UASTC Level 3 povray: Trace Time aom-av1: Speed 6 Two-Pass c-ray: Total Time - 4K, 16 Rays Per Pixel avifenc: 2 x265: Bosphorus 4K liquid-dsp: 32 - 256 - 57 liquid-dsp: 16 - 256 - 57 basis: ETC1S etcpak: ETC2 jpegxl: PNG - 5 aom-av1: Speed 6 Realtime crafty: Elapsed Time encode-wavpack: WAV To WavPack basis: UASTC Level 2 libraw: Post-Processing Benchmark jpegxl: JPEG - 5 tnn: CPU - MobileNet v2 onednn: IP Shapes 1D - f32 - CPU tnn: CPU - SqueezeNet v1.1 dav1d: Summer Nature 4K svt-av1: Enc Mode 4 - 1080p rnnoise: webp: Quality 100, Lossless encode-ogg: WAV To Ogg redis: LPOP etcpak: ETC1 jpegxl: JPEG - 7 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p astcenc: Thorough onednn: IP Shapes 3D - f32 - CPU encode-opus: WAV To Opus Encode avifenc: 6 x264: H.264 Video Encoding jpegxl: JPEG - 8 svt-av1: Enc Mode 8 - 1080p x265: Bosphorus 1080p webp2: Quality 100, Compression Effort 5 onednn: Convolution Batch Shapes Auto - f32 - CPU webp: Quality 100, Highest Compression basis: UASTC Level 0 aom-av1: Speed 8 Realtime avifenc: 10, Lossless smallpt: Global Illumination Renderer; 128 Samples astcenc: Medium dav1d: Summer Nature 1080p etcpak: DXT1 onednn: Deconvolution Batch shapes_3d - f32 - CPU avifenc: 10 svt-vp9: Visual Quality Optimized - Bosphorus 1080p webp2: Default webp: Quality 100 tscp: AI Chess Performance webp: Default GCC 10.2 515.8 1055.0 843.1 1063.5 4715.4 920.8 6721 370.571 367.371 545.914609 17.61 13.77 20.77 25.67 10.82 14.11 57.89 12.76 1.83 5.32 3.93 4.23 3.85 4.43 12.42 1.14 203.811 171.186 99 614 433 15049 111.802 4.46777 97.75 13397.7 71.13 11.20 91743.72 2757.52 79.523 1773.67 32.344 2.351 3.240 25.065 5.081 2222217.52 71.603 2640316.17 9.20 3041527.37 3470419.90 5.73 5.64 62.816 439 375 454 1166 1056 2165 1115 59.869 52.9268 3.72 0.37 1.22 4350.9 36.6 56.53 13400.1 72.36 44.3941 43.615 42.599 4251.7 51.6 210.99 3196.9 4886.2 1122.6 4737.1 1425.9 4617.1 1057.4 81844000 13771.1 12330.56 30.977 966.298 28.813 28.126 24.093 29.43 25.089 23.538 27.83 1164966667 1111200000 19.896 245.041 74.12 35.13 11731249 10.149 15.902 78.66 87.35 216.281 3.95979 211.567 243.69 6.137 14.197 13.990 13.578 3549910.50 386.561 87.07 0.638664 235.04 6.9922 9.25967 5.484 8.927 208.93 38.13 51.774 89.80 6.414 17.2905 5.242 5.157 121.13 4.875 4.674 4.0524 971.79 1546.299 3.55467 2.934 228.96 2.274 1.652 1965773 1.042 OpenBenchmarking.org
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform GCC 10.2 110 220 330 440 550 SE +/- 0.63, N = 9 515.8 1. 3.8.1.0
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter GCC 10.2 200 400 600 800 1000 SE +/- 0.78, N = 9 1055.0 1. 3.8.1.0
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 1.32, N = 9 843.1 1. 3.8.1.0
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter GCC 10.2 200 400 600 800 1000 SE +/- 2.09, N = 9 1063.5 1. 3.8.1.0
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) GCC 10.2 1000 2000 3000 4000 5000 SE +/- 16.39, N = 9 4715.4 1. 3.8.1.0
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters GCC 10.2 200 400 600 800 1000 SE +/- 19.67, N = 9 920.8 1. 3.8.1.0
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1400 2800 4200 5600 7000 SE +/- 215.50, N = 12 6721 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile GCC 10.2 80 160 240 320 400 SE +/- 2.79, N = 3 370.57
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression GCC 10.2 80 160 240 320 400 SE +/- 0.42, N = 3 367.37 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms GCC 10.2 120 240 360 480 600 SE +/- 3.29, N = 15 545.91 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 17.61 MIN: 16.94 / MAX: 25.97 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd GCC 10.2 4 8 12 16 20 SE +/- 0.06, N = 15 13.77 MIN: 13.25 / MAX: 23.45 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny GCC 10.2 5 10 15 20 25 SE +/- 0.17, N = 15 20.77 MIN: 19.69 / MAX: 43.19 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 GCC 10.2 6 12 18 24 30 SE +/- 0.21, N = 15 25.67 MIN: 24.52 / MAX: 35.96 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet GCC 10.2 3 6 9 12 15 SE +/- 0.09, N = 15 10.82 MIN: 10.41 / MAX: 17.59 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 15 14.11 MIN: 13.84 / MAX: 23.15 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 GCC 10.2 13 26 39 52 65 SE +/- 0.12, N = 15 57.89 MIN: 55.89 / MAX: 80.86 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet GCC 10.2 3 6 9 12 15 SE +/- 0.06, N = 15 12.76 MIN: 12.19 / MAX: 19.36 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface GCC 10.2 0.4118 0.8236 1.2354 1.6472 2.059 SE +/- 0.01, N = 15 1.83 MIN: 1.77 / MAX: 3.9 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 GCC 10.2 1.197 2.394 3.591 4.788 5.985 SE +/- 0.02, N = 15 5.32 MIN: 5.15 / MAX: 13.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet GCC 10.2 0.8843 1.7686 2.6529 3.5372 4.4215 SE +/- 0.02, N = 15 3.93 MIN: 3.71 / MAX: 6.06 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 GCC 10.2 0.9518 1.9036 2.8554 3.8072 4.759 SE +/- 0.01, N = 15 4.23 MIN: 4.15 / MAX: 9.05 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 10.2 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.02, N = 15 3.85 MIN: 3.74 / MAX: 10.85 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 10.2 0.9968 1.9936 2.9904 3.9872 4.984 SE +/- 0.01, N = 15 4.43 MIN: 4.19 / MAX: 11.09 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet GCC 10.2 3 6 9 12 15 SE +/- 0.16, N = 15 12.42 MIN: 11.7 / MAX: 20.08 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 8 GCC 10.2 0.2565 0.513 0.7695 1.026 1.2825 SE +/- 0.00, N = 3 1.14 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 GCC 10.2 40 80 120 160 200 SE +/- 0.04, N = 3 203.81 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 40 80 120 160 200 SE +/- 0.29, N = 3 171.19 1. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 20 40 60 80 100 SE +/- 0.17, N = 3 99 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 130 260 390 520 650 SE +/- 6.71, N = 3 614 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 1.96, N = 3 433 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 3K 6K 9K 12K 15K SE +/- 134.84, N = 3 15049 1. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 GCC 10.2 30 60 90 120 150 SE +/- 1.06, N = 3 111.80 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU GCC 10.2 1.0052 2.0104 3.0156 4.0208 5.026 SE +/- 0.30276, N = 15 4.46777 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 20 40 60 80 100 SE +/- 0.08, N = 3 97.75 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 35.65, N = 6 13397.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.68, N = 6 71.13 1. (CC) gcc options: -O3
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 11.20 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 10.2 20K 40K 60K 80K 100K SE +/- 115.96, N = 3 91743.72 1. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU GCC 10.2 600 1200 1800 2400 3000 SE +/- 2.01, N = 3 2757.52 MIN: 2719.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 79.52
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU GCC 10.2 400 800 1200 1600 2000 SE +/- 5.00, N = 3 1773.67 MIN: 1750.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.09, N = 3 32.34 MIN: 31.33 / MAX: 42.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 GCC 10.2 0.529 1.058 1.587 2.116 2.645 SE +/- 0.027, N = 3 2.351 MIN: 2.27 / MAX: 7.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 GCC 10.2 0.729 1.458 2.187 2.916 3.645 SE +/- 0.049, N = 3 3.240 MIN: 3.12 / MAX: 11.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 25.07 MIN: 23.97 / MAX: 39.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 GCC 10.2 1.1432 2.2864 3.4296 4.5728 5.716 SE +/- 0.010, N = 3 5.081 MIN: 4.92 / MAX: 14.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 500K 1000K 1500K 2000K 2500K SE +/- 23396.73, N = 15 2222217.52 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 16 32 48 64 80 SE +/- 0.21, N = 3 71.60 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 600K 1200K 1800K 2400K 3000K SE +/- 26145.63, N = 15 2640316.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 4 Two-Pass GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.20 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 39730.96, N = 15 3041527.37 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 700K 1400K 2100K 2800K 3500K SE +/- 36718.95, N = 15 3470419.90 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID GCC 10.2 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.02, N = 3 5.73 1. (CXX) g++ options: -O3 -march=native -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets GCC 10.2 1.269 2.538 3.807 5.076 6.345 SE +/- 0.05, N = 3 5.64 1. (CXX) g++ options: -O3 -march=native -pthread
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 14 28 42 56 70 SE +/- 0.15, N = 3 62.82 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 100 200 300 400 500 SE +/- 0.33, N = 3 439 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 80 160 240 320 400 SE +/- 1.00, N = 3 375 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 100 200 300 400 500 SE +/- 0.67, N = 3 454 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 300 600 900 1200 1500 SE +/- 3.67, N = 3 1166 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 200 400 600 800 1000 SE +/- 3.51, N = 3 1056 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 500 1000 1500 2000 2500 SE +/- 1.45, N = 3 2165 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 200 400 600 800 1000 SE +/- 1.33, N = 3 1115 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 13 26 39 52 65 SE +/- 0.15, N = 3 59.87 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -lm -lreadline
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive GCC 10.2 12 24 36 48 60 SE +/- 0.09, N = 3 52.93 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya GCC 10.2 0.837 1.674 2.511 3.348 4.185 SE +/- 0.03, N = 3 3.72 1. (CXX) g++ options: -O3 -march=native -pthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 0 Two-Pass GCC 10.2 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 0.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom GCC 10.2 0.2745 0.549 0.8235 1.098 1.3725 SE +/- 0.01, N = 3 1.22 1. (CXX) g++ options: -O3 -march=native -pthread
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 72.38, N = 3 4350.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 8 16 24 32 40 SE +/- 0.03, N = 3 36.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 GCC 10.2 13 26 39 52 65 SE +/- 0.05, N = 3 56.53
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 48.22, N = 3 13400.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 16 32 48 64 80 SE +/- 0.86, N = 3 72.36 1. (CC) gcc options: -O3
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 44.39 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 10 20 30 40 50 SE +/- 0.21, N = 3 43.62 1. (CXX) g++ options: -O3 -fPIC -lm
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.60 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 900 1800 2700 3600 4500 SE +/- 6.53, N = 3 4251.7 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 12 24 36 48 60 SE +/- 0.20, N = 3 51.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: All GCC 10.2 50 100 150 200 250 SE +/- 0.29, N = 3 210.99
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 700 1400 2100 2800 3500 SE +/- 33.41, N = 5 3196.9 1. (CXX) g++ options: -O3 -march=native -rdynamic
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 29.99, N = 3 4886.2 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 2.15, N = 3 1122.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 SE +/- 46.74, N = 3 4737.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 300 600 900 1200 1500 SE +/- 2.43, N = 3 1425.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 1000 2000 3000 4000 5000 4617.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 200 400 600 800 1000 SE +/- 3.93, N = 3 1057.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 20M 40M 60M 80M 100M SE +/- 828458.69, N = 5 81844000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 38.97, N = 3 13771.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 3K 6K 9K 12K 15K SE +/- 76.55, N = 3 12330.56 1. (CC) gcc options: -O3
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 7 14 21 28 35 SE +/- 0.06, N = 3 30.98 1. (CXX) g++ options: -O3 -fPIC -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 200 400 600 800 1000 SE +/- 1.26, N = 3 966.30 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 10.2 7 14 21 28 35 SE +/- 0.08, N = 3 28.81 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 GCC 10.2 7 14 21 28 35 SE +/- 0.04, N = 3 28.13 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 6 12 18 24 30 SE +/- 0.09, N = 3 24.09 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass GCC 10.2 7 14 21 28 35 SE +/- 0.26, N = 3 29.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 6 12 18 24 30 SE +/- 0.07, N = 3 25.09 1. (CC) gcc options: -lm -lpthread -O3 -march=native
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 6 12 18 24 30 SE +/- 0.02, N = 3 23.54 1. (CXX) g++ options: -O3 -fPIC -lm
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 7 14 21 28 35 SE +/- 0.16, N = 3 27.83 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 497772.82, N = 3 1164966667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 200M 400M 600M 800M 1000M SE +/- 5768882.04, N = 3 1111200000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S GCC 10.2 5 10 15 20 25 SE +/- 0.04, N = 3 19.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 GCC 10.2 50 100 150 200 250 SE +/- 1.65, N = 3 245.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 GCC 10.2 16 32 48 64 80 SE +/- 0.03, N = 3 74.12 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime GCC 10.2 8 16 24 32 40 SE +/- 0.16, N = 3 35.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 10.2 3M 6M 9M 12M 15M SE +/- 26371.45, N = 3 11731249 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 10.2 3 6 9 12 15 SE +/- 0.10, N = 5 10.15 1. (CXX) g++ options: -O3 -march=native -rdynamic
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 GCC 10.2 4 8 12 16 20 SE +/- 0.05, N = 3 15.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 20 40 60 80 100 SE +/- 0.16, N = 3 78.66 1. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 87.35 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 50 100 150 200 250 SE +/- 0.56, N = 3 216.28 MIN: 215.1 / MAX: 218.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU GCC 10.2 0.891 1.782 2.673 3.564 4.455 SE +/- 0.00506, N = 3 3.95979 MIN: 3.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.57, N = 3 211.57 MIN: 206.88 / MAX: 212.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K GCC 10.2 50 100 150 200 250 SE +/- 0.47, N = 3 243.69 MIN: 181.29 / MAX: 252.3 1. (CC) gcc options: -O3 -march=native -pthread -lm
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p GCC 10.2 2 4 6 8 10 SE +/- 0.014, N = 3 6.137 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 GCC 10.2 4 8 12 16 20 SE +/- 0.04, N = 3 14.20 1. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 10.2 4 8 12 16 20 SE +/- 0.11, N = 3 13.99 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg GCC 10.2 3 6 9 12 15 SE +/- 0.04, N = 3 13.58 1. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP GCC 10.2 800K 1600K 2400K 3200K 4000K SE +/- 26197.04, N = 3 3549910.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 GCC 10.2 80 160 240 320 400 SE +/- 0.37, N = 3 386.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 87.07 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU GCC 10.2 0.1437 0.2874 0.4311 0.5748 0.7185 SE +/- 0.000722, N = 3 0.638664 MIN: 0.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 2.40, N = 12 235.04 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough GCC 10.2 2 4 6 8 10 SE +/- 0.0057, N = 3 6.9922 1. (CXX) g++ options: -O3 -march=native -flto -pthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU GCC 10.2 3 6 9 12 15 SE +/- 0.01340, N = 3 9.25967 MIN: 9.1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 1.2339 2.4678 3.7017 4.9356 6.1695 SE +/- 0.031, N = 5 5.484 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 2 4 6 8 10 SE +/- 0.048, N = 3 8.927 1. (CXX) g++ options: -O3 -fPIC -lm
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding GCC 10.2 50 100 150 200 250 SE +/- 1.66, N = 9 208.93 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 38.13 1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p GCC 10.2 12 24 36 48 60 SE +/- 0.24, N = 3 51.77 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 20 40 60 80 100 SE +/- 0.19, N = 3 89.80 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 GCC 10.2 2 4 6 8 10 SE +/- 0.011, N = 3 6.414 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU GCC 10.2 4 8 12 16 20 SE +/- 0.09, N = 3 17.29 MIN: 16.58 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression GCC 10.2 1.1795 2.359 3.5385 4.718 5.8975 SE +/- 0.018, N = 3 5.242 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 GCC 10.2 1.1603 2.3206 3.4809 4.6412 5.8015 SE +/- 0.023, N = 3 5.157 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime GCC 10.2 30 60 90 120 150 SE +/- 0.75, N = 3 121.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 1.0969 2.1938 3.2907 4.3876 5.4845 SE +/- 0.022, N = 3 4.875 1. (CXX) g++ options: -O3 -fPIC -lm
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 10.2 1.0517 2.1034 3.1551 4.2068 5.2585 SE +/- 0.015, N = 3 4.674 1. (CXX) g++ options: -fopenmp -O3 -march=native
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium GCC 10.2 0.9118 1.8236 2.7354 3.6472 4.559 SE +/- 0.0178, N = 3 4.0524 1. (CXX) g++ options: -O3 -march=native -flto -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p GCC 10.2 200 400 600 800 1000 SE +/- 1.38, N = 3 971.79 MIN: 732.02 / MAX: 1055.82 1. (CC) gcc options: -O3 -march=native -pthread -lm
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 GCC 10.2 300 600 900 1200 1500 SE +/- 2.21, N = 3 1546.30 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU GCC 10.2 0.7998 1.5996 2.3994 3.1992 3.999 SE +/- 0.00753, N = 3 3.55467 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 0.6602 1.3204 1.9806 2.6408 3.301 SE +/- 0.014, N = 3 2.934 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p GCC 10.2 50 100 150 200 250 SE +/- 0.68, N = 3 228.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default GCC 10.2 0.5117 1.0234 1.5351 2.0468 2.5585 SE +/- 0.005, N = 3 2.274 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 GCC 10.2 0.3717 0.7434 1.1151 1.4868 1.8585 SE +/- 0.018, N = 4 1.652 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 400K 800K 1200K 1600K 2000K SE +/- 7442.75, N = 5 1965773 1. (CC) gcc options: -O3 -march=native
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default GCC 10.2 0.2345 0.469 0.7035 0.938 1.1725 SE +/- 0.008, N = 3 1.042 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff
Phoronix Test Suite v10.8.5