TR 3990X GCC 11 Benchmarking AMD Ryzen Threadripper 3990X 64-Core testing with a System76 Thelio Major (F4c Z5 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Pop 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103125-PTS-TR3990XG68&sro .
TR 3990X GCC 11 Benchmarking Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 GCC 11.0.1 AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) System76 Thelio Major (F4c Z5 BIOS) AMD Starship/Matisse 126GB Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz) AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 20.10 5.8.0-7630-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.8 4.6 Mesa 21.1.0-devel (git-96d7555 2021-01-22 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.145 GCC 10.2.0 + Clang 11.0.1-1~oibaf~g ext4 3840x2160 GCC 11.0.1 20210307 + Clang 11.0.1-1~oibaf~g OpenBenchmarking.org Kernel Details - snd_usb_audio.ignore_ctl_error=1 - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301025 Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3990X GCC 11 Benchmarking quantlib: lczero: BLAS lczero: Eigen toybrot: TBB toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads fftw: Stock - 2D FFT Size 4096 mrbayes: Primate Phylogeny Analysis openfoam: Motorbike 30M openfoam: Motorbike 60M simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID compress-lz4: 1 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl-decode: 1 jpegxl-decode: All scimark2: Composite scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation libraw: Post-Processing Benchmark tscp: AI Chess Performance graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space x265: Bosphorus 4K x265: Bosphorus 1080p coremark: CoreMark Size 666 - Iterations Per Second compress-7zip: Compress Speed Test avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless build-apache: Time To Compile build-godot: Time To Compile build-imagemagick: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel povray: Trace Time aobench: 2048 x 2048 - Total Time build-eigen: Time To Compile compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 encode-flac: WAV To FLAC encode-mp3: WAV To MP3 encode-opus: WAV To Opus Encode espeak: Text-To-Speech Synthesis gcrypt: ngspice: C2670 ngspice: C7552 tachyon: Total Time synthmark: VoiceMark_100 liquid-dsp: 1 - 256 - 57 liquid-dsp: 2 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 128 - 256 - 57 financebench: Repo OpenMP financebench: Bonds OpenMP pgbench: 100 - 50 - Read Only pgbench: 100 - 50 - Read Only - Average Latency pgbench: 100 - 100 - Read Only pgbench: 100 - 100 - Read Only - Average Latency pgbench: 100 - 50 - Read Write pgbench: 100 - 50 - Read Write - Average Latency astcenc: Fast astcenc: Medium astcenc: Thorough astcenc: Exhaustive basis: ETC1S basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 basis: UASTC Level 2 + RDO Post-Processing sqlite-speedtest: Timed Time - Size 1,000 redis: SADD redis: LPUSH redis: GET redis: SET mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU GCC 10.2 GCC 11.0.1 2505.0 1549 1556 6052 6338 6285 6081 7853.1 95.914 27.37 441.90 2.65 0.91 4.31 4.49 10427.81 12031.6 56.59 11376.2 56.99 11382.9 5501.3 2460.4 3653.6 86.3 3397.4 599.2 3786.5 631.5 3929.2 44.3 3434.8 79.56 9.80 67.78 67.84 41.43 164.75 3211.78 249.03 3495.05 8213.68 2151.60 61.32 1340806 2010 639 758 1014 1403 813 1367 25.03 57.46 2490073.059112 179376 48.537 25.641 9.879 3.320 25.590 5.877 21.304 55.047 78.183 9.164 9.004 28.038 77.464 20.084 8.466 6.863 6.623 28.808 201.805 88.881 76.562 15.6495 615.487 68181000 136276667 270570000 526856667 1013100000 1861233333 3131766667 3433533333 34588.787760 48718.346354 813236 0.061 1245297 0.080 8226 6.097 4.53 5.25 9.40 36.43 42.115 6.551 10.501 14.759 595.240 60.402 1914129.29 1422330.50 2098930.04 1642170.79 7.706 28.432 5.655 3.967 35.386 289.206 251.380 230 415 73 8413 5178 8267.2 92.056 27.38 442.63 10428.44 12016.3 54.81 11429.7 53.90 11425.9 5539.4 2460.2 3751.2 86.7 3491.3 591.2 3896.6 602.1 4019.7 44.8 3453.7 3124.66 252.57 3457.06 8094.47 2251.68 59.83 1345998 2016 641 758 1012 1404 790 1348 24.00 58.03 2500492.156623 180550 48.212 25.511 9.568 3.282 25.621 5.783 21.529 55.309 79.145 9.255 30.215 88.807 20.368 8.473 6.931 6.635 28.990 206.363 88.575 78.112 15.0588 67124333 135280000 268723333 522383333 1009633333 1865300000 3134500000 3469400000 33264.424479 47567.308594 828618 0.06 1266789 0.079 7892 6.352 59.197 1905050.75 1447008.09 2007263.79 1667586.71 OpenBenchmarking.org
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 500 1000 1500 2000 2500 SE +/- 0.38, N = 3 2505.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS GCC 10.2 300 600 900 1200 1500 SE +/- 14.19, N = 3 1549 1. (CXX) g++ options: -flto -O3 -march=native -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen GCC 10.2 300 600 900 1200 1500 SE +/- 26.77, N = 3 1556 1. (CXX) g++ options: -flto -O3 -march=native -pthread
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB GCC 10.2 1300 2600 3900 5200 6500 SE +/- 48.64, N = 3 6052 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 10.2 1400 2800 4200 5600 7000 SE +/- 16.01, N = 3 6338 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 10.2 1300 2600 3900 5200 6500 SE +/- 10.40, N = 3 6285 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 10.2 1300 2600 3900 5200 6500 SE +/- 29.78, N = 3 6081 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 69.69, N = 12 SE +/- 107.36, N = 4 7853.1 8267.2 1. (CC) gcc options: -pthread -O3 -march=native -flto -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.36, N = 3 SE +/- 0.79, N = 3 95.91 92.06 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -flto -lm -lreadline
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 27.37 27.38 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M GCC 10.2 GCC 11.0.1 100 200 300 400 500 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 441.90 442.63 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya GCC 10.2 0.5963 1.1926 1.7889 2.3852 2.9815 SE +/- 0.01, N = 3 2.65 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom GCC 10.2 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.00, N = 3 0.91 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets GCC 10.2 0.9698 1.9396 2.9094 3.8792 4.849 SE +/- 0.01, N = 3 4.31 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID GCC 10.2 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.01, N = 3 4.49 1. (CXX) g++ options: -O3 -march=native -flto -pthread
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 32.58, N = 3 SE +/- 23.26, N = 3 10427.81 10428.44 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 GCC 11.0.1 3K 6K 9K 12K 15K SE +/- 25.74, N = 3 SE +/- 24.40, N = 3 12031.6 12016.3 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.44, N = 3 SE +/- 0.62, N = 7 56.59 54.81 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 27.35, N = 3 SE +/- 27.84, N = 7 11376.2 11429.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.45, N = 3 SE +/- 0.48, N = 3 56.99 53.90 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 37.20, N = 3 SE +/- 27.71, N = 3 11382.9 11425.9 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 1200 2400 3600 4800 6000 SE +/- 17.22, N = 3 SE +/- 1.94, N = 3 5501.3 5539.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 16.63, N = 3 SE +/- 4.24, N = 3 2460.4 2460.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 9.41, N = 3 SE +/- 9.34, N = 3 3653.6 3751.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.95, N = 3 86.3 86.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 15.64, N = 3 SE +/- 1.91, N = 3 3397.4 3491.3 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 130 260 390 520 650 SE +/- 3.12, N = 3 SE +/- 6.24, N = 8 599.2 591.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 10.15, N = 3 SE +/- 3.65, N = 8 3786.5 3896.6 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 8.94, N = 3 SE +/- 1.39, N = 3 631.5 602.1 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 900 1800 2700 3600 4500 SE +/- 4.04, N = 3 SE +/- 4.01, N = 3 3929.2 4019.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 44.3 44.8 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 11.16, N = 3 SE +/- 13.15, N = 3 3434.8 3453.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 79.56 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.25, N = 15 9.80 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 GCC 10.2 15 30 45 60 75 SE +/- 0.33, N = 3 67.78 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 GCC 10.2 15 30 45 60 75 SE +/- 0.19, N = 3 67.84 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 41.43
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All GCC 10.2 40 80 120 160 200 SE +/- 0.24, N = 3 164.75
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 19.33, N = 3 SE +/- 31.37, N = 3 3211.78 3124.66 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 10.2 GCC 11.0.1 60 120 180 240 300 SE +/- 0.53, N = 3 SE +/- 0.73, N = 3 249.03 252.57 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 16.71, N = 3 SE +/- 32.06, N = 3 3495.05 3457.06 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 84.17, N = 3 SE +/- 126.25, N = 3 8213.68 8094.47 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 7.85, N = 3 SE +/- 1.81, N = 3 2151.60 2251.68 1. (CC) gcc options: -O3 -march=native -flto -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.49, N = 3 61.32 59.83 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ljpeg -lz -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 4514.49, N = 5 SE +/- 1916.14, N = 5 1340806 1345998 1. (CC) gcc options: -O3 -march=native -flto
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 GCC 11.0.1 400 800 1200 1600 2000 SE +/- 11.36, N = 3 2010 2016 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 1.00, N = 3 SE +/- 1.45, N = 3 639 641 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 GCC 11.0.1 160 320 480 640 800 SE +/- 3.53, N = 3 SE +/- 4.16, N = 3 758 758 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 2.08, N = 3 SE +/- 1.86, N = 3 1014 1012 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 12.14, N = 15 SE +/- 18.34, N = 3 1403 1404 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 2.19, N = 3 813 790 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 8.67, N = 3 SE +/- 10.37, N = 3 1367 1348 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 25.03 24.00 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 57.46 58.03 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 10.2 GCC 11.0.1 500K 1000K 1500K 2000K 2500K SE +/- 10187.73, N = 3 SE +/- 8519.73, N = 3 2490073.06 2500492.16 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 10.2 GCC 11.0.1 40K 80K 120K 160K 200K SE +/- 375.42, N = 3 SE +/- 284.33, N = 3 179376 180550 1. (CXX) g++ options: -pipe -lpthread
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 GCC 11.0.1 11 22 33 44 55 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 48.54 48.21 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 25.64 25.51 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.031, N = 3 9.879 9.568 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 GCC 11.0.1 0.747 1.494 2.241 2.988 3.735 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 3.320 3.282 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 25.59 25.62 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 GCC 11.0.1 1.3223 2.6446 3.9669 5.2892 6.6115 SE +/- 0.013, N = 3 SE +/- 0.003, N = 3 5.877 5.783 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 21.30 21.53
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 GCC 11.0.1 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 55.05 55.31
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 78.18 79.15
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.056, N = 3 SE +/- 0.066, N = 3 9.164 9.255 1. (CC) gcc options: -lm -lpthread -O3 -march=native -flto
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 3 6 9 12 15 SE +/- 0.052, N = 3 9.004 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 28.04 30.22 1. (CC) gcc options: -lm -O3 -march=native -flto
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 77.46 88.81
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 20.08 20.37 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native -flto
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.021, N = 5 8.466 8.473 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 3 SE +/- 0.021, N = 3 6.863 6.931 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -flto -lncurses -lm
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.013, N = 5 6.623 6.635 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.05, N = 4 28.81 28.99 1. (CC) gcc options: -O3 -march=native -flto -std=c99
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 GCC 11.0.1 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.60, N = 3 201.81 206.36 1. (CC) gcc options: -O3 -march=native -flto -fvisibility=hidden -lgpg-error
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 88.88 88.58 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.10, N = 4 SE +/- 0.33, N = 3 76.56 78.11 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 GCC 11.0.1 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 15.65 15.06 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 130 260 390 520 650 SE +/- 0.97, N = 3 615.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 15M 30M 45M 60M 75M SE +/- 246698.87, N = 3 SE +/- 640108.87, N = 9 68181000 67124333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 30M 60M 90M 120M 150M SE +/- 283392.15, N = 3 SE +/- 255408.17, N = 3 136276667 135280000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 60M 120M 180M 240M 300M SE +/- 492747.40, N = 3 SE +/- 361678.19, N = 3 270570000 268723333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 110M 220M 330M 440M 550M SE +/- 4259226.59, N = 3 SE +/- 4136086.45, N = 3 526856667 522383333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 200M 400M 600M 800M 1000M SE +/- 5391041.95, N = 3 SE +/- 3681183.38, N = 3 1013100000 1009633333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 400M 800M 1200M 1600M 2000M SE +/- 10038813.56, N = 3 SE +/- 9960589.01, N = 3 1861233333 1865300000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 18565320.12, N = 3 SE +/- 9971459.27, N = 3 3131766667 3134500000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 12549413.44, N = 3 SE +/- 7629547.82, N = 3 3433533333 3469400000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP GCC 10.2 GCC 11.0.1 7K 14K 21K 28K 35K SE +/- 141.22, N = 3 SE +/- 45.75, N = 3 34588.79 33264.42 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP GCC 10.2 GCC 11.0.1 10K 20K 30K 40K 50K SE +/- 32.76, N = 3 SE +/- 167.36, N = 3 48718.35 47567.31 1. (CXX) g++ options: -O3 -march=native -fopenmp
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only GCC 10.2 GCC 11.0.1 200K 400K 600K 800K 1000K SE +/- 1171.97, N = 3 SE +/- 588.21, N = 3 813236 828618 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.061 0.060 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 1515.50, N = 3 SE +/- 8011.04, N = 3 1245297 1266789 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.018 0.036 0.054 0.072 0.09 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.080 0.079 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 116.64, N = 15 SE +/- 101.14, N = 15 8226 7892 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.086, N = 15 SE +/- 0.077, N = 15 6.097 6.352 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast GCC 10.2 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.01, N = 3 4.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium GCC 10.2 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.00, N = 3 5.25 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.40 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive GCC 10.2 8 16 24 32 40 SE +/- 0.25, N = 3 36.43 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 GCC 10.2 2 4 6 8 10 SE +/- 0.032, N = 3 6.551 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 10.50 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 GCC 10.2 4 8 12 16 20 SE +/- 0.01, N = 3 14.76 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 + RDO Post-Processing OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 + RDO Post-Processing GCC 10.2 130 260 390 520 650 SE +/- 0.50, N = 3 595.24 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.08, N = 3 60.40 59.20 1. (CC) gcc options: -O3 -march=native -flto -ldl -lz -lpthread
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 18458.26, N = 3 SE +/- 24998.90, N = 3 1914129.29 1905050.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 15117.80, N = 3 SE +/- 24877.75, N = 3 1422330.50 1447008.09 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 24394.75, N = 3 SE +/- 20810.45, N = 3 2098930.04 2007263.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 5296.10, N = 3 SE +/- 8022.37, N = 3 1642170.79 1667586.71 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 GCC 10.2 2 4 6 8 10 SE +/- 0.057, N = 3 7.706 MIN: 7.49 / MAX: 9.85 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 GCC 10.2 7 14 21 28 35 SE +/- 0.43, N = 3 28.43 MIN: 27.09 / MAX: 31.41 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 GCC 10.2 1.2724 2.5448 3.8172 5.0896 6.362 SE +/- 0.023, N = 3 5.655 MIN: 5.38 / MAX: 6.05 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 GCC 10.2 0.8926 1.7852 2.6778 3.5704 4.463 SE +/- 0.014, N = 3 3.967 MIN: 3.62 / MAX: 4.65 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.19, N = 3 35.39 MIN: 34.32 / MAX: 39.11 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 60 120 180 240 300 SE +/- 0.78, N = 3 289.21 MIN: 272.83 / MAX: 331 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.16, N = 3 251.38 MIN: 250.21 / MAX: 252.69 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 50 100 150 200 250 SE +/- 1.76, N = 3 230 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 5.22, N = 12 415 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 16 32 48 64 80 SE +/- 0.17, N = 3 73 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 2K 4K 6K 8K 10K SE +/- 183.41, N = 10 8413 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1100 2200 3300 4400 5500 SE +/- 30.91, N = 3 5178 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
Phoronix Test Suite v10.8.4