TR 3990X GCC 11 Benchmarking AMD Ryzen Threadripper 3990X 64-Core testing with a System76 Thelio Major (F4c Z5 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Pop 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103125-PTS-TR3990XG68&rdt&grs .
TR 3990X GCC 11 Benchmarking Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 GCC 11.0.1 AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) System76 Thelio Major (F4c Z5 BIOS) AMD Starship/Matisse 126GB Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz) AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 20.10 5.8.0-7630-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.8 4.6 Mesa 21.1.0-devel (git-96d7555 2021-01-22 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.145 GCC 10.2.0 + Clang 11.0.1-1~oibaf~g ext4 3840x2160 GCC 11.0.1 20210307 + Clang 11.0.1-1~oibaf~g OpenBenchmarking.org Kernel Details - snd_usb_audio.ignore_ctl_error=1 - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301025 Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3990X GCC 11 Benchmarking build-eigen: Time To Compile aobench: 2048 x 2048 - Total Time compress-lz4: 9 - Compression Speed fftw: Stock - 2D FFT Size 4096 compress-zstd: 8, Long Mode - Compression Speed scimark2: Jacobi Successive Over-Relaxation redis: GET x265: Bosphorus 4K pgbench: 100 - 50 - Read Write mrbayes: Primate Phylogeny Analysis pgbench: 100 - 50 - Read Write - Average Latency financebench: Repo OpenMP tachyon: Total Time avifenc: 6 compress-lz4: 3 - Compression Speed graphics-magick: Noise-Gaussian compress-zstd: 3, Long Mode - Decompression Speed scimark2: Composite compress-zstd: 19 - Decompression Speed compress-zstd: 8 - Decompression Speed libraw: Post-Processing Benchmark financebench: Bonds OpenMP compress-zstd: 8, Long Mode - Decompression Speed gcrypt: sqlite-speedtest: Timed Time - Size 1,000 ngspice: C7552 pgbench: 100 - 50 - Read Only redis: LPUSH pgbench: 100 - 100 - Read Only pgbench: 100 - 50 - Read Only - Average Latency avifenc: 10, Lossless liquid-dsp: 1 - 256 - 57 redis: SET scimark2: Dense LU Matrix Factorization scimark2: Fast Fourier Transform compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 graphics-magick: HWB Color Space compress-zstd: 3, Long Mode - Compression Speed pgbench: 100 - 100 - Read Only - Average Latency build-imagemagick: Time To Compile avifenc: 10 compress-zstd: 19, Long Mode - Compression Speed scimark2: Sparse Matrix Multiply build-apache: Time To Compile liquid-dsp: 128 - 256 - 57 c-ray: Total Time - 4K, 16 Rays Per Pixel x265: Bosphorus 1080p encode-mp3: WAV To MP3 liquid-dsp: 8 - 256 - 57 liquid-dsp: 2 - 256 - 57 compress-zstd: 3 - Compression Speed liquid-dsp: 4 - 256 - 57 avifenc: 0 compress-7zip: Compress Speed Test espeak: Text-To-Speech Synthesis compress-zstd: 19, Long Mode - Decompression Speed avifenc: 2 redis: SADD build-godot: Time To Compile compress-lz4: 3 - Decompression Speed compress-zstd: 19 - Compression Speed coremark: CoreMark Size 666 - Iterations Per Second tscp: AI Chess Performance compress-lz4: 9 - Decompression Speed ngspice: C2670 liquid-dsp: 16 - 256 - 57 graphics-magick: Rotate graphics-magick: Swirl liquid-dsp: 32 - 256 - 57 graphics-magick: Enhanced encode-opus: WAV To Opus Encode openfoam: Motorbike 60M compress-lz4: 1 - Decompression Speed avifenc: 6, Lossless liquid-dsp: 64 - 256 - 57 encode-flac: WAV To FLAC graphics-magick: Resizing openfoam: Motorbike 30M compress-zstd: 8 - Compression Speed compress-lz4: 1 - Compression Speed onnx: super-resolution-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU tnn: CPU - SqueezeNet v1.1 tnn: CPU - MobileNet v2 mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 basis: UASTC Level 2 + RDO Post-Processing basis: UASTC Level 3 basis: UASTC Level 2 basis: UASTC Level 0 basis: ETC1S astcenc: Exhaustive astcenc: Thorough astcenc: Medium astcenc: Fast synthmark: VoiceMark_100 povray: Trace Time graphics-magick: Sharpen jpegxl-decode: All jpegxl-decode: 1 jpegxl: JPEG - 7 jpegxl: JPEG - 5 jpegxl: PNG - 5 simdjson: DistinctUserID simdjson: PartialTweets simdjson: LargeRand simdjson: Kostya toybrot: C++ Threads toybrot: C++ Tasks toybrot: OpenMP toybrot: TBB lczero: Eigen lczero: BLAS quantlib: onnx: shufflenet-v2-10 - OpenMP CPU jpegxl: PNG - 7 GCC 10.2 GCC 11.0.1 77.464 28.038 56.99 7853.1 631.5 2151.60 2098930.04 25.03 8226 95.914 6.097 34588.787760 15.6495 9.879 56.59 813 3786.5 3211.78 3397.4 3653.6 61.32 48718.346354 3929.2 201.805 60.402 76.562 813236 1422330.50 1245297 0.061 5.877 68181000 1642170.79 8213.68 249.03 20.084 1367 599.2 0.080 78.183 3.320 44.3 3495.05 21.304 3433533333 9.164 57.46 6.863 526856667 136276667 5501.3 270570000 48.537 179376 28.808 3434.8 25.641 1914129.29 55.047 11376.2 86.3 2490073.059112 1340806 11382.9 88.881 1013100000 639 2010 1861233333 1014 6.623 441.90 12031.6 25.590 3131766667 8.466 1403 27.37 2460.4 10427.81 5178 73 415 230 251.380 289.206 35.386 3.967 5.655 28.432 7.706 595.240 14.759 10.501 6.551 42.115 36.43 9.40 5.25 4.53 615.487 9.004 758 164.75 41.43 67.84 67.78 79.56 4.49 4.31 0.91 2.65 6081 6285 6338 6052 1556 1549 2505.0 8413 9.80 88.807 30.215 53.90 8267.2 602.1 2251.68 2007263.79 24.00 7892 92.056 6.352 33264.424479 15.0588 9.568 54.81 790 3896.6 3124.66 3491.3 3751.2 59.83 47567.308594 4019.7 206.363 59.197 78.112 828618 1447008.09 1266789 0.06 5.783 67124333 1667586.71 8094.47 252.57 20.368 1348 591.2 0.079 79.145 3.282 44.8 3457.06 21.529 3469400000 9.255 58.03 6.931 522383333 135280000 5539.4 268723333 48.212 180550 28.990 3453.7 25.511 1905050.75 55.309 11429.7 86.7 2500492.156623 1345998 11425.9 88.575 1009633333 641 2016 1865300000 1012 6.635 442.63 12016.3 25.621 3134500000 8.473 1404 27.38 2460.2 10428.44 758 OpenBenchmarking.org
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 77.46 88.81
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 28.04 30.22 1. (CC) gcc options: -lm -O3 -march=native -flto
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.45, N = 3 SE +/- 0.48, N = 3 56.99 53.90 1. (CC) gcc options: -O3
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 69.69, N = 12 SE +/- 107.36, N = 4 7853.1 8267.2 1. (CC) gcc options: -pthread -O3 -march=native -flto -lm
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 8.94, N = 3 SE +/- 1.39, N = 3 631.5 602.1 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 7.85, N = 3 SE +/- 1.81, N = 3 2151.60 2251.68 1. (CC) gcc options: -O3 -march=native -flto -lm
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 24394.75, N = 3 SE +/- 20810.45, N = 3 2098930.04 2007263.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 25.03 24.00 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 116.64, N = 15 SE +/- 101.14, N = 15 8226 7892 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.36, N = 3 SE +/- 0.79, N = 3 95.91 92.06 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -flto -lm -lreadline
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.086, N = 15 SE +/- 0.077, N = 15 6.097 6.352 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP GCC 10.2 GCC 11.0.1 7K 14K 21K 28K 35K SE +/- 141.22, N = 3 SE +/- 45.75, N = 3 34588.79 33264.42 1. (CXX) g++ options: -O3 -march=native -fopenmp
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 GCC 11.0.1 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 15.65 15.06 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.031, N = 3 9.879 9.568 1. (CXX) g++ options: -O3 -fPIC -lm
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.44, N = 3 SE +/- 0.62, N = 7 56.59 54.81 1. (CC) gcc options: -O3
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 2.19, N = 3 813 790 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 10.15, N = 3 SE +/- 3.65, N = 8 3786.5 3896.6 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 19.33, N = 3 SE +/- 31.37, N = 3 3211.78 3124.66 1. (CC) gcc options: -O3 -march=native -flto -lm
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 15.64, N = 3 SE +/- 1.91, N = 3 3397.4 3491.3 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 9.41, N = 3 SE +/- 9.34, N = 3 3653.6 3751.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.49, N = 3 61.32 59.83 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ljpeg -lz -lm
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP GCC 10.2 GCC 11.0.1 10K 20K 30K 40K 50K SE +/- 32.76, N = 3 SE +/- 167.36, N = 3 48718.35 47567.31 1. (CXX) g++ options: -O3 -march=native -fopenmp
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 900 1800 2700 3600 4500 SE +/- 4.04, N = 3 SE +/- 4.01, N = 3 3929.2 4019.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 GCC 11.0.1 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.60, N = 3 201.81 206.36 1. (CC) gcc options: -O3 -march=native -flto -fvisibility=hidden -lgpg-error
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.08, N = 3 60.40 59.20 1. (CC) gcc options: -O3 -march=native -flto -ldl -lz -lpthread
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.10, N = 4 SE +/- 0.33, N = 3 76.56 78.11 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only GCC 10.2 GCC 11.0.1 200K 400K 600K 800K 1000K SE +/- 1171.97, N = 3 SE +/- 588.21, N = 3 813236 828618 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 15117.80, N = 3 SE +/- 24877.75, N = 3 1422330.50 1447008.09 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 1515.50, N = 3 SE +/- 8011.04, N = 3 1245297 1266789 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.061 0.060 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 GCC 11.0.1 1.3223 2.6446 3.9669 5.2892 6.6115 SE +/- 0.013, N = 3 SE +/- 0.003, N = 3 5.877 5.783 1. (CXX) g++ options: -O3 -fPIC -lm
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 15M 30M 45M 60M 75M SE +/- 246698.87, N = 3 SE +/- 640108.87, N = 9 68181000 67124333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 5296.10, N = 3 SE +/- 8022.37, N = 3 1642170.79 1667586.71 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 84.17, N = 3 SE +/- 126.25, N = 3 8213.68 8094.47 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 10.2 GCC 11.0.1 60 120 180 240 300 SE +/- 0.53, N = 3 SE +/- 0.73, N = 3 249.03 252.57 1. (CC) gcc options: -O3 -march=native -flto -lm
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 20.08 20.37 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native -flto
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 8.67, N = 3 SE +/- 10.37, N = 3 1367 1348 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 130 260 390 520 650 SE +/- 3.12, N = 3 SE +/- 6.24, N = 8 599.2 591.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.018 0.036 0.054 0.072 0.09 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.080 0.079 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 78.18 79.15
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 GCC 11.0.1 0.747 1.494 2.241 2.988 3.735 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 3.320 3.282 1. (CXX) g++ options: -O3 -fPIC -lm
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 44.3 44.8 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 16.71, N = 3 SE +/- 32.06, N = 3 3495.05 3457.06 1. (CC) gcc options: -O3 -march=native -flto -lm
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 21.30 21.53
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 12549413.44, N = 3 SE +/- 7629547.82, N = 3 3433533333 3469400000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.056, N = 3 SE +/- 0.066, N = 3 9.164 9.255 1. (CC) gcc options: -lm -lpthread -O3 -march=native -flto
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 57.46 58.03 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 3 SE +/- 0.021, N = 3 6.863 6.931 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -flto -lncurses -lm
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 110M 220M 330M 440M 550M SE +/- 4259226.59, N = 3 SE +/- 4136086.45, N = 3 526856667 522383333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 30M 60M 90M 120M 150M SE +/- 283392.15, N = 3 SE +/- 255408.17, N = 3 136276667 135280000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 1200 2400 3600 4800 6000 SE +/- 17.22, N = 3 SE +/- 1.94, N = 3 5501.3 5539.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 60M 120M 180M 240M 300M SE +/- 492747.40, N = 3 SE +/- 361678.19, N = 3 270570000 268723333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 GCC 11.0.1 11 22 33 44 55 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 48.54 48.21 1. (CXX) g++ options: -O3 -fPIC -lm
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 10.2 GCC 11.0.1 40K 80K 120K 160K 200K SE +/- 375.42, N = 3 SE +/- 284.33, N = 3 179376 180550 1. (CXX) g++ options: -pipe -lpthread
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.05, N = 4 28.81 28.99 1. (CC) gcc options: -O3 -march=native -flto -std=c99
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 11.16, N = 3 SE +/- 13.15, N = 3 3434.8 3453.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 25.64 25.51 1. (CXX) g++ options: -O3 -fPIC -lm
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 18458.26, N = 3 SE +/- 24998.90, N = 3 1914129.29 1905050.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 GCC 11.0.1 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 55.05 55.31
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 27.35, N = 3 SE +/- 27.84, N = 7 11376.2 11429.7 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.95, N = 3 86.3 86.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 10.2 GCC 11.0.1 500K 1000K 1500K 2000K 2500K SE +/- 10187.73, N = 3 SE +/- 8519.73, N = 3 2490073.06 2500492.16 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 4514.49, N = 5 SE +/- 1916.14, N = 5 1340806 1345998 1. (CC) gcc options: -O3 -march=native -flto
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 37.20, N = 3 SE +/- 27.71, N = 3 11382.9 11425.9 1. (CC) gcc options: -O3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 88.88 88.58 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 200M 400M 600M 800M 1000M SE +/- 5391041.95, N = 3 SE +/- 3681183.38, N = 3 1013100000 1009633333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 1.00, N = 3 SE +/- 1.45, N = 3 639 641 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 GCC 11.0.1 400 800 1200 1600 2000 SE +/- 11.36, N = 3 2010 2016 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 400M 800M 1200M 1600M 2000M SE +/- 10038813.56, N = 3 SE +/- 9960589.01, N = 3 1861233333 1865300000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 2.08, N = 3 SE +/- 1.86, N = 3 1014 1012 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.013, N = 5 6.623 6.635 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M GCC 10.2 GCC 11.0.1 100 200 300 400 500 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 441.90 442.63 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 GCC 11.0.1 3K 6K 9K 12K 15K SE +/- 25.74, N = 3 SE +/- 24.40, N = 3 12031.6 12016.3 1. (CC) gcc options: -O3
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 25.59 25.62 1. (CXX) g++ options: -O3 -fPIC -lm
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 18565320.12, N = 3 SE +/- 9971459.27, N = 3 3131766667 3134500000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.021, N = 5 8.466 8.473 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 12.14, N = 15 SE +/- 18.34, N = 3 1403 1404 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 27.37 27.38 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 16.63, N = 3 SE +/- 4.24, N = 3 2460.4 2460.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 32.58, N = 3 SE +/- 23.26, N = 3 10427.81 10428.44 1. (CC) gcc options: -O3
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1100 2200 3300 4400 5500 SE +/- 30.91, N = 3 5178 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 16 32 48 64 80 SE +/- 0.17, N = 3 73 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 5.22, N = 12 415 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 50 100 150 200 250 SE +/- 1.76, N = 3 230 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.16, N = 3 251.38 MIN: 250.21 / MAX: 252.69 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 60 120 180 240 300 SE +/- 0.78, N = 3 289.21 MIN: 272.83 / MAX: 331 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.19, N = 3 35.39 MIN: 34.32 / MAX: 39.11 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 GCC 10.2 0.8926 1.7852 2.6778 3.5704 4.463 SE +/- 0.014, N = 3 3.967 MIN: 3.62 / MAX: 4.65 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 GCC 10.2 1.2724 2.5448 3.8172 5.0896 6.362 SE +/- 0.023, N = 3 5.655 MIN: 5.38 / MAX: 6.05 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 GCC 10.2 7 14 21 28 35 SE +/- 0.43, N = 3 28.43 MIN: 27.09 / MAX: 31.41 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 GCC 10.2 2 4 6 8 10 SE +/- 0.057, N = 3 7.706 MIN: 7.49 / MAX: 9.85 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Basis Universal Settings: UASTC Level 2 + RDO Post-Processing OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 + RDO Post-Processing GCC 10.2 130 260 390 520 650 SE +/- 0.50, N = 3 595.24 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 GCC 10.2 4 8 12 16 20 SE +/- 0.01, N = 3 14.76 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 10.50 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 GCC 10.2 2 4 6 8 10 SE +/- 0.032, N = 3 6.551 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive GCC 10.2 8 16 24 32 40 SE +/- 0.25, N = 3 36.43 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.40 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium GCC 10.2 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.00, N = 3 5.25 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast GCC 10.2 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.01, N = 3 4.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 130 260 390 520 650 SE +/- 0.97, N = 3 615.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 3 6 9 12 15 SE +/- 0.052, N = 3 9.004 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 GCC 11.0.1 160 320 480 640 800 SE +/- 3.53, N = 3 SE +/- 4.16, N = 3 758 758 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All GCC 10.2 40 80 120 160 200 SE +/- 0.24, N = 3 164.75
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 41.43
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 GCC 10.2 15 30 45 60 75 SE +/- 0.19, N = 3 67.84 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 GCC 10.2 15 30 45 60 75 SE +/- 0.33, N = 3 67.78 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 79.56 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID GCC 10.2 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.01, N = 3 4.49 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets GCC 10.2 0.9698 1.9396 2.9094 3.8792 4.849 SE +/- 0.01, N = 3 4.31 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom GCC 10.2 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.00, N = 3 0.91 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya GCC 10.2 0.5963 1.1926 1.7889 2.3852 2.9815 SE +/- 0.01, N = 3 2.65 1. (CXX) g++ options: -O3 -march=native -flto -pthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 10.2 1300 2600 3900 5200 6500 SE +/- 29.78, N = 3 6081 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 10.2 1300 2600 3900 5200 6500 SE +/- 10.40, N = 3 6285 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 10.2 1400 2800 4200 5600 7000 SE +/- 16.01, N = 3 6338 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB GCC 10.2 1300 2600 3900 5200 6500 SE +/- 48.64, N = 3 6052 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen GCC 10.2 300 600 900 1200 1500 SE +/- 26.77, N = 3 1556 1. (CXX) g++ options: -flto -O3 -march=native -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS GCC 10.2 300 600 900 1200 1500 SE +/- 14.19, N = 3 1549 1. (CXX) g++ options: -flto -O3 -march=native -pthread
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 500 1000 1500 2000 2500 SE +/- 0.38, N = 3 2505.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 2K 4K 6K 8K 10K SE +/- 183.41, N = 10 8413 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.25, N = 15 9.80 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
Phoronix Test Suite v10.8.4