TR 3990X GCC 11 Benchmarking AMD Ryzen Threadripper 3990X 64-Core testing with a System76 Thelio Major (F4c Z5 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Pop 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103125-PTS-TR3990XG68&grr&rdt .
TR 3990X GCC 11 Benchmarking Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 10.2 GCC 11.0.1 AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) System76 Thelio Major (F4c Z5 BIOS) AMD Starship/Matisse 126GB Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz) AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 20.10 5.8.0-7630-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.8 4.6 Mesa 21.1.0-devel (git-96d7555 2021-01-22 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.145 GCC 10.2.0 + Clang 11.0.1-1~oibaf~g ext4 3840x2160 GCC 11.0.1 20210307 + Clang 11.0.1-1~oibaf~g OpenBenchmarking.org Kernel Details - snd_usb_audio.ignore_ctl_error=1 - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301025 Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3990X GCC 11 Benchmarking basis: UASTC Level 2 + RDO Post-Processing openfoam: Motorbike 60M jpegxl: PNG - 7 onnx: bertsquad-10 - OpenMP CPU fftw: Stock - 2D FFT Size 4096 onnx: shufflenet-v2-10 - OpenMP CPU lczero: Eigen lczero: BLAS gcrypt: graphics-magick: Resizing pgbench: 100 - 50 - Read Write - Average Latency pgbench: 100 - 50 - Read Write onnx: fcn-resnet101-11 - OpenMP CPU onnx: yolov4 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU mrbayes: Primate Phylogeny Analysis compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed ngspice: C7552 mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 ngspice: C2670 build-eigen: Time To Compile build-imagemagick: Time To Compile compress-7zip: Compress Speed Test jpegxl-decode: 1 compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed graphics-magick: Sharpen graphics-magick: Rotate graphics-magick: Enhanced graphics-magick: Noise-Gaussian graphics-magick: Swirl graphics-magick: HWB Color Space sqlite-speedtest: Timed Time - Size 1,000 simdjson: PartialTweets simdjson: DistinctUserID financebench: Bonds OpenMP compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed build-godot: Time To Compile jpegxl-decode: All simdjson: Kostya simdjson: LargeRand avifenc: 0 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed basis: ETC1S astcenc: Exhaustive financebench: Repo OpenMP liquid-dsp: 1 - 256 - 57 espeak: Text-To-Speech Synthesis openfoam: Motorbike 30M compress-zstd: 3 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed pgbench: 100 - 50 - Read Only - Average Latency pgbench: 100 - 50 - Read Only pgbench: 100 - 100 - Read Only - Average Latency pgbench: 100 - 100 - Read Only synthmark: VoiceMark_100 compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed quantlib: aobench: 2048 x 2048 - Total Time scimark2: Composite avifenc: 6, Lossless coremark: CoreMark Size 666 - Iterations Per Second avifenc: 2 x265: Bosphorus 4K build-apache: Time To Compile libraw: Post-Processing Benchmark tnn: CPU - MobileNet v2 compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 liquid-dsp: 8 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 2 - 256 - 57 jpegxl: JPEG - 5 tnn: CPU - SqueezeNet v1.1 jpegxl: PNG - 5 redis: LPUSH redis: SET tachyon: Total Time redis: SADD redis: GET basis: UASTC Level 3 jpegxl: JPEG - 7 encode-flac: WAV To FLAC povray: Trace Time encode-opus: WAV To Opus Encode basis: UASTC Level 2 x265: Bosphorus 1080p avifenc: 6 c-ray: Total Time - 4K, 16 Rays Per Pixel astcenc: Thorough encode-mp3: WAV To MP3 basis: UASTC Level 0 toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads toybrot: TBB avifenc: 10, Lossless astcenc: Medium astcenc: Fast avifenc: 10 tscp: AI Chess Performance scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform GCC 10.2 GCC 11.0.1 595.240 441.90 9.80 415 7853.1 8413 1556 1549 201.805 1403 6.097 8226 73 230 5178 95.914 11376.2 56.59 76.562 35.386 3.967 5.655 28.432 7.706 88.881 77.464 78.183 179376 41.43 3786.5 599.2 758 639 1014 813 2010 1367 60.402 4.31 4.49 48718.346354 11382.9 56.99 55.047 164.75 2.65 0.91 48.537 3397.4 86.3 3434.8 44.3 42.115 36.43 34588.787760 68181000 28.808 27.37 5501.3 3653.6 2460.4 3929.2 631.5 0.061 813236 0.080 1245297 615.487 12031.6 10427.81 2505.0 28.038 3211.78 25.590 2490073.059112 25.641 25.03 21.304 61.32 289.206 20.084 526856667 3433533333 3131766667 1861233333 1013100000 270570000 136276667 67.78 251.380 79.56 1422330.50 1642170.79 15.6495 1914129.29 2098930.04 14.759 67.84 8.466 9.004 6.623 10.501 57.46 9.879 9.164 9.40 6.863 6.551 6338 6285 6081 6052 5.877 5.25 4.53 3.320 1340806 2151.60 8213.68 3495.05 249.03 442.63 8267.2 206.363 1404 6.352 7892 92.056 11429.7 54.81 78.112 88.575 88.807 79.145 180550 3896.6 591.2 758 641 1012 790 2016 1348 59.197 47567.308594 11425.9 53.90 55.309 48.212 3491.3 86.7 3453.7 44.8 33264.424479 67124333 28.990 27.38 5539.4 3751.2 2460.2 4019.7 602.1 0.06 828618 0.079 1266789 12016.3 10428.44 30.215 3124.66 25.621 2500492.156623 25.511 24.00 21.529 59.83 20.368 522383333 3469400000 3134500000 1865300000 1009633333 268723333 135280000 1447008.09 1667586.71 15.0588 1905050.75 2007263.79 8.473 6.635 58.03 9.568 9.255 6.931 5.783 3.282 1345998 2251.68 8094.47 3457.06 252.57 OpenBenchmarking.org
Basis Universal Settings: UASTC Level 2 + RDO Post-Processing OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 + RDO Post-Processing GCC 10.2 130 260 390 520 650 SE +/- 0.50, N = 3 595.24 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M GCC 10.2 GCC 11.0.1 100 200 300 400 500 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 441.90 442.63 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 GCC 10.2 3 6 9 12 15 SE +/- 0.25, N = 15 9.80 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU GCC 10.2 90 180 270 360 450 SE +/- 5.22, N = 12 415 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 69.69, N = 12 SE +/- 107.36, N = 4 7853.1 8267.2 1. (CC) gcc options: -pthread -O3 -march=native -flto -lm
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU GCC 10.2 2K 4K 6K 8K 10K SE +/- 183.41, N = 10 8413 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen GCC 10.2 300 600 900 1200 1500 SE +/- 26.77, N = 3 1556 1. (CXX) g++ options: -flto -O3 -march=native -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS GCC 10.2 300 600 900 1200 1500 SE +/- 14.19, N = 3 1549 1. (CXX) g++ options: -flto -O3 -march=native -pthread
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 10.2 GCC 11.0.1 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.60, N = 3 201.81 206.36 1. (CC) gcc options: -O3 -march=native -flto -fvisibility=hidden -lgpg-error
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 12.14, N = 15 SE +/- 18.34, N = 3 1403 1404 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.086, N = 15 SE +/- 0.077, N = 15 6.097 6.352 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Write GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 116.64, N = 15 SE +/- 101.14, N = 15 8226 7892 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU GCC 10.2 16 32 48 64 80 SE +/- 0.17, N = 3 73 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU GCC 10.2 50 100 150 200 250 SE +/- 1.76, N = 3 230 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU GCC 10.2 1100 2200 3300 4400 5500 SE +/- 30.91, N = 3 5178 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ffunction-sections -fdata-sections -ldl -lrt
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.36, N = 3 SE +/- 0.79, N = 3 95.91 92.06 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -flto -lm -lreadline
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 27.35, N = 3 SE +/- 27.84, N = 7 11376.2 11429.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.44, N = 3 SE +/- 0.62, N = 7 56.59 54.81 1. (CC) gcc options: -O3
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 1.10, N = 4 SE +/- 0.33, N = 3 76.56 78.11 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 GCC 10.2 8 16 24 32 40 SE +/- 0.19, N = 3 35.39 MIN: 34.32 / MAX: 39.11 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 GCC 10.2 0.8926 1.7852 2.6778 3.5704 4.463 SE +/- 0.014, N = 3 3.967 MIN: 3.62 / MAX: 4.65 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 GCC 10.2 1.2724 2.5448 3.8172 5.0896 6.362 SE +/- 0.023, N = 3 5.655 MIN: 5.38 / MAX: 6.05 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 GCC 10.2 7 14 21 28 35 SE +/- 0.43, N = 3 28.43 MIN: 27.09 / MAX: 31.41 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 GCC 10.2 2 4 6 8 10 SE +/- 0.057, N = 3 7.706 MIN: 7.49 / MAX: 9.85 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 88.88 88.58 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 77.46 88.81
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 78.18 79.15
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 10.2 GCC 11.0.1 40K 80K 120K 160K 200K SE +/- 375.42, N = 3 SE +/- 284.33, N = 3 179376 180550 1. (CXX) g++ options: -pipe -lpthread
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 GCC 10.2 9 18 27 36 45 SE +/- 0.02, N = 3 41.43
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 10.15, N = 3 SE +/- 3.65, N = 8 3786.5 3896.6 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 130 260 390 520 650 SE +/- 3.12, N = 3 SE +/- 6.24, N = 8 599.2 591.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 10.2 GCC 11.0.1 160 320 480 640 800 SE +/- 3.53, N = 3 SE +/- 4.16, N = 3 758 758 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 1.00, N = 3 SE +/- 1.45, N = 3 639 641 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 2.08, N = 3 SE +/- 1.86, N = 3 1014 1012 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 10.2 GCC 11.0.1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 2.19, N = 3 813 790 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 10.2 GCC 11.0.1 400 800 1200 1600 2000 SE +/- 11.36, N = 3 2010 2016 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 10.2 GCC 11.0.1 300 600 900 1200 1500 SE +/- 8.67, N = 3 SE +/- 10.37, N = 3 1367 1348 1. (CC) gcc options: -fopenmp -O3 -march=native -flto -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.08, N = 3 60.40 59.20 1. (CC) gcc options: -O3 -march=native -flto -ldl -lz -lpthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets GCC 10.2 0.9698 1.9396 2.9094 3.8792 4.849 SE +/- 0.01, N = 3 4.31 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID GCC 10.2 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.01, N = 3 4.49 1. (CXX) g++ options: -O3 -march=native -flto -pthread
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP GCC 10.2 GCC 11.0.1 10K 20K 30K 40K 50K SE +/- 32.76, N = 3 SE +/- 167.36, N = 3 48718.35 47567.31 1. (CXX) g++ options: -O3 -march=native -fopenmp
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 37.20, N = 3 SE +/- 27.71, N = 3 11382.9 11425.9 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.45, N = 3 SE +/- 0.48, N = 3 56.99 53.90 1. (CC) gcc options: -O3
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile GCC 10.2 GCC 11.0.1 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 55.05 55.31
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All GCC 10.2 40 80 120 160 200 SE +/- 0.24, N = 3 164.75
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya GCC 10.2 0.5963 1.1926 1.7889 2.3852 2.9815 SE +/- 0.01, N = 3 2.65 1. (CXX) g++ options: -O3 -march=native -flto -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom GCC 10.2 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.00, N = 3 0.91 1. (CXX) g++ options: -O3 -march=native -flto -pthread
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 GCC 10.2 GCC 11.0.1 11 22 33 44 55 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 48.54 48.21 1. (CXX) g++ options: -O3 -fPIC -lm
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 15.64, N = 3 SE +/- 1.91, N = 3 3397.4 3491.3 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed GCC 10.2 GCC 11.0.1 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.95, N = 3 86.3 86.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 11.16, N = 3 SE +/- 13.15, N = 3 3434.8 3453.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 44.3 44.8 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S GCC 10.2 10 20 30 40 50 SE +/- 0.13, N = 3 42.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive GCC 10.2 8 16 24 32 40 SE +/- 0.25, N = 3 36.43 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP GCC 10.2 GCC 11.0.1 7K 14K 21K 28K 35K SE +/- 141.22, N = 3 SE +/- 45.75, N = 3 34588.79 33264.42 1. (CXX) g++ options: -O3 -march=native -fopenmp
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 15M 30M 45M 60M 75M SE +/- 246698.87, N = 3 SE +/- 640108.87, N = 9 68181000 67124333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.05, N = 4 28.81 28.99 1. (CC) gcc options: -O3 -march=native -flto -std=c99
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 27.37 27.38 -lfoamToVTK -lfileFormats -ldecompose -lmetisDecomp -lscotchDecomp -lregionModels 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Compression Speed GCC 10.2 GCC 11.0.1 1200 2400 3600 4800 6000 SE +/- 17.22, N = 3 SE +/- 1.94, N = 3 5501.3 5539.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed GCC 10.2 GCC 11.0.1 800 1600 2400 3200 4000 SE +/- 9.41, N = 3 SE +/- 9.34, N = 3 3653.6 3751.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 16.63, N = 3 SE +/- 4.24, N = 3 2460.4 2460.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed GCC 10.2 GCC 11.0.1 900 1800 2700 3600 4500 SE +/- 4.04, N = 3 SE +/- 4.01, N = 3 3929.2 4019.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed GCC 10.2 GCC 11.0.1 140 280 420 560 700 SE +/- 8.94, N = 3 SE +/- 1.39, N = 3 631.5 602.1 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.061 0.060 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only GCC 10.2 GCC 11.0.1 200K 400K 600K 800K 1000K SE +/- 1171.97, N = 3 SE +/- 588.21, N = 3 813236 828618 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency GCC 10.2 GCC 11.0.1 0.018 0.036 0.054 0.072 0.09 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.080 0.079 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 1515.50, N = 3 SE +/- 8011.04, N = 3 1245297 1266789 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 10.2 130 260 390 520 650 SE +/- 0.97, N = 3 615.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed GCC 10.2 GCC 11.0.1 3K 6K 9K 12K 15K SE +/- 25.74, N = 3 SE +/- 24.40, N = 3 12031.6 12016.3 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 32.58, N = 3 SE +/- 23.26, N = 3 10427.81 10428.44 1. (CC) gcc options: -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 10.2 500 1000 1500 2000 2500 SE +/- 0.38, N = 3 2505.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 10.2 GCC 11.0.1 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 28.04 30.22 1. (CC) gcc options: -lm -O3 -march=native -flto
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 19.33, N = 3 SE +/- 31.37, N = 3 3211.78 3124.66 1. (CC) gcc options: -O3 -march=native -flto -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 25.59 25.62 1. (CXX) g++ options: -O3 -fPIC -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 10.2 GCC 11.0.1 500K 1000K 1500K 2000K 2500K SE +/- 10187.73, N = 3 SE +/- 8519.73, N = 3 2490073.06 2500492.16 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 25.64 25.51 1. (CXX) g++ options: -O3 -fPIC -lm
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 10.2 GCC 11.0.1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 25.03 24.00 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 21.30 21.53
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark GCC 10.2 GCC 11.0.1 14 28 42 56 70 SE +/- 0.25, N = 3 SE +/- 0.49, N = 3 61.32 59.83 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -ljpeg -lz -lm
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 GCC 10.2 60 120 180 240 300 SE +/- 0.78, N = 3 289.21 MIN: 272.83 / MAX: 331 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 GCC 10.2 GCC 11.0.1 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 20.08 20.37 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native -flto
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 110M 220M 330M 440M 550M SE +/- 4259226.59, N = 3 SE +/- 4136086.45, N = 3 526856667 522383333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 12549413.44, N = 3 SE +/- 7629547.82, N = 3 3433533333 3469400000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 700M 1400M 2100M 2800M 3500M SE +/- 18565320.12, N = 3 SE +/- 9971459.27, N = 3 3131766667 3134500000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 400M 800M 1200M 1600M 2000M SE +/- 10038813.56, N = 3 SE +/- 9960589.01, N = 3 1861233333 1865300000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 200M 400M 600M 800M 1000M SE +/- 5391041.95, N = 3 SE +/- 3681183.38, N = 3 1013100000 1009633333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 60M 120M 180M 240M 300M SE +/- 492747.40, N = 3 SE +/- 361678.19, N = 3 270570000 268723333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 GCC 10.2 GCC 11.0.1 30M 60M 90M 120M 150M SE +/- 283392.15, N = 3 SE +/- 255408.17, N = 3 136276667 135280000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 GCC 10.2 15 30 45 60 75 SE +/- 0.33, N = 3 67.78 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 GCC 10.2 50 100 150 200 250 SE +/- 0.16, N = 3 251.38 MIN: 250.21 / MAX: 252.69 1. (CXX) g++ options: -O3 -march=native -flto -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 GCC 10.2 20 40 60 80 100 SE +/- 0.14, N = 3 79.56 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 15117.80, N = 3 SE +/- 24877.75, N = 3 1422330.50 1447008.09 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 5296.10, N = 3 SE +/- 8022.37, N = 3 1642170.79 1667586.71 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 10.2 GCC 11.0.1 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 15.65 15.06 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 18458.26, N = 3 SE +/- 24998.90, N = 3 1914129.29 1905050.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 10.2 GCC 11.0.1 400K 800K 1200K 1600K 2000K SE +/- 24394.75, N = 3 SE +/- 20810.45, N = 3 2098930.04 2007263.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 GCC 10.2 4 8 12 16 20 SE +/- 0.01, N = 3 14.76 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 GCC 10.2 15 30 45 60 75 SE +/- 0.19, N = 3 67.84 1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie -pthread
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.021, N = 5 8.466 8.473 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time GCC 10.2 3 6 9 12 15 SE +/- 0.052, N = 3 9.004 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -flto -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 5 SE +/- 0.013, N = 5 6.623 6.635 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 GCC 10.2 3 6 9 12 15 SE +/- 0.03, N = 3 10.50 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 10.2 GCC 11.0.1 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 57.46 58.03 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.031, N = 3 9.879 9.568 1. (CXX) g++ options: -O3 -fPIC -lm
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 10.2 GCC 11.0.1 3 6 9 12 15 SE +/- 0.056, N = 3 SE +/- 0.066, N = 3 9.164 9.255 1. (CC) gcc options: -lm -lpthread -O3 -march=native -flto
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough GCC 10.2 3 6 9 12 15 SE +/- 0.02, N = 3 9.40 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 10.2 GCC 11.0.1 2 4 6 8 10 SE +/- 0.014, N = 3 SE +/- 0.021, N = 3 6.863 6.931 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -flto -lncurses -lm
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 GCC 10.2 2 4 6 8 10 SE +/- 0.032, N = 3 6.551 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 10.2 1400 2800 4200 5600 7000 SE +/- 16.01, N = 3 6338 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 10.2 1300 2600 3900 5200 6500 SE +/- 10.40, N = 3 6285 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 10.2 1300 2600 3900 5200 6500 SE +/- 29.78, N = 3 6081 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB GCC 10.2 1300 2600 3900 5200 6500 SE +/- 48.64, N = 3 6052 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless GCC 10.2 GCC 11.0.1 1.3223 2.6446 3.9669 5.2892 6.6115 SE +/- 0.013, N = 3 SE +/- 0.003, N = 3 5.877 5.783 1. (CXX) g++ options: -O3 -fPIC -lm
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium GCC 10.2 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.00, N = 3 5.25 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast GCC 10.2 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.01, N = 3 4.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 10.2 GCC 11.0.1 0.747 1.494 2.241 2.988 3.735 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 3.320 3.282 1. (CXX) g++ options: -O3 -fPIC -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10.2 GCC 11.0.1 300K 600K 900K 1200K 1500K SE +/- 4514.49, N = 5 SE +/- 1916.14, N = 5 1340806 1345998 1. (CC) gcc options: -O3 -march=native -flto
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 10.2 GCC 11.0.1 500 1000 1500 2000 2500 SE +/- 7.85, N = 3 SE +/- 1.81, N = 3 2151.60 2251.68 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 10.2 GCC 11.0.1 2K 4K 6K 8K 10K SE +/- 84.17, N = 3 SE +/- 126.25, N = 3 8213.68 8094.47 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 10.2 GCC 11.0.1 700 1400 2100 2800 3500 SE +/- 16.71, N = 3 SE +/- 32.06, N = 3 3495.05 3457.06 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 10.2 GCC 11.0.1 60 120 180 240 300 SE +/- 0.53, N = 3 SE +/- 0.73, N = 3 249.03 252.57 1. (CC) gcc options: -O3 -march=native -flto -lm
Phoronix Test Suite v10.8.4