5950X comp AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101155-HA-5950XCOMP80&grw&sro .
5950X comp Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O1 -O3 -Ofast AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-051100rc2daily20210108-generic (x86_64) 20210107 GNOME Shell 3.38.1 X Server 1.20.9 amdgpu 19.1.0 4.6 Mesa 21.0.0-devel (git-f01bca8 2021-01-08 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.164 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O1: CXXFLAGS=-O1 CFLAGS=-O1 - -O3: CXXFLAGS=-O3 CFLAGS=-O3 - -Ofast: CXXFLAGS=-Ofast CFLAGS=-Ofast Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
5950X comp compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed encode-opus: WAV To Opus Encode encode-wavpack: WAV To WavPack astcenc: Thorough astcenc: Exhaustive lczero: BLAS lczero: Eigen mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 amg: openfoam: Motorbike 30M coremark: CoreMark Size 666 - Iterations Per Second simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID -O1 -O3 -Ofast 72.28 12834.8 70.68 12852.4 6.947 10.184 12.53 99.81 349 355 5.176 24.004 3.345 2.516 30.753 220.154 205.857 211605433 96.97 672167.939285 0.69 0.60 0.91 0.90 71.11 12828.5 69.43 12827.4 6.025 10.272 12.51 99.75 405 410 5.104 23.685 3.307 2.447 30.757 215.335 211.790 211971700 99.22 865041.041461 0.65 0.61 0.87 0.89 70.28 12828.1 69.42 12794.5 10.143 12.53 99.70 411 424 5.106 23.406 3.277 2.410 30.845 217.849 210.221 211981075 96.68 868266.143661 0.66 0.60 0.86 0.88 OpenBenchmarking.org
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.55, N = 10 SE +/- 0.84, N = 4 SE +/- 0.71, N = 5 72.28 71.11 70.28 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 9.67, N = 10 SE +/- 14.38, N = 4 SE +/- 15.70, N = 5 12834.8 12828.5 12828.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.97, N = 3 SE +/- 0.63, N = 3 SE +/- 0.76, N = 5 70.68 69.43 69.42 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 21.65, N = 3 SE +/- 43.19, N = 3 SE +/- 30.70, N = 5 12852.4 12827.4 12794.5 1. (CC) gcc options: -O3
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode -O1 -O3 2 4 6 8 10 SE +/- 0.037, N = 5 SE +/- 0.050, N = 5 6.947 6.025 -O1 -O3 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.04, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 10.18 10.27 10.14 -O1 -O3 -Ofast 1. (CXX) g++ options: -rdynamic
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.53 12.51 12.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 99.81 99.75 99.70 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 3.06, N = 3 SE +/- 4.10, N = 6 SE +/- 2.00, N = 3 349 405 411 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 4.36, N = 3 SE +/- 2.03, N = 3 SE +/- 5.13, N = 3 355 410 424 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 -O1 -O3 -Ofast 1.1646 2.3292 3.4938 4.6584 5.823 SE +/- 0.075, N = 3 SE +/- 0.043, N = 15 SE +/- 0.053, N = 5 5.176 5.104 5.106 -O1 - MIN: 4.93 / MAX: 8.12 MIN: 4.6 / MAX: 13.78 -Ofast - MIN: 4.8 / MAX: 6.57 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 -O1 -O3 -Ofast 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.16, N = 15 SE +/- 0.38, N = 5 24.00 23.69 23.41 -O1 - MIN: 22.91 / MAX: 32.98 MIN: 21.64 / MAX: 51.01 -Ofast - MIN: 21.71 / MAX: 31.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 -O1 -O3 -Ofast 0.7526 1.5052 2.2578 3.0104 3.763 SE +/- 0.037, N = 3 SE +/- 0.025, N = 15 SE +/- 0.061, N = 5 3.345 3.307 3.277 -O1 - MIN: 3.2 / MAX: 8.98 MIN: 3.07 / MAX: 12.37 -Ofast - MIN: 3.05 / MAX: 3.93 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 -O1 -O3 -Ofast 0.5661 1.1322 1.6983 2.2644 2.8305 SE +/- 0.040, N = 3 SE +/- 0.025, N = 15 SE +/- 0.062, N = 5 2.516 2.447 2.410 -O1 - MIN: 2.42 / MAX: 3.08 MIN: 2.24 / MAX: 11.79 -Ofast - MIN: 2.26 / MAX: 4.06 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 -O1 -O3 -Ofast 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.13, N = 15 SE +/- 0.28, N = 5 30.75 30.76 30.85 -O1 - MIN: 29.72 / MAX: 38.23 MIN: 29.69 / MAX: 40.5 -Ofast - MIN: 29.65 / MAX: 40.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 0.81, N = 3 SE +/- 1.91, N = 3 SE +/- 0.41, N = 3 220.15 215.34 217.85 -O1 - MIN: 215.64 / MAX: 269.36 MIN: 209.09 / MAX: 276.74 -Ofast - MIN: 214.17 / MAX: 258.58 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 1.36, N = 3 SE +/- 0.73, N = 3 SE +/- 0.13, N = 3 205.86 211.79 210.22 -O1 - MIN: 203.7 / MAX: 208.59 MIN: 210.23 / MAX: 212.89 -Ofast - MIN: 209.89 / MAX: 210.66 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 -O1 -O3 -Ofast 50M 100M 150M 200M 250M SE +/- 2531189.55, N = 3 SE +/- 2500260.63, N = 3 SE +/- 2395455.32, N = 4 211605433 211971700 211981075 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.50, N = 3 SE +/- 0.15, N = 3 96.97 99.22 96.68 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O1 -O3 -Ofast 200K 400K 600K 800K 1000K SE +/- 1727.87, N = 3 SE +/- 3792.56, N = 3 SE +/- 2256.07, N = 3 672167.94 865041.04 868266.14 -O1 -O3 -Ofast 1. (CC) gcc options: -O2 -lrt" -lrt
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya -O1 -O3 -Ofast 0.1553 0.3106 0.4659 0.6212 0.7765 SE +/- 0.03, N = 12 SE +/- 0.03, N = 12 SE +/- 0.02, N = 12 0.69 0.65 0.66 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom -O1 -O3 -Ofast 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.60 0.61 0.60 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets -O1 -O3 -Ofast 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.01, N = 15 SE +/- 0.01, N = 13 SE +/- 0.00, N = 3 0.91 0.87 0.86 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID -O1 -O3 -Ofast 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 12 0.90 0.89 0.88 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
Phoronix Test Suite v10.8.4