5950X comp AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101155-HA-5950XCOMP80&grr .
5950X comp Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O1 -O3 -Ofast AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-051100rc2daily20210108-generic (x86_64) 20210107 GNOME Shell 3.38.1 X Server 1.20.9 amdgpu 19.1.0 4.6 Mesa 21.0.0-devel (git-f01bca8 2021-01-08 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.164 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O1: CXXFLAGS=-O1 CFLAGS=-O1 - -O3: CXXFLAGS=-O3 CFLAGS=-O3 - -Ofast: CXXFLAGS=-Ofast CFLAGS=-Ofast Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
5950X comp lczero: BLAS lczero: Eigen simdjson: Kostya mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 simdjson: PartialTweets openfoam: Motorbike 30M astcenc: Exhaustive compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed simdjson: DistinctUserID compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed simdjson: LargeRand amg: coremark: CoreMark Size 666 - Iterations Per Second encode-wavpack: WAV To WavPack astcenc: Thorough tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 encode-opus: WAV To Opus Encode -O1 -O3 -Ofast 349 355 0.69 30.753 2.516 3.345 24.004 5.176 0.91 96.97 99.81 12834.8 72.28 0.90 12852.4 70.68 0.60 211605433 672167.939285 10.184 12.53 220.154 205.857 6.947 405 410 0.65 30.757 2.447 3.307 23.685 5.104 0.87 99.22 99.75 12828.5 71.11 0.89 12827.4 69.43 0.61 211971700 865041.041461 10.272 12.51 215.335 211.790 6.025 411 424 0.66 30.845 2.410 3.277 23.406 5.106 0.86 96.68 99.70 12828.1 70.28 0.88 12794.5 69.42 0.60 211981075 868266.143661 10.143 12.53 217.849 210.221 OpenBenchmarking.org
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 3.06, N = 3 SE +/- 4.10, N = 6 SE +/- 2.00, N = 3 349 405 411 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 4.36, N = 3 SE +/- 2.03, N = 3 SE +/- 5.13, N = 3 355 410 424 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya -O1 -O3 -Ofast 0.1553 0.3106 0.4659 0.6212 0.7765 SE +/- 0.03, N = 12 SE +/- 0.03, N = 12 SE +/- 0.02, N = 12 0.69 0.65 0.66 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 -O1 -O3 -Ofast 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.13, N = 15 SE +/- 0.28, N = 5 30.75 30.76 30.85 -O1 - MIN: 29.72 / MAX: 38.23 MIN: 29.69 / MAX: 40.5 -Ofast - MIN: 29.65 / MAX: 40.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 -O1 -O3 -Ofast 0.5661 1.1322 1.6983 2.2644 2.8305 SE +/- 0.040, N = 3 SE +/- 0.025, N = 15 SE +/- 0.062, N = 5 2.516 2.447 2.410 -O1 - MIN: 2.42 / MAX: 3.08 MIN: 2.24 / MAX: 11.79 -Ofast - MIN: 2.26 / MAX: 4.06 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 -O1 -O3 -Ofast 0.7526 1.5052 2.2578 3.0104 3.763 SE +/- 0.037, N = 3 SE +/- 0.025, N = 15 SE +/- 0.061, N = 5 3.345 3.307 3.277 -O1 - MIN: 3.2 / MAX: 8.98 MIN: 3.07 / MAX: 12.37 -Ofast - MIN: 3.05 / MAX: 3.93 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 -O1 -O3 -Ofast 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.16, N = 15 SE +/- 0.38, N = 5 24.00 23.69 23.41 -O1 - MIN: 22.91 / MAX: 32.98 MIN: 21.64 / MAX: 51.01 -Ofast - MIN: 21.71 / MAX: 31.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 -O1 -O3 -Ofast 1.1646 2.3292 3.4938 4.6584 5.823 SE +/- 0.075, N = 3 SE +/- 0.043, N = 15 SE +/- 0.053, N = 5 5.176 5.104 5.106 -O1 - MIN: 4.93 / MAX: 8.12 MIN: 4.6 / MAX: 13.78 -Ofast - MIN: 4.8 / MAX: 6.57 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets -O1 -O3 -Ofast 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.01, N = 15 SE +/- 0.01, N = 13 SE +/- 0.00, N = 3 0.91 0.87 0.86 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.50, N = 3 SE +/- 0.15, N = 3 96.97 99.22 96.68 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 99.81 99.75 99.70 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 9.67, N = 10 SE +/- 14.38, N = 4 SE +/- 15.70, N = 5 12834.8 12828.5 12828.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.55, N = 10 SE +/- 0.84, N = 4 SE +/- 0.71, N = 5 72.28 71.11 70.28 1. (CC) gcc options: -O3
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID -O1 -O3 -Ofast 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 12 0.90 0.89 0.88 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 21.65, N = 3 SE +/- 43.19, N = 3 SE +/- 30.70, N = 5 12852.4 12827.4 12794.5 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.97, N = 3 SE +/- 0.63, N = 3 SE +/- 0.76, N = 5 70.68 69.43 69.42 1. (CC) gcc options: -O3
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom -O1 -O3 -Ofast 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.60 0.61 0.60 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 -O1 -O3 -Ofast 50M 100M 150M 200M 250M SE +/- 2531189.55, N = 3 SE +/- 2500260.63, N = 3 SE +/- 2395455.32, N = 4 211605433 211971700 211981075 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O1 -O3 -Ofast 200K 400K 600K 800K 1000K SE +/- 1727.87, N = 3 SE +/- 3792.56, N = 3 SE +/- 2256.07, N = 3 672167.94 865041.04 868266.14 -O1 -O3 -Ofast 1. (CC) gcc options: -O2 -lrt" -lrt
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.04, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 10.18 10.27 10.14 -O1 -O3 -Ofast 1. (CXX) g++ options: -rdynamic
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.53 12.51 12.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 0.81, N = 3 SE +/- 1.91, N = 3 SE +/- 0.41, N = 3 220.15 215.34 217.85 -O1 - MIN: 215.64 / MAX: 269.36 MIN: 209.09 / MAX: 276.74 -Ofast - MIN: 214.17 / MAX: 258.58 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 1.36, N = 3 SE +/- 0.73, N = 3 SE +/- 0.13, N = 3 205.86 211.79 210.22 -O1 - MIN: 203.7 / MAX: 208.59 MIN: 210.23 / MAX: 212.89 -Ofast - MIN: 209.89 / MAX: 210.66 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode -O1 -O3 2 4 6 8 10 SE +/- 0.037, N = 5 SE +/- 0.050, N = 5 6.947 6.025 -O1 -O3 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
Phoronix Test Suite v10.8.4