5950X comp AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101155-HA-5950XCOMP80&grs .
5950X comp Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution -O1 -O3 -Ofast AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-051100rc2daily20210108-generic (x86_64) 20210107 GNOME Shell 3.38.1 X Server 1.20.9 amdgpu 19.1.0 4.6 Mesa 21.0.0-devel (git-f01bca8 2021-01-08 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.164 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Environment Details - -O1: CXXFLAGS=-O1 CFLAGS=-O1 - -O3: CXXFLAGS=-O3 CFLAGS=-O3 - -Ofast: CXXFLAGS=-Ofast CFLAGS=-Ofast Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
5950X comp coremark: CoreMark Size 666 - Iterations Per Second lczero: Eigen lczero: BLAS encode-opus: WAV To Opus Encode mnn: mobilenet-v1-1.0 tnn: CPU - SqueezeNet v1.1 compress-lz4: 3 - Compression Speed openfoam: Motorbike 30M mnn: resnet-v2-50 tnn: CPU - MobileNet v2 mnn: MobileNetV2_224 compress-lz4: 9 - Compression Speed simdjson: LargeRand mnn: SqueezeNetV1.0 encode-wavpack: WAV To WavPack compress-lz4: 9 - Decompression Speed mnn: inception-v3 amg: astcenc: Thorough astcenc: Exhaustive compress-lz4: 3 - Decompression Speed simdjson: DistinctUserID simdjson: PartialTweets simdjson: Kostya -O1 -O3 -Ofast 672167.939285 355 349 6.947 2.516 205.857 72.28 96.97 24.004 220.154 3.345 70.68 0.60 5.176 10.184 12852.4 30.753 211605433 12.53 99.81 12834.8 0.90 0.91 0.69 865041.041461 410 405 6.025 2.447 211.790 71.11 99.22 23.685 215.335 3.307 69.43 0.61 5.104 10.272 12827.4 30.757 211971700 12.51 99.75 12828.5 0.89 0.87 0.65 868266.143661 424 411 2.410 210.221 70.28 96.68 23.406 217.849 3.277 69.42 0.60 5.106 10.143 12794.5 30.845 211981075 12.53 99.70 12828.1 0.88 0.86 0.66 OpenBenchmarking.org
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second -O1 -O3 -Ofast 200K 400K 600K 800K 1000K SE +/- 1727.87, N = 3 SE +/- 3792.56, N = 3 SE +/- 2256.07, N = 3 672167.94 865041.04 868266.14 -O1 -O3 -Ofast 1. (CC) gcc options: -O2 -lrt" -lrt
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 4.36, N = 3 SE +/- 2.03, N = 3 SE +/- 5.13, N = 3 355 410 424 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS -O1 -O3 -Ofast 90 180 270 360 450 SE +/- 3.06, N = 3 SE +/- 4.10, N = 6 SE +/- 2.00, N = 3 349 405 411 -O1 -O3 -Ofast 1. (CXX) g++ options: -flto -pthread
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode -O1 -O3 2 4 6 8 10 SE +/- 0.037, N = 5 SE +/- 0.050, N = 5 6.947 6.025 -O1 -O3 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 -O1 -O3 -Ofast 0.5661 1.1322 1.6983 2.2644 2.8305 SE +/- 0.040, N = 3 SE +/- 0.025, N = 15 SE +/- 0.062, N = 5 2.516 2.447 2.410 -O1 - MIN: 2.42 / MAX: 3.08 MIN: 2.24 / MAX: 11.79 -Ofast - MIN: 2.26 / MAX: 4.06 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 1.36, N = 3 SE +/- 0.73, N = 3 SE +/- 0.13, N = 3 205.86 211.79 210.22 -O1 - MIN: 203.7 / MAX: 208.59 MIN: 210.23 / MAX: 212.89 -Ofast - MIN: 209.89 / MAX: 210.66 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.55, N = 10 SE +/- 0.84, N = 4 SE +/- 0.71, N = 5 72.28 71.11 70.28 1. (CC) gcc options: -O3
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.50, N = 3 SE +/- 0.15, N = 3 96.97 99.22 96.68 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 -O1 -O3 -Ofast 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.16, N = 15 SE +/- 0.38, N = 5 24.00 23.69 23.41 -O1 - MIN: 22.91 / MAX: 32.98 MIN: 21.64 / MAX: 51.01 -Ofast - MIN: 21.71 / MAX: 31.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 -O1 -O3 -Ofast 50 100 150 200 250 SE +/- 0.81, N = 3 SE +/- 1.91, N = 3 SE +/- 0.41, N = 3 220.15 215.34 217.85 -O1 - MIN: 215.64 / MAX: 269.36 MIN: 209.09 / MAX: 276.74 -Ofast - MIN: 214.17 / MAX: 258.58 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 -O1 -O3 -Ofast 0.7526 1.5052 2.2578 3.0104 3.763 SE +/- 0.037, N = 3 SE +/- 0.025, N = 15 SE +/- 0.061, N = 5 3.345 3.307 3.277 -O1 - MIN: 3.2 / MAX: 8.98 MIN: 3.07 / MAX: 12.37 -Ofast - MIN: 3.05 / MAX: 3.93 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed -O1 -O3 -Ofast 16 32 48 64 80 SE +/- 0.97, N = 3 SE +/- 0.63, N = 3 SE +/- 0.76, N = 5 70.68 69.43 69.42 1. (CC) gcc options: -O3
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom -O1 -O3 -Ofast 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.60 0.61 0.60 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 -O1 -O3 -Ofast 1.1646 2.3292 3.4938 4.6584 5.823 SE +/- 0.075, N = 3 SE +/- 0.043, N = 15 SE +/- 0.053, N = 5 5.176 5.104 5.106 -O1 - MIN: 4.93 / MAX: 8.12 MIN: 4.6 / MAX: 13.78 -Ofast - MIN: 4.8 / MAX: 6.57 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.04, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 10.18 10.27 10.14 -O1 -O3 -Ofast 1. (CXX) g++ options: -rdynamic
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 21.65, N = 3 SE +/- 43.19, N = 3 SE +/- 30.70, N = 5 12852.4 12827.4 12794.5 1. (CC) gcc options: -O3
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 -O1 -O3 -Ofast 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.13, N = 15 SE +/- 0.28, N = 5 30.75 30.76 30.85 -O1 - MIN: 29.72 / MAX: 38.23 MIN: 29.69 / MAX: 40.5 -Ofast - MIN: 29.65 / MAX: 40.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 -O1 -O3 -Ofast 50M 100M 150M 200M 250M SE +/- 2531189.55, N = 3 SE +/- 2500260.63, N = 3 SE +/- 2395455.32, N = 4 211605433 211971700 211981075 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough -O1 -O3 -Ofast 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.53 12.51 12.53 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive -O1 -O3 -Ofast 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 99.81 99.75 99.70 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed -O1 -O3 -Ofast 3K 6K 9K 12K 15K SE +/- 9.67, N = 10 SE +/- 14.38, N = 4 SE +/- 15.70, N = 5 12834.8 12828.5 12828.1 1. (CC) gcc options: -O3
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID -O1 -O3 -Ofast 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 12 0.90 0.89 0.88 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets -O1 -O3 -Ofast 0.2048 0.4096 0.6144 0.8192 1.024 SE +/- 0.01, N = 15 SE +/- 0.01, N = 13 SE +/- 0.00, N = 3 0.91 0.87 0.86 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya -O1 -O3 -Ofast 0.1553 0.3106 0.4659 0.6212 0.7765 SE +/- 0.03, N = 12 SE +/- 0.03, N = 12 SE +/- 0.02, N = 12 0.69 0.65 0.66 -O1 -Ofast 1. (CXX) g++ options: -O3 -pthread
Phoronix Test Suite v10.8.4