CPU comparison AMD Ryzen Threadripper 1920X 12-Core testing with a ASRock X399M Taichi (P1.00 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012040-HA-CPUCOMPAR48&grr .
CPU comparison Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution TR1920X AMD Ryzen Threadripper 1920X 12-Core @ 3.50GHz (12 Cores / 24 Threads) ASRock X399M Taichi (P1.00 BIOS) AMD 17h 32GB 256GB THNSN5256GPUK TOSHIBA + 512GB SAMSUNG MZVKW512HMJP-000L7 AMD Radeon RX 56/64 8GB Realtek ALC1220 Optix MAG24C 2 x Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 20.04 5.4.0-56-generic (x86_64) Xfce 4.14 X Server 1.20.8 amdgpu 19.1.0 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8001129 - Python 3.8.5 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
CPU comparison core-latency: Average Latency Between CPU Cores npb: EP.D blender: BMW27 - CPU-Only parboil: OpenMP MRI Gridding kvazaar: Bosphorus 4K - Medium npb: BT.C build-ffmpeg: Time To Compile parboil: OpenMP LBM build-linux-kernel: Time To Compile parboil: OpenMP Stencil pmbench: 8 - 80% Reads 20% Writes pmbench: 24 - 80% Reads 20% Writes pmbench: 1 - 80% Reads 20% Writes pmbench: 24 - 50% pmbench: 8 - 50% pmbench: 1 - 50% pmbench: 1 - 100% Writes pmbench: 24 - 100% Writes pmbench: 8 - 100% Writes pmbench: 24 - 100% Reads pmbench: 8 - 100% Reads pmbench: 1 - 100% Reads npb: LU.C c-ray: Total Time - 4K, 16 Rays Per Pixel lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression compress-7zip: Compress Speed Test stream: Copy aobench: 2048 x 2048 - Total Time vpxenc: Speed 5 aom-av1: Speed 6 Two-Pass npb: SP.B npb: FT.C kvazaar: Bosphorus 1080p - Medium build-apache: Time To Compile lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Libdeflate 1 - Decompression lzbench: Libdeflate 1 - Compression botan: AES-256 lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression aom-av1: Speed 8 Realtime ipc-benchmark: FIFO Named Pipe - 1024 lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression botan: Blowfish npb: EP.C smallpt: Global Illumination Renderer; 128 Samples npb: MG.C ffmpeg: H.264 HD To NTSC DV ipc-benchmark: TCP Socket - 1024 ipc-benchmark: TCP Socket - 128 ipc-benchmark: Unnamed Pipe - 1024 ipc-benchmark: FIFO Named Pipe - 128 ipc-benchmark: Unnamed Pipe - 128 parboil: OpenMP CUTCP stream: Add stream: Triad stream: Scale TR1920X 382.28 701.36 138.10 137.447454 5.09 30423.90 57.024 81.600878 79.170 15.769507 0.1228 0.1509 0.1072 0.1450 0.1032 0.0860 0.0474 0.1225 0.0676 0.0468 0.0439 0.0393 37706.56 50.261 109 37 58222 33068.2 36.516 17.49 2.96 11196.39 13645.70 19.94 27.549 649 193 1166 236 5478.295 458 86 1305 494 556 470 28.22 2068182 1445 93 438.421 708.60 8.648 21997.24 7.896 1896825 2763312 2183690 2351114 2606605 2.925903 36465.7 36520.6 33721.5 OpenBenchmarking.org
Core-Latency Average Latency Between CPU Cores OpenBenchmarking.org ns, Fewer Is Better Core-Latency Average Latency Between CPU Cores TR1920X 80 160 240 320 400 382.28 MIN: 39.57 / MAX: 571.28 1. (CXX) g++ options: -std=c++11 -pthread -O3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D TR1920X 150 300 450 600 750 SE +/- 3.01, N = 3 701.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CPU-Only TR1920X 30 60 90 120 150 SE +/- 0.84, N = 3 138.10
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding TR1920X 30 60 90 120 150 SE +/- 0.21, N = 3 137.45 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium TR1920X 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.03, N = 3 5.09 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C TR1920X 7K 14K 21K 28K 35K SE +/- 116.41, N = 3 30423.90 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile TR1920X 13 26 39 52 65 SE +/- 0.71, N = 5 57.02
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM TR1920X 20 40 60 80 100 SE +/- 0.11, N = 3 81.60 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.4 Time To Compile TR1920X 20 40 60 80 100 SE +/- 0.91, N = 3 79.17
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil TR1920X 4 8 12 16 20 SE +/- 0.43, N = 12 15.77 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 80% Reads 20% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 80% Reads 20% Writes TR1920X 0.0276 0.0552 0.0828 0.1104 0.138 SE +/- 0.0003, N = 3 0.1228 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 80% Reads 20% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 80% Reads 20% Writes TR1920X 0.034 0.068 0.102 0.136 0.17 SE +/- 0.0013, N = 3 0.1509 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 80% Reads 20% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 80% Reads 20% Writes TR1920X 0.0241 0.0482 0.0723 0.0964 0.1205 SE +/- 0.0004, N = 3 0.1072 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 50% OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 50% TR1920X 0.0326 0.0652 0.0978 0.1304 0.163 SE +/- 0.0012, N = 3 0.1450 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 50% OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 50% TR1920X 0.0232 0.0464 0.0696 0.0928 0.116 SE +/- 0.0000, N = 3 0.1032 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 50% OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 50% TR1920X 0.0194 0.0388 0.0582 0.0776 0.097 SE +/- 0.0001, N = 3 0.0860 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 100% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 100% Writes TR1920X 0.0107 0.0214 0.0321 0.0428 0.0535 SE +/- 0.0001, N = 3 0.0474 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 100% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 100% Writes TR1920X 0.0276 0.0552 0.0828 0.1104 0.138 SE +/- 0.0020, N = 3 0.1225 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 100% Writes OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 100% Writes TR1920X 0.0152 0.0304 0.0456 0.0608 0.076 SE +/- 0.0000, N = 3 0.0676 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 100% Reads OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 24 - Read-Write Ratio: 100% Reads TR1920X 0.0105 0.021 0.0315 0.042 0.0525 SE +/- 0.0005, N = 3 0.0468 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 100% Reads OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 8 - Read-Write Ratio: 100% Reads TR1920X 0.0099 0.0198 0.0297 0.0396 0.0495 SE +/- 0.0004, N = 3 0.0439 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 100% Reads OpenBenchmarking.org us - Average Page Latency, Fewer Is Better pmbench Concurrent Worker Threads: 1 - Read-Write Ratio: 100% Reads TR1920X 0.0088 0.0176 0.0264 0.0352 0.044 SE +/- 0.0003, N = 3 0.0393 1. (CC) gcc options: -lm -luuid -lxml2 -m64 -pthread
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C TR1920X 8K 16K 24K 32K 40K SE +/- 75.11, N = 3 37706.56 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel TR1920X 11 22 33 44 55 SE +/- 0.14, N = 3 50.26 1. (CC) gcc options: -lm -lpthread -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression TR1920X 20 40 60 80 100 109 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression TR1920X 9 18 27 36 45 37 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test TR1920X 12K 24K 36K 48K 60K SE +/- 281.88, N = 3 58222 1. (CXX) g++ options: -pipe -lpthread
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy TR1920X 7K 14K 21K 28K 35K SE +/- 6.41, N = 5 33068.2 1. (CC) gcc options: -O3 -march=native -fopenmp
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time TR1920X 8 16 24 32 40 SE +/- 0.10, N = 3 36.52 1. (CC) gcc options: -lm -O3
VP9 libvpx Encoding Speed: Speed 5 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 TR1920X 4 8 12 16 20 SE +/- 0.08, N = 3 17.49 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass TR1920X 0.666 1.332 1.998 2.664 3.33 SE +/- 0.01, N = 3 2.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B TR1920X 2K 4K 6K 8K 10K SE +/- 104.68, N = 3 11196.39 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C TR1920X 3K 6K 9K 12K 15K SE +/- 81.43, N = 3 13645.70 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium TR1920X 5 10 15 20 25 SE +/- 0.08, N = 3 19.94 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile TR1920X 6 12 18 24 30 SE +/- 0.11, N = 3 27.55
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression TR1920X 140 280 420 560 700 SE +/- 1.33, N = 3 649 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression TR1920X 40 80 120 160 200 SE +/- 0.58, N = 3 193 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression TR1920X 300 600 900 1200 1500 SE +/- 1.53, N = 3 1166 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression TR1920X 50 100 150 200 250 SE +/- 1.73, N = 3 236 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: AES-256 TR1920X 1200 2400 3600 4800 6000 SE +/- 11.65, N = 3 5478.30 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression TR1920X 100 200 300 400 500 SE +/- 1.00, N = 3 458 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression TR1920X 20 40 60 80 100 86 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression TR1920X 300 600 900 1200 1500 SE +/- 2.52, N = 3 1305 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression TR1920X 110 220 330 440 550 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression TR1920X 120 240 360 480 600 SE +/- 0.88, N = 3 556 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression TR1920X 100 200 300 400 500 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime TR1920X 7 14 21 28 35 SE +/- 0.19, N = 3 28.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 1024 TR1920X 400K 800K 1200K 1600K 2000K SE +/- 16762.60, N = 15 2068182
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression TR1920X 300 600 900 1200 1500 SE +/- 4.48, N = 3 1445 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression TR1920X 20 40 60 80 100 SE +/- 0.33, N = 3 93 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Blowfish TR1920X 90 180 270 360 450 SE +/- 0.54, N = 3 438.42 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C TR1920X 150 300 450 600 750 SE +/- 0.23, N = 3 708.60 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples TR1920X 2 4 6 8 10 SE +/- 0.044, N = 3 8.648 1. (CXX) g++ options: -fopenmp -O3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C TR1920X 5K 10K 15K 20K 25K SE +/- 228.43, N = 3 21997.24 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV TR1920X 2 4 6 8 10 SE +/- 0.054, N = 3 7.896 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
IPC_benchmark Type: TCP Socket - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: TCP Socket - Message Bytes: 1024 TR1920X 400K 800K 1200K 1600K 2000K SE +/- 5701.88, N = 3 1896825
IPC_benchmark Type: TCP Socket - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: TCP Socket - Message Bytes: 128 TR1920X 600K 1200K 1800K 2400K 3000K SE +/- 11614.85, N = 3 2763312
IPC_benchmark Type: Unnamed Pipe - Message Bytes: 1024 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Pipe - Message Bytes: 1024 TR1920X 500K 1000K 1500K 2000K 2500K SE +/- 30370.57, N = 3 2183690
IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: FIFO Named Pipe - Message Bytes: 128 TR1920X 500K 1000K 1500K 2000K 2500K SE +/- 18263.92, N = 3 2351114
IPC_benchmark Type: Unnamed Pipe - Message Bytes: 128 OpenBenchmarking.org Messages Per Second, More Is Better IPC_benchmark Type: Unnamed Pipe - Message Bytes: 128 TR1920X 600K 1200K 1800K 2400K 3000K SE +/- 19949.18, N = 3 2606605
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP TR1920X 0.6583 1.3166 1.9749 2.6332 3.2915 SE +/- 0.020765, N = 3 2.925903 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add TR1920X 8K 16K 24K 32K 40K SE +/- 7.03, N = 5 36465.7 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad TR1920X 8K 16K 24K 32K 40K SE +/- 6.15, N = 5 36520.6 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale TR1920X 7K 14K 21K 28K 35K SE +/- 22.71, N = 5 33721.5 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.5