AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
SMT Disabled Changed Processor to AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) .
Processor Change: Scaling Governor: acpi-cpufreq ondemandSecurity Change: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
2700 - 4 GHz Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnuProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 3.7.4Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
2700 - 4 GHz II Processor: AMD Ryzen 7 2700 Eight-Core @ 4.00GHz (8 Cores / 16 Threads) , Motherboard: ASUS ROG STRIX B350-F GAMING (5008 BIOS) , Chipset: AMD 17h , Memory: 16384MB, Disk: 250GB Western Digital WDS250G2X0C-00L350 + 2000GB Seagate ST2000DM006-2DM1 + 240GB Corsair Force GS + 500GB Western Digital WD5000BEKT-0 + 1000GB Seagate ST1000LM024 HN-M , Graphics: AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz) , Audio: AMD Ellesmere HDMI Audio , Monitor: LG ULTRAWIDE , Network: Intel I211 + Qualcomm Atheros AR93xx
OS: Arch Linux, Kernel: 5.2.5-arch1-1-ryzen (x86_64), Desktop: GNOME Shell 3.32.2, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 4.5 Mesa 19.1.3 (LLVM 8.0.1), Vulkan: 1.1.90, Compiler: GCC 9.1.0 + Clang 8.0.1, File-System: xfs, Screen Resolution: 2560x1080
2700 - 4 GHz Arch kernel OS: Arch Linux, Kernel: 5.2.5-arch1-1-ARCH (x86_64), Desktop: GNOME Shell 3.32.2, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 4.5 Mesa 19.1.3 (LLVM 8.0.1), Vulkan: 1.1.90, Compiler: GCC 9.1.0 + Clang 8.0.1, File-System: xfs, Screen Resolution: 2560x1080
Ryzen 9 3900X Linux SMT Performance Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Vulkan SMT Enabled - Default SMT Disabled 2700 - 4 GHz 2700 - 4 GHz II 2700 - 4 GHz Arch kernel AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS) AMD Device 1480 16384MB 2000GB Force MP600 Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz) AMD Device aae0 ASUS VP28U Realtek Device 8125 + Intel I211 + Intel Device 2723 Ubuntu 18.04 5.3.0-999-generic (x86_64) 20190725 GNOME Shell 3.28.4 X Server 1.20.4 modesetting 1.20.4 4.5 Mesa 19.0.2 (LLVM 8.0.0) GCC 7.4.0 ext4 3840x2160 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) AMD Ryzen 7 2700 Eight-Core @ 4.00GHz (8 Cores / 16 Threads) ASUS ROG STRIX B350-F GAMING (5008 BIOS) AMD 17h 250GB Western Digital WDS250G2X0C-00L350 + 2000GB Seagate ST2000DM006-2DM1 + 240GB Corsair Force GS + 500GB Western Digital WD5000BEKT-0 + 1000GB Seagate ST1000LM024 HN-M AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz) AMD Ellesmere HDMI Audio LG ULTRAWIDE Intel I211 + Qualcomm Atheros AR93xx Arch Linux 5.2.5-arch1-1-ryzen (x86_64) GNOME Shell 3.32.2 X Server 1.20.5 modesetting 1.20.5 4.5 Mesa 19.1.3 (LLVM 8.0.1) 1.1.90 GCC 9.1.0 + Clang 8.0.1 xfs 2560x1080 5.2.5-arch1-1-ARCH (x86_64) OpenBenchmarking.org Compiler Details - SMT Enabled - Default: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Disabled: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - 2700 - 4 GHz: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu - 2700 - 4 GHz II: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu - 2700 - 4 GHz Arch kernel: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details - SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil - SMT Disabled: Scaling Governor: acpi-cpufreq ondemand - 2700 - 4 GHz: Scaling Governor: acpi-cpufreq schedutil - 2700 - 4 GHz II: Scaling Governor: acpi-cpufreq schedutil - 2700 - 4 GHz Arch kernel: Scaling Governor: acpi-cpufreq schedutil Python Details - SMT Enabled - Default: Python 2.7.15+ + Python 3.6.8 - SMT Disabled: Python 2.7.15+ + Python 3.6.8 - 2700 - 4 GHz: Python 3.7.4 - 2700 - 4 GHz II: Python 3.7.4 - 2700 - 4 GHz Arch kernel: Python 3.7.4 Security Details - SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling - 2700 - 4 GHz: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling - 2700 - 4 GHz II: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling - 2700 - 4 GHz Arch kernel: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
SMT Enabled - Default SMT Disabled 2700 - 4 GHz 2700 - 4 GHz II 2700 - 4 GHz Arch kernel Logarithmic Result Overview Phoronix Test Suite NAS Parallel Benchmarks NAS Parallel Benchmarks NAS Parallel Benchmarks NAS Parallel Benchmarks NAS Parallel Benchmarks BT.A LU.A FT.B FT.A EP.C
Ryzen 9 3900X Linux SMT Performance mkl-dnn: Convolution Batch conv_all - f32 gromacs: Water Benchmark mkl-dnn: Deconvolution Batch deconv_all - f32 blender: Barbershop - CPU-Only appleseed: Emily npb: LU.C asmfish: 1024 Hash Memory, 26 Depth cp2k: Fayalite-FIST Data appleseed: Disney Material appleseed: Material Tester mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 parboil: OpenMP LBM npb: BT.A build-linux-kernel: Time To Compile namd: ATPase Simulation - 327,506 Atoms npb: SP.A stockfish: Total Time graphics-magick: Noise-Gaussian nero2d: Total Time v-ray: CPU parboil: OpenMP MRI Gridding indigobench: Bedroom indigobench: Supercar graphics-magick: Enhanced graphics-magick: Sharpen graphics-magick: Swirl graphics-magick: Rotate graphics-magick: HWB Color Space graphics-magick: Resizing c-ray: Total Time - 4K, 16 Rays Per Pixel parboil: OpenMP Stencil mkl-dnn: IP Batch All - f32 himeno: Poisson Pressure Solver rodinia: OpenMP LavaMD npb: FT.B mkl-dnn: Convolution Batch conv_3d - f32 npb: EP.C compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 compress-7zip: Compress Speed Test rust-prime: Prime Number Test To 200,000,000 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping coremark: CoreMark Size 666 - Iterations Per Second compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 mkl-dnn: Convolution Batch conv_alexnet - f32 npb: LU.A swet: Average mkl-dnn: Deconvolution Batch deconv_1d - f32 rodinia: OpenMP CFD Solver primesieve: 1e12 Prime Number Generation rodinia: OpenMP Streamcluster x265: H.265 1080p Video Encoding mkl-dnn: IP Batch 1D - f32 smallpt: Global Illumination Renderer; 128 Samples ffmpeg: H.264 HD To NTSC DV svt-av1: 1080p 8-bit YUV To AV1 Video Encode svt-hevc: 1080p 8-bit YUV To HEVC Video Encode cloverleaf: Lagrangian-Eulerian Hydrodynamics mkl-dnn: Deconvolution Batch deconv_3d - f32 parboil: OpenMP CUTCP npb: FT.A SMT Enabled - Default SMT Disabled 2700 - 4 GHz 2700 - 4 GHz II 2700 - 4 GHz Arch kernel 2108 0.99 5994 710 272 21265 39682621 323 168 164 113 151.13 6417 48.63 1.44394 4294 39137992 169 72.34 20558 28.84 2.03 4.35 197 165 231 263 285 266 53.23 15.12 210.10 1387 52.26 6193 18.70 477 25.60 77922 30.87 644 532588 18.05 258 50909 852324587 26.52 13.91 15.43 21.20 40.66 17.25 8.33 8.45 41.35 251 3.79 5.05 2.22 5711 2011 0.92 4559 1027 370 21548 28280592 447 198 217 109 74.27 6440 58.80 1.72944 4399 28765888 145 67.01 15709 18.48 1.38 3.06 203 173 233 269 283 272 56.14 28.89 89.54 1369 62.20 6219 17.56 481 37.18 59108 40.48 483 376805 21.50 248 52829 860433765 20.67 15.50 16.72 16.02 48.90 7.11 10.62 4.02 32.24 201 3.27 4.89 3.22 5702 1121 4042 370 4191 4187 43446 0.20 40758 1160 424 5309 24094759 528 233 224 2329 140.11 1116 104.58 2.24837 525 23248448 172 100.58 1.24 2.64 164 144 195 241 249 226 64.34 25.62 920.89 1256 30.30 4457 206.21 384 43.00 37997 427 331952 31.07 5398 5614 672310326 239.24 27.64 24.08 20.72 31.06 74.59 11.63 12.55 26.08 153 5.02 83.14 3.15 4216 43401 0.20 40949 1160 426 5321 24266366 534 236 223 2330 142.90 1121 105.90 2.24891 524 22925851 169 100.56 1.24 2.62 164 144 194 241 248 227 64.36 15.74 916.77 1255 30.21 4484 204.87 394 43.06 37588 427 330608 30.94 5405 5527 669827962 239.04 27.62 24.10 20.71 31.48 74.69 11.63 12.69 26.01 153 5.23 82.61 3.14 4193 OpenBenchmarking.org
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 9K 18K 27K 36K 45K SE +/- 4.07, N = 3 SE +/- 90.43, N = 3 SE +/- 0.71, N = 3 SE +/- 0.80, N = 3 43401 43446 2011 2108 MIN: 43166.5 MIN: 43120.2 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
GROMACS The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2018.3 Water Benchmark 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.20 0.20 0.92 0.99 -march=core-avx2 -march=core-avx2 1. (CXX) g++ options: -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 9K 18K 27K 36K 45K SE +/- 43.57, N = 3 SE +/- 22.12, N = 3 SE +/- 1.43, N = 3 SE +/- 22.04, N = 3 40949 40758 4559 5994 MIN: 40623.5 MIN: 40477.6 MIN: 4450.23 MIN: 5634.28 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Blender Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.79a Blend File: Barbershop - Compute: CPU-Only 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 200 400 600 800 1000 1160 1160 1027 710
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 5K 10K 15K 20K 25K SE +/- 22.51, N = 3 SE +/- 31.87, N = 3 SE +/- 18.22, N = 3 SE +/- 16.81, N = 3 5321 5309 21548 21265 1. Open MPI 2.1.1
CP2K Molecular Dynamics CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 6.1 Fayalite-FIST Data 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 120 240 360 480 600 534 528 447 323
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 500 1000 1500 2000 2500 SE +/- 0.55, N = 3 SE +/- 0.25, N = 3 SE +/- 0.16, N = 3 SE +/- 0.21, N = 3 2330 2329 109 113 MIN: 2310.75 MIN: 2309.92 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.53, N = 3 SE +/- 0.04, N = 3 142.90 140.11 74.27 151.13 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A 2700 - 4 GHz 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 1400 2800 4200 5600 7000 SE +/- 3.41, N = 3 SE +/- 4.25, N = 3 SE +/- 4.37, N = 3 SE +/- 15.50, N = 3 SE +/- 8.30, N = 3 1121 1121 1116 6440 6417 1. Open MPI 2.1.1
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13b1 ATPase Simulation - 327,506 Atoms 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 0.506 1.012 1.518 2.024 2.53 SE +/- 0.00461, N = 3 SE +/- 0.00052, N = 3 SE +/- 0.00287, N = 3 SE +/- 0.00061, N = 3 2.24891 2.24837 1.72944 1.44394
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 900 1800 2700 3600 4500 SE +/- 2.22, N = 3 SE +/- 0.64, N = 3 SE +/- 9.96, N = 3 SE +/- 2.82, N = 3 524 525 4399 4294 1. Open MPI 2.1.1
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 8M 16M 24M 32M 40M SE +/- 232342.01, N = 3 SE +/- 244456.97, N = 3 SE +/- 316141.73, N = 3 SE +/- 320057.07, N = 3 22925851 23248448 28765888 39137992 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 40 80 120 160 200 SE +/- 2.13, N = 5 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 169 172 145 169 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
Open FMM Nero2D This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time SMT Disabled SMT Enabled - Default 16 32 48 64 80 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 67.01 72.34 1. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.18, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 100.56 100.58 18.48 28.84 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.0.64 Scene: Supercar 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 0.9788 1.9576 2.9364 3.9152 4.894 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.62 2.64 3.06 4.35
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 40 80 120 160 200 SE +/- 0.33, N = 3 164 164 203 197 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 40 80 120 160 200 SE +/- 0.33, N = 3 144 144 173 165 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.67, N = 3 194 195 233 231 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 60 120 180 240 300 SE +/- 1.00, N = 3 SE +/- 0.88, N = 3 241 241 269 263 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 248 249 283 285 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 227 226 272 266 -llcms2 -ljasper -lxml2 -llcms2 -ljasper -lxml2 -ljbig -ljbig 1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 14 28 42 56 70 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 64.36 64.34 56.14 53.23 1. (CC) gcc options: -lm -lpthread -O3
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.30, N = 15 SE +/- 0.32, N = 3 SE +/- 0.02, N = 3 15.74 25.62 28.89 15.12 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 200 400 600 800 1000 SE +/- 0.23, N = 3 SE +/- 2.97, N = 3 SE +/- 0.14, N = 3 SE +/- 1.44, N = 3 916.77 920.89 89.54 210.10 MIN: 910.61 MIN: 911.49 MIN: 126.74 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 14 28 42 56 70 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 30.21 30.30 62.20 52.26 1. (CXX) g++ options: -O2 -lOpenCL
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B 2700 - 4 GHz 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 1300 2600 3900 5200 6500 SE +/- 53.21, N = 15 SE +/- 8.00, N = 3 SE +/- 19.91, N = 3 SE +/- 15.63, N = 3 SE +/- 7.75, N = 3 4042 4484 4457 6219 6193 1. Open MPI 2.1.1
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 1.24, N = 3 SE +/- 1.69, N = 3 SE +/- 0.09, N = 3 SE +/- 0.00, N = 3 204.87 206.21 17.56 18.70 MIN: 200.8 MIN: 200.63 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C 2700 - 4 GHz 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 100 200 300 400 500 SE +/- 2.73, N = 3 SE +/- 5.42, N = 3 SE +/- 5.46, N = 15 SE +/- 0.52, N = 3 SE +/- 0.04, N = 3 370 394 384 481 477 1. Open MPI 2.1.1
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 43.06 43.00 37.18 25.60 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
7-Zip Compression This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 20K 40K 60K 80K 100K SE +/- 49.84, N = 3 SE +/- 284.83, N = 3 SE +/- 393.22, N = 3 SE +/- 323.94, N = 3 37588 37997 59108 77922 1. (CXX) g++ options: -pipe -lpthread
Rust Prime Benchmark Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rust Prime Benchmark Prime Number Test To 200,000,000 SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 40.48 30.87 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 140 280 420 560 700 SE +/- 0.31, N = 3 SE +/- 0.38, N = 3 SE +/- 1.29, N = 3 SE +/- 1.06, N = 3 427 427 483 644 -lpthread -lpthread 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 110K 220K 330K 440K 550K SE +/- 1406.14, N = 3 SE +/- 507.36, N = 3 SE +/- 731.59, N = 3 SE +/- 2501.14, N = 3 330608 331952 376805 532588 1. (CC) gcc options: -O2 -lrt" -lrt
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 30.94 31.07 21.50 18.05 -llzma -llz4 -llzma -llz4 1. (CC) gcc options: -O3 -pthread -lz
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 1200 2400 3600 4800 6000 SE +/- 1.39, N = 3 SE +/- 0.80, N = 3 SE +/- 0.21, N = 3 SE +/- 0.49, N = 3 5405 5398 248 258 MIN: 5392.21 MIN: 5387.93 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A 2700 - 4 GHz 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 11K 22K 33K 44K 55K SE +/- 47.81, N = 3 SE +/- 55.24, N = 9 SE +/- 6.33, N = 3 SE +/- 457.69, N = 3 SE +/- 371.45, N = 3 4191 5527 5614 52829 50909 1. Open MPI 2.1.1
Swet Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 200M 400M 600M 800M 1000M SE +/- 2498094.73, N = 3 SE +/- 7363504.33, N = 3 SE +/- 10717369.00, N = 3 SE +/- 11310134.98, N = 4 669827962 672310326 860433765 852324587 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 0.37, N = 3 SE +/- 0.24, N = 3 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 239.04 239.24 20.67 26.52 MIN: 237.71 MIN: 237.84 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.16, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 27.62 27.64 15.50 13.91 1. (CXX) g++ options: -O2 -lOpenCL
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 24.10 24.08 16.72 15.43 1. (CXX) g++ options: -O3 -lpthread
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 20.71 20.72 16.02 21.20 1. (CXX) g++ options: -O2 -lOpenCL
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 11 22 33 44 55 SE +/- 0.22, N = 3 SE +/- 0.44, N = 3 SE +/- 0.19, N = 3 SE +/- 0.40, N = 3 31.48 31.06 48.90 40.66 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.29, N = 3 74.69 74.59 7.11 17.25 MIN: 74.07 MIN: 73.94 MIN: 11.06 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 11.63 11.63 10.62 8.33 1. (CXX) g++ options: -fopenmp -O3
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 12.69 12.55 4.02 8.45 -lxcb-shm -lva -lva-drm -lva-x11 -lvdpau -lxcb-shm -lva -lva-drm -lva-x11 -lvdpau -lsndio -lsndio 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -pthread -lSDL2 -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.5 1080p 8-bit YUV To AV1 Video Encode 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 0.25, N = 3 26.01 26.08 32.24 41.35 1. (CXX) g++ options: -O3 -pie -lpthread -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 1.62, N = 8 SE +/- 1.53, N = 8 SE +/- 1.35, N = 3 SE +/- 1.83, N = 3 153 153 201 251 1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 1.1768 2.3536 3.5304 4.7072 5.884 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.23 5.02 3.27 3.79 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 82.61 83.14 4.89 5.05 MIN: 81.75 MIN: 82.22 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.14 3.15 3.22 2.22 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A 2700 - 4 GHz 2700 - 4 GHz Arch kernel 2700 - 4 GHz II SMT Disabled SMT Enabled - Default 1200 2400 3600 4800 6000 SE +/- 9.80, N = 3 SE +/- 8.39, N = 3 SE +/- 3.81, N = 3 SE +/- 4.89, N = 3 SE +/- 3.40, N = 3 4187 4193 4216 5702 5711 1. Open MPI 2.1.1
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
Testing initiated at 31 July 2019 09:29 by user phoronix.
SMT Disabled Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 31 July 2019 13:23 by user phoronix.
2700 - 4 GHz Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnuProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 3.7.4Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 2 August 2019 20:57 by user marcio.
2700 - 4 GHz II Processor: AMD Ryzen 7 2700 Eight-Core @ 4.00GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG STRIX B350-F GAMING (5008 BIOS), Chipset: AMD 17h, Memory: 16384MB, Disk: 250GB Western Digital WDS250G2X0C-00L350 + 2000GB Seagate ST2000DM006-2DM1 + 240GB Corsair Force GS + 500GB Western Digital WD5000BEKT-0 + 1000GB Seagate ST1000LM024 HN-M, Graphics: AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz), Audio: AMD Ellesmere HDMI Audio, Monitor: LG ULTRAWIDE, Network: Intel I211 + Qualcomm Atheros AR93xx
OS: Arch Linux, Kernel: 5.2.5-arch1-1-ryzen (x86_64), Desktop: GNOME Shell 3.32.2, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 4.5 Mesa 19.1.3 (LLVM 8.0.1), Vulkan: 1.1.90, Compiler: GCC 9.1.0 + Clang 8.0.1, File-System: xfs, Screen Resolution: 2560x1080
Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnuProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 3.7.4Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 3 August 2019 02:57 by user marcio.
2700 - 4 GHz Arch kernel Processor: AMD Ryzen 7 2700 Eight-Core @ 4.00GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG STRIX B350-F GAMING (5008 BIOS), Chipset: AMD 17h, Memory: 16384MB, Disk: 250GB Western Digital WDS250G2X0C-00L350 + 2000GB Seagate ST2000DM006-2DM1 + 240GB Corsair Force GS + 500GB Western Digital WD5000BEKT-0 + 1000GB Seagate ST1000LM024 HN-M, Graphics: AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz), Audio: AMD Ellesmere HDMI Audio, Monitor: LG ULTRAWIDE, Network: Intel I211 + Qualcomm Atheros AR93xx
OS: Arch Linux, Kernel: 5.2.5-arch1-1-ARCH (x86_64), Desktop: GNOME Shell 3.32.2, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 4.5 Mesa 19.1.3 (LLVM 8.0.1), Vulkan: 1.1.90, Compiler: GCC 9.1.0 + Clang 8.0.1, File-System: xfs, Screen Resolution: 2560x1080
Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnuProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 3.7.4Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 4 August 2019 03:13 by user marcio.