AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
SMT Disabled Changed Processor to AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) .
Processor Change: Scaling Governor: acpi-cpufreq ondemandSecurity Change: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Ryzen 9 3900X Linux SMT Performance OpenBenchmarking.org Phoronix Test Suite AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS) AMD Device 1480 16384MB 2000GB Force MP600 Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz) AMD Device aae0 ASUS VP28U Realtek Device 8125 + Intel I211 + Intel Device 2723 Ubuntu 18.04 5.3.0-999-generic (x86_64) 20190725 GNOME Shell 3.28.4 X Server 1.20.4 modesetting 1.20.4 4.5 Mesa 19.0.2 (LLVM 8.0.0) GCC 7.4.0 ext4 3840x2160 Processors Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Ryzen 9 3900X Linux SMT Performance Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil - SMT Disabled: Scaling Governor: acpi-cpufreq ondemand - Python 2.7.15+ + Python 3.6.8 - SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
SMT Enabled - Default vs. SMT Disabled Comparison Phoronix Test Suite Baseline +35.7% +35.7% +71.4% +71.4% +107.1% +107.1% 142.6% 134.6% 110.2% 103.5% 56.1% 32.3% 31.5% 28.3% 20.3% 15.9% 8% 6.5% 4.8% 4.8% 3.9% 3.8% 3.5% 3.3% 3% 2.5% 2.3% 2.3% IP Batch 1D - f32 IP Batch All - f32 H.2.H.T.N.D OpenMP LBM OpenMP Stencil 91.1% O.M.G Bedroom 47.1% C.u.1.0.3.s.i.i.C.L.9 45.2% OpenMP CUTCP 45% Barbershop - CPU-Only 44.6% Supercar 42.2% CoreMark Size 666 - I.P.S 41.3% 1.H.M.2.D 40.3% Fayalite-FIST Data 38.3% Emily 36.3% Total Time 36.1% P.R.W.S.S.M 33.3% O.S Material Tester 32.3% C.S.T 31.8% D.B.d - f32 P.N.T.T.2.0.0 31.1% CPU 30.9% D.B.d - f32 1.8.b.Y.T.A.V.E 28.3% G.I.R.1.S 27.5% 1.8.b.Y.T.H.V.E 25.2% Time To Compile 20.9% H.2.1.V.E ATPase Simulation - 327,506 Atoms 19.8% C.u.1.0.3.s.i.i.C.L.1 19.1% OpenMP LavaMD 19% Disney Material 18.1% Noise-Gaussian 16.6% L.E.H OpenMP CFD Solver 11.4% 1.P.N.G 8.4% Total Time Water Benchmark 7.6% C.B.c - f32 Total Time - 4.1.R.P.P 5.5% Sharpen C.B.c - f32 C.B.c - f32 LU.A C.B.c - f32 D.B.d - f32 Enhanced SP.A Rotate Resizing MKL-DNN MKL-DNN FFmpeg Parboil Parboil Parboil IndigoBench XZ Compression Parboil Blender IndigoBench Coremark asmFish CP2K Molecular Dynamics Appleseed Stockfish TTSIOD 3D Renderer Rodinia Appleseed 7-Zip Compression MKL-DNN Rust Prime Benchmark Chaos Group V-RAY MKL-DNN SVT-AV1 Smallpt SVT-HEVC Timed Linux Kernel Compilation x265 NAMD Zstd Compression Rodinia Appleseed GraphicsMagick CloverLeaf Rodinia Primesieve Open FMM Nero2D GROMACS MKL-DNN C-Ray GraphicsMagick MKL-DNN MKL-DNN NAS Parallel Benchmarks MKL-DNN MKL-DNN GraphicsMagick NAS Parallel Benchmarks GraphicsMagick GraphicsMagick SMT Enabled - Default SMT Disabled
Ryzen 9 3900X Linux SMT Performance compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 himeno: Poisson Pressure Solver cloverleaf: Lagrangian-Eulerian Hydrodynamics gromacs: Water Benchmark parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding mkl-dnn: Deconvolution Batch deconv_3d - f32 npb: BT.A npb: EP.C npb: FT.A npb: FT.B npb: LU.A npb: LU.C mkl-dnn: Convolution Batch conv_3d - f32 npb: SP.A mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 rodinia: OpenMP LavaMD mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 rodinia: OpenMP CFD Solver mkl-dnn: Convolution Batch conv_all - f32 rodinia: OpenMP Streamcluster mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch 1D - f32 namd: ATPase Simulation - 327,506 Atoms cp2k: Fayalite-FIST Data coremark: CoreMark Size 666 - Iterations Per Second primesieve: 1e12 Prime Number Generation rust-prime: Prime Number Test To 200,000,000 stockfish: Total Time compress-7zip: Compress Speed Test compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 nero2d: Total Time asmfish: 1024 Hash Memory, 26 Depth build-linux-kernel: Time To Compile swet: Average graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space svt-av1: 1080p 8-bit YUV To AV1 Video Encode x265: H.265 1080p Video Encoding c-ray: Total Time - 4K, 16 Rays Per Pixel ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping v-ray: CPU svt-hevc: 1080p 8-bit YUV To HEVC Video Encode blender: Barbershop - CPU-Only ffmpeg: H.264 HD To NTSC DV smallpt: Global Illumination Renderer; 128 Samples indigobench: Bedroom indigobench: Supercar appleseed: Emily appleseed: Disney Material appleseed: Material Tester SMT Enabled - Default SMT Disabled 25.60 1386.87 3.79 0.99 151.13 2.22 15.12 28.84 5.05 6417.48 477.09 5711.26 6193.02 50909.30 21264.92 18.70 4293.52 112.71 52.26 257.95 5993.69 26.52 13.91 2107.82 21.20 210.10 17.25 1.44394 323.08 532587.52 15.43 30.87 39137992 77922 18.05 72.34 39682621 48.63 852324587 231 263 165 197 266 169 285 41.35 40.66 53.23 643.90 20558 251.39 710.12 8.45 8.33 2.03 4.35 271.85 167.95 163.74 37.18 1369.03 3.27 0.92 74.27 3.22 28.89 18.48 4.89 6440.11 480.69 5701.85 6218.67 52828.71 21548.38 17.56 4399.16 108.94 62.20 248.32 4559.32 20.67 15.50 2011.48 16.02 89.54 7.11 1.72944 446.70 376805.08 16.72 40.48 28765888 59108 21.50 67.01 28280592 58.80 860433765 233 269 173 203 272 145 283 32.24 48.90 56.14 483.15 15709 200.78 1027.18 4.02 10.62 1.38 3.06 370.45 198.39 216.56 OpenBenchmarking.org
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics SMT Enabled - Default SMT Disabled 0.8528 1.7056 2.5584 3.4112 4.264 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.79 3.27 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
GROMACS The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2018.3 Water Benchmark SMT Disabled SMT Enabled - Default 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.92 0.99 1. (CXX) g++ options: -march=core-avx2 -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM SMT Enabled - Default SMT Disabled 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.53, N = 3 151.13 74.27 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP SMT Disabled SMT Enabled - Default 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.22 2.22 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil SMT Disabled SMT Enabled - Default 7 14 21 28 35 SE +/- 0.32, N = 3 SE +/- 0.02, N = 3 28.89 15.12 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding SMT Enabled - Default SMT Disabled 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 28.84 18.48 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 SMT Enabled - Default SMT Disabled 1.1363 2.2726 3.4089 4.5452 5.6815 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.05 4.89 MIN: 4.97 MIN: 4.79 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A SMT Enabled - Default SMT Disabled 1400 2800 4200 5600 7000 SE +/- 8.30, N = 3 SE +/- 15.50, N = 3 6417.48 6440.11 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C SMT Enabled - Default SMT Disabled 100 200 300 400 500 SE +/- 0.04, N = 3 SE +/- 0.52, N = 3 477.09 480.69 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A SMT Disabled SMT Enabled - Default 1200 2400 3600 4800 6000 SE +/- 4.89, N = 3 SE +/- 3.40, N = 3 5701.85 5711.26 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B SMT Enabled - Default SMT Disabled 1300 2600 3900 5200 6500 SE +/- 7.75, N = 3 SE +/- 15.63, N = 3 6193.02 6218.67 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A SMT Enabled - Default SMT Disabled 11K 22K 33K 44K 55K SE +/- 371.45, N = 3 SE +/- 457.69, N = 3 50909.30 52828.71 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C SMT Enabled - Default SMT Disabled 5K 10K 15K 20K 25K SE +/- 16.81, N = 3 SE +/- 18.22, N = 3 21264.92 21548.38 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 SMT Enabled - Default SMT Disabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.09, N = 3 18.70 17.56 MIN: 18.37 MIN: 17.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A SMT Enabled - Default SMT Disabled 900 1800 2700 3600 4500 SE +/- 2.82, N = 3 SE +/- 9.96, N = 3 4293.52 4399.16 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 SMT Enabled - Default SMT Disabled 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.16, N = 3 112.71 108.94 MIN: 111.1 MIN: 107.13 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD SMT Disabled SMT Enabled - Default 14 28 42 56 70 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 62.20 52.26 1. (CXX) g++ options: -O2 -lOpenCL
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 0.49, N = 3 SE +/- 0.21, N = 3 257.95 248.32 MIN: 256.12 MIN: 246.46 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 SMT Enabled - Default SMT Disabled 1300 2600 3900 5200 6500 SE +/- 22.04, N = 3 SE +/- 1.43, N = 3 5993.69 4559.32 MIN: 5634.28 MIN: 4450.23 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 SMT Enabled - Default SMT Disabled 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 26.52 20.67 MIN: 25.61 MIN: 20.19 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver SMT Disabled SMT Enabled - Default 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 15.50 13.91 1. (CXX) g++ options: -O2 -lOpenCL
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 SMT Enabled - Default SMT Disabled 500 1000 1500 2000 2500 SE +/- 0.80, N = 3 SE +/- 0.71, N = 3 2107.82 2011.48 MIN: 2091.72 MIN: 1994.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster SMT Enabled - Default SMT Disabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 21.20 16.02 1. (CXX) g++ options: -O2 -lOpenCL
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 SMT Enabled - Default SMT Disabled 50 100 150 200 250 SE +/- 1.44, N = 3 SE +/- 0.14, N = 3 210.10 89.54 MIN: 126.74 MIN: 88.32 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 SMT Enabled - Default SMT Disabled 4 8 12 16 20 SE +/- 0.29, N = 3 SE +/- 0.02, N = 3 17.25 7.11 MIN: 11.06 MIN: 7.02 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13b1 ATPase Simulation - 327,506 Atoms SMT Disabled SMT Enabled - Default 0.3891 0.7782 1.1673 1.5564 1.9455 SE +/- 0.00287, N = 3 SE +/- 0.00061, N = 3 1.72944 1.44394
CP2K Molecular Dynamics CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 6.1 Fayalite-FIST Data SMT Disabled SMT Enabled - Default 100 200 300 400 500 446.70 323.08
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation SMT Disabled SMT Enabled - Default 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 16.72 15.43 1. (CXX) g++ options: -O3 -lpthread
Rust Prime Benchmark Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rust Prime Benchmark Prime Number Test To 200,000,000 SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 40.48 30.87 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time SMT Disabled SMT Enabled - Default 8M 16M 24M 32M 40M SE +/- 316141.73, N = 3 SE +/- 320057.07, N = 3 28765888 39137992 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
Open FMM Nero2D This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time SMT Enabled - Default SMT Disabled 16 32 48 64 80 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 72.34 67.01 1. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl SMT Enabled - Default SMT Disabled 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 231 233 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 263 269 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen SMT Enabled - Default SMT Disabled 40 80 120 160 200 SE +/- 0.33, N = 3 165 173 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced SMT Enabled - Default SMT Disabled 40 80 120 160 200 SE +/- 0.33, N = 3 197 203 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 266 272 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian SMT Disabled SMT Enabled - Default 40 80 120 160 200 SE +/- 0.33, N = 3 145 169 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space SMT Disabled SMT Enabled - Default 60 120 180 240 300 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 283 285 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.5 1080p 8-bit YUV To AV1 Video Encode SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.25, N = 3 32.24 41.35 1. (CXX) g++ options: -O3 -pie -lpthread -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel SMT Disabled SMT Enabled - Default 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 56.14 53.23 1. (CC) gcc options: -lm -lpthread -O3
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping SMT Disabled SMT Enabled - Default 140 280 420 560 700 SE +/- 1.29, N = 3 SE +/- 1.06, N = 3 483.15 643.90 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 1.35, N = 3 SE +/- 1.83, N = 3 200.78 251.39 1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV SMT Enabled - Default SMT Disabled 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 8.45 4.02 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples SMT Disabled SMT Enabled - Default 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.62 8.33 1. (CXX) g++ options: -fopenmp -O3
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
Testing initiated at 31 July 2019 09:29 by user phoronix.
SMT Disabled Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 31 July 2019 13:23 by user phoronix.