AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
SMT Disabled Changed Processor to AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) .
Processor Change: Scaling Governor: acpi-cpufreq ondemandSecurity Change: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Ryzen 9 3900X Linux SMT Performance OpenBenchmarking.org Phoronix Test Suite AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS) AMD Device 1480 16384MB 2000GB Force MP600 Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz) AMD Device aae0 ASUS VP28U Realtek Device 8125 + Intel I211 + Intel Device 2723 Ubuntu 18.04 5.3.0-999-generic (x86_64) 20190725 GNOME Shell 3.28.4 X Server 1.20.4 modesetting 1.20.4 4.5 Mesa 19.0.2 (LLVM 8.0.0) GCC 7.4.0 ext4 3840x2160 Processors Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Ryzen 9 3900X Linux SMT Performance Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil - SMT Disabled: Scaling Governor: acpi-cpufreq ondemand - Python 2.7.15+ + Python 3.6.8 - SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
SMT Enabled - Default vs. SMT Disabled Comparison Phoronix Test Suite Baseline +35.7% +35.7% +71.4% +71.4% +107.1% +107.1% 142.6% 134.6% 110.2% 103.5% 56.1% 32.3% 31.5% 28.3% 20.3% 15.9% 8% 6.5% 4.8% 4.8% 3.9% 3.8% 3.5% 3.3% 3% 2.5% 2.3% 2.3% IP Batch 1D - f32 IP Batch All - f32 H.2.H.T.N.D OpenMP LBM OpenMP Stencil 91.1% O.M.G Bedroom 47.1% C.u.1.0.3.s.i.i.C.L.9 45.2% OpenMP CUTCP 45% Barbershop - CPU-Only 44.6% Supercar 42.2% CoreMark Size 666 - I.P.S 41.3% 1.H.M.2.D 40.3% Fayalite-FIST Data 38.3% Emily 36.3% Total Time 36.1% P.R.W.S.S.M 33.3% O.S Material Tester 32.3% C.S.T 31.8% D.B.d - f32 P.N.T.T.2.0.0 31.1% CPU 30.9% D.B.d - f32 1.8.b.Y.T.A.V.E 28.3% G.I.R.1.S 27.5% 1.8.b.Y.T.H.V.E 25.2% Time To Compile 20.9% H.2.1.V.E ATPase Simulation - 327,506 Atoms 19.8% C.u.1.0.3.s.i.i.C.L.1 19.1% OpenMP LavaMD 19% Disney Material 18.1% Noise-Gaussian 16.6% L.E.H OpenMP CFD Solver 11.4% 1.P.N.G 8.4% Total Time Water Benchmark 7.6% C.B.c - f32 Total Time - 4.1.R.P.P 5.5% Sharpen C.B.c - f32 C.B.c - f32 LU.A C.B.c - f32 D.B.d - f32 Enhanced SP.A Rotate Resizing MKL-DNN MKL-DNN FFmpeg Parboil Parboil Parboil IndigoBench XZ Compression Parboil Blender IndigoBench Coremark asmFish CP2K Molecular Dynamics Appleseed Stockfish TTSIOD 3D Renderer Rodinia Appleseed 7-Zip Compression MKL-DNN Rust Prime Benchmark Chaos Group V-RAY MKL-DNN SVT-AV1 Smallpt SVT-HEVC Timed Linux Kernel Compilation x265 NAMD Zstd Compression Rodinia Appleseed GraphicsMagick CloverLeaf Rodinia Primesieve Open FMM Nero2D GROMACS MKL-DNN C-Ray GraphicsMagick MKL-DNN MKL-DNN NAS Parallel Benchmarks MKL-DNN MKL-DNN GraphicsMagick NAS Parallel Benchmarks GraphicsMagick GraphicsMagick SMT Enabled - Default SMT Disabled
Ryzen 9 3900X Linux SMT Performance ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping svt-av1: 1080p 8-bit YUV To AV1 Video Encode svt-hevc: 1080p 8-bit YUV To HEVC Video Encode x265: H.265 1080p Video Encoding graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space coremark: CoreMark Size 666 - Iterations Per Second v-ray: CPU indigobench: Bedroom indigobench: Supercar himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test stockfish: Total Time asmfish: 1024 Hash Memory, 26 Depth gromacs: Water Benchmark swet: Average npb: BT.A npb: EP.C npb: FT.A npb: FT.B npb: LU.A npb: LU.C npb: SP.A namd: ATPase Simulation - 327,506 Atoms mkl-dnn: IP Batch 1D - f32 mkl-dnn: IP Batch All - f32 mkl-dnn: Convolution Batch conv_3d - f32 mkl-dnn: Convolution Batch conv_all - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Deconvolution Batch deconv_3d - f32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding cloverleaf: Lagrangian-Eulerian Hydrodynamics cp2k: Fayalite-FIST Data rodinia: OpenMP LavaMD rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster nero2d: Total Time build-linux-kernel: Time To Compile c-ray: Total Time - 4K, 16 Rays Per Pixel primesieve: 1e12 Prime Number Generation rust-prime: Prime Number Test To 200,000,000 smallpt: Global Illumination Renderer; 128 Samples compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 ffmpeg: H.264 HD To NTSC DV blender: Barbershop - CPU-Only appleseed: Emily appleseed: Disney Material appleseed: Material Tester SMT Enabled - Default SMT Disabled 643.90 41.35 251.39 40.66 231 263 165 197 266 169 285 532587.52 20558 2.03 4.35 1386.87 77922 39137992 39682621 0.99 852324587 6417.48 477.09 5711.26 6193.02 50909.30 21264.92 4293.52 1.44394 17.25 210.10 18.70 2107.82 26.52 5.05 257.95 5993.69 112.71 151.13 2.22 15.12 28.84 3.79 323.08 52.26 13.91 21.20 72.34 48.63 53.23 15.43 30.87 8.33 25.60 18.05 8.45 710.12 271.85 167.95 163.74 483.15 32.24 200.78 48.90 233 269 173 203 272 145 283 376805.08 15709 1.38 3.06 1369.03 59108 28765888 28280592 0.92 860433765 6440.11 480.69 5701.85 6218.67 52828.71 21548.38 4399.16 1.72944 7.11 89.54 17.56 2011.48 20.67 4.89 248.32 4559.32 108.94 74.27 3.22 28.89 18.48 3.27 446.70 62.20 15.50 16.02 67.01 58.80 56.14 16.72 40.48 10.62 37.18 21.50 4.02 1027.18 370.45 198.39 216.56 OpenBenchmarking.org
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping SMT Disabled SMT Enabled - Default 140 280 420 560 700 SE +/- 1.29, N = 3 SE +/- 1.06, N = 3 483.15 643.90 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.5 1080p 8-bit YUV To AV1 Video Encode SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.25, N = 3 32.24 41.35 1. (CXX) g++ options: -O3 -pie -lpthread -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode SMT Disabled SMT Enabled - Default 50 100 150 200 250 SE +/- 1.35, N = 3 SE +/- 1.83, N = 3 200.78 251.39 1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl SMT Enabled - Default SMT Disabled 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 231 233 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 263 269 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen SMT Enabled - Default SMT Disabled 40 80 120 160 200 SE +/- 0.33, N = 3 165 173 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced SMT Enabled - Default SMT Disabled 40 80 120 160 200 SE +/- 0.33, N = 3 197 203 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 266 272 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian SMT Disabled SMT Enabled - Default 40 80 120 160 200 SE +/- 0.33, N = 3 145 169 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space SMT Disabled SMT Enabled - Default 60 120 180 240 300 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 283 285 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time SMT Disabled SMT Enabled - Default 8M 16M 24M 32M 40M SE +/- 316141.73, N = 3 SE +/- 320057.07, N = 3 28765888 39137992 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
GROMACS The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2018.3 Water Benchmark SMT Disabled SMT Enabled - Default 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.92 0.99 1. (CXX) g++ options: -march=core-avx2 -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A SMT Enabled - Default SMT Disabled 1400 2800 4200 5600 7000 SE +/- 8.30, N = 3 SE +/- 15.50, N = 3 6417.48 6440.11 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C SMT Enabled - Default SMT Disabled 100 200 300 400 500 SE +/- 0.04, N = 3 SE +/- 0.52, N = 3 477.09 480.69 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A SMT Disabled SMT Enabled - Default 1200 2400 3600 4800 6000 SE +/- 4.89, N = 3 SE +/- 3.40, N = 3 5701.85 5711.26 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B SMT Enabled - Default SMT Disabled 1300 2600 3900 5200 6500 SE +/- 7.75, N = 3 SE +/- 15.63, N = 3 6193.02 6218.67 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A SMT Enabled - Default SMT Disabled 11K 22K 33K 44K 55K SE +/- 371.45, N = 3 SE +/- 457.69, N = 3 50909.30 52828.71 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C SMT Enabled - Default SMT Disabled 5K 10K 15K 20K 25K SE +/- 16.81, N = 3 SE +/- 18.22, N = 3 21264.92 21548.38 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A SMT Enabled - Default SMT Disabled 900 1800 2700 3600 4500 SE +/- 2.82, N = 3 SE +/- 9.96, N = 3 4293.52 4399.16 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 2.1.1
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13b1 ATPase Simulation - 327,506 Atoms SMT Disabled SMT Enabled - Default 0.3891 0.7782 1.1673 1.5564 1.9455 SE +/- 0.00287, N = 3 SE +/- 0.00061, N = 3 1.72944 1.44394
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 SMT Enabled - Default SMT Disabled 4 8 12 16 20 SE +/- 0.29, N = 3 SE +/- 0.02, N = 3 17.25 7.11 MIN: 11.06 MIN: 7.02 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 SMT Enabled - Default SMT Disabled 50 100 150 200 250 SE +/- 1.44, N = 3 SE +/- 0.14, N = 3 210.10 89.54 MIN: 126.74 MIN: 88.32 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 SMT Enabled - Default SMT Disabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.09, N = 3 18.70 17.56 MIN: 18.37 MIN: 17.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 SMT Enabled - Default SMT Disabled 500 1000 1500 2000 2500 SE +/- 0.80, N = 3 SE +/- 0.71, N = 3 2107.82 2011.48 MIN: 2091.72 MIN: 1994.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 SMT Enabled - Default SMT Disabled 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 26.52 20.67 MIN: 25.61 MIN: 20.19 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 SMT Enabled - Default SMT Disabled 1.1363 2.2726 3.4089 4.5452 5.6815 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.05 4.89 MIN: 4.97 MIN: 4.79 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 SMT Enabled - Default SMT Disabled 60 120 180 240 300 SE +/- 0.49, N = 3 SE +/- 0.21, N = 3 257.95 248.32 MIN: 256.12 MIN: 246.46 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 SMT Enabled - Default SMT Disabled 1300 2600 3900 5200 6500 SE +/- 22.04, N = 3 SE +/- 1.43, N = 3 5993.69 4559.32 MIN: 5634.28 MIN: 4450.23 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 SMT Enabled - Default SMT Disabled 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.16, N = 3 112.71 108.94 MIN: 111.1 MIN: 107.13 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Parboil The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM SMT Enabled - Default SMT Disabled 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.53, N = 3 151.13 74.27 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP SMT Disabled SMT Enabled - Default 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.22 2.22 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil SMT Disabled SMT Enabled - Default 7 14 21 28 35 SE +/- 0.32, N = 3 SE +/- 0.02, N = 3 28.89 15.12 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding SMT Enabled - Default SMT Disabled 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 28.84 18.48 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics SMT Enabled - Default SMT Disabled 0.8528 1.7056 2.5584 3.4112 4.264 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.79 3.27 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CP2K Molecular Dynamics CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 6.1 Fayalite-FIST Data SMT Disabled SMT Enabled - Default 100 200 300 400 500 446.70 323.08
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD SMT Disabled SMT Enabled - Default 14 28 42 56 70 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 62.20 52.26 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver SMT Disabled SMT Enabled - Default 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 15.50 13.91 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster SMT Enabled - Default SMT Disabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 21.20 16.02 1. (CXX) g++ options: -O2 -lOpenCL
Open FMM Nero2D This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time SMT Enabled - Default SMT Disabled 16 32 48 64 80 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 72.34 67.01 1. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel SMT Disabled SMT Enabled - Default 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 56.14 53.23 1. (CC) gcc options: -lm -lpthread -O3
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation SMT Disabled SMT Enabled - Default 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 16.72 15.43 1. (CXX) g++ options: -O3 -lpthread
Rust Prime Benchmark Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rust Prime Benchmark Prime Number Test To 200,000,000 SMT Disabled SMT Enabled - Default 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 40.48 30.87 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples SMT Disabled SMT Enabled - Default 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.62 8.33 1. (CXX) g++ options: -fopenmp -O3
FFmpeg This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV SMT Enabled - Default SMT Disabled 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 8.45 4.02 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
SMT Enabled - Default Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq schedutilPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling
Testing initiated at 31 July 2019 09:29 by user phoronix.
SMT Disabled Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.3.0-999-generic (x86_64) 20190725, Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, OpenGL: 4.5 Mesa 19.0.2 (LLVM 8.0.0), Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15+ + Python 3.6.8Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
Testing initiated at 31 July 2019 13:23 by user phoronix.