Ryzen 9 3900X Linux SMT Performance

AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1908041-HV-1907314AS66&grt&sro.

Ryzen 9 3900X Linux SMT PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionVulkanSMT Enabled - DefaultSMT Disabled2700 - 4 GHz2700 - 4 GHz II2700 - 4 GHz Arch kernelAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 7.4.0ext43840x2160AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores)AMD Ryzen 7 2700 Eight-Core @ 4.00GHz (8 Cores / 16 Threads)ASUS ROG STRIX B350-F GAMING (5008 BIOS)AMD 17h250GB Western Digital WDS250G2X0C-00L350 + 2000GB Seagate ST2000DM006-2DM1 + 240GB Corsair Force GS + 500GB Western Digital WD5000BEKT-0 + 1000GB Seagate ST1000LM024 HN-MAMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz)AMD Ellesmere HDMI AudioLG ULTRAWIDEIntel I211 + Qualcomm Atheros AR93xxArch Linux5.2.5-arch1-1-ryzen (x86_64)GNOME Shell 3.32.2X Server 1.20.5modesetting 1.20.54.5 Mesa 19.1.3 (LLVM 8.0.1)1.1.90GCC 9.1.0 + Clang 8.0.1xfs2560x10805.2.5-arch1-1-ARCH (x86_64)OpenBenchmarking.orgCompiler Details- SMT Enabled - Default: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Disabled: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - 2700 - 4 GHz: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu - 2700 - 4 GHz II: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu - 2700 - 4 GHz Arch kernel: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details- SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil- SMT Disabled: Scaling Governor: acpi-cpufreq ondemand- 2700 - 4 GHz: Scaling Governor: acpi-cpufreq schedutil- 2700 - 4 GHz II: Scaling Governor: acpi-cpufreq schedutil- 2700 - 4 GHz Arch kernel: Scaling Governor: acpi-cpufreq schedutilPython Details- SMT Enabled - Default: Python 2.7.15+ + Python 3.6.8- SMT Disabled: Python 2.7.15+ + Python 3.6.8- 2700 - 4 GHz: Python 3.7.4- 2700 - 4 GHz II: Python 3.7.4- 2700 - 4 GHz Arch kernel: Python 3.7.4Security Details- SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling- SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling- 2700 - 4 GHz: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling- 2700 - 4 GHz II: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling- 2700 - 4 GHz Arch kernel: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

Ryzen 9 3900X Linux SMT Performancecompress-7zip: Compress Speed Testappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerasmfish: 1024 Hash Memory, 26 Depthblender: Barbershop - CPU-Onlyc-ray: Total Time - 4K, 16 Rays Per Pixelv-ray: CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicscoremark: CoreMark Size 666 - Iterations Per Secondcp2k: Fayalite-FIST Dataffmpeg: H.264 HD To NTSC DVgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: Water Benchmarkhimeno: Poisson Pressure Solverindigobench: Bedroomindigobench: Supercarmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32namd: ATPase Simulation - 327,506 Atomsnpb: BT.Anpb: EP.Cnpb: FT.Anpb: FT.Bnpb: LU.Anpb: LU.Cnpb: SP.Anero2d: Total Timeparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingprimesieve: 1e12 Prime Number Generationrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodeswet: Averagebuild-linux-kernel: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingx265: H.265 1080p Video Encodingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT Enabled - DefaultSMT Disabled2700 - 4 GHz2700 - 4 GHz II2700 - 4 GHz Arch kernel779222721681643968262171053.23205583.795325883238.452312631651972661692850.9913872.034.3517.25210.1018.70210826.525.0525859941131.443946417477571161935090921265429472.34151.132.2215.1228.8415.4352.2613.9121.2030.878.333913799241.3525185232458748.6364440.6625.6018.055910837019821728280592102756.14157093.273768054474.022332691732032721452830.9213691.383.067.1189.5417.56201120.674.8924845591091.729446440481570262195282921548439967.0174.273.2228.8918.4816.7262.2015.5016.0240.4810.622876588832.2420186043376558.8048348.9037.1821.5011213704187404241913799742423322424094759116064.345.0233195252812.551952411441642261722490.2012561.242.6474.59920.89206.2143446239.2483.1453984075823292.2483711163844216445756145309525140.113.1525.62100.5824.0830.3027.6420.7211.632324844826.08153672310326104.5842731.0643.0031.073758842623622324266366116064.365.2333060853412.691942411441642271692480.2012551.242.6274.69916.77204.8743401239.0482.6154054094923302.2489111213944193448455275321524142.903.1415.74100.5624.1030.2127.6220.7111.632292585126.01153669827962105.9042731.4843.0630.94OpenBenchmarking.org

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default20K40K60K80K100KSE +/- 49.84, N = 3SE +/- 284.83, N = 3SE +/- 393.22, N = 3SE +/- 323.94, N = 3375883799759108779221. (CXX) g++ options: -pipe -lpthread

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Emily2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default90180270360450426424370272

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney Material2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250236233198168

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material Tester2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250223224217164

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default8M16M24M32M40MSE +/- 250611.74, N = 3SE +/- 191988.13, N = 3SE +/- 41988.40, N = 3SE +/- 201681.41, N = 324266366240947592828059239682621

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-Only2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default2004006008001000116011601027710

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default1428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 364.3664.3456.1453.231. (CC) gcc options: -lm -lpthread -O3

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.03Mode: CPUSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 24.84, N = 3SE +/- 55.98, N = 31570920558

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default1.17682.35363.53044.70725.884SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.235.023.273.791. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 1406.14, N = 3SE +/- 507.36, N = 3SE +/- 731.59, N = 3SE +/- 2501.14, N = 33306083319523768055325881. (CC) gcc options: -O2 -lrt" -lrt

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 6.1Fayalite-FIST Data2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default120240360480600534528447323

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DV2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default3691215SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 312.6912.554.028.45-lxcb-shm -lva -lva-drm -lva-x11 -lvdpau-lxcb-shm -lva -lva-drm -lva-x11 -lvdpau-lsndio-lsndio1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -pthread -lSDL2 -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.67, N = 3194195233231-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default60120180240300SE +/- 1.00, N = 3SE +/- 0.88, N = 3241241269263-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default4080120160200SE +/- 0.33, N = 3144144173165-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default4080120160200SE +/- 0.33, N = 3164164203197-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default60120180240300SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 1.20, N = 3227226272266-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default4080120160200SE +/- 2.13, N = 5SE +/- 0.88, N = 3SE +/- 0.33, N = 3169172145169-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default60120180240300SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 3248249283285-llcms2 -ljasper -lxml2-llcms2 -ljasper -lxml2-ljbig-ljbig1. (CC) gcc options: -fopenmp -O2 -pthread -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water Benchmark2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default0.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.200.200.920.99-march=core-avx2-march=core-avx21. (CXX) g++ options: -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default30060090012001500SE +/- 3.71, N = 3SE +/- 2.94, N = 3SE +/- 3.89, N = 3SE +/- 1.81, N = 312551256136913871. (CC) gcc options: -O3 -mavx2

IndigoBench

Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Bedroom2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default0.45680.91361.37041.82722.284SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.241.241.382.03

IndigoBench

Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Supercar2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default0.97881.95762.93643.91524.894SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.622.643.064.35

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default20406080100SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.29, N = 374.6974.597.1117.25MIN: 74.07MIN: 73.94MIN: 11.061. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default2004006008001000SE +/- 0.23, N = 3SE +/- 2.97, N = 3SE +/- 0.14, N = 3SE +/- 1.44, N = 3916.77920.8989.54210.10MIN: 910.61MIN: 911.49MIN: 126.741. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250SE +/- 1.24, N = 3SE +/- 1.69, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 3204.87206.2117.5618.70MIN: 200.8MIN: 200.631. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default9K18K27K36K45KSE +/- 4.07, N = 3SE +/- 90.43, N = 3SE +/- 0.71, N = 3SE +/- 0.80, N = 3434014344620112108MIN: 43166.5MIN: 43120.21. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250SE +/- 0.37, N = 3SE +/- 0.24, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3239.04239.2420.6726.52MIN: 237.71MIN: 237.841. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default20406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 382.6183.144.895.05MIN: 81.75MIN: 82.221. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default12002400360048006000SE +/- 1.39, N = 3SE +/- 0.80, N = 3SE +/- 0.21, N = 3SE +/- 0.49, N = 354055398248258MIN: 5392.21MIN: 5387.931. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default9K18K27K36K45KSE +/- 43.57, N = 3SE +/- 22.12, N = 3SE +/- 1.43, N = 3SE +/- 22.04, N = 3409494075845595994MIN: 40623.5MIN: 40477.6MIN: 4450.23MIN: 5634.281. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f322700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default5001000150020002500SE +/- 0.55, N = 3SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.21, N = 323302329109113MIN: 2310.75MIN: 2309.921. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 Atoms2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default0.5061.0121.5182.0242.53SE +/- 0.00461, N = 3SE +/- 0.00052, N = 3SE +/- 0.00287, N = 3SE +/- 0.00061, N = 32.248912.248371.729441.44394

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.A2700 - 4 GHz2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default14002800420056007000SE +/- 3.41, N = 3SE +/- 4.25, N = 3SE +/- 4.37, N = 3SE +/- 15.50, N = 3SE +/- 8.30, N = 3112111211116644064171. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.C2700 - 4 GHz2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default100200300400500SE +/- 2.73, N = 3SE +/- 5.42, N = 3SE +/- 5.46, N = 15SE +/- 0.52, N = 3SE +/- 0.04, N = 33703943844814771. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: FT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.A2700 - 4 GHz2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default12002400360048006000SE +/- 9.80, N = 3SE +/- 8.39, N = 3SE +/- 3.81, N = 3SE +/- 4.89, N = 3SE +/- 3.40, N = 3418741934216570257111. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.B2700 - 4 GHz2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default13002600390052006500SE +/- 53.21, N = 15SE +/- 8.00, N = 3SE +/- 19.91, N = 3SE +/- 15.63, N = 3SE +/- 7.75, N = 3404244844457621961931. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.A2700 - 4 GHz2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default11K22K33K44K55KSE +/- 47.81, N = 3SE +/- 55.24, N = 9SE +/- 6.33, N = 3SE +/- 457.69, N = 3SE +/- 371.45, N = 341915527561452829509091. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.C2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default5K10K15K20K25KSE +/- 22.51, N = 3SE +/- 31.87, N = 3SE +/- 18.22, N = 3SE +/- 16.81, N = 35321530921548212651. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.A2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default9001800270036004500SE +/- 2.22, N = 3SE +/- 0.64, N = 3SE +/- 9.96, N = 3SE +/- 2.82, N = 3524525439942941. Open MPI 2.1.1

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeSMT DisabledSMT Enabled - Default1632486480SE +/- 0.29, N = 3SE +/- 0.29, N = 367.0172.341. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default306090120150SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.53, N = 3SE +/- 0.04, N = 3142.90140.1174.27151.131. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default0.72451.4492.17352.8983.6225SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.143.153.222.221. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default714212835SE +/- 0.05, N = 3SE +/- 0.30, N = 15SE +/- 0.32, N = 3SE +/- 0.02, N = 315.7425.6228.8915.121. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI Gridding2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default20406080100SE +/- 0.04, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3100.56100.5818.4828.841. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number Generation2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 324.1024.0816.7215.431. (CXX) g++ options: -O3 -lpthread

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default1428425670SE +/- 0.01, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 330.2130.3062.2052.261. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default714212835SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 327.6227.6415.5013.911. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP Streamcluster2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 320.7120.7216.0221.201. (CXX) g++ options: -O2 -lOpenCL

Rust Prime Benchmark

Prime Number Test To 200,000,000

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000SMT DisabledSMT Enabled - Default918273645SE +/- 0.08, N = 3SE +/- 0.07, N = 340.4830.871. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.6311.6310.628.331. (CXX) g++ options: -fopenmp -O3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default8M16M24M32M40MSE +/- 232342.01, N = 3SE +/- 244456.97, N = 3SE +/- 316141.73, N = 3SE +/- 320057.07, N = 3229258512324844828765888391379921. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video Encode2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default918273645SE +/- 0.28, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 326.0126.0832.2441.351. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video Encode2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default50100150200250SE +/- 1.62, N = 8SE +/- 1.53, N = 8SE +/- 1.35, N = 3SE +/- 1.83, N = 31531532012511. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

Swet

Average

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16Average2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default200M400M600M800M1000MSE +/- 2498094.73, N = 3SE +/- 7363504.33, N = 3SE +/- 10717369.00, N = 3SE +/- 11310134.98, N = 46698279626723103268604337658523245871. (CC) gcc options: -lm -lpthread -lcurses -lrt

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To Compile2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default20406080100SE +/- 0.81, N = 3SE +/- 1.24, N = 5SE +/- 0.81, N = 3SE +/- 0.70, N = 3105.90104.5858.8048.63

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mapping2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default140280420560700SE +/- 0.31, N = 3SE +/- 0.38, N = 3SE +/- 1.29, N = 3SE +/- 1.06, N = 3427427483644-lpthread-lpthread1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding2700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default1122334455SE +/- 0.22, N = 3SE +/- 0.44, N = 3SE +/- 0.19, N = 3SE +/- 0.40, N = 331.4831.0648.9040.661. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 92700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default1020304050SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 343.0643.0037.1825.601. (CC) gcc options: -pthread -fvisibility=hidden -O2

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 192700 - 4 GHz Arch kernel2700 - 4 GHz IISMT DisabledSMT Enabled - Default714212835SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 330.9431.0721.5018.05-llzma -llz4-llzma -llz41. (CC) gcc options: -O3 -pthread -lz


Phoronix Test Suite v10.8.5