Ryzen 9 3900X Linux SMT Performance

AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1907314-AS-RYZEN939048
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
SMT Enabled - Default
July 31 2019
  3 Hours, 22 Minutes
SMT Disabled
July 31 2019
  3 Hours, 33 Minutes
Invert Behavior (Only Show Selected Data)
  3 Hours, 28 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ryzen 9 3900X Linux SMT PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 7.4.0ext43840x2160ProcessorsMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionRyzen 9 3900X Linux SMT Performance BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil- SMT Disabled: Scaling Governor: acpi-cpufreq ondemand- Python 2.7.15+ + Python 3.6.8- SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

SMT Enabled - Default vs. SMT Disabled ComparisonPhoronix Test SuiteBaseline+35.7%+35.7%+71.4%+71.4%+107.1%+107.1%142.6%134.6%110.2%103.5%56.1%32.3%31.5%28.3%20.3%15.9%8%6.5%4.8%4.8%3.9%3.8%3.5%3.3%3%2.5%2.3%2.3%IP Batch 1D - f32IP Batch All - f32H.2.H.T.N.DOpenMP LBMOpenMP Stencil91.1%O.M.GBedroom47.1%C.u.1.0.3.s.i.i.C.L.945.2%OpenMP CUTCP45%Barbershop - CPU-Only44.6%Supercar42.2%CoreMark Size 666 - I.P.S41.3%1.H.M.2.D40.3%Fayalite-FIST Data38.3%Emily36.3%Total Time36.1%P.R.W.S.S.M33.3%O.SMaterial Tester32.3%C.S.T31.8%D.B.d - f32P.N.T.T.2.0.031.1%CPU30.9%D.B.d - f321.8.b.Y.T.A.V.E28.3%G.I.R.1.S27.5%1.8.b.Y.T.H.V.E25.2%Time To Compile20.9%H.2.1.V.EATPase Simulation - 327,506 Atoms19.8%C.u.1.0.3.s.i.i.C.L.119.1%OpenMP LavaMD19%Disney Material18.1%Noise-Gaussian16.6%L.E.HOpenMP CFD Solver11.4%1.P.N.G8.4%Total TimeWater Benchmark7.6%C.B.c - f32Total Time - 4.1.R.P.P5.5%SharpenC.B.c - f32C.B.c - f32LU.AC.B.c - f32D.B.d - f32EnhancedSP.ARotateResizingMKL-DNNMKL-DNNFFmpegParboilParboilParboilIndigoBenchXZ CompressionParboilBlenderIndigoBenchCoremarkasmFishCP2K Molecular DynamicsAppleseedStockfishTTSIOD 3D RendererRodiniaAppleseed7-Zip CompressionMKL-DNNRust Prime BenchmarkChaos Group V-RAYMKL-DNNSVT-AV1SmallptSVT-HEVCTimed Linux Kernel Compilationx265NAMDZstd CompressionRodiniaAppleseedGraphicsMagickCloverLeafRodiniaPrimesieveOpen FMM Nero2DGROMACSMKL-DNNC-RayGraphicsMagickMKL-DNNMKL-DNNNAS Parallel BenchmarksMKL-DNNMKL-DNNGraphicsMagickNAS Parallel BenchmarksGraphicsMagickGraphicsMagickSMT Enabled - DefaultSMT Disabled

Ryzen 9 3900X Linux SMT Performancecompress-7zip: Compress Speed Testappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerasmfish: 1024 Hash Memory, 26 Depthblender: Barbershop - CPU-Onlyc-ray: Total Time - 4K, 16 Rays Per Pixelv-ray: CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicscoremark: CoreMark Size 666 - Iterations Per Secondcp2k: Fayalite-FIST Dataffmpeg: H.264 HD To NTSC DVgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: Water Benchmarkhimeno: Poisson Pressure Solverindigobench: Bedroomindigobench: Supercarmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32namd: ATPase Simulation - 327,506 Atomsnpb: BT.Anpb: EP.Cnpb: FT.Anpb: FT.Bnpb: LU.Anpb: LU.Cnpb: SP.Anero2d: Total Timeparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingprimesieve: 1e12 Prime Number Generationrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodeswet: Averagebuild-linux-kernel: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingx265: H.265 1080p Video Encodingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT Enabled - DefaultSMT Disabled77922271.85167.95163.7439682621710.1253.23205583.79532587.52323.088.452312631651972661692850.991386.872.034.3517.25210.1018.702107.8226.525.05257.955993.69112.711.443946417.48477.095711.266193.0250909.3021264.924293.5272.34151.132.2215.1228.8415.4352.2613.9121.2030.878.333913799241.35251.3985232458748.63643.9040.6625.6018.0559108370.45198.39216.56282805921027.1856.14157093.27376805.08446.704.022332691732032721452830.921369.031.383.067.1189.5417.562011.4820.674.89248.324559.32108.941.729446440.11480.695701.856218.6752828.7121548.384399.1667.0174.273.2228.8918.4816.7262.2015.5016.0240.4810.622876588832.24200.7886043376558.80483.1548.9037.1821.50OpenBenchmarking.org

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestSMT DisabledSMT Enabled - Default20K40K60K80K100KSE +/- 393.22, N = 3SE +/- 323.94, N = 359108779221. (CXX) g++ options: -pipe -lpthread

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT DisabledSMT Enabled - Default80160240320400370.45271.85

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT DisabledSMT Enabled - Default4080120160200198.39167.95

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT DisabledSMT Enabled - Default50100150200250216.56163.74

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthSMT DisabledSMT Enabled - Default8M16M24M32M40MSE +/- 41988.40, N = 3SE +/- 201681.41, N = 32828059239682621

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-OnlySMT DisabledSMT Enabled - Default20040060080010001027.18710.12

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelSMT DisabledSMT Enabled - Default1326395265SE +/- 0.04, N = 3SE +/- 0.02, N = 356.1453.231. (CC) gcc options: -lm -lpthread -O3

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.03Mode: CPUSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 24.84, N = 3SE +/- 55.98, N = 31570920558

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT DisabledSMT Enabled - Default0.85281.70562.55843.41124.264SE +/- 0.01, N = 3SE +/- 0.00, N = 33.273.791. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 731.59, N = 3SE +/- 2501.14, N = 3376805.08532587.521. (CC) gcc options: -O2 -lrt" -lrt

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 6.1Fayalite-FIST DataSMT DisabledSMT Enabled - Default100200300400500446.70323.08

FFmpeg

This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVSMT DisabledSMT Enabled - Default246810SE +/- 0.01, N = 3SE +/- 0.06, N = 34.028.451. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlSMT DisabledSMT Enabled - Default50100150200250SE +/- 1.53, N = 3SE +/- 0.67, N = 32332311. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateSMT DisabledSMT Enabled - Default60120180240300SE +/- 1.00, N = 3SE +/- 0.88, N = 32692631. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenSMT DisabledSMT Enabled - Default4080120160200SE +/- 0.33, N = 31731651. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedSMT DisabledSMT Enabled - Default4080120160200SE +/- 0.33, N = 32031971. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingSMT DisabledSMT Enabled - Default60120180240300SE +/- 1.20, N = 3SE +/- 1.20, N = 32722661. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianSMT DisabledSMT Enabled - Default4080120160200SE +/- 0.33, N = 31451691. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceSMT DisabledSMT Enabled - Default60120180240300SE +/- 1.20, N = 3SE +/- 0.88, N = 32832851. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkSMT DisabledSMT Enabled - Default0.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 30.920.991. (CXX) g++ options: -march=core-avx2 -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverSMT DisabledSMT Enabled - Default30060090012001500SE +/- 3.89, N = 3SE +/- 1.81, N = 31369.031386.871. (CC) gcc options: -O3 -mavx2

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomSMT DisabledSMT Enabled - Default0.45680.91361.37041.82722.284SE +/- 0.00, N = 3SE +/- 0.00, N = 31.382.03

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarSMT DisabledSMT Enabled - Default0.97881.95762.93643.91524.894SE +/- 0.01, N = 3SE +/- 0.00, N = 33.064.35

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32SMT DisabledSMT Enabled - Default48121620SE +/- 0.02, N = 3SE +/- 0.29, N = 37.1117.25MIN: 7.02MIN: 11.061. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32SMT DisabledSMT Enabled - Default50100150200250SE +/- 0.14, N = 3SE +/- 1.44, N = 389.54210.10MIN: 88.32MIN: 126.741. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32SMT DisabledSMT Enabled - Default510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 317.5618.70MIN: 17.1MIN: 18.371. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32SMT DisabledSMT Enabled - Default5001000150020002500SE +/- 0.71, N = 3SE +/- 0.80, N = 32011.482107.82MIN: 1994.1MIN: 2091.721. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32SMT DisabledSMT Enabled - Default612182430SE +/- 0.15, N = 3SE +/- 0.04, N = 320.6726.52MIN: 20.19MIN: 25.611. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32SMT DisabledSMT Enabled - Default1.13632.27263.40894.54525.6815SE +/- 0.01, N = 3SE +/- 0.01, N = 34.895.05MIN: 4.79MIN: 4.971. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32SMT DisabledSMT Enabled - Default60120180240300SE +/- 0.21, N = 3SE +/- 0.49, N = 3248.32257.95MIN: 246.46MIN: 256.121. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32SMT DisabledSMT Enabled - Default13002600390052006500SE +/- 1.43, N = 3SE +/- 22.04, N = 34559.325993.69MIN: 4450.23MIN: 5634.281. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32SMT DisabledSMT Enabled - Default306090120150SE +/- 0.16, N = 3SE +/- 0.21, N = 3108.94112.71MIN: 107.13MIN: 111.11. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsSMT DisabledSMT Enabled - Default0.38910.77821.16731.55641.9455SE +/- 0.00287, N = 3SE +/- 0.00061, N = 31.729441.44394

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.ASMT DisabledSMT Enabled - Default14002800420056007000SE +/- 15.50, N = 3SE +/- 8.30, N = 36440.116417.481. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.CSMT DisabledSMT Enabled - Default100200300400500SE +/- 0.52, N = 3SE +/- 0.04, N = 3480.69477.091. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.ASMT DisabledSMT Enabled - Default12002400360048006000SE +/- 4.89, N = 3SE +/- 3.40, N = 35701.855711.261. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.BSMT DisabledSMT Enabled - Default13002600390052006500SE +/- 15.63, N = 3SE +/- 7.75, N = 36218.676193.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.ASMT DisabledSMT Enabled - Default11K22K33K44K55KSE +/- 457.69, N = 3SE +/- 371.45, N = 352828.7150909.301. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.CSMT DisabledSMT Enabled - Default5K10K15K20K25KSE +/- 18.22, N = 3SE +/- 16.81, N = 321548.3821264.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.ASMT DisabledSMT Enabled - Default9001800270036004500SE +/- 9.96, N = 3SE +/- 2.82, N = 34399.164293.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Open FMM Nero2D

This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeSMT DisabledSMT Enabled - Default1632486480SE +/- 0.29, N = 3SE +/- 0.29, N = 367.0172.341. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMSMT DisabledSMT Enabled - Default306090120150SE +/- 0.53, N = 3SE +/- 0.04, N = 374.27151.131. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPSMT DisabledSMT Enabled - Default0.72451.4492.17352.8983.6225SE +/- 0.01, N = 3SE +/- 0.00, N = 33.222.221. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilSMT DisabledSMT Enabled - Default714212835SE +/- 0.32, N = 3SE +/- 0.02, N = 328.8915.121. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingSMT DisabledSMT Enabled - Default714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 318.4828.841. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationSMT DisabledSMT Enabled - Default48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 316.7215.431. (CXX) g++ options: -O3 -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDSMT DisabledSMT Enabled - Default1428425670SE +/- 0.01, N = 3SE +/- 0.06, N = 362.2052.261. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverSMT DisabledSMT Enabled - Default48121620SE +/- 0.07, N = 3SE +/- 0.02, N = 315.5013.911. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterSMT DisabledSMT Enabled - Default510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 316.0221.201. (CXX) g++ options: -O2 -lOpenCL

Rust Prime Benchmark

Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000SMT DisabledSMT Enabled - Default918273645SE +/- 0.08, N = 3SE +/- 0.07, N = 340.4830.871. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesSMT DisabledSMT Enabled - Default3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.628.331. (CXX) g++ options: -fopenmp -O3

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeSMT DisabledSMT Enabled - Default8M16M24M32M40MSE +/- 316141.73, N = 3SE +/- 320057.07, N = 328765888391379921. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeSMT DisabledSMT Enabled - Default918273645SE +/- 0.02, N = 3SE +/- 0.25, N = 332.2441.351. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeSMT DisabledSMT Enabled - Default50100150200250SE +/- 1.35, N = 3SE +/- 1.83, N = 3200.78251.391. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageSMT DisabledSMT Enabled - Default200M400M600M800M1000MSE +/- 10717369.00, N = 3SE +/- 11310134.98, N = 48604337658523245871. (CC) gcc options: -lm -lpthread -lcurses -lrt

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileSMT DisabledSMT Enabled - Default1326395265SE +/- 0.81, N = 3SE +/- 0.70, N = 358.8048.63

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingSMT DisabledSMT Enabled - Default140280420560700SE +/- 1.29, N = 3SE +/- 1.06, N = 3483.15643.901. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingSMT DisabledSMT Enabled - Default1122334455SE +/- 0.19, N = 3SE +/- 0.40, N = 348.9040.661. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9SMT DisabledSMT Enabled - Default918273645SE +/- 0.09, N = 3SE +/- 0.07, N = 337.1825.601. (CC) gcc options: -pthread -fvisibility=hidden -O2

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT DisabledSMT Enabled - Default510152025SE +/- 0.02, N = 3SE +/- 0.08, N = 321.5018.051. (CC) gcc options: -O3 -pthread -lz